Question [URGENT] a STRANGE problem about using unity_CameraInvProjection

gannchuukaju · Mar 18, 2022

Recently I searched many articles about how to do full-screen Raymarching. There is a point that transform points(actually a quad with 4 points generated by Blit) from ndc space to view space and compute the direction of frustum corner ray. Then we can get the interpolated raymarching direction from camera to each pixel. The theory is easy but... the specific methods really cause a headache. Here is the method1:

Code (CSharp):

v2f vert(appdata v)

{

v2f o;

VertexPositionInputs vertexPos = GetVertexPositionInputs(v.vertex.xyz);

o.vertex = vertexPos.positionCS;

o.uv = v.uv;

float3 viewDir = mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, -1)).xyz ;

o.viewDir = mul(unity_CameraToWorld, float4(viewDir, 0)).xyz;

return o;

}

You can see this method in https://github.com/BenSnow6/Drone-f...Test/Image Effect Test/ImageEffectTest.shader
I think this is a very strange way to get the viewDir. Especially after I saw articles about reconstructing posCS using screen [0,1] uv and rawDepth. And here is the method2:

Code (CSharp):

float ndcZ = (SAMPLE_DEPTH_TEXTURE(_CameraDepthTexture, i.uv));

// ndc to world pos => worldPos = _InvVP * ndc; // _InvVP = Matrix4x4.Inverse(GL.GetGPUProjectionMatrix(camera.projectionMatrix, false) * camera.worldToCameraMatrix);

float4 fwp = mul(_InvVP, float4(i.uv * 2 - 1, ndcZ, 1));

fwp /= fwp.w;

return fwp;

You can also see this method in
https://github.com/LGhassen/Scatter...tererShaders/Assets/Shaders/DepthCommon.cginc
And here are questions:
1. I can understand why use "/fwp.w"(https://answers.unity.com/questions/1657972/what-does-unity-camerainvprojection-actually-ishow.html), but why method1 does not use like "/o.viewDir.w"? That really confuses me.
2. In method2 , the ndc.w is set 1 because forward after w-division ndc.w is 1. But why the ndc.w is -1 in method1? I try modifying the ndc.z but nothing changed while perspective is totally wrong when ndc.w is set to 1. That is so strange.
3. What is the different in GL.GetGPUProjectionMatrix() , camera.XXMatrix , UNITY_MATRIX_I_P?

MelvMay · Mar 18, 2022

I've deleted your other other cross-posts about this. Please don't cross-post. Note that your first two "urgent" posts were the same.

https://forum.unity.com/threads/urg...-impact-character-upskirt-censorship.1253520/
https://forum.unity.com/threads/urg...-impact-character-upskirt-censorship.1253526/

Please find the correct forum and post there only.

Thanks.

bgolus · Mar 18, 2022

unity_Camera*
matrices don't match the various
UNITY_MATRIX_*
matrices used for rendering.
UNITY_MATRIX_V
is the view matrix which transforms from world space to view space (aka eye space or camera space), which in Unity uses a -Z forward. This is equivalent to the c#
camera.worldToCameraMatrix
matrix, which does not match the game object's transform matrix as it ignores scale, and Unity's transforms are +Z forward.
UNITY_MATRIX_I_V
is the inverse of that.
UNITY_MATRIX_P
is the projection matrix which transforms from view space to clip space using the current graphics API's projection model. This is equivalent to
GL.GetGPUProjectionMatrix(camera.projectionMatrix, false)
. There is no built in
UNITY_MATRIX_I_P
for the inverse!
unity_CameraToWorld
is the world to camera space matrix, which like the view matrix has no scale from the camera's game object, but still uses +Z forward. So these two matrices are not the same! This is equivalent to c#
Matrix4x4.TRS(camera.transform.position, camera.transform.rotation, Vector3.one)
.
unity_WorldToCamera
is the inverse.
unity_CameraProjection
is the default OpenGL projection matrix which transforms from view space to OpenGL clip space. This is equivalent to
camera.projectionMatrix
as Unity defines all of the projection matrices on the camera in OpenGL form until they're sent to the GPU, at which point it transforms them into the projection matrix for the current platform via the above mentioned
GL.GetGPUProjectionMatrix()
function. Note, OpenGL's clip space is uniquely -1 to 1 near to far on the z where all other APIs use 1 to 0 near to far. So on OpenGL platforms,
UNITY_MATRIX_P
and
unity_CameraProjection
do match, but they do not on all other APIs. There are also cases where Unity will flip the Y axis of the projection matrix for other APIs to make them match OpenGL ... which renders upside down compared to all other APIs.
unity_CameraInvProjection
is the inverse.

Using
unity_CameraInvProjection
can be nice because it works "the same" regardless of which API is in use for calculating a view direction from the screen UV. But the depth texture isn't the same depending on if it's OpenGL or not. It also only gets you the view space view direction, not the world space one.

Using a manually calculated
_InvVP
has the advantage of going straight to world space in one step, and you can use the raw depth texture value as the input. But of course it requires you have an additional script.

gannchuukaju · Mar 19, 2022

MelvMay said: ↑

I've deleted your other other cross-posts about this. Please don't cross-post. Note that your first two "urgent" posts were the same.

https://forum.unity.com/threads/urg...-impact-character-upskirt-censorship.1253520/
https://forum.unity.com/threads/urg...-impact-character-upskirt-censorship.1253526/

Please find the correct forum and post there only.

Thanks.
Click to expand...

oh, I am new one in posting threads. Sorry, I will not do that!

gannchuukaju · Mar 19, 2022

bgolus said: ↑
unity_Camera*
matrices don't match the various
UNITY_MATRIX_*
matrices used for rendering.
UNITY_MATRIX_V
is the view matrix which transforms from world space to view space (aka eye space or camera space), which in Unity uses a -Z forward. This is equivalent to the c#
camera.worldToCameraMatrix
matrix, which does not match the game object's transform matrix as it ignores scale, and Unity's transforms are +Z forward.
UNITY_MATRIX_I_V
is the inverse of that.
UNITY_MATRIX_P
is the projection matrix which transforms from view space to clip space using the current graphics API's projection model. This is equivalent to
GL.GetGPUProjectionMatrix(camera.projectionMatrix, false)
. There is no built in
UNITY_MATRIX_I_P
for the inverse!
unity_CameraToWorld
is the world to camera space matrix, which like the view matrix has no scale from the camera's game object, but still uses +Z forward. So these two matrices are not the same! This is equivalent to c#
Matrix4x4.TRS(camera.transform.position, camera.transform.rotation, Vector3.one)
.
unity_WorldToCamera
is the inverse.
unity_CameraProjection
is the default OpenGL projection matrix which transforms from view space to OpenGL clip space. This is equivalent to
camera.projectionMatrix
as Unity defines all of the projection matrices on the camera in OpenGL form until they're sent to the GPU, at which point it transforms them into the projection matrix for the current platform via the above mentioned
GL.GetGPUProjectionMatrix()
function. Note, OpenGL's clip space is uniquely -1 to 1 near to far on the z where all other APIs use 1 to 0 near to far. So on OpenGL platforms,
UNITY_MATRIX_P
and
unity_CameraProjection
do match, but they do not on all other APIs. There are also cases where Unity will flip the Y axis of the projection matrix for other APIs to make them match OpenGL ... which renders upside down compared to all other APIs.
unity_CameraInvProjection
is the inverse.

Using
unity_CameraInvProjection
can be nice because it works "the same" regardless of which API is in use for calculating a view direction from the screen UV. But the depth texture isn't the same depending on if it's OpenGL or not. It also only gets you the view space view direction, not the world space one.

Using a manually calculated
_InvVP
has the advantage of going straight to world space in one step, and you can use the raw depth texture value as the input. But of course it requires you have an additional script.
Click to expand...
Thank you, Mr.Bgolus! I have seen your high-quality answers in many threads. Now I understand that :
1. camera.worldToCameraMatrix == UNITY_MATRIX_V
2.
· GL.GetGPUProjectionMatrix(came.projectionMatrix, false) == UNITY_MATRIX_P （in the current graphics API）
· unity_CameraProjection == came.projectionMatrix == UNITY_MATRIX_P (in OpenGL API)
3.
· camera.cameraToWorldMatrix = Matrix4x4.TRS(camera.transform.position, camera.transform.rotation, new Vector3（1,1,-1))
· unity_CameraToWorld=Matrix4x4.TRS(camera.transform.position, camera.transform.rotation, Vector3.one)
So if we try transforming from view space to world space by unity_CameraToWorld , the Z of view space(right-handed coordinate system) should be inverse first.
4. We can use unity_CameraInvProjection to go back to view space if we do not use depth like computing screen-based raymarching direction. But when we need depth data like reconstructing posCS or posWS, the Z range of different graphics API should be considered.
-------------------------------------------------------------------------------------------------------------------------------------------------------
But I am still confused in Why "float3 viewDir = mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, -1)).xyz " is correct ?

bgolus · Mar 19, 2022

It's a very odd setup, I agree. There's an old graphics programmer idiom I like to pull out at times like this.

"Make sure you have an even number of sign errors."

The
unity_CameraInvProjection
matrix converts from an OpenGL clip space to a -Z forward view space. Using a w of -1 I think results in the equivalent of a +Z forward vector. Which then using the +Z forward
unity_CameraToWorld
matrix gets you a proper world space vector.

gannchuukaju · Mar 20, 2022

bgolus said: ↑
It's a very odd setup, I agree. There's an old graphics programmer idiom I like to pull out at times like this.

"Make sure you have an even number of sign errors."

The
unity_CameraInvProjection
matrix converts from an OpenGL clip space to a -Z forward view space. Using a w of -1 I think results in the equivalent of a +Z forward vector. Which then using the +Z forward
unity_CameraToWorld
matrix gets you a proper world space vector.
Click to expand...
Yes, you are right! I just do this:

Code (CSharp):

float3 viewDir = mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, 1)).xyz ;

viewDir.z=-viewDir.z ;

o.viewDir = mul(unity_CameraToWorld, float4(viewDir, 0)).xyz;

It is equivalent to "float3 viewDir = mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, -1)).xyz " , that's amazing!
Now it leaves one last question. Why not divide "mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, -1)).w"?
Here is my draft：

As you can see, to compute viewDir, it still need divide viewDir.w(like mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, 1)).w ) . But the divition dispears in method1.
Here is my conjecture :
I notice that unity_CameraInvProjection may be this:

Az+B =1/z' xyz is the position of ndc space while x'y'z' of view space.
So if we need to compute real view space position, multiplying z' by result of the img above is necessary.
"multiply z' " means "divide 1/z' " and you can according to "Az+B =1/z' " and "mul(unity_CameraInvProjection, float4(v.uv * 2.0 - 1.0, 0, 1)).w==Az+B " ) affirm that "multiply z' " is is equivalent to "divide viewDir.w".
But if we just need to compute view space direction, not do "multiply z' " is okay. Because no matter x'y'z' devide by any of the same numbers , the result still stands for the direction. Oh that is really exciting if the conjecture is correct!

bgolus · Mar 20, 2022

You’re over thinking the problem.

In the example shader the first thing it does to that vector in the fragment shader is this:

Code (CSharp):

float3 rayDir = normalize(input.viewVector);
normalize(viewVec.xyz)
is equal to
normalize(viewVec.xyz / n)
where n is any positive value. So there’s no reason to do a divide if it isn’t going to affect the results.

But also because the input w was -1, I believe the output w is also going to be negative and dividing by that would undo the whole “trick”. You’d have an odd number of sign errors.

gannchuukaju · Mar 20, 2022

bgolus said: ↑
You’re over thinking the problem.

In the example shader the first thing it does to that vector in the fragment shader is this:

Code (CSharp):

float3 rayDir = normalize(input.viewVector);
normalize(viewVec.xyz)
is equal to
normalize(viewVec.xyz / n)
where n is any positive value. So there’s no reason to do a divide if it isn’t going to affect the results.

But also because the input w was -1, I believe the output w is also going to be negative and dividing by that would undo the whole “trick”. You’d have an odd number of sign errors.
Click to expand...
Oh,So that's it! Cheers!

Search Unity

Question [URGENT] a STRANGE problem about using unity_CameraInvProjection

gannchuukaju

MelvMay

Unity Technologies

bgolus

gannchuukaju

gannchuukaju

bgolus

gannchuukaju

bgolus

gannchuukaju

Search Unity

Unity ID

Useful Searches

Question [URGENT] a STRANGE problem about using unity_CameraInvProjection

Unity Technologies