|
Science Forum Index » Statistics - Math Forum » condition Gaussian distribution on a hyperplane
Page 1 of 1
|
| Author |
Message |
| shaobo hou |
Posted: Fri Jan 19, 2007 5:15 am |
|
|
|
Guest
|
Hi, given a D dimensional multivariate Gaussian distribution and an arbitrary hyperplane in this D dimensional space, is it possible to compute the conditional Gaussian distribution on the hyperplane?
I know that if the hyperplane's axes corresponds to a subset of the axes of the D dimensional space, then this is can be computed using the standard formula for conditioning a Gaussian distribution.
However if the hyperplane is arbitrarily oriented then it seems to suggest that the original Gaussian distribution need to be rotated (so the axes matches the hyperplane's axes) and then apply the standard formula. But I can't quite figure out how to generate this rotation matrix from just the hyperplane informations.
Can anyone help me? Or suggest a better way?
thanks
Shaobo |
|
|
| Back to top |
|
| shaobo hou |
Posted: Fri Jan 19, 2007 10:22 am |
|
|
|
Guest
|
Actually, the hyperplane can have dimensionality of D-K, where K < D.
What do you mean by "direction cosines of the arbitrary vector"? And assume that even though the other rows are arbitrary, they still needed to be generated in order to create the D x D orthonormal matrix, how is this done?
thanks
Shaobo |
|
|
| Back to top |
|
| Ray Koopman |
Posted: Fri Jan 19, 2007 3:47 pm |
|
|
|
Guest
|
shaobo hou wrote:
Quote: Hi, given a D dimensional multivariate Gaussian distribution and an arbitrary hyperplane in this D dimensional space, is it possible to compute the conditional Gaussian distribution on the hyperplane?
I know that if the hyperplane's axes corresponds to a subset of the axes of the D dimensional space, then this is can be computed using the standard formula for conditioning a Gaussian distribution.
However if the hyperplane is arbitrarily oriented then it seems to suggest that the original Gaussian distribution need to be rotated (so the axes matches the hyperplane's axes) and then apply the standard formula. But I can't quite figure out how to generate this rotation matrix from just the hyperplane informations.
Can anyone help me? Or suggest a better way?
thanks
Shaobo
If by an arbitrary hyperplane you mean the (D-1)-space orthogonal
to an arbitrary vector, then the rotation matrix would be any D x D
orthonormal matrix whose determinant is +1 and whose first row
contains the direction cosines of the arbitrary vector. The D-1 other
rows are arbitrary. |
|
|
| Back to top |
|
| Ray Koopman |
Posted: Fri Jan 19, 2007 5:21 pm |
|
|
|
Guest
|
shaobo hou wrote:
Quote: Actually, the hyperplane can have dimensionality of D-K, where K < D.
What do you mean by "direction cosines of the arbitrary vector"? And assume that even though the other rows are arbitrary, they still needed to be generated in order to create the D x D orthonormal matrix, how is this done?
thanks
Shaobo
How are you specifying your hyperplane? |
|
|
| Back to top |
|
| shaobo hou |
Posted: Sat Jan 20, 2007 2:48 am |
|
|
|
Guest
|
| a set of orthogonal axes and an origin |
|
|
| Back to top |
|
| shaobo hou |
Posted: Sat Jan 20, 2007 2:49 am |
|
|
|
Guest
|
Quote:
How are you specifying your hyperplane?
oops, accidentally replied to the wrong post.
The hyperplane is specified by a set of orthogonal axes and an origin |
|
|
| Back to top |
|
| Ray Koopman |
Posted: Sat Jan 20, 2007 6:20 pm |
|
|
|
Guest
|
shaobo hou wrote:
Quote:
How are you specifying your hyperplane?
oops, accidentally replied to the wrong post.
The hyperplane is specified by a set of orthogonal axes and an origin
Let A be the (D-K) by D matrix of direction cosines of the hyperplane
axes with the original axes, and let b be the hyperplane origin in the
original space. Then the projection of a point x in the original space
onto the hyperplane coordinates is A(x-b). |
|
|
| Back to top |
|
| shaobo hou |
Posted: Sun Jan 21, 2007 4:51 am |
|
|
|
Guest
|
isn't projecting the gaussian onto the hyperlane in this way, equivalent to marginalising the gaussian, which is not really what I want.
What I want is more like finding the intersection of the gaussian with the hyperplane, which can be done by conditioning the gaussian given some of the parameters are known. Provided I can rotate the gaussian so that points projected onto the hyperplane are treated as unknown parameters and the rest as known parameters.
Quote: Let A be the (D-K) by D matrix of direction cosines
of the hyperplane
axes with the original axes, and let b be the
hyperplane origin in the
original space. Then the projection of a point x in
the original space
onto the hyperplane coordinates is A(x-b).
|
|
|
| Back to top |
|
| shaobo hou |
Posted: Mon Jan 22, 2007 12:51 am |
|
|
|
Guest
|
Quote: You want the distribution of Y|Z=0.
Yes, P(Y|Z=0) is basically what I am looking for, sorry I couldn't describe it clearly in the first post. I am guessing that the actual order of the rows in B is not important.
Is there an analytical solution or even robust numerical solution to this problem? |
|
|
| Back to top |
|
| Ray Koopman |
Posted: Mon Jan 22, 2007 3:56 am |
|
|
|
Guest
|
shaobo hou wrote:
Quote: isn't projecting the gaussian onto the hyperlane in this way,
equivalent to marginalising the gaussian, which is not really
what I want.
What I want is more like finding the intersection of the gaussian
with the hyperplane, which can be done by conditioning the gaussian
given some of the parameters are known. Provided I can rotate the
gaussian so that points projected onto the hyperplane are treated
as unknown parameters and the rest as known parameters.
Let A be the (D-K) by D matrix of cosines of the hyperplane axes with
the original axes, let B be any nonsingular K by D matrix whose rows
are orthogonal to the rows of A, and let c be the hyperplane origin
in the original space. Then the joint distribution of Y = A(X-c) and
Z = B(X-c) is multivariate normal, as is the conditional distribution
of Y|Z. You want the distribution of Y|Z=0. |
|
|
| Back to top |
|
| shaobo hou |
Posted: Mon Jan 22, 2007 10:40 am |
|
|
|
Guest
|
| thanks very much, I think I'll take some time to study what you posted. |
|
|
| Back to top |
|
| Ray Koopman |
Posted: Mon Jan 22, 2007 3:40 pm |
|
|
|
Guest
|
shaobo hou wrote:
Quote: You want the distribution of Y|Z=0.
Yes, P(Y|Z=0) is basically what I am looking for, sorry I couldn't
describe it clearly in the first post. I am guessing that the actual
order of the rows in B is not important.
Is there an analytical solution or even robust numerical solution
to this problem?
y = A(x - c) m_y = A (m_x - c)
z = B(x - c) m_z = B (m_x - c)
S_yy = A S_xx A' S_yz = A S_xx B'
S_zy = B S_xx A' S_zz = B S_xx B'
m_y|z = m_y + S_yz S_zz^-1 (z - m_z)
S_yy|z = S_yy - S_yz S_zz^-1 S_zy |
|
|
| Back to top |
|
| |