Main Page | Report this Page
 
   
Science Forum Index  »  Statistics - Math Forum  »  condition Gaussian distribution on a hyperplane
Page 1 of 1    
Author Message
shaobo hou
Posted: Fri Jan 19, 2007 5:15 am
Guest
Hi, given a D dimensional multivariate Gaussian distribution and an arbitrary hyperplane in this D dimensional space, is it possible to compute the conditional Gaussian distribution on the hyperplane?

I know that if the hyperplane's axes corresponds to a subset of the axes of the D dimensional space, then this is can be computed using the standard formula for conditioning a Gaussian distribution.

However if the hyperplane is arbitrarily oriented then it seems to suggest that the original Gaussian distribution need to be rotated (so the axes matches the hyperplane's axes) and then apply the standard formula. But I can't quite figure out how to generate this rotation matrix from just the hyperplane informations.

Can anyone help me? Or suggest a better way?

thanks

Shaobo
shaobo hou
Posted: Fri Jan 19, 2007 10:22 am
Guest
Actually, the hyperplane can have dimensionality of D-K, where K < D.

What do you mean by "direction cosines of the arbitrary vector"? And assume that even though the other rows are arbitrary, they still needed to be generated in order to create the D x D orthonormal matrix, how is this done?

thanks

Shaobo
Ray Koopman
Posted: Fri Jan 19, 2007 3:47 pm
Guest
shaobo hou wrote:
Quote:
Hi, given a D dimensional multivariate Gaussian distribution and an arbitrary hyperplane in this D dimensional space, is it possible to compute the conditional Gaussian distribution on the hyperplane?

I know that if the hyperplane's axes corresponds to a subset of the axes of the D dimensional space, then this is can be computed using the standard formula for conditioning a Gaussian distribution.

However if the hyperplane is arbitrarily oriented then it seems to suggest that the original Gaussian distribution need to be rotated (so the axes matches the hyperplane's axes) and then apply the standard formula. But I can't quite figure out how to generate this rotation matrix from just the hyperplane informations.

Can anyone help me? Or suggest a better way?

thanks

Shaobo

If by an arbitrary hyperplane you mean the (D-1)-space orthogonal
to an arbitrary vector, then the rotation matrix would be any D x D
orthonormal matrix whose determinant is +1 and whose first row
contains the direction cosines of the arbitrary vector. The D-1 other
rows are arbitrary.
Ray Koopman
Posted: Fri Jan 19, 2007 5:21 pm
Guest
shaobo hou wrote:
Quote:
Actually, the hyperplane can have dimensionality of D-K, where K < D.

What do you mean by "direction cosines of the arbitrary vector"? And assume that even though the other rows are arbitrary, they still needed to be generated in order to create the D x D orthonormal matrix, how is this done?

thanks

Shaobo

How are you specifying your hyperplane?
shaobo hou
Posted: Sat Jan 20, 2007 2:48 am
Guest
a set of orthogonal axes and an origin
shaobo hou
Posted: Sat Jan 20, 2007 2:49 am
Guest
Quote:

How are you specifying your hyperplane?


oops, accidentally replied to the wrong post.

The hyperplane is specified by a set of orthogonal axes and an origin
Ray Koopman
Posted: Sat Jan 20, 2007 6:20 pm
Guest
shaobo hou wrote:
Quote:

How are you specifying your hyperplane?


oops, accidentally replied to the wrong post.

The hyperplane is specified by a set of orthogonal axes and an origin

Let A be the (D-K) by D matrix of direction cosines of the hyperplane
axes with the original axes, and let b be the hyperplane origin in the
original space. Then the projection of a point x in the original space
onto the hyperplane coordinates is A(x-b).
shaobo hou
Posted: Sun Jan 21, 2007 4:51 am
Guest
isn't projecting the gaussian onto the hyperlane in this way, equivalent to marginalising the gaussian, which is not really what I want.

What I want is more like finding the intersection of the gaussian with the hyperplane, which can be done by conditioning the gaussian given some of the parameters are known. Provided I can rotate the gaussian so that points projected onto the hyperplane are treated as unknown parameters and the rest as known parameters.

Quote:
Let A be the (D-K) by D matrix of direction cosines
of the hyperplane
axes with the original axes, and let b be the
hyperplane origin in the
original space. Then the projection of a point x in
the original space
onto the hyperplane coordinates is A(x-b).
shaobo hou
Posted: Mon Jan 22, 2007 12:51 am
Guest
Quote:
You want the distribution of Y|Z=0.

Yes, P(Y|Z=0) is basically what I am looking for, sorry I couldn't describe it clearly in the first post. I am guessing that the actual order of the rows in B is not important.

Is there an analytical solution or even robust numerical solution to this problem?
Ray Koopman
Posted: Mon Jan 22, 2007 3:56 am
Guest
shaobo hou wrote:
Quote:
isn't projecting the gaussian onto the hyperlane in this way,
equivalent to marginalising the gaussian, which is not really
what I want.

What I want is more like finding the intersection of the gaussian
with the hyperplane, which can be done by conditioning the gaussian
given some of the parameters are known. Provided I can rotate the
gaussian so that points projected onto the hyperplane are treated
as unknown parameters and the rest as known parameters.

Let A be the (D-K) by D matrix of cosines of the hyperplane axes with
the original axes, let B be any nonsingular K by D matrix whose rows
are orthogonal to the rows of A, and let c be the hyperplane origin
in the original space. Then the joint distribution of Y = A(X-c) and
Z = B(X-c) is multivariate normal, as is the conditional distribution
of Y|Z. You want the distribution of Y|Z=0.
shaobo hou
Posted: Mon Jan 22, 2007 10:40 am
Guest
thanks very much, I think I'll take some time to study what you posted.
Ray Koopman
Posted: Mon Jan 22, 2007 3:40 pm
Guest
shaobo hou wrote:
Quote:
You want the distribution of Y|Z=0.

Yes, P(Y|Z=0) is basically what I am looking for, sorry I couldn't
describe it clearly in the first post. I am guessing that the actual
order of the rows in B is not important.

Is there an analytical solution or even robust numerical solution
to this problem?

y = A(x - c) m_y = A (m_x - c)

z = B(x - c) m_z = B (m_x - c)

S_yy = A S_xx A' S_yz = A S_xx B'

S_zy = B S_xx A' S_zz = B S_xx B'

m_y|z = m_y + S_yz S_zz^-1 (z - m_z)

S_yy|z = S_yy - S_yz S_zz^-1 S_zy
 
Page 1 of 1       All times are GMT - 5 Hours
The time now is Tue Oct 14, 2008 6:50 am