An introduction into Projections (#01)

A brief introduction to projections.

Oct 01, 2023

Projections have a wide variety of applications, such as Gram-Schmidt Orthogonalization and dimensionality reduction. In this article, we are going to talk about orthogonal projections. We’ll discuss oblique projections in the future :)

tl:dr

Orthogonal projection πU(x) from vector x∈R² to the 1D subspace spanned by basis vector b is,

\(\mathbf{\pi_U(x) = \frac{\langle x, b \rangle }{\|b\|^2} b } \)

Orthogonal projections preserves length.
They have huge applications in various machine learning and scientific applications.
Projection matrices are always symmetric and orthogonal.

Orthogonal Projections

In machine learning, data are represented using matrices. 1D or 2D matrices are easier to imagine and work with. However, as the dimension increases, it becomes increasingly difficult to imagine and compute the data. We can resolve this problem by 'projecting' the higher-dimensional data into a lower-dimensional space where it is easier for us to work without losing any important data. This process is also called dimensionality reduction.

Let's consider projections onto 1D to understand them better.

Projections onto 1D subspace

Consider this figure:

Projection of x ∈ R² onto a subspace with basis vector b

As shown in the figure, b is a basis vector that spans a 1D subspace. The vector x belongs to R²(a 2-dimensional space) and we are trying to project x onto b, which is a 1-dimensional subspace. An orthogonal projection is a specific type of linear transformation in which a vector is projected onto a subspace in a way that preserves the orthogonal component of the vector. In other words, an orthogonal projection maps a vector onto a subspace while keeping the part of the vector that is orthogonal (perpendicular) to the subspace.

Since minimum distance is the orthogonal distance from a vector to another , we minimize the distance between the vector x and any vector in the subspace.

Let πU(x) be the projection of shortest distance from x. Then the shortest distance is :

\(\mathbf{|| x - \pi_{U (x)} ||} \)

This means the segment is orthogonal to the basis vector b, implying that the inner product (dot product in this case) is 0.

\(\mathbf{〈 x - π_{U (x)} , b 〉 = 0 } \:\:\:\:\:\:\:\:\:\: (eq : 1)\)

Now lets concentrate on the vector πU(x). This vector can be written as a linear combination of basis vector b. So,

\( \mathbf{π_{U (x)} = λb}\)

Substituting in eq : 1 , we get ,

\(\mathbf{〈 x - λb , b 〉 = 0 }\)

Now we can expand this equation using the bilinearity property of the inner product and solve for λ. We will obtain,

\(\mathbf{⟨x, b⟩ − λ ⟨b, b⟩ = 0 }\)

\(\mathbf{\lambda = \frac{\langle x, b \rangle}{\langle b, b \rangle} = \frac{\langle b, x \rangle}{\|b\|^2}} \)

Remember that the λ we obtained is a scalar that scales the basis vector b to obtain the vector πU(x). So in order to get the value of πU(x), we have to multiply λ with the basis vector. We can also expand the inner product as dot product.

\(\mathbf{\pi_U(x) = \lambda b = \frac{\langle x, b \rangle }{\|b\|^2} b= \frac{b^T x }{\|b\|^2}b } \)

If we look closely to the equation, we see that why this is a linear transformation of vector x. To understand that, lets rearrange the terms in the above equation :

\(\mathbf{\frac{b^T x }{\|b\|^2}b = \frac{b b^T }{\|b\|^2}x}\)

This is similar to Px, where P is the transformational matrix.

\(\mathbf{P = \frac{bb^T }{\|b\|^2}} \:\:\:\:\:\:\:(eq: 2) \)

\(\mathbf{\pi_U(x) = Px}\)

Thus P transforms x onto subspace spanned by b.

We can also infer eq : 2 that P is symmetric because,

\(\mathbf{P^T = \left(\frac{bb^T}{\|b\|^2}\right)^T = \frac{bb^T }{\|b\|^2} = P} \)

Also, P is orthogonal,

\(\mathbf{P^T = P^{-1}}\)

Interestingly, the orthogonal projections preservers length because the transformational matrix P is an orthogonal matrix. This means the norm of vector x and the projection of it πU(x) should be same! Lets see,

\(\mathbf{||\pi_U(x)||^2} = P^TPx^Tx\)

Since P is an orthogonal and symmetric matrix,

\(\mathbf{||\pi_U(x)||^2} = P^{-1}Px^Tx = x^Tx = ||x||^2\)

There you go, the norm of projection and the vector is equal!

Next is projections onto General Subspace which I will try to discuss in the next article.

Till then, Bye 👋

Corrections and feedback are appreciated :)

References

Mathematics for machine learning, section: 3.8

3b1b video on linear transformation