represents the point on W that is the closest to x
We can see this as the same way if a vector is inconsistent and we need the solution so we can just find the solution closest to the orignal vector
Any x which is a solution to Ax=b0 is the closest in minimizing the least squares error, we call such a solution a best fit solution or a least squares error solution
Finding the best-fit solution to Ax=b
it is possible to find the best-fit solution to an inconsistent system Ax=b without finding b0 by projection
recall that row(A) and null(A) are “orthogonal to eachother”. that is:
Null(A)=(Row(A))⊥
similarly
Null(AT)=(Row(AT))⊥=Col(A)T
Since the perp part b−b0 is orthogonal to Col(A), b−b0 is in Null(AT) so we can write