User:KYN/SingularValues
From Wikipedia, the free encyclopedia
- Maybe this should be merged into singular value decomposition?
Let M be an m×n matrix with elements from the field K. Let Sm − 1 and Sn − 1 denote the sets of unit 2-norm vectors in Km and Kn, respectively. Define the function
- σ(u,v) = uTMv
for vectors and
. The function σ maps from
to K. Since both Sm − 1 and Sn − 1 are compact sets, it follows that their product also is compact. Furthermore, since σ is continuous, it follows that the range of σ is compact. Consequently, σ assumes a largest value for at least one pair of vectors
and
. This largest value is denoted σ1 and the corresponding vectors are denoted u1 and v1.
Given u1 and v1, let and
denote the subsets of Sm − 1 and Sn − 1 which are orthogonal to u1 and v1, respectively. Consider now the restriction of σ to the set
. Again it turns out that the domain of this σ is compact, hence it assumes a largest value for some vectors u2 and v2.
Given u2 and v2, let and
denote the subsets of
and
which are orthogonal to u2 and v2, respectively. By restricting σ to the corresponding domain, a largest value σ is found for some vectors u3 and v3.
Clearly, this procedure of defining smaller and smaller subsets of normalized vectors in Km and Kn which are orthogonal to all vectors uk and vk previously found can be repeated until we reach the sets and
, where p = min(m,n), from which the value σp is found as the largest value of σ.
In the end, we have generated the set of scalars together with two sets of vectors
and
. It turns out that the scalars σk are the singular values of M and that uk and vk are the corresponding left and right singular vectors. To see this, let us reexamine u1,v1,σ1. The two vectors were found by minimizing the function σ with
and
. This can be done by finding u and v for which the gradient of σ with respect to u and v equals a linear combination of the Lagrange multipliers corresponding to the conditions
. This gives,
- Mv1 = 2λ1u1 + 0
- MTu1 = 0 + 2λ2v1
Multiplying the first equation from left by and the second equation from left by
and taking
into account gives
or σ1 = 2λ1 = 2λ2. Inserted into the above equations this gives
- Mv1 = σ1
- MTu1 = σ1
This proves that u1,v2 are in fact left and right singular vectors of M and that σ1 is their corresponding singular value. Similar results can be derived for the remaining vectors and values derived above.
This definition of singular values and singular vectors may not be of much practical use, but it interesting from the point of view that it proves the existence of these entities.