Subsection 3.3.4 Householder QR factorization algorithm
ΒΆfit widthLet A be an mΓn with mβ₯n. We will now show how to compute AβQR, the QR factorization, as a sequence of Householder transformations applied to A, which eventually zeroes out all elements of that matrix below the diagonal. The process is illustrated in Figure 3.3.4.1.
Original matrix[(Ο11u21),Ο1]= Housev(Ξ±11a21)(Ξ±11aT12a21A22):=(Ο11aT12βwT120A22βu21wT12)ββMove forwardβ³ΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓΓ0ΓΓΓ0ΓΓΓ0ΓΓΓ0ΓΓΓΓΓΓΓ0ΓΓΓ0ΓΓΓ0ΓΓΓ0ΓΓΓΓΓΓΓ0ΓΓΓ0ΓΓΓ0ΓΓΓ0ΓΓΓΓΓΓΓ0ΓΓΓ00ΓΓ00ΓΓ00ΓΓΓΓΓΓ0ΓΓΓ00ΓΓ00ΓΓ00ΓΓΓΓΓΓ0ΓΓΓ00ΓΓ00ΓΓ00ΓΓΓΓΓΓ0ΓΓΓ00ΓΓ000Γ000ΓΓΓΓΓ0ΓΓΓ00ΓΓ000Γ000ΓΓΓΓΓ0ΓΓΓ00ΓΓ000Γ000ΓΓΓΓΓ0ΓΓΓ00ΓΓ000Γ0000ΓΓΓΓ0ΓΓΓ00ΓΓ000Γ0000
Aβ(Ξ±11aT12a21A22).
Let
[(Ο11u21),Ο1]=Housev((Ξ±11a21))
be the Householder transform computed from the first column of A. Then applying this Householder transform to A yields
(Ξ±11aT12a21A22):=(Iβ1Ο1(1u21)(1u21)H)(Ξ±11aT12a21A22)=(Ο11aT12βwT120A22βu21wT12),
where wT12=(aT12+uH21A22)/Ο1. Computation of a full QR factorization of A will now proceed with the updated matrix A22.
fit width
Homework 3.3.4.1.
Show that
(I00Iβ1Ο1(1u21)(1u21)H)=(Iβ1Ο1(01u21)(01u21)H).
Solution
\begin{equation*}
\begin{array}{rcl}
\FlaTwoByTwo
{ I } { 0 }
{ 0 }
{ I - \frac{1}{\tau_1}
\FlaTwoByOneSingleLine
{ 1 }
{ u_{21} }
\FlaTwoByOneSingleLine
{ 1 }
{ u_{21} }^H
}
\amp=\amp
I -
\FlaTwoByTwo
{0 } { 0 }
{ 0 }
{ \frac{1}{\tau_1}
\FlaTwoByOneSingleLine
{ 1 }
{ u_{21} }
\FlaTwoByOneSingleLine
{ 1 }
{ u_{21} }^H
} \\
\amp=\amp
I -
\frac{1}{\tau_1}
\FlaTwoByTwo
{0 } { 0 }
{ 0 }
{
\FlaTwoByOneSingleLine
{ 1 }
{ u_{21} }
\FlaTwoByOneSingleLine
{ 1 }
{ u_{21} }^H
} \\
\amp=\amp
I -
\frac{1}{\tau_1}
\FlaThreeByThreeBR
{0 } { 0 }{0}
{ 0 }{1}{u_{21}^H}
{ 0}{u_{21}}{ u_{21} u_{21}^H}
\\
\amp=\amp
\left( I - \frac{1}{\tau_1}
\FlaThreeByOneB
{ 0 }
{ 1 }
{ u_{21} }
\FlaThreeByOneB
{ 0 }
{ 1 }
{ u_{21} }^H \right).
\end{array}
\end{equation*}
Aβ(RTLRTR0ABR)=(R00r01R020Ξ±11aT120a21A22),
where RTL and R00 are kΓk upper triangular matrices. Let
[(Ο11u21),Ο1]=Housev((Ξ±11a21)).
and update
A:=(I00(Iβ1Ο1(1u21)(1u21)H))(R00r01R020Ξ±11aT120a21A22)=(Iβ1Ο1(01u21)(01u21)H)(R00r01R020Ξ±11aT120a21A22)=(R00r01R020Ο11aT12βwT1200A22βu21wT12),
where, again, wT12=(aT12+uH21A22)/Ο1.
Let
Hk=(Iβ1Ο1(0k1u21)(0k1u21)H)
be the Householder transform so computed during the (k+1)st iteration. Then upon completion matrix A contains
R=(RTL0)=Hnβ1β―H1H0ΛA
where ΛA denotes the original contents of A and RTL is an upper triangular matrix. Rearranging this we find that
ΛA=H0H1β―Hnβ1R
which shows that if Q=H0H1β―Hnβ1 then ΛA=QR.
Typically, the algorithm overwrites the original matrix A with the upper triangular matrix, and at each step u21 is stored over the elements that become zero, thus overwriting a21. (It is for this reason that the first element of u was normalized to equal "1".) In this case Q is usually not explicitly formed as it can be stored as the separate Householder vectors below the diagonal of the overwritten matrix. The algorithm that overwrites A in this manner is given in Figure 3.3.4.2.
[A,t]=HQR_unb_var1(A)Aβ(ATLATRABLABR) and tβ(tTtB) ATL is 0Γ0 and tT has 0 elementswhile n(ABR)>0 (ATLATRABLABR)β(A00a01A02aT10Ξ±11aT12A20a21A22) and (tTtB)β(t0Ο1t2) [(Ξ±11a21),Ο1]:=[(Ο11u21),Ο1]=Housev(Ξ±11a21)Update (aT12A22):=(Iβ1Ο1(1u21)(1uH21))(aT12A22) via the steps wT12:=(aT12+aH21A22)/Ο1 (aT12A22):=(aT12βwT12A22βa21wT12) (ATLATRABLABR)β(A00a01A02aT10Ξ±11aT12A20a21A22) and (tTtB)β(t0Ο1t2)endwhile
[A,t]=HQR_unb_var1(A)
denotes the operation that computes the QR factorization of mΓn matrix A, with mβ₯n, via Householder transformations. It returns the Householder vectors and matrix R in the first argument and the vector of scalars "Οi" that are computed as part of the Householder transformations in t.
Homework 3.3.4.2.
Given AβRmΓn show that the cost of the algorithm in Figure 3.3.4.2 is given by
CHQR(m,n)β2mn2β23n3 flops.
Solution
The bulk of the computation is in
\begin{equation*}
w_{12}^{T} = ( a_{12}^{T} + u_{21}^{H} A_{22} ) /
\tau_1
\end{equation*}
and
\begin{equation*}
A_{22} - u_{21} w_{12}^{T} \text{.}
\end{equation*}
During the \(k \)th iteration (when \(R_{TL} \) is \(k \times k \)), this means a matrix-vector multiplication (\(u_{21}^H A_{22} \)) and rank-1 update with matrix \(A_{22} \) which is of size approximately \((m-k) \times (n-k) \) for a cost of \(4 (m-k)(n-k) \) flops. Thus the total cost is approximately
\begin{equation*}
\begin{array}{l}
\sum_{k=0}^{n-1} 4 (m-k)(n-k) \\
~~~ = ~~~~\\
4 \sum_{j=0}^{n-1} (m - n + j )j \\
~~~ = ~~~~ \\
4 ( m-n) \sum_{j=0}^{n-1} j +
4 \sum_{j=0}^{n-1} j^2\\
~~~ = ~~~~ \\
2 ( m-n) n ( n-1 ) +
4 \sum_{j=0}^{n-1} j^2 \\
~~~ \approx ~~~~ \\
2 ( m-n) n^2 +
4 \int_0^n x^2 dx \\
~~~ = ~~~~ \\
2 m n^2 - 2 n^3 +
\frac{4}{3} n^3 \\
~~~ = ~~~~ \\
2 m n^2 - \frac{2}{3} n^3.
\end{array}
\end{equation*}
Homework 3.3.4.3.
Implement the algorithm given in Figure 3.3.4.2 as
function [ A_out, t ] = HQR( A )by completing the code in
Assignments/Week03/matlab/HQR.m
. Input is an mΓn matrix A. Output is the matrix A_out with the Householder vectors below its diagonal and R in its upper triangular part. You may want to use Assignments/Week03/matlab/test_HQR.m
to check your implementation.
Solution
See Assignments/Week03/answers/HQR.m
. Warning: it only checks if \(R \) is computed correctly.