Subsection 1.2.6 Equivalence of vector norms
¶Homework 1.2.6.1.
Fill out the following table:
In this course, norms are going to be used to reason that vectors are "small" or "large". It would be unfortunate if a vector were small in one norm yet large in another norm. Fortunately, the following theorem excludes this possibility:
Theorem 1.2.6.1. Equivalence of vector norms.
Let \(\| \cdot \|: \C^m \rightarrow \mathbb R \) and \(\vert \vert \vert \cdot \vert \vert \vert: \C^m \rightarrow \mathbb R \) both be vector norms. Then there exist positive scalars \(\sigma \) and \(\tau \) such that for all \(x \in \Cm \)
Proof.
The proof depends on a result from real analysis (sometimes called "advanced calculus") that states that \(\sup_{x \in S} f( x ) \) is attained for some vector \(x \in S \) as long as \(f \) is continuous and \(S \) is a compact (closed and bounded) set. For any norm \(\| \cdot \| \text{,}\) the unit ball \(\| x \| = 1 \) is a compact set. When a supremum is an element in \(S \text{,}\) it is called the maximum instead and \(\sup_{x \in S} f( x ) \) can be restated as \(\max_{x \in S} f( x ) \text{.}\)
Those who have not studied real analysis (which is not a prerequisite for this course) have to take this on faith. It is a result that we will use a few times in our discussion.
We prove that there exists a \(\tau \) such that for all \(x \in \C^m \)
leaving the rest of the proof as an exercise.
Let \(x \in \C^m \) be an arbitary vector. W.l.o.g. assume that \(x \neq 0 \text{.}\) Then
The desired \(\tau \) can now be chosen to equal \(\max_{\| y \| = 1} \vert \vert \vert y \vert \vert \vert \text{.}\)
Homework 1.2.6.2.
Complete the proof of Theorem 1.2.6.1.
We need to prove that
From the first part of the proof of Theorem 1.2.6.1, we know that there exists a \(\rho > 0 \) such that
and hence
We conclude that
where \(\sigma = 1 / \rho \text{.}\)
Example 1.2.6.2.
Let \(x \in \R^2 \text{.}\) Use the picture
to determine the constant \(C \) such that \(\| x \|_1 \leq C \| x \|_\infty \text{.}\) Give a vector \(x \) for which \(\| x \|_1 = C \| x \|_\infty \text{.}\)For \(x \in \R^2 \) and the \(C \) you determined in the first part of this problem, prove that \(\| x \|_1 \leq C \| x \|_\infty \text{.}\)
Let \(x \in \Cm \text{.}\) Extrapolate from the last part the constant \(C \) such that \(\| x \|_1 \leq C \| x \|_\infty \) and then prove the inequality. Give a vector \(x \) for which \(\| x \|_1 = C \| x \|_\infty \text{.}\)
-
Consider the picture
The red square represents all vectors such that \(\| x \|_\infty = 1 \) and the white square represents all vectors such that \(\| x \|_1 = 2 \text{.}\)
All points on or outside the red square represent vectors \(y \) such that \(\| y \|_\infty \ge 1 \text{.}\) Hence if \(\| y \|_1 = 2 \) then \(\| y \|_\infty \geq 1 \text{.}\)
-
Now, pick any \(z \neq 0 \text{.}\) Then \(\left\| 2 z / \| z \|_1 \right\|_1= 2) \text{.}\) Hence
\begin{equation*} \left\| 2 z / \| z \|_1 \right\|_\infty \ge 1 \end{equation*}which can be rewritten as
\begin{equation*} \| z \|_1 \le 2 \| z \|_\infty. \end{equation*}Thus, \(C = 2 \) works.
Now, from the picture it is clear that \(x = \left( \begin{array}{c} 1 \\ 1 \end{array} \right) \) has the property that \(\| x \|_1 = 2 \| x \|_\infty \text{.}\) Thus, the inequality is "tight."
-
We now prove that \(\| x \|_1 \leq 2 \| x \|_\infty \) for \(x \in \R^2 \text{:}\)
\begin{equation*} \begin{array}{l} \| x \|_1 \\ ~~~ = ~~~~ \lt \mbox{ definition} \gt \\ \vert \chi_0 \vert + \vert \chi_1 \vert \\ ~~~ \leq ~~~~ \lt \mbox{ algebra} \gt \\ \max(\vert \chi_0 \vert, \vert \chi_1 \vert ) + \max(\vert \chi_0 \vert, \vert \chi_1 \vert ) \\ ~~~ = ~~~~ \lt \mbox{ algebra} \gt \\ 2 \max(\vert \chi_0 \vert, \vert \chi_1 \vert ) \\ ~~~ = ~~~~ \lt \mbox{ definition} \gt \\ 2 \| x \|_\infty. \end{array} \end{equation*} -
From the last part we extrapolate that \(\| x \|_1 \leq m \| x \|_\infty \text{.}\)
\begin{equation*} \begin{array}{l} \| x \|_1 \\ ~~~ = ~~~~ \lt \mbox{ definition} \gt \\ \sum_{i=0}^{m-1} \vert \chi_i \vert \\ ~~~ \leq ~~~~ \lt \mbox{ algebra} \gt \\ \sum_{i=0}^{m-1} \left( \max_{j=0}^{m-1} \vert \chi_j \vert \right) \\ ~~~ = ~~~~ \lt \mbox{ algebra} \gt \\ m \max_{j=0}^{m-1} \vert \chi_j \vert \\ ~~~ = ~~~~ \lt \mbox{ definition} \gt \\ m \| x \|_\infty. \end{array} \end{equation*}Equality holds (i.e., \(\| x \|_1 = m \| x \|_\infty \)) for \(x = \left( \begin{array}{c} 1 \\ 1 \\ \vdots \\ 1 \end{array} \right) \text{.}\)
Some will be able to go straight for the general result, while others will want to seek inspiration from the picture and/or the specialized case where \(x \in \R^2 \text{.}\)
Homework 1.2.6.3.
Let \(x \in \Cm \text{.}\) The following table organizes the various bounds:
For each, determine the constant \(C_{x,y} \) and prove the inequality, including that it is a tight inequality.
Hint: look at the hint!
\(\| x \|_1 \leq \sqrt{m} \| x \|_2 \text{:}\)
This is the hardest one to prove. Do it last and use the following hint:
Consider \(y = \left( \begin{array}{c} \chi_0 / \vert \chi_0 \vert \\ \vdots \\ \chi_{m-1} / \vert \chi_{m-1} \vert \end{array} \right) \) and employ the Cauchy-Schwarz inequality.
(How do you modify this hint to cover the case where one or more elements equal zero?)
\(\| x \|_1 \leq \sqrt{m} \| x \|_2 \text{:}\)
Consider \(y = \left( \begin{array}{c} \psi_0 \\ \vdots \\ \psi_{m-1} \end{array} \right) = \left( \begin{array}{c} \chi_0 / \vert \chi_0 \vert \\ \vdots \\ \chi_{m-1} / \vert \chi_{m-1} \vert \end{array} \right). \) Then
We also notice that \(\| y \|_2 = \sqrt{m} \text{.}\)
From the Cauchy-Swartz inequality we know that
The problem with the above argument is that one of more \(\chi_i \) may equal zero. The argument can be fixed by choosing
or
To demonstrate that equality is attained for at least one non-zero vector, we choose
then \(\| x \|_1 = m \) and \(\| x \|_2 = \sqrt{m}\) so that \(\| x \|_1 = \sqrt{m} \| x \|_2 \text{.}\)
\(\| x \|_1 \leq m \| x \|_\infty \text{:}\)
See Example 1.2.6.2.
\(\| x \|_2 \leq \| x \|_1 \text{:}\)
Taking the square root of both sides yields \(\| x \|_2 \leq \| x \|_1 \text{.}\)
If we now choose
then \(\| x \|_2 = \| x \|_1 \text{.}\)
\(\| x \|_2 \leq \sqrt{m} \| x \|_\infty \text{:}\)
Taking the square root of both sides yields \(\| x \|_2 \leq \sqrt{m} \| x \|_\infty \text{.}\)
Consider
then \(\| x \|_2 = \sqrt{m} \) and \(\| x \|_\infty = 1\) so that \(\| x \|_2 = \sqrt{m} \| x \|_\infty \text{.}\)
\(\| x \|_\infty \leq \| x \|_1 \text{:}\)
Consider
Then \(\| x \|_\infty = 1 = \| x \|_1 \text{.}\)
\(\| x \|_\infty \leq \| x \|_2 \text{:}\)
Taking the square root of both sides yields \(\| x \|_\infty \leq \| x \|_2 \text{.}\)
Consider
Then \(\| x \|_\infty = 1 = \| x \|_2 \text{.}\)
Remark 1.2.6.3.
The bottom line is that, modulo a constant factor, if a vector is "small" in one norm, it is "small" in all other norms. If it is "large" in one norm, it is "large" in all other norms.