Hudyma Alex
Hudyma Alex
Hudyma Alex
by
Alex Hudyma
Lakehead University
Thunder Bay, Ontario, Canada
copyright c (2015) Alex Hudyma
Abstract
This project is designed to give readers a basic understanding on the topic of Linear
Functional Anaylsis. After covering preliminaries, we will examine different spaces and
their properties, bounded linear operators, duality, and finish with the Hahn-Banach
Theorem - a very powerful theorem that is a cornerstone of functional analysis and has
applications in other areas of mathematics as well.
i
Acknowledgements
I would like to thank my supervisor, Monica Ilie, for her guidance with this project,
and the coordinator of Math 4301, Razvan Anisca. I would also like to thank the students
and faculty of the Lakehead University Department of Mathematical Sciences for a great
four years.
ii
Contents
Abstract i
Acknowledgements ii
Chapter 1. Introduction 1
Chapter 2. Preliminaries 2
1. Linear Algebra 2
2. Metric Spaces 3
3. Lebesgue Integration 6
Chapter 3. Spaces 9
1. Normed Spaces 9
2. Banach Spaces 13
3. Inner Product Spaces 13
4. Hilbert Spaces 15
Chapter 4. Operators 17
1. Continuous Linear Transformations 17
2. The Space B(X, Y ) 18
3. Isometries, Isomorphisms, and Inverses 21
Chapter 5. Duality 22
1. Dual Spaces 22
2. Sublinear Functionals and Seminorms 23
3. The Hahn-Banach Theorem 25
Chapter 6. Concluding Remarks 28
Bibliography 29
iii
CHAPTER 1
Introduction
This project will introduce the methods of linear functional analysis. Our basic goal
here is to perform analysis on infinite-dimension vector spaces. Because not all of our
linear algebra properties hold for infinite spaces, we extend ideas like the length of a
vector into an abstract setting: the norm. Much like in real and complex analysis, we will
give additional structures to certain spaces to yield special properties. Through different
examples, we will gain an understanding of normed and inner product spaces, bounded
linear operators, duality, and the Hahn-Banach theorem.
1
CHAPTER 2
Preliminaries
1. Linear Algebra
Here, we will recall some essential concepts from Linear Algebra for use throughout
the project. We will be working with the sets R, C, and N. Note that R and C are both
algebraic fields, and when giving examples, theorems, or definitions that apply equally to
R and C, we will simply use F to denote either set.
Definition 2.1. A vector space over F is a non-empty set V with two operations,
vector addition and scalar multiplication, where vector addition is a function from V ×
V → V defined by (x, y) = x + y, and scalar multiplication is a function from F × V
→ V defined by (α, x) = αx. The following properties hold for any α, β ∈ F and any
x, y, z ∈ V .
(i) x + y = y + x, x + (y + z) = (x + y) + z;
(ii) there exists a unique 0 ∈ V such that for all x ∈ V , x + 0 = 0 + x = x;
(iii) for all x ∈ V , there exists a unique −x ∈ V such that x + (−x) = (−x) + x = 0;
(iv) 1x = x, α(βx) = (αβ)x;
(v) α(x + y) = αx + αy, (α + β)x = αx + βx.
Rather than proving Definition 2.1 (i)-(v) for U , we can use the subspace test:
Let U be a non-empty subset of a vector space V . For all α ∈ F and x, y ∈ U , if
(i) x + y ∈ U ;
(ii) αx ∈ U
then U is a subspace of V . Note that the set {0} ⊂ V is a linear subspace.
2. Metric Spaces
Metric spaces are an abstract setting for the discussion of concepts from analysis
(such as convergence and continuity). The basic tool used here is the notion of a distance
function, or a metric, which we will define in this section.
Definition 2.6. A metric on a set M is a function d : M × M → R such that for all
x, y, z ∈ M :
(i) d(x, y) ≥ 0;
(ii) d(x, y) = 0 ⇔ x = y;
(iii) d(x, y) = d(y, x);
(iv) d(x, z) ≤ d(x, y) + d(y, z) (the triangle inequality).
Chapter 2. Preliminaries 4
If d is a metric on M , then we called the pair (M, d) a metric space. Unless the set M
consists of a single point, it can have more than one metric, and we often say “the metric
space M ” rather than (M, d).
Example 2.7. For an integer k ≥ 1, the function d : Fk × Fk → R defined by
k
!1/2
X
2
d(x, y) = |xj − yj | ,
j=1
Note that if x ∈ A, then x is a closure point of A (for every > 0, d(x, x) = 0 < ),
hence A ⊂ Ā. However, the converse is not necessarily true, as we see in the following
theorem.
Theorem 2.11. Let (M, d) be a metric space and A ⊂ M .
(i) Ā is closed and is the smallest closed set that contains A.
(ii) A is closed, if and only if A = Ā.
(iii) A is closed if and only if the following implication holds:
if {xn } is a sequence in A and x ∈ M such that xn → x, then x ∈ A.
(iv) x ∈ Ā if and only if inf{d(x, y) : y ∈ A} = 0.
(v) A is dense if and only if for any element x ∈ M , there exists a sequence {yn } in
A such that yn → x.
Chapter 2. Preliminaries 5
We can extend the idea of a continuous function from real analysis to describe general
metric spaces.
Example 2.14. For k ≥ 1, the space Fk with the standard metric is complete.
Definition 2.18. Let (M, d) be a compact metric space. Then CF (M ) denotes the
set of continuous functions f : M → F. We will define the uniform metric on CF (M ) by
Note that most properties of CF (M ) hold for both real and complex cases, so generally,
we will omit the subscript and just write C(M ). Furthermore, if M is a bounded interval
[a, b] ⊂ R, we will write C[a, b]. Also note that (C(M ), d) is a complete metric space.
So far we have seen only the uniform metric on C[a, b], but there are other metrics
Rb
that are defined by integrals. If a f (x)dx is the usual Riemann integral of f ∈ C[a, b],
then for 1 ≤ p < ∞, the function dp : C[a, b] × C[a, b] → R defined by
Z b 1/p
p
dp (f, g) = |f (x) − g(x)| dx
a
is a metric on C[a, b]. However, (C[a, b], dp ) is not a complete metric space, due to
some drawbacks of the Riemann integral. Thus, we will introduce here the Lebesgue
Integral, which is more powerful and works on a wider class of functions. We will look at
metric spaces which are complete using the metric dp , and will introduce the notion of a
“measure” of a set (AKA the “length;” for example, the bounded interval [a, b] has length
b − a). Note that many sets such as R have “infinite measure,” and so we introduce the
+
extended sets of real numbers: R = R ∪ {−∞, ∞} and R = [0, ∞) ∪ {∞}. The standard
algebraic operations apply, with the products 0 · ∞ = 0 · (−∞) = 0, and the operations
∞ − ∞ and ∞/∞ forbidden (like dividing by zero in R).
Definition 2.20. A σ-algebra is a collection, Σ, of subsets of X satisfying:
(i) ∅, X ∈ Σ;
(ii) S ∈ Σ ⇒ X \ S ∈ Σ;
∞
S
(iii) Sn ∈ Σ, n = 1, 2, . . . , ⇒ Sn ∈ Σ.
n=1
Note that the sets of measure zero in (R, ΣL , µL ) are the sets A with the following
property: for any > 0, there exists a sequence of intervals Ij ⊂ R, j = 1, 2, . . . , such that
∞
[ X∞
A⊂ Ij and `(Ij ) < .
j=1 j=1
Now we would like to define a metric on L1 (X), and the obvious choice is
Z
d1 (f, g) = |f − g|dµ,
X
1
for all f, g ∈ L (X). However, this doesn’t fit criteria (ii) of Definition 2.6 for being a
metric. Unfortunately, L1 (X) has functions f, g such that f = g almost everywhere, but
Chapter 2. Preliminaries 8
f 6= g (which may happen on a set of measure zero). Hence we may have d1 (f, g) = 0,
but f 6= g. So, we will regard “equivalent” any two functions f, g ∈ L1 (X) which are
equal almost everywhere. We will define an equivalence relation ≡ on L1 (X) by
f ≡ g ⇐⇒ f (x) = g(x) almost everywhere for all x ∈ X.
This partitions L1 (X) into a space of equivalence classes, which we will denote by L1 (X).
The space L1 (X) is a vector space and we see that d1 (f, g) = 0 if and only if f ≡ g. Thus
d1 meets all criteria to be a metric for L1 (X), which we will use from now on instead of
L1 (X). We should also note that when we say the “function” f ∈ L1 (X), we mean some
representative of the appropriate equivalence class.
Example 2.29. Take the counting measure (N, Σc , µc ). Any function f : N → F can
be seen as an F-valued sequence {an } (with an = f (n) for n ≥ 1) such that every sequence
P∞ function. It follows, then, that {an } is integrable with respect
is a measurable
1
to µc if
and only if n=1 |an | < ∞. The space of these sequences will be denoted by ` .
Definition 2.30. Let (X, Σ, µ) = (Rk , ΣL , µL ) for some k ≥ 1. If f ∈ L1 (X), then f
is Lebesgue integrable.
Theorem 2.31. If I = [a, b] is a bounded interval and f : I → R is bounded and
Riemann integrable on I, then f is Lebesgue integrable on I, and the values of the two
integrals are the same. Particularly, continuous functions on I are Lebesgue integrable,
and we use the same notation as the Riemann integral.
Definition 2.32. We define the space Lp (X) as follows:
Z
L (X) = {f : f is measurable and ( |f |p dµ)1/p < ∞} when 1 ≤ p < ∞.
p
X
We also define the corresponding sets L (X) by identifying functions in Lp (X) which
p
are equal almost everywhere, and considering corresponding spaces of equivalence classes.
Example 2.33. For 1 ≤ p < ∞, the function dp : Lp (X) × Lp (X) → R defined by
Z 1/p
p
dp (f, g) = |f − g| dµ
X
p
is the standard metric on L (X).
Example 2.34 (Counting Measure). Let 1 ≤ p ≤ ∞ and (X, Σ, µ) = (N, Σc , µc ).
Then the space Lp (N) consists of the set of sequences {αn } in F with the properties:
∞
!1/p
X
p
|αn | < ∞ for 1 ≤ p < ∞,
n=1
Spaces
1. Normed Spaces
In R2 and R3 , we can easily picture the length of a vector. However, when it comes to
other, possibly infinite-dimensional vector spaces, we assign a set of axioms to describe
the length of a vector in these spaces. These axioms define the “norm” of a vector as we
see in the next definition.
Definition 3.1. Let X be a vector space over F. Then a norm on X is a function
k·k : X → R such that for all x, y ∈ X and α ∈ F,
(i) kxk ≥ 0;
(ii) kxk = 0 if and only if x = 0;
(iii) kαxk = |α|kxk;
(iv) kx + yk ≤ kxk + kyk (the triangle inequality).
Definition 3.2. If X is a vector space with a norm on it, then X is a normed vector
space (also just called a normed space).
Example 3.3. The function k·k : Fn → R defined by
n
!1/2
X
k(x1 , x2 , . . . , xn )k = |xj |2
j=1
Proof.
Definition 3.7. Let X be a vector space and k·k1 and k·k2 be two different norms
on X. Then k·k1 is equivalent to k·k2 if there exists an M, m > 0 such that for all x ∈ X,
Note that the notion “equivalent to” defines an equivalence relation on the set of all
norms on X.
Theorem 3.8. Let X be a finite-dimensional vector space with norm k·k and basis
Pn P 1/2
n 2
{e1 , e2 , . . . , en }. Define another norm on X by k j=1 λj ej k1 = j=1 |λj | . Then
the norms k·k and k·k1 are equivalent.
Chapter 3. Spaces 11
n
!1/2
X
Proof. Let M = kej k . Since {e1 , e2 , . . . , en } forms a basis for X, we have
j=1
M > 0. Additionally,
Xn n
X
k λj ej k ≤ kλj ej k
j=1 j=1
n
X
= |λj |kej k
j=1
n
!1/2 n
!1/2
X X
≤ |λj |2 kej k2
j=1 j=1
n
X
= Mk λj ej k1 .
j=1
Proof. From the definition of equivalency, there exists an M, m > 0 such that
mk·k1 ≤ k·k2 ≤ M k·k1 , and there exists a K, k > 0 such that kk·k2 ≤ k·k3 ≤ M k·k2
for all x ∈ X. So:
(i) Letting N = M1 and n = m1 gives N, n > 0 such that N k·k2 ≤ k·k1 ≤ nk·k2 for
all x ∈ X. Thus k·k1 is equivalent to k·k2 .
(ii) Letting L = KM and l = km gives L, l > 0 such that lk·k1 ≤ k·k3 ≤ Lk·k1 for
all x ∈ X. Hence k·k3 is equivalent to k·k1 .
Corollary 3.10. If k·k and k·k2 are any two norms on a finite-dimensional vector
space X, then they are equivalent.
Chapter 3. Spaces 12
Proof. Let {e1 , e2 , . . . , en } be a basis for X, and let k·k1 be the norm defined in
Theorem 3.8. Then by Theorem 3.8, both k·k and k·k2 are equivalent to k·k1 , and by
Lemma 3.9, k·k2 is equivalent to k·k.
Theorem 3.11. Let X be a vector space and k·k and k·k1 be equivalent norms on X
such that d and d1 are metrics defined by d(x, y) = kx − yk and d1 (x, y) = kx − yk1 . Then
(X, d) is complete if and only if (X, d1 ) is complete.
A proof of this theorem can be found in [1]. Based on this idea, when we have two
equivalent norms on the same metric space, we may use whichever is easier to work with
to show that the space is complete.
Lemma 3.12. Let X be a finite-dimensional vector space over F with basis {e1 , e2 , . . . , en }.
Pn P 1/2
n 2
If k·k1 is a norm on X defined by k j=1 λj ej k1 = j=1 |λj | , then X is a complete
metric space.
So we get |λj,k − λj,m |2 ≤ 2 for all k, m ≥ N and 1 ≤ j ≤ n. Thus {λj,m }∞ m=1 is a Cauchy
sequence in F. Now since F is a complete metric space, there exists a λj ∈ F such that
λj,m → λj as m → ∞. By the definition of sequence limits, there exists an Nj ∈ N such
that for all m ≥ Nj ,
2
|λj,m − λj |2 ≤ .
n
Now let N0 = max(N1 , N2 , . . . , Nn ) and write x = nj=1 λj ej . Then for m ≥ N0 ,
P
n n
X X 2
kxm − xk21 = 2
|λj,m − λj | ≤ = 2 .
j=1 j=1
n
Proof. Let {e1 , e2 , . . . en } be a basis for X and let k·k1 be the norm on X defined
in Lemma 3.11. Then by Corollary 3.10, the norms k·k and k·k1 are equivalent, and by
Lemma 3.12 the metric space X under the norm k·k1 is complete. Thus by Theorem 3.11,
X with norm k·k is also complete.
Chapter 3. Spaces 13
2. Banach Spaces
When we look at infinite-dimensional vector spaces, we find that there may be two
norms which are not equivalent. So, we may not assume that theorems from the previous
section hold, and will now see what we can say about infinite-dimensional vector spaces.
Theorem 3.14. Let X be a normed vector space and S a linear subspace of X. Then
S̄ is a linear subspace of X.
Proof. Take x, y ∈ S̄ and α ∈ F. Now since x and y are in S̄, by Definition 2.10
(iv), there exist sequences {xn } and {yn } in S such that lim xn = x and lim yn = y. We
n→∞ n→∞
know that S is a linear subspace, and so xn + yn ∈ S for all n ∈ N. Hence
x + y = lim (xn + yn ) ∈ S̄.
n→∞
Similarly, αxn ∈ S for all n ∈ N, so
αx = lim αxn ∈ S̄.
n→∞
Proof.
(i) This follows from Corollary 3.13.
(ii) This follows from Theorem 2.19.
(iii) This follows from Theorem 2.35.
(iv) This is a special case of part (iii) (by taking counting measure on N).
(v) This proof can be found in [1].
Recall that in R3 , we can calculate the angle θ between two vectors using the scalar
product. Much like we did with the length of a vector and the norm, we can extend this
idea to other spaces with higher dimensions by finding a set of axioms that the scalar
product satisfies, and using them to form a definition for the general case.
Chapter 3. Spaces 14
Definition 3.17. Let X be a real vector space. An inner product on X is a function
(·, ·) : X × X → R such that for all x, y, z ∈ X and α, β ∈ R,
(i) (x, x) ≥ 0;
(ii) (x, x) = 0 if and only if x = 0;
(iii) (αx + βy, z) = α(x, z) + β(y, z);
(iv) (x, y) = (y, x).
Pk
Example 3.18. The function (·, ·) : Rk × Rk → R defined by (x, y) = n=1 xn yn is
the standard inner product on Rk .
Note that the above definition and example only apply to real vector spaces. We have
a separate definition for complex inner products that is slightly different.
Definition 3.19. Let X be a complex vector space. An inner product on X is a
function (·, ·) : X × X → C such that for all x, y, z ∈ X and α, β ∈ C,
(i) (x, x) ∈ R and (x, x) ≥ 0;
(ii) (x, x) = 0 if and only if x = 0;
(iii) (αx + βy, z) = α(x, z) + β(y, z);
(iv) (x, y) = (y, x).
Pk
Example 3.20. The function (·, ·) : Ck × Ck → C defined by (x, y) = n=1 xn y n is
the standard inner product on Ck .
Definition 3.21. If X is a real or complex valued vector space with an inner product
(·, ·), then X is an inner product space.
We notice that since the definitions are so similar, we will continue to generalize
theorems and ideas of inner product spaces to F, differentiating between complex and
real inner products when necessary. Generally, an inner product can be defined on any
finite-dimensional vector space.
Example 3.22. Let X be a k-dimensional vector space with basis {e1 , e2 , . . . , ek },
and x, y ∈ X such that x = kn=1 λn en and y = kn=1 µn en . Then the function (·, ·) :
P P
A proof of Theorem 3.30 can be found in [1]. We will now give a couple definitions
and theorems here for use later in the project.
Definition 3.31. Let X be an inner product space. Then the vectors x, y ∈ X are
orthogonal if (x, y) = 0.
Definition 3.32. Let X be an inner product space with A ⊂ X. The orthogonal
complement of A is the set
A⊥ = {x ∈ X : (x, a) = 0 for all a ∈ A}.
So we see that the set A⊥ is the set of vectors in X which are orthogonal to every
vector in A (if A = ∅, then A⊥ = X).
Chapter 3. Spaces 16
3 ⊥
Example 3.33. If X = R and A = {(a1 , a2 , 0) : a1 , a2 ∈ R}, then A = {(0, 0, x3 ) :
x3 ∈ R}, since by definition, x ∈ A⊥ if and only if for any a = (a1 , a2 , 0) we have
(x, a) = x1 a1 + x2 a2 = 0. If we want this equality to hold for every a ∈ A, we must have
x1 = x2 = 0.
CHAPTER 4
Operators
Since we’ve covered some properties of normed spaces, we can now talk about functions
mapping from one normed space into another. The simplest to work with of these maps
will be linear (recall the definition of a linear transformation: a function T : V → W
such that for all α, β ∈ F and x, y ∈ V , T (αx + βy) = αT (x) + βT (y)), and the most
important will be continuous. We will give examples, as well as look at the space of all
continuous linear transformations. First however, we will give alternate characterizations
of continuity for linear transformations.
Theorem 4.1. Let X and Y be normed linear spaces and T : X → Y be a linear
transformation. Then the following are equivalent:
(i)T is uniformly continuous;
(ii)T is continuous;
(iii)T is continuous at 0;
(iv) there exists a real number k > 0 such that kT (x)k ≤ k whenever x ∈ X and
kxk ≤ 1;
(v) there exists a real number k > 0 such that kT (x)k ≤ kkxk for all x ∈ X.
While the first two implications are quite obvious, the proof of this theorem can be
found in [1].
Example 4.2. Define T : C[0, 1] → F by T (f ) = f (0). Then if f ∈ C[0, 1], we get
kT (f )k = |f (0)| ≤ sup{|f (x)| : x ∈ [0, 1]} = kf k.
So using part (v) of Theorem 4.1 with k = 1, we see that T is continuous, and also that
this was a much easier way of proving continuity than the epsilon-delta methods that we
are used to.
Theorem 4.3. Let X be a finite-dimensional normed space, Y be any normed linear
space, and T : X → Y be a linear transformation. Then T is continuous.
We just saw that B(X, Y ) is a vector space, so we will now look to define a norm. It is
important to note that proofs here may have three different norms from different spaces
in the same equation. However, we will use the symbol k·k as usual for all three norms,
as it will be easy to determine which norm we are referring to based on the space that
the element is from. We should also note that for a normed linear space X, the set of
bounded linear operators from X to X will simply be denoted B(X).
Chapter 4. Operators 19
Lemma 4.8. Let X and Y be normed spaces and define k·k : B(X, Y ) → R by kT k =
sup {kT (x)k : kxk ≤ 1}. Then k·k is a norm on B(X, Y ), so B(X, Y ) is a normed space.
Proof. Before checking the axioms of a norm for k·k, we must note the following
consequence of Theorem 4.1:
sup{kT (x)k : kxk ≤ 1} = inf{k : kT (x)k ≤ kkxk ∀ x ∈ X},
Lemma 4.9. Let X be a normed linear space with W a dense subspace of X, and
let Y be a Banach Space with S ∈ B(X, Y ). If x ∈ X and {xn }, {yn } are sequences
in W such that lim xn = lim yn = x, then {S(xn )} and {S(yn )} both converge, and
n→∞ n→∞
lim S(xn ) = lim S(yn ).
n→∞ n→∞
In the next theorem, we explore what conditions are required for B(X, Y ) to be a
Banach Space.
Theorem 4.10. If X is a normed linear space and Y is a Banach space, then B(X, Y )
is also a Banach Space.
Proof. By the Lemma 4.8, we know that B(X, Y ) is a normed space, so we must
show that it is complete. Let {Tn } be a Cauchy sequence in B(X, Y ). Then {Tn } is
bounded, so there exists an M > 0 such that kTn k ≤ M for all n ∈ N. Take x ∈ X. Since
we have
kTn x − Tm xk = k(Tn − Tm )(x)k ≤kTn − Tm kkxk,
and {Tn } is Cauchy, it follows that {Tn x} is a Cauchy sequence in Y . Now since Y is
a Banach space, it is complete, and hence {Tn x} is convergent. So we define a transfor-
mation T : X → Y by T (x) = lim Tn (x). We want to show that T is a bounded linear
n→∞
operator, and that {Tn } converges to T . First showing that T is linear, we see
T (x + y) = lim Tn (x + y) = lim (Tn x + Tn y) = lim Tn x + lim Tn y = T x + T y
n→∞ n→∞ n→∞ n→∞
Duality
1. Dual Spaces
Here we will study another space of functions known as the dual space.
Definition 5.1. Let X be a normed space. Linear transformations from X to F are
called linear functionals. The space B(X, F) is called that dual space of X and is denoted
by X 0 .
Corollary 5.2. If X is a normed vector space, then X 0 is a Banach space.
Proof. We know that the space F is complete, hence by Theorem 4.10, X 0 = B(X, F)
is a Banach space.
Definition 5.3. We define the Kronecker delta, denoted δjk . For any integers j, k,
1, if j = k
δjk =
0, if j 6= k
Theorem 5.4. If X is a finite dimensional normed linear space with basis {v1 , v2 , . . . , vn },
then X 0 has a basis {f1 , f2 , . . . , fn } such that fj (vk ) = δjk for 1 ≤ j, k ≤ n. In particular,
dimX 0 = dimX.
Proof. Let x ∈ X. Since {v1 , v2 , P . . . , vn } is a basis for X, we know there exist unique
scalars α1 , α2 , . . . , αn such that x = nk=1 αk vk . For j = 1, . . . , n, we define a function
fj : X → F by fj (x) = αj for x ∈ X. It can be verified that fj is a linear transformation
such that fj (vk ) = δjk . Also, fj is continuous by Theorem 4.3, and so fj ∈ X 0 . Now to
showP that {f1 , f2 , . . . , fn } is a basis for X 0 , suppose that β1 , β2 , . . . , βn are scalars such
that nj=1 βj fj = 0. Then
n
X n
X
0= βj fj (vk ) = βj δjk = βk , 1 ≤ k ≤ n,
j=1 j=1
Proof. First, we will prove the existence of y. If f (x) = 0 for all x ∈ H, then y = 0
works. Otherwise, Kerf = {x ∈ H : f (x) = 0} is a proper closed subspace of H, so by
Theorem 3.30, (Kerf )⊥ 6= {0}. Hence there exists a z ∈ (Kerf )⊥ such that f (z) = 1.
z
Now we define y = (since z 6= 0), and take an arbitrary x ∈ H. Then since f is
kzk2
linear,
f (x − f (x)z) = f (x) − f (x)f (z) = 0,
so x−f (x)z ∈ Kerf . But z ∈ (Kerf )⊥ , so (x−f (x)z, z) = 0. Thus (x, z)−f (x)(z, z) = 0,
so by rearranging we see (x, z) = f (x)kzk2 , and therefore
z
f (x) = (x, ) = (x, y).
kzk2
Now if kxk ≤ 1, then using the Cauchy-Schwarz inequality we get
|f (x)| = |(x, y)| ≤ kxkkyk ≤ kyk,
y
so f (x) ≤ kyk. Furthermore, if x = then kxk = 1 and
kyk
|f (y)| (y, y)
kf k ≥ |f (x)| = = = kyk.
kyk kyk
Hence we also have f (x) ≥ kyk, and so f (x) = kyk.
Next, we will prove uniqueness. If y and w are such that
f (x) = (x, y) = (x, w)
for all x ∈ H, then we must have (x, y − w) = 0 for all x ∈ H. So by Lemma 3.25, we
have y − w = 0, meaning that y = w. Hence y is unique.
Theorem 5.5 gives a representation of all elements of the dual space of a general Hilbert
space. So, in a sense, H0 can be identified with H.
If we have a vector space X, many times in functional analysis we will have a linear
functional fW : W → F defined on a subspace W ⊂ X, but to use this functional, we
need it to be defined on the whole set X. So, we will now see if we can extend the domain
from W to X, and get the following definition.
Definition 5.6. Let X be a vector space, W a linear subspace of X, and fW a linear
functional on W . Then a linear functional fX on X is an extension of fW if fX (w) = fW (w)
for all w ∈ W .
Chapter 5. Duality 24
Definition 5.7. Let X be a real vector space. A sublinear functional on X is a
function p : X → R such that:
(i) p(x + y) ≤ p(x) + p(y) x, y ∈ X.
(ii) p(αx) = αp(x) x ∈ X, α ≥ 0.
Note that from this definition, it is clear that if p is a sublinear functional on X, then
p(0) = 0,
−p(−x) ≤ p(x) for all x ∈ X,
(2.1) −p(y − x) ≤ p(x) − p(y) ≤ p(x − y) for all x, y ∈ X.
In addition, if we know that p satisfies p(−x) = p(x) for all x ∈ X, then p(x) ≥ 0 for all
x ∈ X, and (1.1) becomes:
Notice the distinctions between the definition of a sublinear functional and seminorm.
We have that if X is real, then a seminorm p is a sublinear functional (but the converse
is not necessarily true). Also notice that if we add the condition p(x) = 0 implies x = 0,
then the seminorm p meets the criteria of a norm.
Also, the definition implies that if p is a seminorm, then
p(0) = 0,
We will state some definitions and lemmas here for use in our proof of the Hahn-Banach
theorem. This theorem allows us to construct extensions of functionals on general spaces,
and show the existence of elements in a dual space.
Definition 5.10. Suppose that M is a non-empty set, and ≺ is an ordering on M.
Then ≺ is a partial order on M if
(i) x ≺ x for all x ∈ M;
(ii) x ≺ y and y ≺ x ⇒ x = y;
(iii) x ≺ y and y ≺ z ⇒ x ≺ z;
and we say M is a partially ordered set.
In addition, if ≺ is defined for all pairs of elements (so for any x, y ∈ M, either x ≺ y
or y ≺ x holds), then ≺ is a total order and M is a totally ordered set.
Furthermore, if M is a partially ordered set, then y ∈ M is a maximal element of M
if y ≺ x ⇒ y = x. If N ⊂ M, then y ∈ M is an upper bound for N if x ≺ y for all x ∈ N .
Example 5.11. We use partial and total orders in mathematics all the time.
(i) The usual ordering, ≤ on R is a total order.
(ii) A partial order on R2 is given by (x1 , x2 ) ≺ (y1 , y2 ) ⇐⇒ x1 ≤ y1 and x2 ≤ y2 .
(iii) If S is an arbitrary set and M is the set of all subsets of S, then set inclusion
(A ⊂ B for A, B ∈ M) is a partial order on M.
Lemma 5.12. Let X be a real vector space with a proper linear subspace, W . Let p be
a sublinear functional on X and fW be a linear functional on W such that fW (w) ≤ p(w)
for all w ∈ W . Suppose that z1 ∈
/ W , and let
W1 = Sp{z1 } ⊕ W = {αz1 + w : α ∈ R, w ∈ W }.
Then there exists a ξ1 ∈ R and fW1 : W1 → R that satisfies
(3.1) fW1 (αz1 + w) = αξ1 + fW (w) ≤ p(αz1 + w), α ∈ R, w ∈ W.
Since fW is a linear functional, fW1 is also linear, and for w ∈ W we have fW1 (w) =
fW (w), so fW1 is an extension of fW .
Proof. For any u, v ∈ W , we have
fW (u) + fW (v) = fW (u + v) ≤ p(u + v) ≤ p(u − z1 ) + p(v + z1 ),
which gives
fW (u) − p(u − z1 ) ≤ −fW (u) + p(v + z1 ).
Hence
ξ1 = inf {−fW (v) + p(v + z1 )} > −∞,
v∈W
and
−ξ1 + fW (u) ≤ p(u − z1 ), ξ1 + fW (v) ≤ p(v + z1 ), u, v ∈ W.
Chapter 5. Duality 26
Now multiplying the first inequality by a β > 0 gives
−βξ1 + fW (βu) ≤ p(β(u − z1 )),
and if we let α = −β and w = βu, we get the desired inequality (3.1) when α < 0.
Similarly, the second inequality gives (3.1) when α > 0, and for α = 0, (3.1) follows
immediately from the way p is defined.
Lemma 5.13 (Zorn’s Lemma). Let M be a non-empty, partially ordered set such that
every totally ordered subset of M has an upper bound. Then there exists a maximal
element in M.
Theorem 5.14 (The Hahn-Banach Theorem). Let X be a real vector space, with a
sublinear functional p defined on X. Suppose that W is a linear subspace of X, and fW
is a linear functional on W satisfying
fW (w) ≤ p(w), w ∈ W.
Then fW has an extension fX on X such that
fX (x) ≤ p(x), x ∈ X.
Proof. Let E denote the set of linear functionals f on X that satisfy the following:
(i) f is defined on a linear subspace Df such that W ⊂ Df ⊂ X;
(ii) f (w) = fW (w) for w ∈ W ;
(iii) f (x) ≤ p(x) for x ∈ Df .
So E is the set of all extensions f on fW to general subspaces Df ⊂ X which satisfy
the hypothesis of the theorem on their domain. We will apply Zorn’s lemma to our set
E, and then show that the maximal element of E is our desired functional. So first let us
verify that E satisfies the criteria of Zorn’s lemma. We know that fW ∈ E, so E 6= ∅. Now
define a relation ≺ on E as follows: for any f, g ∈ E,
f ≺ g ⇐⇒ Df ⊂ Dg and f (x) = g(x) for all x ∈ Df .
So f ≺ g iff g is an extension of f . One can verify that ≺ is a partial order on E (though
it is not a total order, since there can be functionals f, g ∈ E which are both extensions
of fW but neither is an extension of the other). Now, take a totally ordered set G ⊂ E.
By definition, G is totally ordered iff for any two arbitrary functionals f, g ∈ G, one is an
extension of the other. We will construct an upper bound for G in E. Define a set
[
ZG = Df .
f ∈G
Now using the total ordering of G, it can be verified that Zg is a linear subspace of X.
So we will define a linear functional fG on ZG as follows. Given a z ∈ ZG , there exists a
ξ ∈ E such that z ∈ Dξ . Next define fG (z) = ξ(z) (this definition does not depend on ξ,
since if η is a functional in G with z ∈ Dη , we must have ξ(z) = η(z) by the total ordering
of G). Again, we can verify fG is linear by the total ordering of G. Also, since ξ ∈ E, we
Chapter 5. Duality 27
have fG (z) = ξ(z) ≤ p(z), and if z ∈ W then fG (z) = fW (z). Thus fG ∈ E and f ≺ fG
for all f ∈ G, and we have that fG is an upper bound for G.
Now since G was an arbitrary, totally ordered subset of E, we conclude by Zorn’s
lemma that E has a maximal element, fmax . Now suppose that the domain Df max 6= X.
Then by Lemma 5.12, fmax has an extension which also lies in E. However, this would
contradict the maximality of fmax in E. So we must have Df max = X, and hence fX = fmax
is our desired extension.
CHAPTER 6
Concluding Remarks
In this paper, we have given a basic introduction to the branch of mathematics known
as linear functional analysis. We discussed the fundamental ideas of normed spaces, inner
product spaces, operators, dual spaces, and the Hahn-Banach Theorem. These concepts
have applications in many other areas of mathematics, including applied studies such as
quantum mechanics and differential equations.
There is plenty of opportunity for further study in this field. While I only proved
the Hahn-Banach theorem for real vector spaces, there are other versions with different
hypotheses, including a version for normed spaces. This is a tool used in many proofs
in all areas of mathematics. Staying in functional analysis, some further topics of study
include weak convergence, the adjoint of an operator, compact operators, and spectral
theory.
28
Bibliography
[1] Bryan P. Rynne and Martin A. Youngston. Linear Functional Analysis. Springer, second edition,
2008. 12, 13, 15, 17
29