Categories:
Mathematics,
Physics.
Sturm-Liouville theory
Sturm-Liouville theory extends
the concept of Hermitian matrix eigenvalue problems
to linear second-order ordinary differential equations.
It states that, given suitable boundary conditions,
any such equation can be rewritten using the Sturm-Liouville operator,
and that the corresponding eigenvalue problem,
known as a Sturm-Liouville problem,
will give real eigenvalues and a complete set of eigenfunctions.
General operator
Consider the most general form of a second-order linear
differential operator L^, where p0(x), p1(x), and p2(x)
are real functions of x∈[a,b] and are nonzero for all x∈]a,b[:
L^{u(x)}≡p2(x)u′′(x)+p1(x)u′(x)+p0(x)u(x)
Analogously to matrices,
we now define its adjoint operator L^† as follows:
⟨L^†f∣g⟩≡⟨f∣L^g⟩
What is L^†, given the above definition of L^?
We start from the inner product ⟨f∣L^g⟩:
⟨f∣L^g⟩=∫abf∗(x)L^{g(x)}dx=∫ab(f∗p2)g′′+(f∗p1)g′+(f∗p0)gdx=[(f∗p2)g′+(f∗p1)g]ab−∫ab(f∗p2)′g′+(f∗p1)′g−(f∗p0)gdx=[f∗(p2g′+p1g)−(f∗p2)′g]ab+∫ab((fp2)′′−(fp1)′+(fp0))∗gdx=[f∗(p2g′+(p1−p2′)g)−(f∗)′p2g]ab+∫ab(L^†{f})∗gdx
The newly-formed operator on f must be L^†,
but there is an additional boundary term.
To fix this, we demand that p1(x)=p2′(x)
and that [p2(f∗g′−(f∗)′g)]ab=0, leaving:
⟨f∣L^g⟩=[f∗(p2g′+(p1−p2′)g)−(f∗)′p2g]ab+⟨L^†f∣g⟩=[p2(f∗g′−(f∗)′g)]ab+⟨L^†f∣g⟩=⟨L^†f∣g⟩
Let us look at the expression for L^† we just found,
with the restriction p1=p2′ in mind:
L^†{f}=(p2f)′′−(p1f)′+(p0f)=(p2′′f+2p2′f′+p2f′′)−(p1′f+p1f′)+(p0f)=p2f′′+(2p2′−p1)f′+(p2′′−p1′+p0)f=p2f′′+p1f′+p0f=L^{f}
So L^ is self-adjoint, i.e. L^† is the same as L^!
Indeed, every such second-order linear operator is self-adjoint
if it satisfies the constraints p1=p2′ and [p2(f∗g′−(f∗)′g)]ab=0.
But what if p1=p2′?
Let us multiply L^ by an unknown p(x)=0
and divide by p2(x)=0:
p2pL^{u}=pu′′+pp2p1u′+pp2p0u
We now demand that the derivative p′(x) of the unknown p(x) satisfies:
p′(x)=p(x)p2(x)p1(x)⟹p2(x)p1(x)dx=p(x)1dp
Taking the indefinite integral of this differential equation
yields an expression for p(x):
∫p2(x)p1(x)dx=∫p1dp=ln(p(x))⟹p(x)=exp(∫p2(x)p1(x)dx)
We define an additional function q(x)
based on the last term of (p/p2)L^ shown above:
q(x)≡p(x)p2(x)p0(x)=p2(x)p0(x)exp(∫p2(x)p1(x)dx)
When rewritten using p and q,
the modified operator (p/p2)L^ looks like this:
p2pL^{u}=pu′′+p′u′+qu=(pu′)′+qu
This is the self-adjoint form from earlier!
So even if p1=p2′, any second-order linear operator
with p2(x)=0 can easily be made self-adjoint.
The resulting general form is called the Sturm-Liouville operator L^SL,
for nonzero p(x):
L^SL{u(x)}=(p(x)u′(x))′+q(x)u(x)=L^SL†{u(x)}
Still subject to the constraint [p(f∗g′−(f∗)′g)]ab=0
such that ⟨f∣L^SLg⟩=⟨L^SL†f∣g⟩.
Eigenvalue problem
An eigenvalue problem of L^SL
is called a Sturm-Liouville problem (SLP).
The goal is to find the eigenvalues λ
and corresponding eigenfunctions u(x) that fulfill:
L^SL{u(x)}=−λw(x)u(x)
Where w(x) is a real weight function satisfying w(x)>0 for x∈]a,b[.
By convention, the trivial solution u=0 is not valid.
Some authors have the opposite sign for λ and/or w.
In our derivation of L^SL above,
we imposed the constraint [p(f∗g′−(f′)∗g)]ab=0 to ensure that
⟨L^SL†f∣g⟩=⟨f∣L^SLg⟩.
Consequently, to have a valid SLP,
the boundary conditions (BCs) on u must be such that,
for any two (possibly identical) eigenfunctions um and un, we have:
[p(x)(um∗(x)un′(x)−(um′(x))∗un(x))]ab=0
There are many boundary conditions that satisfy this requirement.
Some notable ones are listed non-exhaustively below.
Verify for yourself that these work:
- Dirichlet BCs: u(a)=u(b)=0
- Neumann BCs: u′(a)=u′(b)=0
- Robin BCs: α1u(a)+β1u′(a)=α2u(b)+β2u′(b)=0 with α1,2,β1,2∈R
- Periodic BCs: p(a)=p(b), u(a)=u(b), and u′(a)=u′(b)
- Legendre “BCs”: p(a)=p(b)=0
If this is fulfilled, Sturm-Liouville theory gives us
useful information about λ and u.
By definition, the following must be satisfied
for two arbitrary eigenfunctions um and un:
0=L^SL{um∗}+λm∗wum∗=L^SL{un}+λnwun
We multiply each by the other eigenfunction,
subtract the results, and integrate:
0=∫abum∗(L^SL{un}+λnwun)−un(L^SL{um∗}+λm∗wum∗)dx=∫abum∗L^SL{un}−unL^SL{um∗}+(λn−λm∗)um∗wundx=⟨um∣L^SLun⟩−⟨L^SLum∣un⟩+(λn−λm∗)⟨um∣wun⟩
The operator L^SL is self-adjoint of course,
so the first two terms vanish, leaving us with:
0=(λn−λm∗)⟨um∣wun⟩
When m=n, we get ⟨un∣wun⟩>0,
so the equation is only satisfied if λn∗=λn,
meaning the eigenvalue λn is real for any n.
When m=n, then λn−λm∗
may or may not be zero depending on the degeneracy.
If there is no degeneracy, then λn−λm∗=0,
meaning ⟨um∣wun⟩=0, i.e. the eigenfunctions are orthogonal.
In case of degeneracy, manual orthogonalization is needed,
which is guaranteed to be doable using the Gram-Schmidt method.
In conclusion, an SLP has real eigenvalues
and orthogonal eigenfunctions: for all m, n:
λn∈R⟨um∣wun⟩=Anδnm
When solving a differential eigenvalue problem,
knowing that all eigenvalues are real is a huge simplification,
so it is always worth checking whether you are dealing with an SLP.
Another useful fact:
it turns out that SLPs always have an infinite number of discrete eigenvalues.
Furthermore, there always exists a lowest eigenvalue λ0>−∞,
called the ground state.
Complete basis
Not only are an SLP’s eigenfunctions orthogonal,
they also form a complete basis, meaning any well-behaved f(x)
can be expanded as a generalized Fourier series with coefficients an:
f(x)=n=0∑∞anun(x)forx∈]a,b[
This series converges faster if f satisfies the same BCs as un;
in that case the expansion is also valid for the inclusive interval x∈[a,b].
To find an expression for the coefficients an,
we multiply the above generalized Fourier series by um∗w
and integrate it to get inner products on both sides:
um∗wf∫abum∗wfdx⟨um∣wf⟩=n=0∑∞anum∗wun=∫ab(n=0∑∞anum∗wun)dx=n=0∑∞an⟨um∣wun⟩
Because the eigenfunctions of an SLP are mutually orthogonal,
the summation disappears:
⟨um∣wf⟩=n=0∑∞an⟨um∣wun⟩=n=0∑∞anAnδnm=amAm
After isolating this for am, we see that
the coefficients are given by the projection of the target
function f onto the normalized eigenfunctions um/Am:
an=An⟨un∣wf⟩=⟨un∣wun⟩⟨un∣wf⟩
As a final remark, we can see something interesting
by rearranging the generalized Fourier series
after inserting the expression for an:
f(x)=n=0∑∞An1⟨un∣wf⟩un(x)=∫ab(n=0∑∞An1un∗(ξ)w(ξ)f(ξ)un(x))dξ=∫abf(ξ)(n=0∑∞An1un∗(ξ)w(ξ)un(x))dξ
Upon closer inspection, the parenthesized summation
must be the Dirac delta function δ(x)
for the integral to work out.
In fact, this is the underlying requirement for completeness:
n=0∑∞An1un∗(ξ)w(ξ)un(x)=δ(x−ξ)
References
- O. Bang,
Applied mathematics for physicists: lecture notes, 2019,
unpublished.