---
title: "Sturm-Liouville theory"
sort_title: "Sturm-Liouville theory"
date: 2021-02-23
categories:
- Mathematics
- Physics
layout: "concept"
---

**Sturm-Liouville theory** extends
the concept of Hermitian matrix eigenvalue problems
to linear second-order ordinary differential equations.

It states that, given suitable boundary conditions,
any such equation can be rewritten using the **Sturm-Liouville operator**,
and that the corresponding eigenvalue problem,
known as a **Sturm-Liouville problem**,
will give real eigenvalues and a complete set of eigenfunctions.


## General operator

Consider the most general form of a second-order linear
differential operator $$\hat{L}$$, where $$p_0(x)$$, $$p_1(x)$$, and $$p_2(x)$$
are real functions of $$x \in [a,b]$$ and are nonzero for all $$x \in \,\,]a, b[$$:

$$\begin{aligned}
    \hat{L} \{u(x)\}
    \equiv p_2(x) \: u''(x) + p_1(x) \: u'(x) + p_0(x) \: u(x)
\end{aligned}$$

Analogously to matrices,
we now define its **adjoint** operator $$\hat{L}^\dagger$$ as follows:

$$\begin{aligned}
    \inprod{\hat{L}^\dagger f}{g}
    \equiv \inprod{f}{\hat{L} g}
\end{aligned}$$

What is $$\hat{L}^\dagger$$, given the above definition of $$\hat{L}$$?
We start from the inner product $$\inprod{f}{\hat{L} g}$$:

$$\begin{aligned}
    \inprod{f}{\hat{L} g}
    &= \int_a^b f^*(x) \hat{L}\{g(x)\} \dd{x}
    = \int_a^b (f^* p_2) g'' + (f^* p_1) g' + (f^* p_0) g \dd{x}
    \\
    &= \Big[ (f^* p_2) g' + (f^* p_1) g \Big]_a^b - \int_a^b (f^* p_2)' g' + (f^* p_1)' g - (f^* p_0) g \dd{x}
    \\
    &= \Big[ f^* (p_2 g' + p_1 g) - (f^* p_2)' g \Big]_a^b + \int_a^b \! \Big( (f p_2)'' - (f p_1)' + (f p_0) \Big)^* g \dd{x}
    \\
    &= \Big[ f^* \big( p_2 g' + (p_1 - p_2') g \big) - (f^*)' p_2 g \Big]_a^b + \int_a^b \Big( \hat{L}^\dagger\{f\} \Big)^* g \dd{x}
\end{aligned}$$

The newly-formed operator on $$f$$ must be $$\hat{L}^\dagger$$,
but there is an additional boundary term.
To fix this, we demand that $$p_1(x) = p_2'(x)$$
and that $$\big[ p_2 (f^* g' - (f^*)' g) \big]_a^b = 0$$, leaving:

$$\begin{aligned}
    \inprod{f}{\hat{L} g}
    &= \Big[ f^* \big( p_2 g' + (p_1 - p_2') g \big) - (f^*)' p_2 g \Big]_a^b + \inprod{\hat{L}^\dagger f}{g}
    \\
    &= \Big[ p_2 \big( f^* g' - (f^*)' g \big) \Big]_a^b + \inprod{\hat{L}^\dagger f}{g}
    \\
    &= \inprod{\hat{L}^\dagger f}{g}
\end{aligned}$$

Let us look at the expression for $$\hat{L}^\dagger$$ we just found,
with the restriction $$p_1 = p_2'$$ in mind:

$$\begin{aligned}
    \hat{L}^\dagger \{f\}
    &= (p_2 f)'' - (p_1 f)' + (p_0 f)
    \\
    &= (p_2'' f + 2 p_2' f' + p_2 f'') - (p_1' f + p_1 f') + (p_0 f)
    \\
    &= p_2 f'' + (2 p_2' - p_1) f' + (p_2'' - p_1' + p_0) f
    \\
    &= p_2 f'' + p_1 f' + p_0 f
    \\
    &= \hat{L}\{f\}
\end{aligned}$$

So $$\hat{L}$$ is **self-adjoint**, i.e. $$\hat{L}^\dagger$$ is the same as $$\hat{L}$$!
Indeed, every such second-order linear operator is self-adjoint
if it satisfies the constraints $$p_1 = p_2'$$ and $$\big[ p_2 (f^* g' - (f^*)' g) \big]_a^b = 0$$.

But what if $$p_1 \neq p_2'$$?
Let us multiply $$\hat{L}$$ by an unknown $$p(x) \neq 0$$
and divide by $$p_2(x) \neq 0$$:

$$\begin{aligned}
    \frac{p}{p_2} \hat{L} \{u\}
    = p u'' + p \frac{p_1}{p_2} u' + p \frac{p_0}{p_2} u
\end{aligned}$$

We now demand that the derivative $$p'(x)$$ of the unknown $$p(x)$$ satisfies:

$$\begin{aligned}
    p'(x)
    = p(x) \frac{p_1(x)}{p_2(x)}
    \quad \implies \quad
    \frac{p_1(x)}{p_2(x)} \dd{x}
    = \frac{1}{p(x)} \dd{p}
\end{aligned}$$

Taking the indefinite integral of this differential equation
yields an expression for $$p(x)$$:

$$\begin{aligned}
    \int \frac{p_1(x)}{p_2(x)} \dd{x}
    = \int \frac{1}{p} \dd{p}
    = \ln\!\big( p(x) \big)
    \quad \implies \quad
    \boxed{
        p(x)
        = \exp\!\bigg( \int \frac{p_1(x)}{p_2(x)} \dd{x} \bigg)
    }
\end{aligned}$$

We define an additional function $$q(x)$$
based on the last term of $$(p / p_2) \hat{L}$$ shown above:

$$\begin{aligned}
    \boxed{
        q(x)
        \equiv p(x) \frac{p_0(x)}{p_2(x)}
    }
    = \frac{p_0(x)}{p_2(x)} \exp\!\bigg( \int \frac{p_1(x)}{p_2(x)} \dd{x} \bigg)
\end{aligned}$$

When rewritten using $$p$$ and $$q$$,
the modified operator $$(p / p_2) \hat{L}$$ looks like this:

$$\begin{aligned}
    \frac{p}{p_2} \hat{L} \{u\}
    = p u'' + p' u' + q u
    = (p u')' + q u
\end{aligned}$$

This is the self-adjoint form from earlier!
So even if $$p_1 \neq p_2'$$, any second-order linear operator
with $$p_2(x) \neq 0$$ can easily be made self-adjoint.
The resulting general form is called the **Sturm-Liouville operator** $$\hat{L}_\mathrm{SL}$$,
for nonzero $$p(x)$$:

$$\begin{aligned}
    \boxed{
        \begin{aligned}
            \hat{L}_\mathrm{SL} \{u(x)\}
            &= \hat{L}_\mathrm{SL}^\dagger \{u(x)\}
            \\
            &= \Big( p(x) \: u'(x) \Big)' + q(x) \: u(x)
        \end{aligned}
    }
\end{aligned}$$

Still subject to the constraint $$\big[ p (f^* g' - (f^*)' g) \big]_a^b = 0$$
such that $$\inprod{f}{\hat{L}_\mathrm{SL} g} = \inprod{\hat{L}_\mathrm{SL}^\dagger f}{g}$$.


## Eigenvalue problem

An eigenvalue problem of $$\hat{L}_\mathrm{SL}$$
is called a **Sturm-Liouville problem** (SLP).
The goal is to find the **eigenvalues** $$\lambda$$
and corresponding **eigenfunctions** $$u(x)$$ that fulfill:

$$\begin{aligned}
    \boxed{
        \hat{L}_\mathrm{SL}\{u(x)\} = - \lambda \: w(x) \: u(x)
    }
\end{aligned}$$

Where $$w(x)$$ is a real weight function satisfying $$w(x) > 0$$ for $$x \in \,\,]a, b[$$.
By convention, the trivial solution $$u = 0$$ is not valid.
Some authors have the opposite sign for $$\lambda$$ and/or $$w$$.

In our derivation of $$\hat{L}_\mathrm{SL}$$ above,
we imposed the constraint $$\big[ p (f^* g' - (f')^* g) \big]_a^b = 0$$ to ensure that
$$\inprod{\hat{L}_\mathrm{SL}^\dagger f}{g} = \inprod{f}{\hat{L}_\mathrm{SL} g}$$.
Consequently, to have a valid SLP,
the boundary conditions (BCs) on $$u$$ must be such that,
for any two (possibly identical) eigenfunctions $$u_m$$ and $$u_n$$, we have:

$$\begin{aligned}
    \Big[ p(x) \big( u_m^*(x) \: u_n'(x) - \big(u_m'(x)\big)^* u_n(x) \big) \Big]_a^b = 0
\end{aligned}$$

There are many boundary conditions that satisfy this requirement.
Some notable ones are listed non-exhaustively below.
Verify for yourself that these work:

+ **Dirichlet BCs**: $$u(a) = u(b) = 0$$
+ **Neumann BCs**: $$u'(a) = u'(b) = 0$$
+ **Robin BCs**: $$\alpha_1 u(a) + \beta_1 u'(a) = \alpha_2 u(b) + \beta_2 u'(b) = 0$$ with $$\alpha_{1,2}, \beta_{1,2} \in \mathbb{R}$$
+ **Periodic BCs**: $$p(a) = p(b)$$, $$u(a) = u(b)$$, and $$u'(a) = u'(b)$$
+ **Legendre "BCs"**: $$p(a) = p(b) = 0$$

If this is fulfilled, Sturm-Liouville theory gives us
useful information about $$\lambda$$ and $$u$$.
By definition, the following must be satisfied
for two arbitrary eigenfunctions $$u_m$$ and $$u_n$$:

$$\begin{aligned}
    0
    &= \hat{L}_\mathrm{SL}\{u_m^*\} + \lambda_m^* w u_m^*
    \\
    &= \hat{L}_\mathrm{SL}\{u_n\} + \lambda_n w u_n
\end{aligned}$$

We multiply each by the other eigenfunction,
subtract the results, and integrate:

$$\begin{aligned}
    0
    &= \int_a^b u_m^* \big(\hat{L}_\mathrm{SL}\{u_n\} + \lambda_n w u_n\big)
    - u_n \big(\hat{L}_\mathrm{SL}\{u_m^*\} + \lambda_m^* w u_m^*\big) \dd{x}
    \\
    &= \int_a^b u_m^* \hat{L}_\mathrm{SL}\{u_n\} - u_n \hat{L}_\mathrm{SL}\{u_m^*\}
    + (\lambda_n - \lambda_m^*) u_m^* w u_n \dd{x}
    \\
    &= \inprod{u_m}{\hat{L}_\mathrm{SL} u_n} - \inprod{\hat{L}_\mathrm{SL} u_m}{u_n}
    + (\lambda_n - \lambda_m^*) \inprod{u_m}{w u_n}
\end{aligned}$$

The operator $$\hat{L}_\mathrm{SL}$$ is self-adjoint of course,
so the first two terms vanish, leaving us with:

$$\begin{aligned}
    0
    &= (\lambda_n - \lambda_m^*) \inprod{u_m}{w u_n}
\end{aligned}$$

When $$m = n$$, we get $$\inprod{u_n}{w u_n} > 0$$,
so the equation is only satisfied if $$\lambda_n^* = \lambda_n$$,
meaning the eigenvalue $$\lambda_n$$ is real for any $$n$$.
When $$m \neq n$$, then $$\lambda_n - \lambda_m^*$$
may or may not be zero depending on the degeneracy.
If there is no degeneracy, then $$\lambda_n - \lambda_m^* \neq 0$$,
meaning $$\inprod{u_m}{w u_n} = 0$$, i.e. the eigenfunctions are orthogonal.
In case of degeneracy, manual orthogonalization is needed,
which is guaranteed to be doable using the [Gram-Schmidt method](/know/concept/gram-schmidt-method/).

In conclusion, an SLP has **real eigenvalues**
and **orthogonal eigenfunctions**: for all $$m$$, $$n$$:

$$\begin{aligned}
    \boxed{
        \lambda_n \in \mathbb{R}
    }
    \qquad\qquad
    \boxed{
        \inprod{u_m}{w u_n}
        = A_n \delta_{nm}
    }
\end{aligned}$$

When solving a differential eigenvalue problem,
knowing that all eigenvalues are real is a huge simplification,
so it is always worth checking whether you are dealing with an SLP.

Another useful fact:
it turns out that SLPs always have an infinite number of *discrete* eigenvalues.
Furthermore, there always exists a *lowest* eigenvalue $$\lambda_0 > -\infty$$,
called the **ground state**.


## Complete basis

Not only are an SLP's eigenfunctions orthogonal,
they also form a **complete basis**, meaning any well-behaved $$f(x)$$
can be expanded as a **generalized Fourier series** with coefficients $$a_n$$:

$$\begin{aligned}
    \boxed{
        f(x)
        = \sum_{n = 0}^\infty a_n u_n(x)
        \quad \mathrm{for} \: x \in \,\,]a, b[
    }
\end{aligned}$$

This series converges faster if $$f$$ satisfies the same BCs as $$u_n$$;
in that case the expansion is also valid for the inclusive interval $$x \in [a, b]$$.

To find an expression for the coefficients $$a_n$$,
we multiply the above generalized Fourier series by $$u_m^* w$$
and integrate it to get inner products on both sides:

$$\begin{aligned}
    u_m^* w f
    &= \sum_{n = 0}^\infty a_n u_m^* w u_n
    \\
    \int_a^b u_m^* w f \dd{x}
    &= \int_a^b \bigg( \sum_{n = 0}^\infty a_n u_m^* w u_n \bigg) \dd{x}
    \\
    \inprod{u_m}{w f}
    &= \sum_{n = 0}^\infty a_n \inprod{u_m}{w u_n}
\end{aligned}$$

Because the eigenfunctions of an SLP are mutually orthogonal,
the summation disappears:

$$\begin{aligned}
    \inprod{u_m}{w f}
    &= \sum_{n = 0}^\infty a_n \inprod{u_m}{w u_n}
    = \sum_{n = 0}^\infty a_n A_n \delta_{nm}
    = a_m A_m
\end{aligned}$$

After isolating this for $$a_m$$, we see that
the coefficients are given by the projection of the target
function $$f$$ onto the normalized eigenfunctions $$u_m / A_m$$:

$$\begin{aligned}
    \boxed{
        a_n
        = \frac{\inprod{u_n}{w f}}{A_n}
        = \frac{\inprod{u_n}{w f}}{\inprod{u_n}{w u_n}}
    }
\end{aligned}$$

As a final remark, we can see something interesting
by rearranging the generalized Fourier series
after inserting the expression for $$a_n$$:

$$\begin{aligned}
    f(x)
    &= \sum_{n = 0}^\infty \frac{1}{A_n} \inprod{u_n}{w f} u_n(x)
    \\
    &= \int_a^b \bigg(\sum_{n = 0}^\infty \frac{1}{A_n} u_n^*(\xi) \: w(\xi) \: f(\xi) \: u_n(x) \bigg) \dd{\xi}
    \\
    &= \int_a^b f(\xi) \bigg(\sum_{n = 0}^\infty \frac{1}{A_n} u_n^*(\xi) \: w(\xi) \: u_n(x) \bigg) \dd{\xi}
\end{aligned}$$

Upon closer inspection, the parenthesized summation
must be the [Dirac delta function](/know/concept/dirac-delta-function/) $$\delta(x)$$
for the integral to work out.
In fact, this is the underlying requirement for completeness:

$$\begin{aligned}
    \boxed{
        \sum_{n = 0}^\infty \frac{1}{A_n} u_n^*(\xi) \: w(\xi) \: u_n(x) = \delta(x - \xi)
    }
\end{aligned}$$


## References
1.  O. Bang,
    *Applied mathematics for physicists: lecture notes*, 2019,
    unpublished.