The nonlinear Schrödinger (NLS) equation
is a nonlinear 1+1D partial differential equation
that appears in many areas of physics.
It is often given in its dimensionless form,
where it governs the envelope u(z,t)
of an underlying carrier wave,
with t the transverse coordinate,
and r=±1 a parameter determining
which of two regimes the equation is intended for:
i∂z∂u+∂t2∂2u+r∣u∣2u=0
Many variants exist, depending on the conventions used by authors.
The NLS equation is used to describe pulses in fiber optics (as derived below),
waves over deep water, local opening of DNA chains, and much more.
Very roughly speaking, it is a valid description of
“all” weakly nonlinear, slowly modulated waves in physics.
We only consider fiber optics here;
the NLS equation can be derived in many other ways.
We start from the most general form of the
electromagnetic wave equation,
after assuming the medium cannot be magnetized (μr=1):
∇×(∇×E)=−μ0ε0∂t2∂2E−μ0∂t2∂2P
Using the vector identity
∇×(∇×E)=∇(∇⋅E)−∇2E
and Gauss’s law∇⋅E=0,
and splitting the polarization P
into linear and nonlinear contributions
PL and PNL:
∇2E−μ0ε0∂t2∂2E=μ0∂t2∂2PL+μ0∂t2∂2PNL
In general, PL is given by the convolution
of E with a second-rank response tensor χ(1):
PL(r,t)=ε0∫−∞∞χ(1)(t−t′)⋅E(r,t′)dt′
In PNL we only include third-order nonlinearities,
since higher orders are usually negligible,
and second-order nonlinear effects only exist in very specific crystals.
So we “only” need to deal with a fourth-rank response tensor χ(3):
In practice, two phenomena contribute to χ(3):
the Kerr effect due to electrons’ response to E,
and Raman scattering due to nuclei’s response,
which is slower because of their mass.
But if the light pulses are sufficiently long (>1ps in silica),
both effects can be treated as fast, so:
Where δ is the Dirac delta function.
To keep things simple,
we consider linearly x-polarized light E=x^∣E∣,
such that the tensor can be replaced with its scalar element χxxxx(3).
Then:
PNL=ε0χxxxx(3)(E⋅E)E
For the same reasons, the linear polarization is reduced to:
PL=ε0χxx(1)E
Next, we decompose E as follows,
consisting of a carrier wave e−iω0t
at a constant frequency ω0,
modulated by an envelope E
that is assumed to be slowly-varying compared to the carrier,
plus the complex conjugate E∗eiω0t:
E(r,t)=x^21(E(r,t)e−iω0t+E∗(r,t)eiω0t)
Note that no generality has been lost in this step.
Inserting it into the polarizations:
The terms with 3ω0 represent third-harmonic generation,
and only matter if the carrier is phase-matched
to the tripled wave, which is generally not the case,
so they can be ignored.
Now, if we decompose the polarizations in the same was as E:
Then it is straightforward to see that their envelope functions are given by:
PLPNL=ε0χxx(1)E=43ε0χxxxx(3)∣E∣2E
The forward carrier e−iω0t
and the backward carrier eiω0t
can be regarded as separate channels,
which only interact via PNL.
From now on, we only consider the forward-propagating wave,
so all terms containing eiω0t are dropped;
by taking the complex conjugate of the resulting equations,
the backward-propagating counterparts can always be recovered,
so no information is really lost.
Therefore, the main wave equation becomes:
Where we have used our assumption that E is slowly-varying
to treat ∣E∣2 as a constant,
in order to move it outside the t-derivative.
We thus arrive at:
0=(∇2E−c2εr∂t2∂2E)e−iω0t
Where c=1/μ0ε0 is the phase velocity of light in a vacuum,
and the relative permittivity εr is defined as shown below.
Note that this is a mild abuse of notation,
since the symbol εr is usually reserved for linear materials:
εr≡1+χxx(1)+43χxxxx(3)∣E∣2
Next, we take the Fourier transformt→ω of the wave equation,
again treating ∣E∣2 (inside εr) as a constant.
The constant s=±1 is included here
to deal with the fact that different authors use different sign conventions:
Now all the x- and y-dependence is on the left,
and the z-dependence is on the right.
We have placed the εr-term on the left too
because it depends relatively strongly on (x,y)
to describe the fiber’s internal structure,
and weakly on z due to nonlinear effects.
Meanwhile, β0 is on the right because that will lead to
a nicer equation for A later.
Note that both sides are functions of Ω.
Based on the aforementioned dependences,
in order for this equation to have a solution for all (x,y,z),
there must exist a quantity β(Ω) that is constant in space,
such that we obtain two separated equations for F and A:
Note that we replaced Ω with ω in F’s equation
(and redefined β and εr accordingly).
This is not an innocent detail:
the idea is that ωεr/c
would be the light’s wavenumber if it had not been trapped in a waveguide,
and that β is the confined wavenumber,
also known as the propagation constant.
If we had kept Ω,
the meaning of β would not be so straightforward.
The difference between β(ω) and β0
is simply that β0≡β(ω0).
Our ansatz for separating the variables contained β0,
such that the full carrier wave eiβ0z−iω0t was represented
(with e−iω0t now hidden inside the Fourier transform).
But later, to properly describe how light behaves inside the fiber,
the full dispersion relation β(ω) will be needed.
Multiplying by F and A,
we get the following set of equations,
implicitly coupled via β:
The equation for F must be solved first.
To do so, we treat the nonlinearity as a perturbation
to be neglected initially.
In other words, we first solve the following eigenvalue problem for β2,
where n(x,y) is the linear refractive index,
with n2=1+Re{χxx(1)}≈εr:
∂x2∂2F+∂y2∂2F+(c2ω2n2−β2)F=0
This gives us the allowed values of β;
see step-index fiber for an example solution.
Now we add the small index change Δn(x,y) due to nonlinear effects:
εr=(n+Δn)2≈n2+2nΔn
Then it can be shown using first-order
perturbation theory
that the eigenfunction F is not really affected,
and the eigenvalue β2 is shifted by Δ(β2), given by:
Δ(β2)=c22ω2∬−∞∞∣F∣2dxdy∬−∞∞nΔn∣F∣2dxdy
But we are more interested in the wavenumber shift Δβ
than the eigenvalue shift Δ(β2).
They are related to one another as follows:
β2+Δ(β2)=(β+Δβ)2≈β2+2βΔβ
Furthermore, we assume that the fiber only consists of materials
with similar refractive indices, or in other words,
that it confines the light using only a small index difference,
in which case we can treat n as a constant and move it outside the integral.
Then Δβ becomes:
Δβ=βc2ω2n∬−∞∞∣F∣2dxdy∬−∞∞Δn∣F∣2dxdy
Recall that β is the wavenumber of the confined mode:
by solving the unperturbed F-equation,
it can be shown that β’s value is somewhere
between the bulk wavenumbers of the fiber materials.
Since we just approximated n as a constant,
this means that ωn/c≈β, leading us to
the general “final” form of Δβ,
with all the arguments shown for clarity:
Δβ(ω)=cAmodeω∬−∞∞Δn(x,y,ω)∣F(x,y)∣2dxdy
Where we have defined the mode areaAmode as shown below.
In order for Amode to be in units of area,
F must be dimensionless,
and consequently A has (SI) units of an electric field.
Amode≡∬−∞∞∣F∣2dxdy
Now we finally turn our attention to the equation for A.
Before perturbation, it was:
0=2iβ0∂z∂A+(β2−β02)A
Where β≈β0, so we can replace
β2−β02 with 2β0(β−β0).
Also including Δβ, we get:
0=i∂z∂A+(β+Δβ−β0)A
Usually, we do not know a full expression for β(ω),
so it makes sense to expand it around the carrier frequency ω0 as follows,
where βn=dnβ/dωn∣ω=ω0:
Spectrally, the broader the light pulse, the more terms must be included.
Recall that earlier, in order to treat χ(3) as instantaneous,
we already assumed a temporally broad
(spectrally narrow) pulse.
Hence, for simplicity, we can cut off this Taylor series at β2,
which is good enough in many cases.
Inserting the expansion into A’s equation:
0=i∂z∂A+isβ1(−isΩ)A−2s2β2(−isΩ)2A+Δβ0A
Which we have rewritten in preparation for taking the inverse Fourier transform,
by introducing s and by replacing Δβ(ω)
with Δβ0≡Δβ(ω0)
in order to remove all explicit dependence on ω,
i.e. we only keep the first term of Δβ’s Taylor expansion.
After transforming and using s2=1,
we get the following equation for A(z,t):
0=i∂z∂A+isβ1∂t∂A−2β2∂t2∂2A+Δβ0A
The next step is to insert our expression for Δβ0,
for which we must first choose a specific form for Δn
according to which effects we want to include.
Earlier, we approximated εr≈n2,
so if we instead say that εr=(n+Δn)2,
then Δn should include absorption and nonlinearity.
The most commonly used form for Δn is therefore:
Δn(x,y,ω)=n2(ω)I(x,y,ω)+i2ωcα(ω)
Where I is the intensity (i.e. power per unit area) of the light,
n2 is the material’s Kerr coefficient in units of inverse intensity,
and α is the attenuation coefficient
consisting of linear and nonlinear contributions
(see multi-photon absorption).
Specifically, they are given by:
For simplicity we set Im{χxxxx(3)}=0,
which is a good approximation for silica fibers.
Inserting this form of Δn into Δβ0
and neglecting the (x,y)-dependence of Δn yields:
Where we have defined the parameter γ0≡γ(ω0) like so,
involving the effective mode areaAeff,
which contains all information about F needed for solving A’s equation:
Note the ω-dependence of Aeff:
so far we have conveniently ignored that F also depends on ω,
because it is a parameter in its eigenvalue equation.
This is valid for spectrally narrow pulses, so we will stick with it.
Just beware that some people make the ad-hoc generalization
γ0→γ(ω), which is not correct in general
(this is an advanced topic, see Lægsgaard).
Substituting Δβ0 into the main problem
yields a prototype of the NLS equation:
The factor ε0cn/2 looks familiar from the intensity I.
This, combined with Amode
and the fact that A is an electric field,
suggests that we can redefine A→A′
such that ∣A′∣2 is the optical power in watts.
Hence we make the following transformation:
2ε0cnAmode∣A∣2→∣A∣2
We can divide away the transformation factors
from all other terms in the equation, since they are linear,
leading to the full nonlinear Schrödinger equation:
0=i∂z∂A+isβ1∂t∂A−2β2∂t2∂2A+i2αA+γ0∣A∣2A
This can be reduced by switching to a coordinate system
where the time axis slides along the propagation axis at a speed sv,
so we define Z≡z and T≡t−sz/v such that:
We insert this and set v=vg,
where vg=1/β1 is the light’s group velocity:
0=i∂Z∂A−2β2∂T2∂2A+i2αA+γ0∣A∣2A
The NLS equation’s name is due to its similarity
to the Schrödinger equation of quantum physics,
if you set α=0 and treat γ0∣A∣2 as a potential.
In fiber optics, the equation is usually rearranged
to highlight that Z (or z) is the propagation direction:
∂Z∂A=−i2β2∂T2∂2A−2αA+iγ0∣A∣2A
Next, we want to reduce the equation to its dimensionless form.
To do so, we make the following coordinate transformation,
where A~, Z~ and T~ are unitless,
and Ac, Zc and Tc are dimensioned scale parameters
to be determined later:
A~(Z~,T~)=AcA(Z,T)Z~=ZcZT~=TcT
We insert this into the NLS equation,
after setting α=0 according to convention:
The goal is to remove those constant factors.
In other words, we demand:
2Tc2β2Zc=−1γ0Ac2Zc=r
Where r≡±1, whose sign choice will be explained shortly.
Note that we have two equations for three unknowns
(Ac, Zc and Tc),
so one of the parameters needs to fixed manually.
For example, we could choose our “input power”
Ac≡1W, and then:
Because Tc must be real,
we should choose r≡−sgn(γ0β2).
We thus arrive at:
0=i∂Z~∂A~+∂T~2∂2A~+rA~2A~
In fiber optics, γ0>0 for all materials,
meaning r represents the dispersion regime,
so r=1 is called anomalous dispersion
and r=−1normal dispersion.
In some other fields, where β2<0 always,
r=1 is called a focusing nonlinearity
and r=−1 a defocusing nonlinearity.
The famous bright solitons only exist for r=1,
so many authors only show that case.