summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorPrefetch2024-09-15 18:51:18 +0200
committerPrefetch2024-09-15 18:51:18 +0200
commit8d5e0c951c8e8c1124678922bfd6962f8cdb056c (patch)
tree09a848a6001a8ea99ed8b9d151cffcd8e5b663c7
parent05f779c72d987d0c2a060f93c9493c86b109587c (diff)
Expand knowledge baseHEADmaster
-rw-r--r--source/know/concept/nonlinear-schrodinger-equation/index.md675
1 files changed, 675 insertions, 0 deletions
diff --git a/source/know/concept/nonlinear-schrodinger-equation/index.md b/source/know/concept/nonlinear-schrodinger-equation/index.md
new file mode 100644
index 0000000..506600e
--- /dev/null
+++ b/source/know/concept/nonlinear-schrodinger-equation/index.md
@@ -0,0 +1,675 @@
+---
+title: "Nonlinear Schrödinger equation"
+sort_title: "Nonlinear Schrodinger equation" # sic
+date: 2024-09-15
+categories:
+- Physics
+- Mathematics
+- Fiber optics
+- Nonlinear optics
+layout: "concept"
+---
+
+The **nonlinear Schrödinger (NLS) equation**
+is a nonlinear 1+1D partial differential equation
+that appears in many areas of physics.
+It is used to describe pulses in fiber optics (as derived below),
+waves over deep water, local opening of DNA chains, and more.
+It is often given as:
+
+$$\begin{aligned}
+ \boxed{
+ i \pdv{u}{z} + \pdvn{2}{u}{t} + |u|^2 u
+ = 0
+ }
+\end{aligned}$$
+
+Which is its dimensionless form,
+governing the envelope $$u(z, t)$$
+of an underlying carrier wave,
+with $$t$$ being the transverse coordinate.
+Notably, the NLS equation has **soliton** solutions,
+where $$u$$ maintains its shape over great distances.
+
+
+
+## Derivation
+
+We only consider fiber optics here;
+the NLS equation can be derived in many other ways.
+We start from the most general form of the
+[electromagnetic wave equation](/know/concept/electromagnetic-wave-equation/),
+after assuming the medium cannot be magnetized ($$\mu_r = 1$$):
+
+$$\begin{aligned}
+ \nabla \cross \big( \nabla \cross \vb{E} \big)
+ = - \mu_0 \varepsilon_0 \pdvn{2}{\vb{E}}{t} - \mu_0 \pdvn{2}{\vb{P}}{t}
+\end{aligned}$$
+
+Using the vector identity
+$$\nabla \cross (\nabla \cross \vb{E}) = \nabla (\nabla \cdot \vb{E}) - \nabla^2 \vb{E}$$
+and [Gauss's law](/know/concept/maxwells-equations/) $$\nabla \cdot \vb{E} = 0$$,
+and splitting the polarization $$\vb{P}$$
+into linear and nonlinear contributions
+$$\vb{P}_\mathrm{L}$$ and $$\vb{P}_\mathrm{NL}$$:
+
+$$\begin{aligned}
+ \nabla^2 \vb{E} - \mu_0 \varepsilon_0 \pdvn{2}{\vb{E}}{t}
+ &= \mu_0 \pdvn{2}{\vb{P}_\mathrm{L}}{t} + \mu_0 \pdvn{2}{\vb{P}_\mathrm{NL}}{t}
+\end{aligned}$$
+
+In general, $$\vb{P}_\mathrm{L}$$ is given by the convolution
+of $$\vb{E}$$ with a second-rank response tensor $$\chi^{(1)}$$:
+
+$$\begin{aligned}
+ \vb{P}_\mathrm{L}(\vb{r}, t)
+ = \varepsilon_0 \int_{-\infty}^\infty \chi^{(1)}(t - t') \cdot \vb{E}(\vb{r}, t') \dd{t'}
+\end{aligned}$$
+
+In $$\vb{P}_\mathrm{NL}$$ we only include third-order nonlinearities,
+since higher orders are usually negligible,
+and second-order nonlinear effects only exist in very specific crystals.
+So we "only" need to deal with a fourth-rank response tensor $$\chi^{(3)}$$:
+
+$$\begin{aligned}
+ \vb{P}_\mathrm{NL}(\vb{r}, t)
+ = \varepsilon_0 \iiint_{-\infty}^\infty \chi^{(3)}(t \!-\! t_1, t \!-\! t_2, t \!-\! t_3)
+ \:\vdots\: \vb{E}(\vb{r}, t_1) \vb{E}(\vb{r}, t_2) \vb{E}(\vb{r}, t_3) \dd{t_1} \dd{t_2} \dd{t_3}
+\end{aligned}$$
+
+In practice, two phenomena contribute to $$\chi^{(3)}$$:
+the *Kerr effect* due to electrons' response to $$\vb{E}$$,
+and *Raman scattering* due to nuclei's response,
+which is slower because of their mass.
+But if the light pulses are sufficiently long (>1ps in silica),
+both effects can be treated as fast, so:
+
+$$\begin{aligned}
+ \chi^{(3)}(t \!-\! t_1, t \!-\! t_2, t \!-\! t_3)
+ &= \chi^{(3)} \delta(t - t_1) \delta(t - t_2) \delta(t - t_3)
+\end{aligned}$$
+
+Where $$\delta$$ is the [Dirac delta function](/know/concept/dirac-delta-function/).
+To keep things simple,
+we consider linearly $$x$$-polarized light $$\vb{E} = \vu{x} |\vb{E}|$$,
+such that the tensor can be replaced with its scalar element $$\chi^{(3)}_{xxxx}$$.
+Then:
+
+$$\begin{aligned}
+ \vb{P}_\mathrm{NL}
+ = \varepsilon_0 \chi^{(3)}_{xxxx} \big( \vb{E} \cdot \vb{E} \big) \vb{E}
+\end{aligned}$$
+
+For the same reasons, the linear polarization is reduced to:
+
+$$\begin{aligned}
+ \vb{P}_\mathrm{L}
+ &= \varepsilon_0 \chi^{(1)}_{xx} \vb{E}
+\end{aligned}$$
+
+Next, we decompose $$\vb{E}$$ as follows,
+consisting of a carrier wave $$e^{-i \omega_0 t}$$
+at a constant frequency $$\omega_0$$,
+modulated by an envelope $$E$$
+that is assumed to be slowly-varying compared to the carrier,
+plus the complex conjugate $$E^* e^{i \omega_0 t}$$:
+
+$$\begin{aligned}
+ \vb{E}(\vb{r}, t)
+ &= \vu{x} \frac{1}{2} \Big( E(\vb{r}, t) e^{- i \omega_0 t} + E^*(\vb{r}, t) e^{i \omega_0 t} \Big)
+\end{aligned}$$
+
+Note that no generality has been lost in this step.
+Inserting it into the polarizations:
+
+$$\begin{aligned}
+ \mathrm{P}_\mathrm{L}
+ &= \vu{x} \frac{1}{2} \varepsilon_0 \chi^{(1)}_{xx} \Big( E(\vb{r}, t) e^{- i \omega_0 t} + E^*(\vb{r}, t) e^{i \omega_0 t} \Big)
+ \\
+ \vb{P}_\mathrm{NL}
+ &= \vu{x} \frac{1}{8} \varepsilon_0 \chi^{(3)}_{xxxx} \Big( E e^{- i \omega_0 t} + E^* e^{i \omega_0 t} \Big)^{3}
+ \\
+ &= \vu{x} \frac{1}{8} \varepsilon_0 \chi^{(3)}_{xxxx}
+ \Big( E^3 e^{- i 3 \omega_0 t} + 3 E^2 E^* e^{- i \omega_0 t} + 3 E (E^*)^2 e^{i \omega_0 t} + (E^*)^3 e^{i 3 \omega_0 t} \Big)
+\end{aligned}$$
+
+The terms with $$3 \omega_0$$ represent *third-harmonic generation*,
+and only matter if the carrier is phase-matched
+to the tripled wave, which is generally not the case,
+so they can be ignored.
+Now, if we decompose the polarizations in the same was as $$\vb{E}$$:
+
+$$\begin{aligned}
+ \vb{P}_\mathrm{L}(\vb{r}, t)
+ &= \vu{x} \frac{1}{2} \Big( P_\mathrm{L}(\vb{r}, t) e^{- i \omega_0 t} + P_\mathrm{L}^*(\vb{r}, t) e^{i \omega_0 t} \Big)
+ \\
+ \vb{P}_\mathrm{NL}(\vb{r}, t)
+ &= \vu{x} \frac{1}{2} \Big( P_\mathrm{NL}(\vb{r}, t) e^{- i \omega_0 t} + P_\mathrm{NL}^*(\vb{r}, t) e^{i \omega_0 t} \Big)
+\end{aligned}$$
+
+Then it is straightforward to see that their envelope functions are given by:
+
+$$\begin{aligned}
+ P_\mathrm{L}
+ &= \varepsilon_0 \chi^{(1)}_{xx} E
+ \\
+ P_\mathrm{NL}
+ &= \frac{3}{4} \varepsilon_0 \chi^{(3)}_{xxxx} |E|^2 E
+\end{aligned}$$
+
+The forward carrier $$e^{- i \omega_0 t}$$
+and the backward carrier $$e^{i \omega_0 t}$$
+can be regarded as separate channels,
+which only interact via $$P_\mathrm{NL}$$.
+From now on, we only consider the forward-propagating wave,
+so all terms containing $$e^{i \omega_0 t}$$ are dropped;
+by taking the complex conjugate of the resulting equations,
+the backward-propagating counterparts can always be recovered,
+so no information is really lost.
+Therefore, the main wave equation becomes:
+
+$$\begin{aligned}
+ 0
+ &= \bigg(
+ \nabla^2 E - \mu_0 \varepsilon_0 \pdvn{2}{E}{t} - \mu_0 \pdvn{2}{P_\mathrm{L}}{t} - \mu_0 \pdvn{2}{P_\mathrm{NL}}{t}
+ \bigg) e^{-i \omega_0 t}
+ \\
+ &= \bigg(
+ \nabla^2 E - \Big( 1 + \chi^{(1)}_{xx} + \frac{3}{4} \chi^{(3)}_{xxxx} |E|^2 \Big) \mu_0 \varepsilon_0 \pdvn{2}{E}{t}
+ \bigg) e^{-i \omega_0 t}
+\end{aligned}$$
+
+Where we have used our assumption that $$E$$ is slowly-varying
+to treat $$|E|^2$$ as a constant,
+in order to move it outside the $$t$$-derivative.
+We thus arrive at:
+
+$$\begin{aligned}
+ 0
+ &= \bigg( \nabla^2 E - \frac{\varepsilon_r}{c^2} \pdvn{2}{E}{t} \bigg) e^{-i \omega_0 t}
+\end{aligned}$$
+
+Where $$c = 1 / \sqrt{\mu_0 \varepsilon_0}$$ is the phase velocity of light in a vacuum,
+and the relative permittivity $$\varepsilon_r$$ is defined as shown below.
+Note that this is a mild abuse of notation,
+since the symbol $$\varepsilon_r$$ is usually reserved for linear materials:
+
+$$\begin{aligned}
+ \varepsilon_r
+ \equiv 1 + \chi^{(1)}_{xx} + \frac{3}{4} \chi^{(3)}_{xxxx} |E|^2
+\end{aligned}$$
+
+Next, we take the [Fourier transform](/know/concept/fourier-transform/)
+$$t \to (\omega\!-\!\omega_0)$$ of the wave equation,
+once again treating $$|E|^2$$ (inside $$\varepsilon_r$$) as a constant.
+The constant $$s = \pm 1$$ is included here
+to deal with the fact that different authors use different sign conventions:
+
+$$\begin{aligned}
+ 0
+ &= \hat{\mathcal{F}}\bigg\{ \nabla^2 E - \frac{\varepsilon_r}{c^2} \pdvn{2}{E}{t} \bigg\}
+ \\
+ &= \int_{-\infty}^\infty
+ \bigg( \nabla^2 E - \frac{\varepsilon_r}{c^2} \pdvn{2}{E}{t} \bigg)
+ e^{i s (\omega - \omega_0) t} \dd{t}
+ \\
+ &= \nabla^2 E + s^2 \frac{\varepsilon_r}{c^2} (\omega - \omega_0)^2 E
+\end{aligned}$$
+
+We use $$s^2 = 1$$ and define $$\Omega \equiv \omega - \omega_0$$
+as the frequency shift relative to the carrier wave:
+
+$$\begin{aligned}
+ 0
+ &= \nabla^2 E + \frac{\Omega^2 \varepsilon_r}{c^2} E
+\end{aligned}$$
+
+This is a so-called *Helmholtz equation* in 3D,
+which we will solve using separation of variables,
+by assuming that its solution can be written as:
+
+$$\begin{aligned}
+ E(\vb{r}, \Omega)
+ &= F(x, y) \: A(z, \Omega) \: e^{i \beta_0 z}
+\end{aligned}$$
+
+Where $$\beta_0$$ is the wavenumber of the carrier,
+which will be determined later.
+Inserting this ansatz into the Helmholtz equation yields:
+
+$$\begin{aligned}
+ 0
+ &= \Big( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \Big) A e^{i \beta_0 z}
+ + \pdvn{2}{}{z} \Big( A e^{i \beta_0 z} \Big) F
+ + \frac{\Omega^2 \varepsilon_r}{c^2} F A e^{i \beta_0 z}
+ \\
+ &= \bigg( \Big( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \Big) A
+ + \Big( \pdvn{2}{A}{z} + 2 i \beta_0 \pdv{A}{z} - \beta_0^2 A \Big) F
+ + \frac{\Omega^2 \varepsilon_r}{c^2} F A \bigg) e^{i \beta_0 z}
+\end{aligned}$$
+
+We divide by $$F A \: e^{i \beta_0 z}$$
+and rearrange the terms in a specific way:
+
+$$\begin{aligned}
+ \Big( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \Big) \frac{1}{F} + \frac{\Omega^2 \varepsilon_r}{c^2}
+ &= - 2 i \beta_0 \pdv{A}{z} \frac{1}{A} + \beta_0^2
+\end{aligned}$$
+
+Now all the $$x$$- and $$y$$-dependence is on the left,
+and the $$z$$-dependence is on the right.
+We have placed the $$\varepsilon_r$$-term on the left too
+because it depends relatively strongly on $$(x, y)$$
+to describe the fiber's internal structure,
+and weakly on $$z$$ due to nonlinear effects.
+Meanwhile, $$\beta_0$$ is on the right because that will lead to
+a nicer equation for $$A$$ later.
+
+Note that both sides are functions of $$\Omega$$.
+Based on the aforementioned dependences,
+in order for this equation to have a solution for all $$(x, y, z)$$,
+there must exist a quantity $$\beta(\Omega)$$ that is constant in space,
+such that we obtain two separated equations for $$F$$ and $$A$$:
+
+$$\begin{aligned}
+ \beta(\omega)
+ &= \bigg( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \bigg) \frac{1}{F} + \frac{\omega^2 \varepsilon_r}{c^2}
+ \\
+ \beta(\Omega)
+ &= - 2 i \beta_0 \pdv{A}{z} \frac{1}{A} + \beta_0^2
+\end{aligned}$$
+
+Note that we replaced $$\Omega$$ with $$\omega$$ in $$F$$'s equation
+(and redefined $$\beta$$ and $$\varepsilon_r$$ accordingly).
+This is not an innocent detail:
+the idea is that $$\omega \sqrt{\varepsilon_r} / c$$
+would be the light's wavenumber if it had not been trapped in a waveguide,
+and that $$\beta$$ is the *confined* wavenumber,
+also known as the **propagation constant**.
+If we had kept $$\Omega$$,
+the meaning of $$\beta$$ would not be so straightforward.
+
+The difference between $$\beta(\omega)$$ and $$\beta_0$$
+is simply that $$\beta_0 \equiv \beta(\omega_0)$$.
+Our ansatz for separating the variables contained $$\beta_0$$,
+such that the full carrier wave $$e^{i \beta_0 z - i \omega_0 t}$$ was represented
+(with $$e^{- i \omega_0 t}$$ now hidden inside the Fourier transform).
+But later, to properly describe how light behaves inside the fiber,
+the full dispersion relation $$\beta(\omega)$$ will be needed.
+
+Multiplying by $$F$$ and $$A$$,
+we get the following set of equations,
+implicitly coupled via $$\beta$$:
+
+$$\begin{aligned}
+ \boxed{
+ \begin{aligned}
+ 0
+ &= \pdvn{2}{F}{x} + \pdvn{2}{F}{y} + \bigg( \frac{\omega^2 \varepsilon_r}{c^2} - \beta^2 \bigg) F
+ \\
+ 0
+ &= 2 i \beta_0 \pdv{A}{z} + \big( \beta^2 - \beta_0^2 \big) A
+ \end{aligned}
+ }
+\end{aligned}$$
+
+The equation for $$F$$ must be solved first.
+To do so, we treat the nonlinearity as a perturbation
+to be neglected initially.
+In other words, we first solve the following eigenvalue problem for $$\beta^2$$,
+where $$n(x, y)$$ is the linear refractive index,
+with $$n^2 = 1 + \Real\{\chi^{(1)}_{xx}\} \approx \varepsilon_r$$:
+
+$$\begin{aligned}
+ \pdvn{2}{F}{x} + \pdvn{2}{F}{y} + \bigg( \frac{\omega^2 n^2}{c^2} - \beta^2 \bigg) F
+ = 0
+\end{aligned}$$
+
+This gives us the allowed values of $$\beta$$;
+see [step-index fiber](/know/concept/step-index-fiber/) for an example solution.
+Now we add the small index change $$\Delta{n}(x, y)$$ due to nonlinear effects:
+
+$$\begin{aligned}
+ \varepsilon_r
+ = (n + \Delta{n})^2
+ \approx n^2 + 2 n \: \Delta{n}
+\end{aligned}$$
+
+Then it can be shown using first-order
+[perturbation theory](/know/concept/time-independent-perturbation-theory/)
+that the eigenfunction $$F$$ is not really affected,
+and the eigenvalue $$\beta^2$$ is shifted by $$\Delta(\beta^2)$$, given by:
+
+$$\begin{aligned}
+ \Delta(\beta^2)
+ = \frac{2 \omega^2}{c^2} \frac{\displaystyle \iint_{-\infty}^\infty n \: \Delta{n} \: |F|^2 \dd{x} \dd{y}}
+ {\displaystyle \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y}}
+\end{aligned}$$
+
+But we are more interested in the *wavenumber* shift $$\Delta{\beta}$$
+than the *eigenvalue* shift $$\Delta(\beta^2)$$.
+They are related to one another as follows:
+
+$$\begin{aligned}
+ \beta^2 + \Delta(\beta^2)
+ = (\beta + \Delta{\beta})^2
+ \approx \beta^2 + 2 \beta \Delta{\beta}
+\end{aligned}$$
+
+Furthermore, we assume that the fiber only consists of materials
+with similar refractive indices, or in other words,
+that it confines the light using only a small index difference,
+in which case we can treat $$n$$ as a constant and move it outside the integral.
+Then $$\Delta{\beta}$$ becomes:
+
+$$\begin{aligned}
+ \Delta{\beta}
+ = \frac{\omega^2 n}{\beta c^2} \frac{\displaystyle \iint_{-\infty}^\infty \Delta{n} \: |F|^2 \dd{x} \dd{y}}
+ {\displaystyle \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y}}
+\end{aligned}$$
+
+Recall that $$\beta$$ is the wavenumber of the confined mode:
+by solving the unperturbed $$F$$-equation,
+it can be shown that $$\beta$$'s value is somewhere
+between the bulk wavenumbers of the fiber materials.
+Since we just approximated $$n$$ as a constant,
+this means that $$\omega n / c \approx \beta$$, leading us to
+the general "final" form of $$\Delta{\beta}$$,
+with all the arguments shown for clarity:
+
+$$\begin{aligned}
+ \boxed{
+ \Delta{\beta}(\omega)
+ = \frac{\omega}{c \mathcal{A}_\mathrm{mode}(\omega)}
+ \iint_{-\infty}^\infty \Delta{n}(x, y, \omega) \: |F(x, y, \omega)|^2 \dd{x} \dd{y}
+ }
+\end{aligned}$$
+
+Where we have defined the *mode area* $$\mathcal{A}_\mathrm{mode}$$ as shown below.
+In order for $$\mathcal{A}_\mathrm{mode}$$ to be in units of area,
+$$F$$ must be dimensionless,
+and consequently $$A$$ has (SI) units of an electric field.
+
+$$\begin{aligned}
+ \mathcal{A}_\mathrm{mode}(\omega)
+ \equiv \iint_{-\infty}^\infty |F(x, y, \omega)|^2 \dd{x} \dd{y}
+\end{aligned}$$
+
+Now we finally turn our attention to the equation for $$A$$.
+Before perturbation, it was:
+
+$$\begin{aligned}
+ 0
+ &= 2 i \beta_0 \pdv{A}{z} + \big( \beta^2 - \beta_0^2 \big) A
+\end{aligned}$$
+
+Where $$\beta \approx \beta_0$$, so we can replace
+$$\beta^2 - \beta_0^2$$ with $$2 \beta_0 (\beta - \beta_0)$$.
+Also including $$\Delta{\beta}$$, we get:
+
+$$\begin{aligned}
+ 0
+ &= i \pdv{A}{z} + \big( \beta + \Delta{\beta} - \beta_0 \big) A
+\end{aligned}$$
+
+Usually, we do not know a full expression for $$\beta(\omega)$$,
+so it makes sense to expand it around the carrier frequency $$\omega_0$$ as follows,
+where $$\beta_n = \idvn{n}{\beta}{\omega} |_{\omega = \omega_0}$$:
+
+$$\begin{aligned}
+ \beta(\omega)
+ &= \beta_0
+ + (\omega - \omega_0) \beta_1
+ + (\omega - \omega_0)^2 \frac{\beta_2}{2}
+ + (\omega - \omega_0)^3 \frac{\beta_3}{6}
+ + \: ...
+\end{aligned}$$
+
+Spectrally, the broader the light pulse, the more terms must be included.
+Recall that earlier, in order to treat $$\chi^{(3)}$$ as instantaneous,
+we already assumed a temporally broad
+(spectrally narrow) pulse.
+Hence, for simplicity, we can cut off this Taylor series at $$\beta_2$$,
+which is good enough for many cases.
+Inserting the expansion into $$A$$'s equation:
+
+$$\begin{aligned}
+ 0
+ &= i \pdv{A}{z} + i \frac{\beta_1}{s} (-i s \Omega) A - \frac{\beta_2}{2 s^2} (- i s \Omega)^2 A + \Delta{\beta}_0 A
+\end{aligned}$$
+
+Which we have rewritten as preparation for taking the inverse Fourier transform,
+by introducing $$s$$ and by replacing $$\Delta{\beta}(\omega)$$
+with $$\Delta{\beta_0} \equiv \Delta{\beta}(\omega_0)$$
+in order to remove all explicit dependence on $$\omega$$.
+After transforming and using $$s^2 = 1$$,
+we get the following equation for $$A(z, t)$$:
+
+$$\begin{aligned}
+ 0
+ &= i \pdv{A}{z} + i s \beta_1 \pdv{A}{t} - \frac{\beta_2}{2} \pdvn{2}{A}{t} + \Delta{\beta}_0 A
+\end{aligned}$$
+
+The next step is to insert our expression for $$\Delta{\beta}_0$$,
+for which we must first choose a specific form for $$\Delta{n}$$
+according to which effects we want to include.
+Earlier, we approximated $$\varepsilon_r \approx n^2$$,
+so if we instead say that $$\varepsilon_r = (n \!+\! \Delta{n})^2$$,
+then $$\Delta{n}$$ should include absorption and nonlinearity.
+A simple and commonly used form for $$\Delta{n}$$ is therefore:
+
+$$\begin{aligned}
+ \Delta{n}
+ = n_2 I + i \frac{\alpha c}{2 \omega}
+\end{aligned}$$
+
+Where $$I$$ is the intensity (i.e. power per unit area) of the light,
+$$n_2$$ is the material's *Kerr coefficient* in units of inverse intensity,
+and $$\alpha$$ is the attenuation coefficient
+consisting of linear and nonlinear contributions
+(see [multi-photon absorption](/know/concept/multi-photon-absorption/)).
+Specifically, they are given by:
+
+$$\begin{aligned}
+ n_2
+ = \frac{3 \Real\{\chi^{(3)}_{xxxx}\}}{4 \varepsilon_0 c n^2}
+ \qquad
+ \alpha
+ = \frac{\omega \Imag\{\chi^{(1)}_{xx}\}}{c n}
+ + \frac{3 \omega \Imag\{\chi^{(3)}_{xxxx}\}}{2 \varepsilon_0 c^2 n^2} I
+ \qquad
+ I
+ = \frac{\varepsilon_0 c n}{2} |E|^2
+\end{aligned}$$
+
+For simplicity, we set $$\Imag\{\chi^{(3)}_{xxxx}\} = 0$$,
+which is a good approximation for fibers made of silica.
+Inserting this form of $$\Delta{n}$$ into $$\Delta{\beta_0}$$ then yields:
+
+$$\begin{aligned}
+ \Delta{\beta}_0
+ &= i \frac{\alpha}{2} \frac{\mathcal{A}_\mathrm{mode}}{\mathcal{A}_\mathrm{mode}}
+ + \frac{\omega_0 \varepsilon_0 c n n_2}{2 c \mathcal{A}_\mathrm{mode}} |A|^2 \iint_{-\infty}^\infty |F|^4 \dd{x} \dd{y}
+ \\
+ &= i \frac{\alpha}{2}
+ + \gamma_0 \frac{\varepsilon_0 c n}{2} \mathcal{A}_\mathrm{mode} |A|^2
+\end{aligned}$$
+
+Where we have defined the nonlinear parameter $$\gamma_0$$ like so,
+involving the **effective mode area** $$\mathcal{A}_\mathrm{eff}$$,
+which contains all information about $$F$$ needed for solving $$A$$'s equation:
+
+$$\begin{aligned}
+ \boxed{
+ \gamma_0
+ = \gamma(\omega_0)
+ \equiv \frac{\omega_0 n_2}{c \mathcal{A}_\mathrm{eff}}
+ }
+ \qquad \qquad
+ \boxed{
+ \mathcal{A}_\mathrm{eff}(\omega_0)
+ \equiv \frac{\displaystyle \bigg( \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y} \bigg)^2}
+ {\displaystyle \iint_{-\infty}^\infty |F|^4 \dd{x} \dd{y}}
+ }
+\end{aligned}$$
+
+Substituting $$\Delta{\beta_0}$$ into the main problem
+yields a prototype of the NLS equation:
+
+$$\begin{aligned}
+ 0
+ &= i \pdv{A}{z} + i s \beta_1 \pdv{A}{t} - \frac{\beta_2}{2} \pdvn{2}{A}{t} + i \frac{\alpha}{2} A
+ + \gamma_0 \frac{\varepsilon_0 c n}{2} \mathcal{A}_\mathrm{mode} |A|^2 A
+\end{aligned}$$
+
+The factor $$\varepsilon_0 c n / 2$$ looks familiar from the intensity $$I$$.
+This, combined with $$\mathcal{A}_\mathrm{mode}$$
+and the fact that $$A$$ is an electric field,
+suggests that we can redefine $$A \to A'$$
+such that $$|A'|^2$$ is the optical power in watts.
+Hence we make the following transformation:
+
+$$\begin{aligned}
+ \frac{\varepsilon_0 c n}{2} \mathcal{A}_\mathrm{mode} |A|^2
+ \:\:\to\:\:
+ |A|^2
+\end{aligned}$$
+
+We can divide away the transformation factors
+from all other terms in the equation, since they are linear,
+leading to the full *nonlinear Schrödinger equation*:
+
+$$\begin{aligned}
+ \boxed{
+ 0
+ = i \pdv{A}{z} + i s \beta_1 \pdv{A}{t} - \frac{\beta_2}{2} \pdvn{2}{A}{t} + i \frac{\alpha}{2} A + \gamma_0 |A|^2 A
+ }
+\end{aligned}$$
+
+This can be reduced by switching to a coordinate system
+where the time axis slides along the propagation axis at a speed $$s v$$,
+so we define $$Z \equiv z$$ and $$T \equiv t - s z / v$$ such that:
+
+$$\begin{aligned}
+ \pdv{A}{z}
+ &= \pdv{A}{Z} \pdv{Z}{z} + \pdv{A}{T} \pdv{T}{z}
+ = \pdv{A}{Z} - \frac{s}{v} \pdv{A}{T}
+ \\
+ \pdv{A}{t}
+ &= \pdv{A}{Z} \pdv{Z}{t} + \pdv{A}{T} \pdv{T}{t}
+ = \pdv{A}{T}
+\end{aligned}$$
+
+We insert this and set $$v = v_g$$,
+where $$v_g = 1 / \beta_1$$ is the light's group velocity:
+
+$$\begin{aligned}
+ \boxed{
+ 0
+ = i \pdv{A}{Z} - \frac{\beta_2}{2} \pdvn{2}{A}{T} + i \frac{\alpha}{2} A + \gamma_0 |A|^2 A
+ }
+\end{aligned}$$
+
+The NLS equation's name is due to its similarity
+to the Schrödinger equation of quantum physics,
+if you set $$\alpha = 0$$ and treat $$\gamma_0 |A|^2$$ as a potential.
+In fiber optics, the equation is usually rearranged
+to highlight that $$Z$$ (or $$z$$) is the propagation direction:
+
+$$\begin{aligned}
+ \pdv{A}{Z}
+ = - i \frac{\beta_2}{2} \pdvn{2}{A}{T} - \frac{\alpha}{2} A + i \gamma_0 |A|^2 A
+\end{aligned}$$
+
+Next, we want to reduce the equation to its dimensionless form.
+To do so, we make the following coordinate transformation,
+where $$\tilde{A}$$, $$\tilde{Z}$$ and $$\tilde{T}$$ are unitless,
+and $$A_c$$, $$Z_c$$ and $$T_c$$ are dimensioned scale parameters
+to be determined later:
+
+$$\begin{aligned}
+ \tilde{A}(\tilde{Z}, \tilde{T})
+ = \frac{A(Z, T)}{A_c}
+ \qquad\qquad
+ \tilde{Z}
+ = \frac{Z}{Z_c}
+ \qquad\qquad
+ \tilde{T}
+ = \frac{T}{T_c}
+\end{aligned}$$
+
+We insert this into the NLS equation,
+after setting $$\alpha = 0$$ according to convention:
+
+$$\begin{aligned}
+ 0
+ = i \frac{A_c}{Z_c} \pdv{\tilde{A}}{\tilde{Z}}
+ - \frac{\beta_2}{2} \frac{A_c}{T_c^2} \pdvn{2}{\tilde{A}}{\tilde{T}}
+ + \gamma_0 A_c^3 \big|\tilde{A}\big|^2 \tilde{A}
+\end{aligned}$$
+
+Multiplying by $$Z_c / A_c$$ to make all terms dimensionless leads us to:
+
+$$\begin{aligned}
+ 0
+ = i \pdv{\tilde{A}}{\tilde{Z}}
+ - \frac{\beta_2 Z_c}{2 T_c^2} \pdvn{2}{\tilde{A}}{\tilde{T}}
+ + \gamma_0 A_c^2 Z_c \big|\tilde{A}\big|^2 \tilde{A}
+\end{aligned}$$
+
+The goal is to remove those constant factors.
+In other words, we demand:
+
+$$\begin{aligned}
+ \frac{\beta_2 Z_c}{2 T_c^2}
+ = \mp 1
+ \qquad\qquad
+ \gamma_0 A_c^2 Z_c
+ = 1
+\end{aligned}$$
+
+Where the choice of $$\mp$$ will be explained shortly.
+Note that we only have two equations for three unknowns
+($$A_c$$, $$Z_c$$ and $$T_c$$),
+so one of the parameters needs to fixed manually.
+For example, we could choose $$Z_c = 1\:\mathrm{m}$$, and then:
+
+$$\begin{aligned}
+ A_c
+ = \frac{1}{\sqrt{\gamma Z_c}}
+ \qquad\qquad
+ T_c
+ = \sqrt{\frac{\mp \beta_2 Z_c}{2}}
+\end{aligned}$$
+
+Note that this requires that $$\gamma_0 > 0$$,
+which is true for the vast majority of materials,
+and that we choose the sign $$\mp$$ such that $$\mp \beta_2 > 0$$.
+We thus arrive at:
+
+$$\begin{aligned}
+ \boxed{
+ 0
+ = i \pdv{\tilde{A}}{\tilde{Z}}
+ \pm \pdvn{2}{\tilde{A}}{\tilde{T}}
+ + \big|\tilde{A}\big|^2 \tilde{A}
+ }
+\end{aligned}$$
+
+Because soliton solutions only exist
+in the *anomalous dispersion* regime $$\beta_2 < 0$$,
+most authors just write $$+$$.
+There are still plenty of interesting effects
+in the *normal dispersion* regime $$\beta_2 > 0$$,
+hence we write $$\pm$$ for the sake of completeness.
+
+
+
+## References
+
+1. G.P. Agrawal,
+ *Nonlinear fiber optics*, 6th edition,
+ Elsevier.
+2. O. Bang,
+ *Nonlinear mathematical physics: lecture notes*,
+ 2020, unpublished.