Categories: Fiber optics, Mathematics, Nonlinear optics, Physics.

Nonlinear Schrödinger equation

The nonlinear Schrödinger (NLS) equation is a nonlinear 1+1D partial differential equation that appears in many areas of physics. It is often given in its dimensionless form, where it governs the envelope $u(z, t)$ of an underlying carrier wave, with $t$ the transverse coordinate, and $r = \pm 1$ a parameter determining which of two regimes the equation is intended for:

\begin{aligned} \boxed{ i \pdv{u}{z} + \pdvn{2}{u}{t} + r |u|^2 u = 0 } \end{aligned}

Many variants exist, depending on the conventions used by authors. The NLS equation is used to describe pulses in fiber optics (as derived below), waves over deep water, local opening of DNA chains, and much more. Very roughly speaking, it is a valid description of “all” weakly nonlinear, slowly modulated waves in physics.

It exhibits an incredible range of behaviors, from “simple” effects such as dispersive broadening, self-phase modulation and first-order solitons, to weirder and more complicated phenomena like modulational instability, optical wave breaking and periodic higher-order solitons. It is also often modified to include additional physics, further enriching its results with e.g. self-steepening and soliton self-frequency shifting.

We only consider fiber optics here; the NLS equation can be derived in many other ways. We start from the most general form of the electromagnetic wave equation, after assuming the medium cannot be magnetized ( $\mu_r = 1$ ):

\begin{aligned} \nabla \cross \big( \nabla \cross \vb{E} \big) = - \mu_0 \varepsilon_0 \pdvn{2}{\vb{E}}{t} - \mu_0 \pdvn{2}{\vb{P}}{t} \end{aligned}

Using the vector identity $\nabla \cross (\nabla \cross \vb{E}) = \nabla (\nabla \cdot \vb{E}) - \nabla^2 \vb{E}$ and Gauss’s law $\nabla \cdot \vb{E} = 0$ , and splitting the polarization $\vb{P}$ into linear and nonlinear contributions $\vb{P}_\mathrm{L}$ and $\vb{P}_\mathrm{NL}$ :

\begin{aligned} \nabla^2 \vb{E} - \mu_0 \varepsilon_0 \pdvn{2}{\vb{E}}{t} &= \mu_0 \pdvn{2}{\vb{P}_\mathrm{L}}{t} + \mu_0 \pdvn{2}{\vb{P}_\mathrm{NL}}{t} \end{aligned}

In general, $\vb{P}_\mathrm{L}$ is given by the convolution of $\vb{E}$ with a second-rank response tensor $\chi^{(1)}$ :

\begin{aligned} \vb{P}_\mathrm{L}(\vb{r}, t) = \varepsilon_0 \int_{-\infty}^\infty \chi^{(1)}(t - t') \cdot \vb{E}(\vb{r}, t') \dd{t'} \end{aligned}

In $\vb{P}_\mathrm{NL}$ we only include third-order nonlinearities, since higher orders are usually negligible, and second-order nonlinear effects only exist in very specific crystals. So we “only” need to deal with a fourth-rank response tensor $\chi^{(3)}$ :

\begin{aligned} \vb{P}_\mathrm{NL}(\vb{r}, t) = \varepsilon_0 \iiint_{-\infty}^\infty \chi^{(3)}(t \!-\! t_1, t \!-\! t_2, t \!-\! t_3) \:\vdots\: \vb{E}(\vb{r}, t_1) \vb{E}(\vb{r}, t_2) \vb{E}(\vb{r}, t_3) \dd{t_1} \dd{t_2} \dd{t_3} \end{aligned}

In practice, two phenomena contribute to $\chi^{(3)}$ : the Kerr effect due to electrons’ response to $\vb{E}$ , and Raman scattering due to nuclei’s response, which is slower because of their mass. But if the light pulses are sufficiently long (>1ps in silica), both effects can be treated as fast, so:

\begin{aligned} \chi^{(3)}(t \!-\! t_1, t \!-\! t_2, t \!-\! t_3) &= \chi^{(3)} \delta(t - t_1) \delta(t - t_2) \delta(t - t_3) \end{aligned}

Where $\delta$ is the Dirac delta function. To keep things simple, we consider linearly $x$ -polarized light $\vb{E} = \vu{x} |\vb{E}|$ , such that the tensor can be replaced with its scalar element $\chi^{(3)}_{xxxx}$ . Then:

\begin{aligned} \vb{P}_\mathrm{NL} = \varepsilon_0 \chi^{(3)}_{xxxx} \big( \vb{E} \cdot \vb{E} \big) \vb{E} \end{aligned}

For the same reasons, the linear polarization is reduced to:

\begin{aligned} \vb{P}_\mathrm{L} &= \varepsilon_0 \chi^{(1)}_{xx} \vb{E} \end{aligned}

Next, we decompose $\vb{E}$ as follows, consisting of a carrier wave $e^{-i \omega_0 t}$ at a constant frequency $\omega_0$ , modulated by an envelope $E$ that is assumed to be slowly-varying compared to the carrier, plus the complex conjugate $E^* e^{i \omega_0 t}$ :

\begin{aligned} \vb{E}(\vb{r}, t) &= \vu{x} \frac{1}{2} \Big( E(\vb{r}, t) e^{- i \omega_0 t} + E^*(\vb{r}, t) e^{i \omega_0 t} \Big) \end{aligned}

Note that no generality has been lost in this step. Inserting it into the polarizations:

\begin{aligned} \mathrm{P}_\mathrm{L} &= \vu{x} \frac{1}{2} \varepsilon_0 \chi^{(1)}_{xx} \Big( E(\vb{r}, t) e^{- i \omega_0 t} + E^*(\vb{r}, t) e^{i \omega_0 t} \Big) \\ \vb{P}_\mathrm{NL} &= \vu{x} \frac{1}{8} \varepsilon_0 \chi^{(3)}_{xxxx} \Big( E e^{- i \omega_0 t} + E^* e^{i \omega_0 t} \Big)^{3} \\ &= \vu{x} \frac{1}{8} \varepsilon_0 \chi^{(3)}_{xxxx} \Big( E^3 e^{- i 3 \omega_0 t} + 3 E^2 E^* e^{- i \omega_0 t} + 3 E (E^*)^2 e^{i \omega_0 t} + (E^*)^3 e^{i 3 \omega_0 t} \Big) \end{aligned}

The terms with $3 \omega_0$ represent third-harmonic generation, and only matter if the carrier is phase-matched to the tripled wave, which is generally not the case, so they can be ignored. Now, if we decompose the polarizations in the same was as $\vb{E}$ :

\begin{aligned} \vb{P}_\mathrm{L}(\vb{r}, t) &= \vu{x} \frac{1}{2} \Big( P_\mathrm{L}(\vb{r}, t) e^{- i \omega_0 t} + P_\mathrm{L}^*(\vb{r}, t) e^{i \omega_0 t} \Big) \\ \vb{P}_\mathrm{NL}(\vb{r}, t) &= \vu{x} \frac{1}{2} \Big( P_\mathrm{NL}(\vb{r}, t) e^{- i \omega_0 t} + P_\mathrm{NL}^*(\vb{r}, t) e^{i \omega_0 t} \Big) \end{aligned}

Then it is straightforward to see that their envelope functions are given by:

\begin{aligned} P_\mathrm{L} &= \varepsilon_0 \chi^{(1)}_{xx} E \\ P_\mathrm{NL} &= \frac{3}{4} \varepsilon_0 \chi^{(3)}_{xxxx} |E|^2 E \end{aligned}

The forward carrier $e^{- i \omega_0 t}$ and the backward carrier $e^{i \omega_0 t}$ can be regarded as separate channels, which only interact via $P_\mathrm{NL}$ . From now on, we only consider the forward-propagating wave, so all terms containing $e^{i \omega_0 t}$ are dropped; by taking the complex conjugate of the resulting equations, the backward-propagating counterparts can always be recovered, so no information is really lost. Therefore, the main wave equation becomes:

\begin{aligned} 0 &= \bigg( \nabla^2 E - \mu_0 \varepsilon_0 \pdvn{2}{E}{t} - \mu_0 \pdvn{2}{P_\mathrm{L}}{t} - \mu_0 \pdvn{2}{P_\mathrm{NL}}{t} \bigg) e^{-i \omega_0 t} \\ &\approx \bigg( \nabla^2 E - \Big( 1 + \chi^{(1)}_{xx} + \frac{3}{4} \chi^{(3)}_{xxxx} |E|^2 \Big) \mu_0 \varepsilon_0 \pdvn{2}{E}{t} \bigg) e^{-i \omega_0 t} \end{aligned}

Where we have used our assumption that $E$ is slowly-varying to treat $|E|^2$ as a constant, in order to move it outside the $t$ -derivative. We thus arrive at:

\begin{aligned} 0 &= \bigg( \nabla^2 E - \frac{\varepsilon_r}{c^2} \pdvn{2}{E}{t} \bigg) e^{-i \omega_0 t} \end{aligned}

Where $c = 1 / \sqrt{\mu_0 \varepsilon_0}$ is the phase velocity of light in a vacuum, and the relative permittivity $\varepsilon_r$ is defined as shown below. Note that this is a mild abuse of notation, since the symbol $\varepsilon_r$ is usually reserved for linear materials:

\begin{aligned} \varepsilon_r \equiv 1 + \chi^{(1)}_{xx} + \frac{3}{4} \chi^{(3)}_{xxxx} |E|^2 \end{aligned}

Next, we take the Fourier transform $t \to \omega$ of the wave equation, again treating $|E|^2$ (inside $\varepsilon_r$ ) as a constant. The constant $s = \pm 1$ is included here to deal with the fact that different authors use different sign conventions:

\begin{aligned} 0 &= \hat{\mathcal{F}}\bigg\{ \bigg( \nabla^2 E - \frac{\varepsilon_r}{c^2} \pdvn{2}{E}{t} \bigg) e^{-i \omega_0 t} \bigg\} \\ &= \int_{-\infty}^\infty \bigg( \nabla^2 E - \frac{\varepsilon_r}{c^2} \pdvn{2}{E}{t} \bigg) e^{i s (\omega - \omega_0) t} \dd{t} \\ &= \nabla^2 E + s^2 (\omega - \omega_0)^2 \frac{\varepsilon_r}{c^2} E \end{aligned}

We use $s^2 = 1$ and define $\Omega \equiv \omega - \omega_0$ as the frequency shift relative to the carrier wave:

\begin{aligned} 0 &= \nabla^2 E + \frac{\Omega^2 \varepsilon_r}{c^2} E \end{aligned}

This is a so-called Helmholtz equation in 3D, which we will solve using separation of variables, by assuming that its solution can be written as:

\begin{aligned} E(\vb{r}, \Omega) &= F(x, y) \: A(z, \Omega) \: e^{i \beta_0 z} \end{aligned}

Where $\beta_0$ is the wavenumber of the carrier, which will be determined later. Inserting this ansatz into the Helmholtz equation yields:

\begin{aligned} 0 &= \Big( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \Big) A e^{i \beta_0 z} + \pdvn{2}{}{z} \Big( A e^{i \beta_0 z} \Big) F + \frac{\Omega^2 \varepsilon_r}{c^2} F A e^{i \beta_0 z} \\ &= \bigg( \Big( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \Big) A + \Big( \pdvn{2}{A}{z} + 2 i \beta_0 \pdv{A}{z} - \beta_0^2 A \Big) F + \frac{\Omega^2 \varepsilon_r}{c^2} F A \bigg) e^{i \beta_0 z} \end{aligned}

We divide by $F A \: e^{i \beta_0 z}$ and rearrange the terms in a specific way:

\begin{aligned} \Big( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \Big) \frac{1}{F} + \frac{\Omega^2 \varepsilon_r}{c^2} &= - 2 i \beta_0 \pdv{A}{z} \frac{1}{A} + \beta_0^2 \end{aligned}

Now all the $x$ - and $y$ -dependence is on the left, and the $z$ -dependence is on the right. We have placed the $\varepsilon_r$ -term on the left too because it depends relatively strongly on $(x, y)$ to describe the fiber’s internal structure, and weakly on $z$ due to nonlinear effects. Meanwhile, $\beta_0$ is on the right because that will lead to a nicer equation for $A$ later.

Note that both sides are functions of $\Omega$ . Based on the aforementioned dependences, in order for this equation to have a solution for all $(x, y, z)$ , there must exist a quantity $\beta(\Omega)$ that is constant in space, such that we obtain two separated equations for $F$ and $A$ :

\begin{aligned} \beta(\omega) &= \bigg( \pdvn{2}{F}{x} + \pdvn{2}{F}{y} \bigg) \frac{1}{F} + \frac{\omega^2 \varepsilon_r}{c^2} \\ \beta(\Omega) &= - 2 i \beta_0 \pdv{A}{z} \frac{1}{A} + \beta_0^2 \end{aligned}

Note that we replaced $\Omega$ with $\omega$ in $F$ ’s equation (and redefined $\beta$ and $\varepsilon_r$ accordingly). This is not an innocent detail: the idea is that $\omega \sqrt{\varepsilon_r} / c$ would be the light’s wavenumber if it had not been trapped in a waveguide, and that $\beta$ is the confined wavenumber, also known as the propagation constant. If we had kept $\Omega$ , the meaning of $\beta$ would not be so straightforward.

The difference between $\beta(\omega)$ and $\beta_0$ is simply that $\beta_0 \equiv \beta(\omega_0)$ . Our ansatz for separating the variables contained $\beta_0$ , such that the full carrier wave $e^{i \beta_0 z - i \omega_0 t}$ was represented (with $e^{- i \omega_0 t}$ now hidden inside the Fourier transform). But later, to properly describe how light behaves inside the fiber, the full dispersion relation $\beta(\omega)$ will be needed.

Multiplying by $F$ and $A$ , we get the following set of equations, implicitly coupled via $\beta$ :

\begin{aligned} \boxed{ \begin{aligned} 0 &= \pdvn{2}{F}{x} + \pdvn{2}{F}{y} + \bigg( \frac{\omega^2 \varepsilon_r}{c^2} - \beta^2 \bigg) F \\ 0 &= 2 i \beta_0 \pdv{A}{z} + \big( \beta^2 - \beta_0^2 \big) A \end{aligned} } \end{aligned}

The equation for $F$ must be solved first. To do so, we treat the nonlinearity as a perturbation to be neglected initially. In other words, we first solve the following eigenvalue problem for $\beta^2$ , where $n(x, y)$ is the linear refractive index, with $n^2 = 1 + \Real\{\chi^{(1)}_{xx}\} \approx \varepsilon_r$ :

\begin{aligned} \pdvn{2}{F}{x} + \pdvn{2}{F}{y} + \bigg( \frac{\omega^2 n^2}{c^2} - \beta^2 \bigg) F = 0 \end{aligned}

This gives us the allowed values of $\beta$ ; see step-index fiber for an example solution. Now we add the small index change $\Delta{n}(x, y)$ due to nonlinear effects:

\begin{aligned} \varepsilon_r = (n + \Delta{n})^2 \approx n^2 + 2 n \: \Delta{n} \end{aligned}

Then it can be shown using first-order perturbation theory that the eigenfunction $F$ is not really affected, and the eigenvalue $\beta^2$ is shifted by $\Delta(\beta^2)$ , given by:

\begin{aligned} \Delta(\beta^2) = \frac{2 \omega^2}{c^2} \frac{\displaystyle \iint_{-\infty}^\infty n \: \Delta{n} \: |F|^2 \dd{x} \dd{y}} {\displaystyle \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y}} \end{aligned}

But we are more interested in the wavenumber shift $\Delta{\beta}$ than the eigenvalue shift $\Delta(\beta^2)$ . They are related to one another as follows:

\begin{aligned} \beta^2 + \Delta(\beta^2) = (\beta + \Delta{\beta})^2 \approx \beta^2 + 2 \beta \Delta{\beta} \end{aligned}

Furthermore, we assume that the fiber only consists of materials with similar refractive indices, or in other words, that it confines the light using only a small index difference, in which case we can treat $n$ as a constant and move it outside the integral. Then $\Delta{\beta}$ becomes:

\begin{aligned} \Delta{\beta} = \frac{\omega^2 n}{\beta c^2} \frac{\displaystyle \iint_{-\infty}^\infty \Delta{n} \: |F|^2 \dd{x} \dd{y}} {\displaystyle \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y}} \end{aligned}

Recall that $\beta$ is the wavenumber of the confined mode: by solving the unperturbed $F$ -equation, it can be shown that $\beta$ ’s value is somewhere between the bulk wavenumbers of the fiber materials. Since we just approximated $n$ as a constant, this means that $\omega n / c \approx \beta$ , leading us to the general “final” form of $\Delta{\beta}$ , with all the arguments shown for clarity:

\begin{aligned} \boxed{ \Delta{\beta}(\omega) = \frac{\omega}{c \mathcal{A}_\mathrm{mode}} \iint_{-\infty}^\infty \Delta{n}(x, y, \omega) \: |F(x, y)|^2 \dd{x} \dd{y} } \end{aligned}

Where we have defined the mode area $\mathcal{A}_\mathrm{mode}$ as shown below. In order for $\mathcal{A}_\mathrm{mode}$ to be in units of area, $F$ must be dimensionless, and consequently $A$ has (SI) units of an electric field.

\begin{aligned} \mathcal{A}_\mathrm{mode} \equiv \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y} \end{aligned}

Now we finally turn our attention to the equation for $A$ . Before perturbation, it was:

\begin{aligned} 0 &= 2 i \beta_0 \pdv{A}{z} + \big( \beta^2 - \beta_0^2 \big) A \end{aligned}

Where $\beta \approx \beta_0$ , so we can replace $\beta^2 - \beta_0^2$ with $2 \beta_0 (\beta - \beta_0)$ . Also including $\Delta{\beta}$ , we get:

\begin{aligned} 0 &= i \pdv{A}{z} + \big( \beta + \Delta{\beta} - \beta_0 \big) A \end{aligned}

Usually, we do not know a full expression for $\beta(\omega)$ , so it makes sense to expand it around the carrier frequency $\omega_0$ as follows, where $\beta_n = \idvn{n}{\beta}{\omega} |_{\omega = \omega_0}$ :

\begin{aligned} \beta(\omega) &= \beta_0 + (\omega - \omega_0) \beta_1 + (\omega - \omega_0)^2 \frac{\beta_2}{2} + (\omega - \omega_0)^3 \frac{\beta_3}{6} + \: ... \end{aligned}

Spectrally, the broader the light pulse, the more terms must be included. Recall that earlier, in order to treat $\chi^{(3)}$ as instantaneous, we already assumed a temporally broad (spectrally narrow) pulse. Hence, for simplicity, we can cut off this Taylor series at $\beta_2$ , which is good enough in many cases. Inserting the expansion into $A$ ’s equation:

\begin{aligned} 0 &= i \pdv{A}{z} + i \frac{\beta_1}{s} (-i s \Omega) A - \frac{\beta_2}{2 s^2} (- i s \Omega)^2 A + \Delta{\beta}_0 A \end{aligned}

Which we have rewritten in preparation for taking the inverse Fourier transform, by introducing $s$ and by replacing $\Delta{\beta}(\omega)$ with $\Delta{\beta_0} \equiv \Delta{\beta}(\omega_0)$ in order to remove all explicit dependence on $\omega$ , i.e. we only keep the first term of $\Delta{\beta}$ ’s Taylor expansion. After transforming and using $s^2 = 1$ , we get the following equation for $A(z, t)$ :

\begin{aligned} 0 &= i \pdv{A}{z} + i s \beta_1 \pdv{A}{t} - \frac{\beta_2}{2} \pdvn{2}{A}{t} + \Delta{\beta}_0 A \end{aligned}

The next step is to insert our expression for $\Delta{\beta}_0$ , for which we must first choose a specific form for $\Delta{n}$ according to which effects we want to include. Earlier, we approximated $\varepsilon_r \approx n^2$ , so if we instead say that $\varepsilon_r = (n \!+\! \Delta{n})^2$ , then $\Delta{n}$ should include absorption and nonlinearity. The most commonly used form for $\Delta{n}$ is therefore:

\begin{aligned} \Delta{n}(x, y, \omega) = n_2(\omega) \: I(x, y, \omega) + i \frac{c \alpha(\omega)}{2 \omega} \end{aligned}

Where $I$ is the intensity (i.e. power per unit area) of the light, $n_2$ is the material’s Kerr coefficient in units of inverse intensity, and $\alpha$ is the attenuation coefficient consisting of linear and nonlinear contributions (see multi-photon absorption). Specifically, they are given by:

\begin{aligned} n_2 = \frac{3 \Real\{\chi^{(3)}_{xxxx}\}}{4 \varepsilon_0 c n^2} \qquad \alpha = \frac{\omega \Imag\{\chi^{(1)}_{xx}\}}{c n} + \frac{3 \omega \Imag\{\chi^{(3)}_{xxxx}\}}{2 \varepsilon_0 c^2 n^2} I \qquad I = \frac{\varepsilon_0 c n}{2} |F|^2 |A|^2 \end{aligned}

For simplicity we set $\Imag\{\chi^{(3)}_{xxxx}\} = 0$ , which is a good approximation for silica fibers. Inserting this form of $\Delta{n}$ into $\Delta{\beta_0}$ and neglecting the $(x, y)$ -dependence of $\Delta{n}$ yields:

\begin{aligned} \Delta{\beta}_0 &= i \frac{\alpha}{2} \frac{\mathcal{A}_\mathrm{mode}}{\mathcal{A}_\mathrm{mode}} + \frac{\omega_0 \varepsilon_0 c n n_2}{2 c \mathcal{A}_\mathrm{mode}} |A|^2 \iint_{-\infty}^\infty |F|^4 \dd{x} \dd{y} \\ &= i \frac{\alpha}{2} + \gamma_0 \frac{\varepsilon_0 c n}{2} \mathcal{A}_\mathrm{mode} |A|^2 \end{aligned}

Where we have defined the parameter $\gamma_0 \equiv \gamma(\omega_0)$ like so, involving the effective mode area $\mathcal{A}_\mathrm{eff}$ , which contains all information about $F$ needed for solving $A$ ’s equation:

\begin{aligned} \boxed{ \gamma(\omega) \equiv \frac{\omega n_2(\omega)}{c \mathcal{A}_\mathrm{eff}(\omega)} } \qquad \qquad \boxed{ \mathcal{A}_\mathrm{eff}(\omega) \equiv \frac{\displaystyle \bigg( \iint_{-\infty}^\infty |F|^2 \dd{x} \dd{y} \bigg)^2} {\displaystyle \iint_{-\infty}^\infty |F|^4 \dd{x} \dd{y}} } \end{aligned}

Note the $\omega$ -dependence of $A_\mathrm{eff}$ : so far we have conveniently ignored that $F$ also depends on $\omega$ , because it is a parameter in its eigenvalue equation. This is valid for spectrally narrow pulses, so we will stick with it. Just beware that some people make the ad-hoc generalization $\gamma_0 \to \gamma(\omega)$ , which is not correct in general (this is an advanced topic, see Lægsgaard).

Substituting $\Delta{\beta_0}$ into the main problem yields a prototype of the NLS equation:

\begin{aligned} 0 &= i \pdv{A}{z} + i s \beta_1 \pdv{A}{t} - \frac{\beta_2}{2} \pdvn{2}{A}{t} + i \frac{\alpha}{2} A + \gamma_0 \frac{\varepsilon_0 c n}{2} \mathcal{A}_\mathrm{mode} |A|^2 A \end{aligned}

The factor $\varepsilon_0 c n / 2$ looks familiar from the intensity $I$ . This, combined with $\mathcal{A}_\mathrm{mode}$ and the fact that $A$ is an electric field, suggests that we can redefine $A \to A'$ such that $|A'|^2$ is the optical power in watts. Hence we make the following transformation:

\begin{aligned} \frac{\varepsilon_0 c n}{2} \mathcal{A}_\mathrm{mode} |A|^2 \:\:\to\:\: |A|^2 \end{aligned}

We can divide away the transformation factors from all other terms in the equation, since they are linear, leading to the full nonlinear Schrödinger equation:

\begin{aligned} \boxed{ 0 = i \pdv{A}{z} + i s \beta_1 \pdv{A}{t} - \frac{\beta_2}{2} \pdvn{2}{A}{t} + i \frac{\alpha}{2} A + \gamma_0 |A|^2 A } \end{aligned}

This can be reduced by switching to a coordinate system where the time axis slides along the propagation axis at a speed $s v$ , so we define $Z \equiv z$ and $T \equiv t - s z / v$ such that:

\begin{aligned} \pdv{A}{z} &= \pdv{A}{Z} \pdv{Z}{z} + \pdv{A}{T} \pdv{T}{z} = \pdv{A}{Z} - \frac{s}{v} \pdv{A}{T} \\ \pdv{A}{t} &= \pdv{A}{Z} \pdv{Z}{t} + \pdv{A}{T} \pdv{T}{t} = \pdv{A}{T} \end{aligned}

We insert this and set $v = v_g$ , where $v_g = 1 / \beta_1$ is the light’s group velocity:

\begin{aligned} \boxed{ 0 = i \pdv{A}{Z} - \frac{\beta_2}{2} \pdvn{2}{A}{T} + i \frac{\alpha}{2} A + \gamma_0 |A|^2 A } \end{aligned}

The NLS equation’s name is due to its similarity to the Schrödinger equation of quantum physics, if you set $\alpha = 0$ and treat $\gamma_0 |A|^2$ as a potential. In fiber optics, the equation is usually rearranged to highlight that $Z$ (or $z$ ) is the propagation direction:

\begin{aligned} \pdv{A}{Z} = - i \frac{\beta_2}{2} \pdvn{2}{A}{T} - \frac{\alpha}{2} A + i \gamma_0 |A|^2 A \end{aligned}

Next, we want to reduce the equation to its dimensionless form. To do so, we make the following coordinate transformation, where $\tilde{A}$ , $\tilde{Z}$ and $\tilde{T}$ are unitless, and $A_c$ , $Z_c$ and $T_c$ are dimensioned scale parameters to be determined later:

\begin{aligned} \tilde{A}(\tilde{Z}, \tilde{T}) = \frac{A(Z, T)}{A_c} \qquad\qquad \tilde{Z} = \frac{Z}{Z_c} \qquad\qquad \tilde{T} = \frac{T}{T_c} \end{aligned}

We insert this into the NLS equation, after setting $\alpha = 0$ according to convention:

\begin{aligned} 0 = i \frac{A_c}{Z_c} \pdv{\tilde{A}}{\tilde{Z}} - \frac{\beta_2}{2} \frac{A_c}{T_c^2} \pdvn{2}{\tilde{A}}{\tilde{T}} + \gamma_0 A_c^3 \big|\tilde{A}\big|^2 \tilde{A} \end{aligned}

Multiplying by $Z_c / A_c$ to make all terms dimensionless leads us to:

\begin{aligned} 0 = i \pdv{\tilde{A}}{\tilde{Z}} - \frac{\beta_2 Z_c}{2 T_c^2} \pdvn{2}{\tilde{A}}{\tilde{T}} + \gamma_0 A_c^2 Z_c \big|\tilde{A}\big|^2 \tilde{A} \end{aligned}

The goal is to remove those constant factors. In other words, we demand:

\begin{aligned} \frac{\beta_2 Z_c}{2 T_c^2} = -1 \qquad\qquad \gamma_0 A_c^2 Z_c = r \end{aligned}

Where $r \equiv \pm 1$ , whose sign choice will be explained shortly. Note that we have two equations for three unknowns ( $A_c$ , $Z_c$ and $T_c$ ), so one of the parameters needs to fixed manually. For example, we could choose our “input power” $A_c \equiv \sqrt{1\:\mathrm{W}}$ , and then:

\begin{aligned} Z_c = - \frac{2 T_c^2}{\beta_2} \qquad T_c^2 = -\frac{r \beta_2}{2 \gamma_0 A_c^2} \qquad\implies\qquad Z_c = \frac{r}{\gamma_0 A_c^2} \qquad T_c = \sqrt{ -\frac{r \beta_2}{2 \gamma_0 A_c^2} } \end{aligned}

Because $T_c$ must be real, we should choose $r \equiv - \sgn(\gamma_0 \beta_2)$ . We thus arrive at:

\begin{aligned} \boxed{ 0 = i \pdv{\tilde{A}}{\tilde{Z}} + \pdvn{2}{\tilde{A}}{\tilde{T}} + r \big|\tilde{A}\big|^2 \tilde{A} } \end{aligned}

In fiber optics, $\gamma_0 > 0$ for all materials, meaning $r$ represents the dispersion regime, so $r = 1$ is called anomalous dispersion and $r = -1$ normal dispersion. In some other fields, where $\beta_2 < 0$ always, $r = 1$ is called a focusing nonlinearity and $r = -1$ a defocusing nonlinearity. The famous bright solitons only exist for $r = 1$ , so many authors only show that case.

References

G.P. Agrawal, Nonlinear fiber optics, 6th edition, Elsevier.
O. Bang, Nonlinear mathematical physics: lecture notes, 2020, unpublished.
J. Lægsgaard, Mode profile dispersion in the generalized nonlinear Schrödinger equation, 2007, Optica.