ArticlePDF Available

Enhanced bi-LSTM for Modeling Nonlinear Amplification Dynamics of Ultra-Short Optical Pulses

January 2024
Photonics

January 2024

DOI:10.3390/photonics11020126

License
CC BY 4.0

Authors:

Anastasia Bednyakova

Novosibirsk State University

Karina Saraeva

Novosibirsk State University

Fiber amplifiers are essential devices for optical communication and laser physics, yet the intricate nonlinear dynamics they exhibit pose significant challenges for numerical modeling. In this study, we propose using a bi-LSTM neural network to predict the evolution of optical pulses along a fiber amplifier, accounting for the dynamically changing gain profile and the Raman scattering. The neural network can learn information from both past and future data, adhering to the fundamental principles of physics governing pulse evolution over time. We conducted experiments with a diverse range of initial pulse parameters, covering the variation in the ratio between dispersion and nonlinear length, ranging from 0.25 to 250. This deliberate choice has resulted in a wide variety of propagation regimes, ranging from smooth attractor-like to noise-like behaviors. Through a comprehensive evaluation of the neural network performance, we demonstrated its ability to generalize across the various propagation regimes. Notably, our results showcase a relative speedup of 2000 times for evaluating the intensity evolution map using our proposed neural network compared to the NLSE numerical solution employing the split-step Fourier method.

Scheme of the recurrent neural network used consisting of a recurrent (biLSTM) part and a fully connected part.

…

Train data preparation process. (a) Illustration of the sliding window approach for data preparation: synthetic data were subdivided to packs of 10 input (X) data and one output (Y). (b) Preparation of the cold-start data scheme.

…

Temporal error maps for the test dataset, showing a 140-step prediction using 10 inputs for initiation. (a) Illustration of the NRMSE evolution in predicting temporal intensity depending on the distance along the fiber; (b) dependency of the PSNR metric on various initial parameters; points labeled in red correspond to examples of propagation regimes shown in Figure 5.

…

Comparison between the temporal intensity evolution calculated using the NLSE and the prediction made with PI-RNN. The prediction is built 140 steps ahead along the fiber using 10 consecutive pulses as input. (a) P 0 = 432 W, T 0 = 0.16 ps, (b) P 0 = 810 W, T 0 = 2.19 ps, (c) P 0 = 460 W, T 0 = 8.9 ps.

…

Figures - uploaded by Anastasia Bednyakova

Content may be subject to copyright.

Content uploaded by Anastasia Bednyakova

Content may be subject to copyright.

Citation: Saraeva, K.; Bednyakova, A.

Enhanced bi-LSTM for Modeling

Nonlinear Ampliﬁcation Dynamics of

Ultra-Short Optical Pulses. Photonics

2024,11, 126. https://doi.org/

10.3390/photonics11020126

Received: 18 December 2023

Revised: 9 January 2024

Accepted: 23 January 2024

Published: 29 January 2024

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

photonics

Article

Enhanced bi-LSTM for Modeling Nonlinear Ampliﬁcation

Dynamics of Ultra-Short Optical Pulses

Karina Saraeva †and Anastasia Bednyakova *,†

Physics Department, Novosibirsk State University, Pirogova Str. 2, Novosibirsk 630090, Russia; k.saraeva@g.nsu.ru

*Correspondence: anastasia.bednyakova@gmail.com

†These authors contributed equally to this work.

Abstract: Fiber ampliﬁers are essential devices for optical communication and laser physics, yet the

intricate nonlinear dynamics they exhibit pose signiﬁcant challenges for numerical modeling. In this

study, we propose using a bi-LSTM neural network to predict the evolution of optical pulses along a

ﬁber ampliﬁer, accounting for the dynamically changing gain proﬁle and the Raman scattering. The

neural network can learn information from both past and future data, adhering to the fundamental

principles of physics governing pulse evolution over time. We conducted experiments with a diverse

range of initial pulse parameters, covering the variation in the ratio between dispersion and nonlinear

length, ranging from 0.25 to 250. This deliberate choice has resulted in a wide variety of propagation

regimes, ranging from smooth attractor-like to noise-like behaviors. Through a comprehensive

evaluation of the neural network performance, we demonstrated its ability to generalize across the

various propagation regimes. Notably, our results showcase a relative speedup of 2000 times for

evaluating the intensity evolution map using our proposed neural network compared to the NLSE

numerical solution employing the split-step Fourier method.

Keywords: long short-term memory; recurrent neural network; Raman scattering; gain-guiding

nonlinearity; ﬁber ampliﬁer; numerical simulation

1. Introduction

A ﬁber ampliﬁer is a crucial component of a laser system. The main challenges to

deal with in active ﬁbers are managing signiﬁcant nonlinear phase accumulation without

wave breaking and amplifying ultrashort pulses that are affected by strong gain shaping.

Recently, there has been growing interest in a new regime for amplifying linearly chirped

asymmetric pulses with gain-guiding nonlinearity (GGN), which was demonstrated in a

research study by a group from Cornell University [

]. It is worth noting that this regime

governs the pulse evolution in the symmetric arms of the Mamyshev oscillator [

], making

it possible to achieve record-breaking peak power levels. The GGN regime is a nonlinear

ampliﬁcation process that occurs when high-power picosecond pulses are propagated,

where the width of the spectrum is comparable to or exceeds the width of the ampliﬁcation

proﬁle. This ampliﬁcation process results in intricate nonlinear dynamics that lead to pulse

asymmetry and the formation of a nonlinear attractor [

]. The dynamically changing ampli-

ﬁcation proﬁle plays a crucial role in shaping the nonlinear attractor. Therefore, numerical

simulations must use a complex model that considers the evolution of ampliﬁcation along

the ﬁber and its wavelength dependence.

Conventional numerical modeling presents substantial challenges for practical ap-

plications, primarily due to the time-consuming computations required for each new

set of system parameters. Real experimental conditions are often difﬁcult to fully pa-

rameterize, leading to the necessity of making assumptions and neglecting the physical

model description.

Given that a ﬁber ampliﬁer is the most computationally demanding component of a

laser system in numerical modeling tasks, its application in real-time experimental scenarios

Photonics 2024,11, 126. https://doi.org/10.3390/photonics11020126 https://www.mdpi.com/journal/photonics

Photonics 2024,11, 126 2 of 11

becomes challenging. One potential solution involves employing neural networks to predict

the evolution of intensity proﬁles along the ﬁber [

–

]. Neural networks accelerate the

modeling process by reducing the number of computational operations and overcoming

the limitations associated with numerical simulations that rely on approximations and

discretizations. Additionally, they possess the ability to generalize information, enabling

the derivation of solutions from imperfect and noisy experimental data in cases in which a

precise consideration of all factors inﬂuencing the experiment proves unfeasible.

In most studies employing deep learning techniques for ﬁber optics applications,

modern architectures are rarely utilized. Instead, linear perceptrons are widely used;

these do not account for temporal context and are suitable only for classiﬁcation and for

predicting the output pulse proﬁle. Since the task of modeling pulse propagation through

the ﬁber is entirely equivalent to forecasting time series, a more effective solution can

be achieved by employing recurrent neural networks [

]. These networks, equipped

with internal memory, efﬁciently leverage the preceding stages of pulse evolution to

predict the subsequent steps. Importantly, when trained with a dataset generated through

comprehensive numerical modeling, the PI-RNN becomes attuned to the physical principles

governing pulse evolution in optical ﬁbers. Other deep learning algorithms may not be

able to incorporate such physics-informed features.

Our study presents the results of employing a physically informed recurrent neural

network (PI-RNN) for forecasting the nonlinear evolution of the spectral and temporal

pulse intensity along an active optical ﬁber. We have chosen the range of initial pulse

parameters that covers the variation in the relation between dispersion and nonlinear

length from 0.25 to 250. This choice has led to a wide variety of propagation regimes,

from smooth attractor-like modes to noise-like ones. Training the RNN within this range of

parameters requires generalization across various propagation modes, which is a challeng-

ing task. We demonstrate that a single PI-RNN, trained on numerical simulation results,

can accurately and rapidly reproduce the intricate dynamics of a nonlinear attractor within

a ﬁber ampliﬁer across a wide range of initial parameters. Building upon the ﬁndings

presented in [

], in which the focus was on substituting the nonlinear Schrodinger equation

(NLSE)-based numerical modeling of a passive ﬁber with LSTM predictions, we success-

fully developed an architecture capable of simulating the propagation through an active

ﬁber with a more complicated physical model involved. The novelty of our study lies

in the fact that, in contrast to the majority of existing works that employ deep learning

methods to explore dynamics in passive ﬁbers, we have effectively trained PI-RNN to

predict nonlinear ampliﬁcation, considering a dynamically changing gain proﬁle and the

Raman scattering. Most works applying RNNs to dynamic prediction typically present

only a few evolution heatmaps, hindering an accurate assessment of real predictive ability.

To address this limitation, we provide comprehensive error maps that illustrate predictive

performance across the parameter domain. Additionally, we delve into the capabilities

and adaptability of the technique for constructing autoregressive predictions employing a

cold-start initialization.

2. Numerical Model of the Ampliﬁer

We consider pulse evolution in a typical, highly doped, ytterbium ﬁber ampliﬁer.

A Gaussian pulse at 1028 nm is launched into an Yb-doped ﬁber ampliﬁer with a 6-

core diameter, which is co-pumped at 976 nm. As the pulse propagates, it accumulates a

pronounced nonlinear phase, resulting in a signiﬁcant broadening of the spectra. When the

spectrum is broadened to match the width of the gain spectrum, the proper parameters of

the input pulse facilitate an evolution towards a nonlinear attractor in the GGN ampliﬁca-

tion regime. It is also worth noting that, apart from this, there exists a wide diversity of

different pulse propagation regimes, often accompanied by the formation of noisy Raman

pulses. For modeling highly nonlinear propagation of ultrashort pulses inside the ampliﬁer,

a complex numerical model considering a dynamically changing ampliﬁcation proﬁle and

Raman scattering is required.

Photonics 2024,11, 126 3 of 11

The numerical model employed in simulations comprises a system of coupled equa-

tions governing pulsed signal generation and continuous-wave pump [10–13]:

∂As(z,t)

∂z=−iβ2

∂2As(z,t)

∂t2+Z∞

−∞

gs(ω,z)

2˜

As(z,ω)ex p(−iωt)dω+

iγ1+i

ω0

∂

∂tA(z,t)Z∞

−∞R(t′)|A(z,t−t′)|2dt′(1)

∂Pp(z)

∂z=gp(z)Pp(z), (2)

where

As(z

is the slowly varying envelope associated with the signal,

Pp(z)

is the

average power of continuous-wave pump,

β2

is the group velocity dispersion,

is the

Kerr nonlinearity,

and

are signal and pump gain/loss coefﬁcients, correspondingly.

The response function

R(t) = (

−fR)δ(t) + fRhR(t)

includes both instantaneous electronic

and delayed Raman contributions [

]. We used the Hollenbeck vibrational model [

] to

describe the Raman response function

. The spectral window considered in the model

extended from 865 to 1260 nm with the central wavelength at 1028 nm. The temporal

window was equal to 150 ps.

The wavelength dependence of the gain is considered in the frequency domain, where

the optical ﬁeld

A(z

ω)

is multiplied by the gain proﬁle

gs(ω

. Each spectral component

of the gain

gs(λi

(

. . .

Nω

, where

Nω

—is the number of the discreet frequencies in

simulations) and the pump gain/loss coefﬁcients at each step along the ﬁber were found

based on the rate equations in the stationary case dN2/dt =0:

gs(λi,z) = σs

21(λi)ρs(λi)N2(z)−σs

12(λi)ρs(λi)N1(z),i=1, . . . , Nω(3)

gp(z) = σp

21ρpN2(z)−σp

12ρpN1(z), (4)

dN2(z)

dt =σp

12ρpPp(z)

hνp+k

∑

k=1σs

12(λk)ρs(λk)Ps(λk,z)

hνkN1(z)−

σp

21ρpPp(z)

hνp+k

∑

k=1σs

21(λk)ρs(λk)Ps(λk,z)

hνk+1

TN2(z),

N1(z) = N−N2(z), (5)

here,

N1,2

are population densities in the ground and excited energy levels correspondingly,

N=4.8 ·1015 m−1

is the total number of Yb-ions integrated over the ﬁber mode cross-

section,

Ps(ωk

z) = |˜

A(z

ωk)|2

is the signal power at the frequency

ωk

and position z

along the ﬁbre, and

850

s is the ﬂuorescence lifetime. The effective pump absorption

and emission cross sections at pump wavelengths of 976 nm are

σp

12 =

2.5

−25

and

σp

21 =

2.44

−27

. The absorption and emission cross-section spectra in the considered

spectral window are described by

σs

12(λi)

and

σs

21(λi)

. The normalized pump and signal

power distributions through the ﬁber cross-section are marked

ρp,s=Γp,s/πa2

, where

m is the core radius of a single-mode ﬁber,

Γp(Γs)

corresponds to the modal overlap

factor between the pump (signal) mode and the ion distribution.

Γp=

1 for core pumping,

Γs=1−ex p(−2a2/w2),wis the 1/e electric ﬁeld radius of the equivalent Gaussian spot.

We used the open-source Pyofss library [

] for numerical modeling; it has a newly

added module that enables parallel computing with a Raman inﬂuence [

]. We also added

our own modules for parallel ampliﬁcation computing based on the Yb-coupled equations

described above.

3. The Architecture of a Recurrent Neural Network

The neural network should not only predict the dynamics in a particular propagation

regime but should also guess the regime to be predicted. Predicting the spectrum intensity

is complicated by the substantial broadening during propagation and its high modulation.

Photonics 2024,11, 126 4 of 11

In terms of the temporal intensity prediction, the primary challenge lies in accurately

forecasting the gain variation throughout the evolution. To our knowledge, there have

been no attempts to simulate a ﬁber ampliﬁer using a physically accurate gain model with

a neural network in the ﬁeld of ﬁber optics up to the present date.

Similarly to the study [

], we utilised a single-layer LSTM neural network architecture

as a baseline. Since the baseline architecture proved to be insufﬁcient for the dataset

used, we improved the proposed architecture by incorporating stacked LSTM layers and

a bidirectional cell structure. All the outputs from the LSTM cells, instead of just the last

cell output, were then fed through several dense layers to reﬁne the results further. We

implemented our neural network using the PyTorch Python library [18].

The recurrent neural network’s architecture is depicted in Figure 1.

Figure 1. Scheme of the recurrent neural network used consisting of a recurrent (biLSTM) part and a

fully connected part.

Here, we employ two neural networks with similar structures to predict spectral and

temporal intensity evolution independently.

4. Data Preparation and Training Process

The RNN was trained with synthetic data generated using the model described

in Section 2. The chosen range of initial pulse parameters spans the variation in the

relationship between dispersion and nonlinear length, ranging from 0.25 to 250. The initial

pulse intensity is uniformly variable, ranging from 100 W to 1000 W, while the pulse width

varies logarithmically from 0.1 to 10 ps. The explored parameter space encompasses a

wide diversity of pulse propagation regimes, ranging from GGN ampliﬁcation to pulse

ampliﬁcation, accompanied by the formation of noise Raman pulses in high-intensity

cases [

]. We selected a length of 7 m for the optical ﬁber, a choice deemed sufﬁcient for

stabilizing a nonlinear attractor, as suggested by Sidorenko et al. [

]. The training dataset

comprises 879 examples of pulse evolution within the speciﬁed range of initial parameters.

The RNN is trained to forecast the evolution proﬁle at ﬁber intervals of 46 mm, requiring

150 steps to predict the evolution along the entire length of the ﬁber. No preprocessing was

applied to the data, except for reducing the resolution using linear interpolation along the

temporal (spectral) coordinate, from 16384 to 500 points. The choice of this dimensionality

reduction was based on the resolution needed to display ﬁne spectral modulation dynamics

and the computational resources available for training the neural network, and it can be

varied for different problems.

The dataset utilized for testing the model consisted of displaced grid points within the

same initial pulse parameter interval. To ensure the construction of a reliable model capable

of accurately predicting the evolution of different propagation regimes on a uniform grid, it

Photonics 2024,11, 126 5 of 11

is essential to employ test and train datasets of equal size and distribution. Therefore, main-

taining a 1:1 test-to-train ratio is imperative to effectively assess the prediction performance

of the neural network within the chosen parameter range for this task.

The data for training and testing the neural network are prepared using the sliding

window method illustrated in Figure 2a. This technique facilitates the division of the

data into smaller segments. The ﬁrst ten intensity proﬁles are interpreted as the neural

network input, with the subsequent one serving as its target output. The window then

slides by one point over the ﬁber length and repeats the process until the end of the training

evolution. After creating 10+1 pairs, it is essential to shufﬂe these pairs from all the training

evaluations to ensure a stable learning process. The prepared data are then fed into the

neural network during the training process.

Figure 2. Train data preparation process. (a) Illustration of the sliding window approach for data

preparation: synthetic data were subdivided to packs of 10 input (X) data and one output (Y).

(b) Preparation of the cold-start data scheme.

To ﬁnd the global minimum in the loss function and ensure effective training, we

employed several optimization and learning stabilization techniques. These included the

Adam optimizer, hyperparameter tuning, and a learning rate scheduler. By utilizing these

techniques, we aimed to enhance the training efﬁciency and stability of our model. Model

training was performed on a local server using an NVIDIA RTX 4090 graphics processing

unit (GPU).

5. Results

We used an autoregression approach to reconstruct the prediction. This method allows

the neural network to forecast the data evolution for any number of steps forward by

sequentially feeding its output back into its input.

5.1. Metrics Used for Tracking RNN Performance

Here, we outline the metrics used to assess the ﬁnal predictive performance of the

trained neural network. The network is trained to predict a single pulse proﬁle by leverag-

ing the evolution dynamics extracted from a sequence of proﬁles using the mean squared

error (MSE) loss:

MSE(I,ˆ

I) = 1

∑

i=1

(Ii−ˆ

Ii)2, (6)

where

represents the temporal (spectral) intensity array at a ﬁxed z coordinate along the

ﬁber, and Ndenotes the size of the temporal (spectral) domain.

In autoregression prediction, the model effectively handles newly self-generated data

that were not part of the training sample. To evaluate the ﬁnal result, a different metric

was employed. The normalized root mean square error (NRMSE) metric, widely utilized

Photonics 2024,11, 126 6 of 11

in similar tasks, provides a robust accuracy estimation for predicting errors in individual

intensity proﬁles.

NRMSE(I,ˆ

I) = v

∑Ndomain

i=1(Ii−ˆ

Ii)2

∑Ndomain

i=1(ˆ

Ii)2(7)

However, when applied to the entire evolution map, the NRMSE tends to overestimate

errors in the case of low-energy pulses with narrow spectra and underestimate errors in

high-energy propagation regimes. Additionally, interpreting the quality of the prediction

is not straightforward. In an attempt to address these issues, we propose using another

version of the normalized MSE—the peak signal-to-noise ratio (PSNR) metric [20]:

PSNR(Imap,ˆ

Imap) = 20 ·log10 



max(ˆ

Imap)

qMSE(Ima p,ˆ

Imap)



, (8)

where

Imap

is a 2D array that represents the evolution of a temporal (spectral) intensity

array along the spatial coordinate z.

The PSNR is frequently employed for assessing the quality of reconstructed images

and offers several advantages over the NRMSE metrics, including a decibel scale and

normalization based on the maximum intensity.

5.2. Forecasting of Temporal and Spectral Intensity Evolution

We assessed the prediction error of the PI-RNN model using Equation (7) and depicted

the temporal and spectral intensity maps in Figures 3a and 4a, respectively. The plots

illustrate the smooth evolution of errors along the ﬁber for all initial pulse parameters,

with no discernible discontinuities or outliers within the prediction maps.

Figure 3. Temporal error maps for the test dataset, showing a 140-step prediction using 10 inputs for

initiation. (a) Illustration of the NRMSE evolution in predicting temporal intensity depending on the

distance along the ﬁber; (b) dependency of the PSNR metric on various initial parameters; points

labeled in red correspond to examples of propagation regimes shown in Figure 5.

Figures 3b and 4b, calculated using Equation (8), illustrate the comprehensive error

maps across the plane of initial pulse parameters. These maps reveal the impact of initial

pulse parameters on the overall PSNR error along the ﬁber. The interpretation of the

error diagrams is as follows: the prevailing trend indicating the most signiﬁcant errors

corresponds to situations with numerous high-intensity modulation peaks or noise-like gen-

eration. Speciﬁcally, in the prediction of the temporal evolution, the highest accumulated

error is associated with the Raman generation. When predicting the spectral evolution,

the highest error is associated with regions exhibiting signiﬁcant nonlinear phase accumu-

lation before transitioning to a smooth attractor-like regime or regions characterized by the

emergence of a noisy Raman pulse.

Photonics 2024,11, 126 7 of 11

Figure 4. Spectral error maps for the test dataset, showing a 140-step prediction using 10 inputs for

initiation. (a) Illustration of the NRMSE evolution in predicting spectral intensity depending on the

distance along the ﬁber; (b) Dependency of the PSNR metric on various initial parameters; points

labeled in red correspond to examples of propagation regimes shown in Figure 6.

We conducted tests on various data preprocessing methods, including normalization

and logarithmic transformation, which have demonstrated effectiveness for passive ﬁbers,

as reported in previous works [

]. However, we observed that these methods diminished

the information content of the spectral intensity evolution within the active ﬁber. Recog-

nizing the signiﬁcance of spectral intensity amplitude as an additional feature aiding the

neural network in precise predictions and evolution stage determination through autore-

gression, we opted to use the spectral intensity data in the original linear scale without

any preprocessing.

To compare pulse propagation maps predicted by the PI-RNN with numerical model-

ing using the NLSE, we picked three distinctive regimes from the data. Three main types of

behaviors include a smooth GGN regime resulting in a formation of a nonlinear attractor,

a transient regime, and a regime showing a pronounced inﬂuence of the stimulated Raman

scattering on a pulse ampliﬁcation. Figure 5shows examples of the temporal evolutions,

along with their locations in the error map displayed with red labeled points in Figure 3b.

Figure 6shows typical spectral intensity propagation regimes, with their locations in the

error map in Figure 4b. The Raman scattering manifests itself as the generation of a noise-

like pulse with energy comparable to the main pulse, downshifted by about 13.2 THz in

frequency (Figure 6c) [21].

The PI-RNN demonstrates its capability to model the temporal and spectral evolution

for any point within the training parameter area.

Autoregressive reconstruction of the evolution map along the 7-meter-long ﬁbre,

as presented in the Figures 5and 6, takes about 0.05 s with PI-RNN. This is approximately

700 times faster than the fastest paralleled numerical NLSE model when using the NVIDIA

RTX 4090 GPU and 2000 times faster than the conventional CPU-based numerical model.

This notable difference is attributed to the reduction in the number of numerical operations

required for each step along the ﬁber, a decrease in the total number of steps needed to ob-

tain a solution, and the lower temporal (spectral) resolution required for the computations.

The ﬁnal PI-RNN model has an estimated tens of millions of trainable parameters.

Photonics 2024,11, 126 8 of 11

Figure 5. Comparison between the temporal intensity evolution calculated using the NLSE and

the prediction made with PI-RNN. The prediction is built 140 steps ahead along the ﬁber using

10 consecutive pulses as input. (a)

P0=

432 W,

T0=

0.16 ps, (b)

P0=

810 W,

T0=

2.19 ps,

(c)P0=460 W, T0=8.9 ps.

5.3. Resistance to Noise

The neural network, trained on undisturbed synthetic data, was found to be robust

to external noise in the test dataset. The PI-RNN can capture the pulse propagation

regime up to the signal-to-noise ratio values of 20 dB. Specifically, when given an initial

pulse with added ’white’ Gaussian noise, the neural network predicts the output pulse

with noise suppression. This feature allows for greater practical significance of this

study in the future, as it can handle realistic experimental data that may contain noise or

other imperfections.

5.4. Autoregression Problems

Overfitting poses a significant challenge in autoregressive prediction problems.

In autoregressive predictions, the neural network establishes a feedback loop, using

slightly perturbed input data influenced by its own prediction errors to make subsequent

predictions. This feedback loop complicates the monitoring and prevention of overfitting,

since the neural network is only trained to predict only one step initially and is not

explicitly trained for autoregressive prediction. Even though the model has high accuracy

Photonics 2024,11, 126 9 of 11

for one-step prediction on a validation dataset, it may face issues when estimating

autoregressive predictions on a test dataset. The slightly undertrained model seems

to be more robust to overfitting the numerical modeling data when predicting the

evolution autoregressively.

Figure 6. Comparison between the spectral intensity evolution calculated using the NLSE and

the prediction made with PI-RNN. The prediction is built 140 steps ahead along the ﬁber using

10 consecutive pulses as input. (a)

P0=

432 W,

T0=

0.16 ps, (b)

P0=

810 W,

T0=

2.19 ps,

(c)P0=460 W, T0=8.9 ps.

5.5. Cold Start Problems

Cold start is a method of evolution reconstruction that starts with a single initial pulse

proﬁle, which is then fed to all RNN inputs. This method facilitates the reconstruction of

the autoregressive evolution map using only a single pulse proﬁle, thereby simplifying its

application for various tasks.

To improve the model’s performance in cold start prediction scenarios, we used a

speciﬁc approach to prepare the training data. The main concept involves incorporating

“cold start data” into the training sample, as shown in Figure 2b. “Cold start data” are a

synthetic type of data that mimic the model’s task of reconstructing evolution from one tiled

initial proﬁle. The basic idea is to replace the ﬁrst

evolution proﬁles, where

is the number

of inputs for the recurrent neural network, with the initial proﬁle. This approach aids the

model in improving predictions by learning from these artiﬁcial examples.

Photonics 2024,11, 126 10 of 11

We obtained results with comparable accuracy for predicting cold start temporal and

spectral evolutions as the prediction based on the starting pulse sequence.

6. Conclusions

In this paper, we explored the potential of using PI-RNN to predict the evolution

of spectral and temporal pulse intensity along the ﬁber ampliﬁer—a computationally

challenging task in nonlinear optics. We introduced an updated PI-RNN architecture

designed to learn the complex dynamics of the optical ﬁeld from a large dataset gener-

ated via numerical simulations and conducted a thorough evaluation of its performance.

We found that the PI-RNN can accurately and precisely estimate the evolution map, out-

performing the conventional split-step numerical solution of the NLSE by a signiﬁcant

margin—2000 times faster. This improved speed persists even when parallelized with a

GPU, resulting in a 660-fold faster computation.

Furthermore, the PI-RNN adeptly performs interpolation and extrapolation of

the field evolution along the fiber with reasonable accuracy. It exhibits adaptability to

various grid sizes along the z-coordinate. Notably, a single neural network proved capable

of capturing diverse propagation regimes, making it a universal tool for investigating

dynamics within the chosen parameter space. The PI-RNN approach can achieve further

enhancement by integrating experimental data into the training set. This can not only

improve the time performance but also enhances its descriptive capabilities, surpassing

numerical modeling.

We have also detailed the challenges encountered during the PI-RNN training process

for autoregressive prediction tasks. One signiﬁcant drawback identiﬁed in reconstructing

the entire evolution map using PI-RNN is its cold start prediction performance, heavily

dependent on the data sample provided. To address this issue, we introduce a novel

approach to preparing training data, resulting in improved cold start execution. This

modiﬁcation leads to notable performance enhancements, particularly in predicting the

temporal intensity map using a single input proﬁle.

In conclusion, we assert that the PI-RNN has proven to be a promising technique for

predicting the evolution of pulse intensity along an active ﬁber, demonstrating substantial

advantages over traditional numerical methods.

Author Contributions: Conceptualization, A.B.; methodology, K.S. and A.B.; software, K.S.; valida-

tion, A.B. and K.S.; investigation, K.S. and A.B.; writing—original draft preparation, K.S. and A.B.,

visualization, K.S. All authors have read and agreed to the published version of the manuscript.

Funding: This research was funded by the Ministry of Science and Higher Education of the Russian

Federation (Project No. FSUS-2021-0015).

Institutional Review Board Statement: Not applicable.

Informed Consent Statement: Not applicable.

Data Availability Statement: The data presented in this study are available on request from the

corresponding author.

Conﬂicts of Interest: The authors declare no conﬂicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PI-RNN Physics-informed recurrent neural network.

NLSE Nonlinear Schrodinger equation.

LSTM Long short-term memory.

MSE Mean square error.

PSNR Peak signal-to-noise ratio.

GPU Graphics processing unit.

Photonics 2024,11, 126 11 of 11

References

Sidorenko, P.; Fu, W.; Wise, F. Nonlinear ultrafast ﬁber ampliﬁers beyond the gain-narrowing limit. Optica 2019,6, 1328–1333.

[CrossRef] [PubMed]

Chen, Y.H.; Sidorenko, P.; Thorne, R.; Wise, F. Starting dynamics of a linear-cavity femtosecond Mamyshev oscillator. JOSA B

2021,38, 743–748. [CrossRef] [PubMed]

Turitsyn, S.K.; Bednyakova, A.E.; Podivilov, E.V. Nonlinear Optical Pulses in Media with Asymmetric Gain. Phys. Rev. Lett. 2023,

131, 153802. [CrossRef] [PubMed]

Boscolo, S.; Dudley, J.M.; Finot, C. Modelling self-similar parabolic pulses in optical ﬁbres with a neural network. Results Opt.

2021,3, 100066. [CrossRef]

Stanﬁeld, M.; Ott, J.; Gardner, C.; Beier, N.F.; Farinella, D.M.; Mancuso, C.A.; Baldi, P.; Dollar, F. Real-time reconstruction of high

energy, ultrafast laser pulses using deep learning. Sci. Rep. 2022,12, 5299. [CrossRef] [PubMed]

Freire, P.; Manuylovich, E.; Prilepsky, J.E.; Turitsyn, S.K. Artiﬁcial neural networks for photonic applications—From algorithms to

implementation: Tutorial. Adv. Opt. Photon. 2023,15, 739–834. [CrossRef]

Boscolo, S.; Finot, C. Artiﬁcial neural networks for nonlinear pulse shaping in optical ﬁbers. Opt. Laser Technol. 2020,131, 106439.

[CrossRef]

Salmela, L.; Tsipinakis, N.; Foi, A.; Billet, C.; Dudley, J.M.; Genty, G. Predicting ultrafast nonlinear dynamics in ﬁbre optics with a

recurrent neural network. Nat. Mach. Intell. 2021,3, 344–354. [CrossRef]

Te˘gin, U.; Dinç, N.U.; Moser, C.; Psaltis, D. Reusability report: Predicting spatiotemporal nonlinear dynamics in multimode ﬁbre

optics with a recurrent neural network. Nat. Mach. Intell. 2021,3, 387–391. [CrossRef]

10.

Kirsch, D.C.; Bednyakova, A.; Varak, P.; Honzatko, P.; Cadier, B.; Robin, T.; Fotiadi, A.; Peterka, P.; Chernysheva, M. Gain-

controlled broadband tuneability in self-mode-locked Thulium-doped ﬁbre laser. Commun. Phys. 2022,5, 219. [CrossRef]

11.

Turitsyn, S.K.; Bednyakova, A.E.; Fedoruk, M.P.; Latkin, A.I.; Fotiadi, A.A.; Kurkov, A.S.; Sholokhov, E. Modeling of CW

Yb-doped ﬁber lasers with highly nonlinear cavity dynamics. Opt. Express 2011,19, 8394–8405. [CrossRef] [PubMed]

12.

Dong, L. Nonlinear Propagation in Optical Fibers With Gain Saturation and Gain Dispersion. J. Light. Technol. 2020,38, 6897–6904.

[CrossRef]

13.

Chen, H.W.; Lim, J.; Huang, S.W.; Schimpf, D.N.; Kärtner, F.X.; Chang, G. Optimization of femtosecond Yb-doped ﬁber ampliﬁers

for high-quality pulse compression. Opt. Express 2012,20, 28672–28682. [CrossRef] [PubMed]

14. Agrawal, G.P. Nonlinear Fiber Optics, 4th ed.; Elsevier: Amsterdam, The Netherlands, 2006.

15.

Hollenbeck, D.; Cantrell, C. Multiple-vibrational-mode model for ﬁber-optic Raman gain spectrum and response function. J. Opt.

Soc. Am. B 2002,19, 2886. [CrossRef]

16. Bolt, D. Pyofss; 2013. Available online: https://github.com/LeiDai/pyofss (accessed on 15 December 2023).

17.

Efremov, V.D.; Evmenova, E.A.; Antropov, A.A.; Kharenko, D.S. Numerical investigation of the energy limit in a picosecond ﬁber

opticparametric oscillator. Appl. Opt. 2022,61, 1806–1810. [CrossRef] [PubMed]

18.

Paszke, A.; Gross, S.; Chintala, S.; Chanan, G.; Yang, E.; DeVito, Z.; Lin, Z.; Desmaison, A.; Antiga, L.; Lerer, A. Automatic

differentiation in pytorch. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017),

Long Beach, CA, USA, 4–9 December 2017.

19.

Bednyakova, A.E.; Babin, S.A.; Kharenko, D.S.; Podivilov, E.V.; Fedoruk, M.P.; Kalashnikov, V.L.; Apolonski, A. Evolution of

dissipative solitons in a ﬁber laser oscillator in the presence of strong Raman scattering. Opt. Express 2013,21, 20556–20564.

[CrossRef] [PubMed]

20.

Samajdar, T.; Quraishi, M.I. Analysis and evaluation of image quality metrics. In Information Systems Design and Intelligent

Applications: Proceedings of Second International Conference INDIA 2015, Kalyani, India, 8–9 January 2015; Springer: Berlin/Heidelberg,

Germany, 2015; Volume 2, pp. 369–378.

21.

Agrawal, G.P. Nonlinear ﬁber optics. In Nonlinear Science at the Dawn of the 21st Century; Springer: Berlin/Heidelberg, Germany,

2000; pp. 195–211.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual

author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to

people or property resulting from any ideas, methods, instructions or products referred to in the content.

ResearchGate has not been able to resolve any citations for this publication.

Nonlinear Optical Pulses in Media with Asymmetric Gain

Article

Full-text available

Oct 2023

A generic novel model governing optical pulse propagation in a nonlinear dispersive amplifying medium with asymmetric (linear spectral slope) gain is introduced. We examine the properties of asymmetric optical pulses formed in such gain-skewed media, both theoretically and numerically. We derive a dissipative optical modification of the classical shallow water equations that highlights an analogy between this phenomenon and hydrodynamic wave breaking. These findings provide insight into the nature of asymmetric optical pulses capable of accumulating large nonlinear phase without wave breaking, a crucial aspect in the design of nonlinear fiber amplifiers.

Artificial neural networks for photonic applications—from algorithms to implementation: tutorial

Article

Full-text available

Sep 2023

This tutorial–review on applications of artificial neural networks in photonics targets a broad audience, ranging from optical research and engineering communities to computer science and applied mathematics. We focus here on the research areas at the interface between these disciplines, attempting to find the right balance between technical details specific to each domain and overall clarity. First, we briefly recall key properties and peculiarities of some core neural network types, which we believe are the most relevant to photonics, also linking the layer’s theoretical design to some photonics hardware realizations. After that, we elucidate the question of how to fine-tune the selected model’s design to perform the required task with optimized accuracy. Then, in the review part, we discuss recent developments and progress for several selected applications of neural networks in photonics, including multiple aspects relevant to optical communications, imaging, sensing, and the design of new materials and lasers. In the following section, we put a special emphasis on how to accurately evaluate the complexity of neural networks in the context of the transition from algorithms to hardware implementation. The introduced complexity characteristics are used to analyze the applications of neural networks in optical communications, as a specific, albeit highly important example, comparing those with some benchmark signal-processing methods. We combine the description of the well-known model compression strategies used in machine learning, with some novel techniques introduced recently in optical applications of neural networks. It is important to stress that although our focus in this tutorial–review is on photonics, we believe that the methods and techniques presented here can be handy in a much wider range of scientific and engineering applications.

Gain-controlled broadband tuneability in self-mode-locked Thulium-doped fibre laser

Article

Full-text available

Sep 2022

Ensuring self-driven mode-locking and broadband wavelength tuneability in all-fibre-integrated femtosecond laser sources enables a new level of their versatility and extends areas of their applications. Principle limitations for this are traditionally available ultrafast modulators and tuneability techniques. Here, we exploit Thulium-doped fibre to perform three roles in the cavity: laser gain, saturable absorber, and tuneability element via controlling its excitation level. We confirmed that Tm-doped fibre saturable absorption is defined by a reinforced quenching of Tm3+ pairs. As a result, we present both numerically and experimentally a highly stable sub-picosecond pulse generation with a ~90 nm tuneability range spanning from 1873 to 1962 nm via adjusting the cavity feedback. The maximum laser efficiency corresponds to 25% cavity feedback, enabling the highest output energy of 1 nJ in 600-fs solitons at 1877 nm. Overall, the presented laser system establishes a compact and straightforward approach for ultrafast generation, which can be translated to other fibre laser operation wavelengths. Active tuning of laser output properties promises to broaden the versatility and applications of these light sources. Here, a variable fibre-optical coupler is introduced in a self-mode-locked Tm-doped fibre laser to achieve active wavelength tuning over 90 nm.

Real-time reconstruction of high energy, ultrafast laser pulses using deep learning

Article

Full-text available

Mar 2022

We report a method for the phase reconstruction of an ultrashort laser pulse based on the deep learning of the nonlinear spectral changes induce by self-phase modulation. The neural networks were trained on simulated pulses with random initial phases and spectra, with pulse durations between 8.5 and 65 fs. The reconstruction is valid with moderate spectral resolution, and is robust to noise. The method was validated on experimental data produced from an ultrafast laser system, where near real-time phase reconstructions were performed. This method can be used in systems with known linear and nonlinear responses, even when the fluence is not known, making this method ideal for difficult to measure beams such as the high energy, large aperture beams produced in petawatt systems.

Numerical investigation of the energy limit in a picosecond fiber optic parametric oscillator

Article

Full-text available

Mar 2022

Numerical simulation of a fiber optic parametric oscillator to produce picosecond narrowband pulses for coherent anti-Stokes Raman spectroscopy has been performed by an open source Python-based library using an extremely wide range of parameters, such as the pump pulse duration, parametric frequency shift, spectral bandwidth of the pump, and the parametric pulses. It required an extremely large calculation window, both in time and spectral domains. We managed to speed up the simulation 50 times using a graphic processor unit that allowed us to define the areas of stability for different lengths of standard passive (5–100 m) and photonic crystal (23–100 cm) fibers used in the external linear oscillator cavity. It was shown that highly chirped dissipative solitons at a wavelength about 800 nm can be generated with energy up to 55 nJ, which is limited by the pump depletion and self-phase modulation effects.

Reusability report: Predicting spatiotemporal nonlinear dynamics in multimode fibre optics with a recurrent neural network

Article

Full-text available

May 2021

Predicting ultrafast nonlinear dynamics in fibre optics with a recurrent neural network

Article

Full-text available

Apr 2021

The propagation of ultrashort pulses in optical fibre plays a central role in the development of light sources and photonic technologies, with applications from fundamental studies of light–matter interactions to high-resolution imaging and remote sensing. However, short pulse dynamics are highly nonlinear, and optimizing pulse propagation for application purposes requires extensive and computationally demanding numerical simulations. This creates a severe bottleneck in designing and optimizing experiments in real time. Here, we present a solution to this problem using a recurrent neural network to model and predict complex nonlinear propagation in optical fibre, solely from the input pulse intensity profile. We highlight particular examples in pulse compression and ultra-broadband supercontinuum generation, and compare neural network predictions with experimental data. We also show how the approach can be generalized to model other propagation scenarios for a wider range of input conditions and fibre systems, including multimode propagation. These results open up novel perspectives in the modelling of nonlinear systems, for the development of future photonic technologies and more generally in physics for studies in Bose–Einstein condensates, plasma physics and hydrodynamics.

Modelling self-similar parabolic pulses in optical fibres with a neural network

Article

Full-text available

Feb 2021

We expand our previous analysis of nonlinear pulse shaping in optical fibres using machine learning [Opt. Laser Technol., 131 (2020) 106439] to the case of pulse propagation in the presence of gain/loss, with a special focus on the generation of self-similar parabolic pulses. We use a supervised feedforward neural network paradigm to solve the direct and inverse problems relating to the pulse shaping, bypassing the need for direct numerical solution of the governing propagation model.

Starting dynamics of a linear-cavity femtosecond Mamyshev oscillator

Article

Full-text available

Feb 2021
J OPT SOC AM B

Mamyshev oscillators can generate high-power femtosecond pulses, but starting a mode-locked state has remained a major challenge due to the suppression of continuous-wave lasing. Here, we study the starting dynamics of a linear Mamyshev oscillator designed to generate high-power femtosecond pulses while avoiding component damage. Reliable starting to stable mode-locking is achieved with a combination of modulation of the pump power and shifting of a filter passband. The starting process is automated, with full electronic control. The laser delivers 21-nJ pulses that are dechirped to 65 fs in duration outside the cavity.

Nonlinear Propagation in Optical Fibers With Gain Saturation and Gain Dispersion

Article

Aug 2020

Liang Dong

There have been many developments in nonlinear propagation models in the past few decades. Especially, a form of such model has been developed to allow a standard ordinary differential equation (ODE) solver to be directly used for its solution. But such a model currently does not consider gain saturation and wavelength-dependent gain, which are very important in high-pulse-energy lasers. In this work, the directly-ODE-integrable nonlinear propagation equation is extended to include gain saturation and gain dispersion, which is then used to study maximum pulse energy limited by amplified spontaneous emission and minimum pulse width limited by gain narrowing in ultrafast fiber lasers, in order to demonstrate these new capabilities.

Enhanced bi-LSTM for Modeling Nonlinear Amplification Dynamics of Ultra-Short Optical Pulses

Abstract and Figures

Recommended publications

Numerical Modeling and Experimental Confirmation of an Efficient Phosphosilicate-Fiber Raman Laser

Predicting frequency comb structure in nonlinear optical fibre using a neural network

Deep learning based pulse prediction of nonlinear dynamics in fiber optics

Gain-controlled Broadband Tuneability in Mode-locked Thulium-doped Fibre Laser through Variable Feed...

Long-Range Prediction of Nonlinear Dynamics in Fibre Optics Using Transformer-Based Neural Network