-
Memory Capacity Analysis of Time-delay Reservoir Computing Based on Silicon Microring Resonator Nonlinearities
Authors:
Bernard J. Giron Castro,
Christophe Peucheret,
Francesco Da Ros
Abstract:
Silicon microring resonators (MRRs) have shown strong potential in acting as the nonlinear nodes of photonic reservoir computing (RC) schemes. By using nonlinearities within a silicon MRR, such as the ones caused by free-carrier dispersion (FCD) and thermo-optic (TO) effects, it is possible to map the input data of the RC to a higher dimensional space. Furthermore, by adding an external waveguide…
▽ More
Silicon microring resonators (MRRs) have shown strong potential in acting as the nonlinear nodes of photonic reservoir computing (RC) schemes. By using nonlinearities within a silicon MRR, such as the ones caused by free-carrier dispersion (FCD) and thermo-optic (TO) effects, it is possible to map the input data of the RC to a higher dimensional space. Furthermore, by adding an external waveguide between the through and add ports of the MRR, it is possible to implement a time-delay RC (TDRC) with enhanced memory. The input from the through port is fed back into the add port of the ring with the delay applied by the external waveguide effectively adding memory. In a TDRC, the nodes are multiplexed in time, and their respective time evolutions are detected at the drop port. The performance of MRR-based TDRC is highly dependent on the amount of nonlinearity in the MRR. The nonlinear effects, in turn, are dependent on the physical properties of the MRR as they determine the lifetime of the effects. Another factor to take into account is the stability of the MRR response, as strong time-domain discontinuities at the drop port are known to emerge from FCD nonlinearities due to self-pulsing (high nonlinear behaviour). However, quantifying the right amount of nonlinearity that RC needs for a certain task in order to achieve optimum performance is challenging. Therefore, further analysis is required to fully understand the nonlinear dynamics of this TDRC setup. Here, we quantify the nonlinear and linear memory capacity of the previously described microring-based TDRC scheme, as a function of the time constants of the generated carriers and the thermal of the TO effects. We analyze the properties of the TDRC dynamics that generate the parameter space, in terms of input signal power and frequency detuning range, over which conventional RC tasks can be satisfactorily performed by the TDRC scheme.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Thermal Crosstalk Modelling and Compensation Methods for Programmable Photonic Integrated Circuits
Authors:
Isidora Teofilovic,
Ali Cem,
David Sanchez-Jacome,
Daniel Perez-Lopez,
Francesco Da Ros
Abstract:
Photonic integrated circuits play an important role in the field of optical computing, promising faster and more energy-efficient operations compared to their digital counterparts. This advantage stems from the inherent suitability of optical signals to carry out matrix multiplication. However, even deterministic phenomena such as thermal crosstalk make precise programming of photonic chips a chal…
▽ More
Photonic integrated circuits play an important role in the field of optical computing, promising faster and more energy-efficient operations compared to their digital counterparts. This advantage stems from the inherent suitability of optical signals to carry out matrix multiplication. However, even deterministic phenomena such as thermal crosstalk make precise programming of photonic chips a challenging task. Here, we train and experimentally evaluate three models incorporating varying degrees of physics intuition to predict the effect of thermal crosstalk in different locations of an integrated programmable photonic mesh. We quantify the effect of thermal crosstalk by the resonance wavelength shift in the power spectrum of a microring resonator implemented in the chip, achieving modelling errors <0.5 pm. We experimentally validate the models through compensation of the crosstalk-induced wavelength shift. Finally, we evaluate the generalization capabilities of one of the models by employing it to predict and compensate for the effect of thermal crosstalk for parts of the chip it was not trained on, revealing root-mean-square-errors of <2.0 pm.
△ Less
Submitted 19 March, 2024;
originally announced April 2024.
-
BICM-compatible Rate Adaptive Geometric Constellation Sha** Using Optimized Many-to-one Labeling
Authors:
Metodi Plamenov Yankov,
Smaranika Swain,
Ognjen Jovanovic,
Darko Zibar,
Francesco Da Ros
Abstract:
In this paper, a rate adaptive geometric constellation sha** (GCS) scheme which is fully backward-compatible with existing state of the art bit-interleaved coded modulation (BICM) systems is proposed and experimentally demonstrated. The system relies on optimization of the positions of the quadrature amplitude modulation (QAM) points on the I/Q plane for maximized achievable information rate, wh…
▽ More
In this paper, a rate adaptive geometric constellation sha** (GCS) scheme which is fully backward-compatible with existing state of the art bit-interleaved coded modulation (BICM) systems is proposed and experimentally demonstrated. The system relies on optimization of the positions of the quadrature amplitude modulation (QAM) points on the I/Q plane for maximized achievable information rate, while maintaining quantization and fiber nonlinear noise robustness. Furthermore, `dummy' bits are multiplexed with coded bits before map** to symbols. Rate adaptivity is achieved by tuning the ratio of coded and `dummy' bits, while maintaining a fixed forward error-correction block and a fixed modulation format size. The points' positions and their labeling are optimized using automatic differentiation. The proposed GCS scheme is compared to a time-sharing hybrid (TH) QAM modulation and the now mainstream probabilistic amplitude sha** (PAS) scheme. The TH without sha** is outperformed for all studied data rates in a simulated linear channel by up to 0.7 dB. In a linear channel, PAS is shown to outperform the proposed GCS scheme, while similar performances are reported for PAS and the proposed GCS in a simulated nonlinear fiber channel. The GCS scheme is experimentally demonstrated in a multi-span recirculating loop coherent optical fiber transmission system with a total distance of up to 3000 km. Near-continuous zero-error flexible throughput is reported as a function of the transmission distance. Up to 1-2 spans of increased reach gains are achieved at the same net data rate w.r.t. conventional QAM. At a given distance, up to 0.79 bits/2D symbol of gain w.r.t. conventional QAM is achieved. In the experiment, similar performance to PAS is demonstrated.
△ Less
Submitted 13 March, 2024; v1 submitted 10 November, 2023;
originally announced December 2023.
-
Wavelength-multiplexed Delayed Inputs for Memory Enhancement of Microring-based Reservoir Computing
Authors:
Bernard J. Giron Castro,
Christophe Peucheret,
Francesco Da Ros
Abstract:
We numerically demonstrate a silicon add-drop microring-based reservoir computing scheme that combines parallel delayed inputs and wavelength division multiplexing. The scheme solves memory-demanding tasks like time-series prediction with good performance without requiring external optical feedback.
We numerically demonstrate a silicon add-drop microring-based reservoir computing scheme that combines parallel delayed inputs and wavelength division multiplexing. The scheme solves memory-demanding tasks like time-series prediction with good performance without requiring external optical feedback.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Multi-Task Wavelength-Multiplexed Reservoir Computing Using a Silicon Microring Resonator
Authors:
Bernard J. Giron Castro,
Christophe Peucheret,
Darko Zibar,
Francesco Da Ros
Abstract:
Among the promising advantages of photonic computing over conventional computing architectures is the potential to increase computing efficiency through massive parallelism by using the many degrees of freedom provided by photonics. Here, we numerically demonstrate the simultaneous use of time and frequency (equivalently wavelength) multiplexing to solve three independent tasks at the same time on…
▽ More
Among the promising advantages of photonic computing over conventional computing architectures is the potential to increase computing efficiency through massive parallelism by using the many degrees of freedom provided by photonics. Here, we numerically demonstrate the simultaneous use of time and frequency (equivalently wavelength) multiplexing to solve three independent tasks at the same time on the same photonic circuit. In particular, we consider a microring-based time-delay reservoir computing (TDRC) scheme that simultaneously solves three tasks: Time-series prediction, classification, and wireless channel equalization. The scheme relies on time-division multiplexing to avoid the necessity of multiple physical nonlinear nodes, while the tasks are parallelized using wavelength division multiplexing (WDM). The input data modulated on each optical channel is mapped to a higher dimensional space by the nonlinear dynamics of the silicon microring cavity. The carrier wavelength and input power assigned to each optical channel have a high influence on the performance of its respective task. When all tasks operate under the same wavelength/power conditions, our results show that the computing nature of each task is the deciding factor of the level of performance achievable. However, it is possible to achieve good performance for all tasks simultaneously by optimizing the parameters of each optical channel. The variety of applications covered by the tasks shows the versatility of the proposed photonic TDRC scheme. Overall, this work provides insight into the potential of WDM-based schemes for improving the computing capabilities of reservoir computing schemes.
△ Less
Submitted 27 April, 2024; v1 submitted 25 October, 2023;
originally announced October 2023.
-
Effects of cavity nonlinearities and linear losses on silicon microring-based reservoir computing
Authors:
Bernard J. Giron Castro,
Christophe Peucheret,
Darko Zibar,
Francesco Da Ros
Abstract:
Microring resonators (MRRs) are promising devices for time-delay photonic reservoir computing, but the impact of the different physical effects taking place in the MRRs on the reservoir computing performance is yet to be fully understood. We numerically analyze the impact of linear losses as well as thermo-optic and free-carrier effects relaxation times on the prediction error of the time-series t…
▽ More
Microring resonators (MRRs) are promising devices for time-delay photonic reservoir computing, but the impact of the different physical effects taking place in the MRRs on the reservoir computing performance is yet to be fully understood. We numerically analyze the impact of linear losses as well as thermo-optic and free-carrier effects relaxation times on the prediction error of the time-series task NARMA-10. We demonstrate the existence of three regions, defined by the input power and the frequency detuning between the optical source and the microring resonance, that reveal the cavity transition from linear to nonlinear regimes. One of these regions offers very low error in time-series prediction under relatively low input power and number of nodes while the other regions either lack nonlinearity or become unstable. This study provides insight into the design of the MRR and the optimization of its physical properties for improving the prediction performance of time-delay reservoir computing.
△ Less
Submitted 22 December, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Optimization of Raman amplifiers: a comparison between black-, grey- and white-box modeling
Authors:
Metodi P. Yankov,
Mehran Soltani,
Andrea Carena,
Darko Zibar,
Francesco Da Ros
Abstract:
Designing and optimizing optical amplifiers to maximize system performance is becoming increasingly important as optical communication systems strive to increase throughput. Offline optimization of optical amplifiers relies on models ranging from white-box models deeply rooted in physics to black-box data-driven physics-agnostic models. Here, we compare the capabilities of white-, grey- and black-…
▽ More
Designing and optimizing optical amplifiers to maximize system performance is becoming increasingly important as optical communication systems strive to increase throughput. Offline optimization of optical amplifiers relies on models ranging from white-box models deeply rooted in physics to black-box data-driven physics-agnostic models. Here, we compare the capabilities of white-, grey- and black-box models to achieve a target frequency-distance amplification in a bidirectional Raman amplifier. We show that any of the studied methods can achieve down to 1 dB of frequency-distance flatness over the C-band in a 100-km span. Then, we discuss the models' applicability, advantages, and drawbacks based on the target application scenario, in particular in terms of optimization speed and access to training data.
△ Less
Submitted 11 September, 2023;
originally announced October 2023.
-
Differentiable Machine Learning-Based Modeling for Directly-Modulated Lasers
Authors:
Sergio Hernandez,
Ognjen Jovanovic,
Christophe Peucheret,
Francesco Da Ros,
Darko Zibar
Abstract:
End-to-end learning has become a popular method for joint transmitter and receiver optimization in optical communication systems. Such approach may require a differentiable channel model, thus hindering the optimization of links based on directly modulated lasers (DMLs). This is due to the DML behavior in the large-signal regime, for which no analytical solution is available. In this paper, this p…
▽ More
End-to-end learning has become a popular method for joint transmitter and receiver optimization in optical communication systems. Such approach may require a differentiable channel model, thus hindering the optimization of links based on directly modulated lasers (DMLs). This is due to the DML behavior in the large-signal regime, for which no analytical solution is available. In this paper, this problem is addressed by develo** and comparing differentiable machine learning-based surrogate models. The models are quantitatively assessed in terms of root mean square error and training/testing time. Once the models are trained, the surrogates are then tested in a numerical equalization setup, resembling a practical end-to-end scenario. Based on the numerical investigation conducted, the convolutional attention transformer is shown to outperform the other models considered.
△ Less
Submitted 4 January, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Addressing Data Scarcity in Optical Matrix Multiplier Modeling Using Transfer Learning
Authors:
Ali Cem,
Ognjen Jovanovic,
Siqi Yan,
Yunhong Ding,
Darko Zibar,
Francesco Da Ros
Abstract:
We present and experimentally evaluate using transfer learning to address experimental data scarcity when training neural network (NN) models for Mach-Zehnder interferometer mesh-based optical matrix multipliers. Our approach involves pre-training the model using synthetic data generated from a less accurate analytical model and fine-tuning with experimental data. Our investigation demonstrates th…
▽ More
We present and experimentally evaluate using transfer learning to address experimental data scarcity when training neural network (NN) models for Mach-Zehnder interferometer mesh-based optical matrix multipliers. Our approach involves pre-training the model using synthetic data generated from a less accurate analytical model and fine-tuning with experimental data. Our investigation demonstrates that this method yields significant reductions in modeling errors compared to using an analytical model, or a standalone NN model when training data is limited. Utilizing regularization techniques and ensemble averaging, we achieve < 1 dB root-mean-square error on the matrix weights implemented by a 3x3 photonic chip while using only 25% of the available data.
△ Less
Submitted 13 November, 2023; v1 submitted 10 August, 2023;
originally announced August 2023.
-
Impact of Free-carrier Nonlinearities on Silicon Microring-based Reservoir Computing
Authors:
Bernard J. Giron Castro,
Christophe Peucheret,
Darko Zibar,
Francesco Da Ros
Abstract:
We quantify the impact of thermo-optic and free-carrier effects on time-delay reservoir computing using a silicon microring resonator. We identify pump power and frequency detuning ranges with NMSE less than 0.05 for the NARMA-10 task depending on the time constants of the two considered effects.
We quantify the impact of thermo-optic and free-carrier effects on time-delay reservoir computing using a silicon microring resonator. We identify pump power and frequency detuning ranges with NMSE less than 0.05 for the NARMA-10 task depending on the time constants of the two considered effects.
△ Less
Submitted 13 July, 2023;
originally announced July 2023.
-
Data-Driven Modeling of Directly-Modulated Lasers
Authors:
Sergio Hernandez Fernandez,
Christophe Peucheret,
Ognjen Jovanovic,
Francesco Da Ros,
Darko Zibar
Abstract:
The end-to-end optimization of links based on directly-modulated lasers may require an analytically differentiable channel. We overcome this problem by develo** and comparing differentiable laser models based on machine learning techniques.
The end-to-end optimization of links based on directly-modulated lasers may require an analytically differentiable channel. We overcome this problem by develo** and comparing differentiable laser models based on machine learning techniques.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Reservoir Computing-based Multi-Symbol Equalization for PAM 4 Short-reach Transmission
Authors:
Yevhenii Osadchuk,
Ognjen Jovanovic,
Darko Zibar,
Francesco Da Ros
Abstract:
We propose spectrum-sliced reservoir computer-based (RC) multi-symbol equalization for 32-GBd PAM4 transmission. RC with 17 symbols at the output achieves an order of magnitude reduction in multiplications/symbol versus single output case while maintaining simple training.
We propose spectrum-sliced reservoir computer-based (RC) multi-symbol equalization for 32-GBd PAM4 transmission. RC with 17 symbols at the output achieves an order of magnitude reduction in multiplications/symbol versus single output case while maintaining simple training.
△ Less
Submitted 29 November, 2022;
originally announced December 2022.
-
Data-efficient Modeling of Optical Matrix Multipliers Using Transfer Learning
Authors:
Ali Cem,
Ognjen Jovanovic,
Siqi Yan,
Yunhong Ding,
Darko Zibar,
Francesco Da Ros
Abstract:
We demonstrate transfer learning-assisted neural network models for optical matrix multipliers with scarce measurement data. Our approach uses <10\% of experimental data needed for best performance and outperforms analytical models for a Mach-Zehnder interferometer mesh.
We demonstrate transfer learning-assisted neural network models for optical matrix multipliers with scarce measurement data. Our approach uses <10\% of experimental data needed for best performance and outperforms analytical models for a Mach-Zehnder interferometer mesh.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Geometric Constellation Sha** for Fiber-Optic Channels via End-to-End Learning
Authors:
Ognjen Jovanovic,
Francesco Da Ros,
Darko Zibar,
Metodi P. Yankov
Abstract:
End-to-end learning has become a popular method to optimize a constellation shape of a communication system. When the channel model is differentiable, end-to-end learning can be applied with conventional backpropagation algorithm for optimization of the shape. A variety of optimization algorithms have also been developed for end-to-end learning over a non-differentiable channel model. In this pape…
▽ More
End-to-end learning has become a popular method to optimize a constellation shape of a communication system. When the channel model is differentiable, end-to-end learning can be applied with conventional backpropagation algorithm for optimization of the shape. A variety of optimization algorithms have also been developed for end-to-end learning over a non-differentiable channel model. In this paper, we compare gradient-free optimization method based on the cubature Kalman filter, model-free optimization and backpropagation for end-to-end learning on a fiber-optic channel modeled by the split-step Fourier method. The results indicate that the gradient-free optimization algorithms provide a decent replacement to backpropagation in terms of performance at the expense of computational complexity. Furthermore, the quantization problem of finite bit resolution of the digital-to-analog and analog-to-digital converters is addressed and its impact on geometrically shaped constellations is analysed. Here, the results show that when optimizing a constellation with respect to mutual information, a minimum number of quantization levels is required to achieve sha** gain. For generalized mutual information, the gain is maintained throughout all of the considered quantization levels. Also, the results implied that the autoencoder can adapt the constellation size to the given channel conditions.
△ Less
Submitted 17 May, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Data-driven Modeling of Mach-Zehnder Interferometer-based Optical Matrix Multipliers
Authors:
Ali Cem,
Siqi Yan,
Yunhong Ding,
Darko Zibar,
Francesco Da Ros
Abstract:
Photonic integrated circuits are facilitating the development of optical neural networks, which have the potential to be both faster and more energy efficient than their electronic counterparts since optical signals are especially well-suited for implementing matrix multiplications. However, accurate programming of photonic chips for optical matrix multiplication remains a difficult challenge. Her…
▽ More
Photonic integrated circuits are facilitating the development of optical neural networks, which have the potential to be both faster and more energy efficient than their electronic counterparts since optical signals are especially well-suited for implementing matrix multiplications. However, accurate programming of photonic chips for optical matrix multiplication remains a difficult challenge. Here, we describe both simple analytical models and data-driven models for offline training of optical matrix multipliers. We train and evaluate the models using experimental data obtained from a fabricated chip featuring a Mach-Zehnder interferometer mesh implementing 3-by-3 matrix multiplication. The neural network-based models outperform the simple physics-based models in terms of prediction error. Furthermore, the neural network models are also able to predict the spectral variations in the matrix weights for up to 100 frequency channels covering the C-band. The use of neural network models for programming the chip for optical matrix multiplication yields increased performance on multiple machine learning tasks.
△ Less
Submitted 6 March, 2023; v1 submitted 17 October, 2022;
originally announced October 2022.
-
Experimental validation of machine-learning based spectral-spatial power evolution sha** using Raman amplifiers
Authors:
Mehran Soltani,
Francesco Da Ros,
Andrea Carena,
Darko Zibar
Abstract:
We experimentally validate a real-time machine learning framework, capable of controlling the pump power values of Raman amplifiers to shape the signal power evolution in two-dimensions (2D): frequency and fiber distance. In our setup, power values of four first-order counter-propagating pumps are optimized to achieve the desired 2D power profile. The pump power optimization framework includes a c…
▽ More
We experimentally validate a real-time machine learning framework, capable of controlling the pump power values of Raman amplifiers to shape the signal power evolution in two-dimensions (2D): frequency and fiber distance. In our setup, power values of four first-order counter-propagating pumps are optimized to achieve the desired 2D power profile. The pump power optimization framework includes a convolutional neural network (CNN) followed by differential evolution (DE) technique, applied online to the amplifier setup to automatically achieve the target 2D power profiles. The results on achievable 2D profiles show that the framework is able to guarantee very low maximum absolute error (MAE) (<0.5 dB) between the obtained and the target 2D profiles. Moreover, the framework is tested in a multi-objective design scenario where the goal is to achieve the 2D profiles with flat gain levels at the end of the span, jointly with minimum spectral excursion over the entire fiber length. In this case, the experimental results assert that for 2D profiles with the target flat gain levels, the DE obtains less than 1 dB maximum gain deviation, when the setup is not physically limited in the pump power values. The simulation results also prove that with enough pump power available, better gain deviation (less than 0.6 dB) for higher target gain levels is achievable.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Experimental Validation of Spectral-Spatial Power Evolution Design Using Raman Amplifiers
Authors:
Mehran Soltani,
Francesco Da Ros,
Andrea Carena,
Darko Zibar
Abstract:
We experimentally validate a machine learning-enabled Raman amplification framework, capable of jointly sha** the signal power evolution in two domains: frequency and fiber distance. The proposed experiment addresses the amplification in the whole C-band, by optimizing four first-order counter-propagating Raman pumps.
We experimentally validate a machine learning-enabled Raman amplification framework, capable of jointly sha** the signal power evolution in two domains: frequency and fiber distance. The proposed experiment addresses the amplification in the whole C-band, by optimizing four first-order counter-propagating Raman pumps.
△ Less
Submitted 16 May, 2022;
originally announced June 2022.
-
Flexible Raman Amplifier Optimization Based on Machine Learning-aided Physical Stimulated Raman Scattering Model
Authors:
Metodi Plamenov Yankov,
Francesco Da Ros,
Uiara Celine de Moura,
Andrea Carena,
Darko Zibar
Abstract:
The problem of Raman amplifier optimization is studied. A differentiable interpolation function is obtained for the Raman gain coefficient using machine learning (ML), which allows for the gradient descent optimization of forward-propagating Raman pumps. Both the frequency and power of an arbitrary number of pumps in a forward pum** configuration are then optimized for an arbitrary data channel…
▽ More
The problem of Raman amplifier optimization is studied. A differentiable interpolation function is obtained for the Raman gain coefficient using machine learning (ML), which allows for the gradient descent optimization of forward-propagating Raman pumps. Both the frequency and power of an arbitrary number of pumps in a forward pum** configuration are then optimized for an arbitrary data channel load and span length. The forward propagation model is combined with an experimentally-trained ML model of a backward-pum** Raman amplifier to jointly optimize the frequency and power of the forward amplifier's pumps and the powers of the backward amplifier's pumps. The joint forward and backward amplifier optimization is demonstrated for an unrepeatered transmission of 250 km. A gain flatness of $<$ 1~dB over 4 THz is achieved. The optimized amplifiers are validated using a numerical simulator.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Capacity and Achievable Rates of Fading Few-mode MIMO IM/DD Optical Fiber Channels
Authors:
Metodi P. Yankov,
Francesco Da Ros,
Søren Forchhammer,
Lars Gruner-Nielsen
Abstract:
The optical fiber multiple-input multiple-output (MIMO) channel with intensity modulation and direct detection (IM/DD) per spatial path is treated. The spatial dimensions represent the multiple modes employed for transmission and the cross-talk between them originates in the multiplexers and demultiplexers, which are polarization dependent and thus timevarying. The upper bounds from free-space IM/…
▽ More
The optical fiber multiple-input multiple-output (MIMO) channel with intensity modulation and direct detection (IM/DD) per spatial path is treated. The spatial dimensions represent the multiple modes employed for transmission and the cross-talk between them originates in the multiplexers and demultiplexers, which are polarization dependent and thus timevarying. The upper bounds from free-space IM/DD MIMO channels are adapted to the fiber case, and the constellation constrained capacity is constructively estimated using the Blahut-Arimoto algorithm. An autoencoder is then proposed to optimize a practical MIMO transmission in terms of pre-coder and detector assuming channel distribution knowledge at the transmitter. The pre-coders are shown to be robust to changes in the channel.
△ Less
Submitted 27 January, 2022;
originally announced January 2022.
-
Comparison of Models for Training Optical Matrix Multipliers in Neuromorphic PICs
Authors:
Ali Cem,
Siqi Yan,
Uiara Celine de Moura,
Yunhong Ding,
Darko Zibar,
Francesco Da Ros
Abstract:
We experimentally compare simple physics-based vs. data-driven neural-network-based models for offline training of programmable photonic chips using Mach-Zehnder interferometer meshes. The neural-network model outperforms physics-based models for a chip with thermal crosstalk, yielding increased testing accuracy.
We experimentally compare simple physics-based vs. data-driven neural-network-based models for offline training of programmable photonic chips using Mach-Zehnder interferometer meshes. The neural-network model outperforms physics-based models for a chip with thermal crosstalk, yielding increased testing accuracy.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
SNR optimization of multi-span fiber optic communication systems employing EDFAs with non-flat gain and noise figure
Authors:
Metodi Plamenov Yankov,
Pawel Marcin Kaminski,
Henrik Enggaard Hansen,
Francesco Da Ros
Abstract:
Throughput optimization of optical communication systems is a key challenge for current optical networks. The use of gain-flattening filters (GFFs) simplifies the problem at the cost of insertion loss, higher power consumption and potentially poorer performance. In this work, we propose a component wise model of a multi-span transmission system for signal-to-noise (SNR) optimization. A machine-lea…
▽ More
Throughput optimization of optical communication systems is a key challenge for current optical networks. The use of gain-flattening filters (GFFs) simplifies the problem at the cost of insertion loss, higher power consumption and potentially poorer performance. In this work, we propose a component wise model of a multi-span transmission system for signal-to-noise (SNR) optimization. A machine-learning based model is trained for the gain and noise figure spectral profile of a C-band amplifier without a GFF. The model is combined with the Gaussian noise model for nonlinearities in optical fibers including stimulated Raman scattering and the implementation penalty spectral profile measured in back-to-back in order to predict the SNR in each channel of a multi-span wavelength division multiplexed system. All basic components in the system model are differentiable and allow for the gradient descent-based optimization of a system of arbitrary configuration in terms of number of spans and length per span. When the input power profile is optimized for flat and maximized received SNR per channel, the minimum performance in an arbitrary 3-span experimental system is improved by up to 8 dB w.r.t. a system with flat input power profile. An SNR flatness down to 1.2 dB is simultaneously achieved. The model and optimization methods are used to optimize the performance of an example core network, and 0.2 dB of gain is shown w.r.t. solutions that do not take into account nonlinearities. The method is also shown to be beneficial for systems with ideal gain flattening, achieving up to 0.3 dB of gain w.r.t. a flat input power profile.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization
Authors:
Filip de Roos,
Carl Jidling,
Adrian Wills,
Thomas Schön,
Philipp Hennig
Abstract:
Machine learning practitioners invest significant manual and computational resources in finding suitable learning rates for optimization algorithms. We provide a probabilistic motivation, in terms of Gaussian inference, for popular stochastic first-order methods. As an important special case, it recovers the Polyak step with a general metric. The inference allows us to relate the learning rate to…
▽ More
Machine learning practitioners invest significant manual and computational resources in finding suitable learning rates for optimization algorithms. We provide a probabilistic motivation, in terms of Gaussian inference, for popular stochastic first-order methods. As an important special case, it recovers the Polyak step with a general metric. The inference allows us to relate the learning rate to a dimensionless quantity that can be automatically adapted during training by a control algorithm. The resulting meta-algorithm is shown to adapt learning rates in a robust manner across a large range of initial values when applied to deep learning benchmark problems.
△ Less
Submitted 22 February, 2021;
originally announced February 2021.
-
High-Dimensional Gaussian Process Inference with Derivatives
Authors:
Filip de Roos,
Alexandra Gessner,
Philipp Hennig
Abstract:
Although it is widely known that Gaussian processes can be conditioned on observations of the gradient, this functionality is of limited use due to the prohibitive computational cost of $\mathcal{O}(N^3 D^3)$ in data points $N$ and dimension $D$. The dilemma of gradient observations is that a single one of them comes at the same cost as $D$ independent function evaluations, so the latter are often…
▽ More
Although it is widely known that Gaussian processes can be conditioned on observations of the gradient, this functionality is of limited use due to the prohibitive computational cost of $\mathcal{O}(N^3 D^3)$ in data points $N$ and dimension $D$. The dilemma of gradient observations is that a single one of them comes at the same cost as $D$ independent function evaluations, so the latter are often preferred. Careful scrutiny reveals, however, that derivative observations give rise to highly structured kernel Gram matrices for very general classes of kernels (inter alia, stationary kernels). We show that in the low-data regime $N<D$, the Gram matrix can be decomposed in a manner that reduces the cost of inference to $\mathcal{O}(N^2D + (N^2)^3)$ (i.e., linear in the number of dimensions) and, in special cases, to $\mathcal{O}(N^2D + N^3)$. This reduction in complexity opens up new use-cases for inference with gradients especially in the high-dimensional regime, where the information-to-cost ratio of gradient observations significantly increases. We demonstrate this potential in a variety of tasks relevant for machine learning, such as optimization and Hamiltonian Monte Carlo with predictive gradients.
△ Less
Submitted 15 February, 2021;
originally announced February 2021.
-
Experimental Demonstration of Optoelectronic Equalization for Short-reach Transmission with Reservoir Computing
Authors:
Stenio M. Ranzini,
Roman Dischler,
Francesco da Ros,
Henning Buelow,
Darko Zibar
Abstract:
A receiver with shared complexity between optical and digital domains is experimentally demonstrated. Reservoir computing is used to equalize up to 4 directly-detected optically filtered spectral slices of a 32 GBd OOK signal over up to 80 km of SMF.
A receiver with shared complexity between optical and digital domains is experimentally demonstrated. Reservoir computing is used to equalize up to 4 directly-detected optically filtered spectral slices of a 32 GBd OOK signal over up to 80 km of SMF.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Power Evolution Prediction and Optimization in a Multi-span System Based on Component-wise System Modeling
Authors:
Metodi P. Yankov,
Uiara Celine de Moura,
Francesco Da Ros
Abstract:
Cascades of a machine learning-based EDFA gain model trained on a single physical device and a fully differentiable stimulated Raman scattering fiber model are used to predict and optimize the power profile at the output of an experimental multi-span fully-loaded C-band optical communication system.
Cascades of a machine learning-based EDFA gain model trained on a single physical device and a fully differentiable stimulated Raman scattering fiber model are used to predict and optimize the power profile at the output of an experimental multi-span fully-loaded C-band optical communication system.
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Machine learning-based EDFA Gain Model Generalizable to Multiple Physical Devices
Authors:
Francesco Da Ros,
Uiara Celine de Moura,
Metodi P. Yankov
Abstract:
We report a neural-network based erbium-doped fiber amplifier (EDFA) gain model built from experimental measurements. The model shows low gain-prediction error for both the same device used for training (MSE $\leq$ 0.04 dB$^2$) and different physical units of the same make (generalization MSE $\leq$ 0.06 dB$^2$).
We report a neural-network based erbium-doped fiber amplifier (EDFA) gain model built from experimental measurements. The model shows low gain-prediction error for both the same device used for training (MSE $\leq$ 0.04 dB$^2$) and different physical units of the same make (generalization MSE $\leq$ 0.06 dB$^2$).
△ Less
Submitted 11 September, 2020;
originally announced September 2020.
-
Mind the GAP: Security & Privacy Risks of Contact Tracing Apps
Authors:
Lars Baumgärtner,
Alexandra Dmitrienko,
Bernd Freisleben,
Alexander Gruler,
Jonas Höchst,
Joshua Kühlberg,
Mira Mezini,
Richard Mitev,
Markus Miettinen,
Anel Muhamedagic,
Thien Duc Nguyen,
Alvar Penning,
Dermot Frederik Pustelnik,
Filipp Roos,
Ahmad-Reza Sadeghi,
Michael Schwarz,
Christian Uhl
Abstract:
Google and Apple have jointly provided an API for exposure notification in order to implement decentralized contract tracing apps using Bluetooth Low Energy, the so-called "Google/Apple Proposal", which we abbreviate by "GAP". We demonstrate that in real-world scenarios the current GAP design is vulnerable to (i) profiling and possibly de-anonymizing infected persons, and (ii) relay-based wormhole…
▽ More
Google and Apple have jointly provided an API for exposure notification in order to implement decentralized contract tracing apps using Bluetooth Low Energy, the so-called "Google/Apple Proposal", which we abbreviate by "GAP". We demonstrate that in real-world scenarios the current GAP design is vulnerable to (i) profiling and possibly de-anonymizing infected persons, and (ii) relay-based wormhole attacks that basically can generate fake contacts with the potential of affecting the accuracy of an app-based contact tracing system. For both types of attack, we have built tools that can easily be used on mobile phones or Raspberry Pis (e.g., Bluetooth sniffers). The goal of our work is to perform a reality check towards possibly providing empirical real-world evidence for these two privacy and security risks. We hope that our findings provide valuable input for develo** secure and privacy-preserving digital contact tracing systems.
△ Less
Submitted 6 November, 2020; v1 submitted 10 June, 2020;
originally announced June 2020.
-
Probabilistically Shaped 4-PAM for Short-Reach IM/DD Links with a Peak Power Constraint
Authors:
Thomas Wiegart,
Francesco Da Ros,
Metodi Plamenov Yankov,
Fabian Steiner,
Simone Gaiarin,
Richard Wesel
Abstract:
Probabilistic sha** for intensity modulation and direct detection (IM/DD) links is discussed and a peak power constraint determined by the limited modulation extinction ratio (ER) of optical modulators is introduced. The input distribution of 4-ary unipolar pulse amplitude modulation (PAM) symbols is optimized for short-reach transmission links without optical amplification nor in-line dispersio…
▽ More
Probabilistic sha** for intensity modulation and direct detection (IM/DD) links is discussed and a peak power constraint determined by the limited modulation extinction ratio (ER) of optical modulators is introduced. The input distribution of 4-ary unipolar pulse amplitude modulation (PAM) symbols is optimized for short-reach transmission links without optical amplification nor in-line dispersion compensation. The resulting distribution is symmetric around its mean allowing to use probabilistic amplitude sha** (PAS) to generate symbols that are protected by forward error correction (FEC) and that have the optimal input distribution. The numerical analysis is confirmed experimentally for both an additive white Gaussian noise (AWGN) channel and a fiber channel, showing gains in transmission reach and transmission rate, as well as rate adaptability.
△ Less
Submitted 8 October, 2020; v1 submitted 25 May, 2020;
originally announced May 2020.
-
Active Probabilistic Inference on Matrices for Pre-Conditioning in Stochastic Optimization
Authors:
Filip de Roos,
Philipp Hennig
Abstract:
Pre-conditioning is a well-known concept that can significantly improve the convergence of optimization algorithms. For noise-free problems, where good pre-conditioners are not known a priori, iterative linear algebra methods offer one way to efficiently construct them. For the stochastic optimization problems that dominate contemporary machine learning, however, this approach is not readily avail…
▽ More
Pre-conditioning is a well-known concept that can significantly improve the convergence of optimization algorithms. For noise-free problems, where good pre-conditioners are not known a priori, iterative linear algebra methods offer one way to efficiently construct them. For the stochastic optimization problems that dominate contemporary machine learning, however, this approach is not readily available. We propose an iterative algorithm inspired by classic iterative linear solvers that uses a probabilistic model to actively infer a pre-conditioner in situations where Hessian-projections can only be constructed with strong Gaussian noise. The algorithm is empirically demonstrated to efficiently construct effective pre-conditioners for stochastic gradient descent and its variants. Experiments on problems of comparably low dimensionality show improved convergence. In very high-dimensional problems, such as those encountered in deep learning, the pre-conditioner effectively becomes an automatic learning-rate adaptation scheme, which we also empirically show to work well.
△ Less
Submitted 20 February, 2019;
originally announced February 2019.
-
Experimental Verification of Rate Flexibility and Probabilistic Sha** by 4D Signaling
Authors:
Fabian Steiner,
Francesco Da Ros,
Metodi Plamenov Yankov,
Georg Böcherer,
Patrick Schulte,
Søren Forchhammer,
Gerhard Kramer
Abstract:
The rate flexibility and probabilistic sha** gain of $4$-dimensional signaling is experimentally tested for short-reach, unrepeated transmission. A rate granularity of 0.5 bits/QAM symbol is achieved with a distribution matcher based on a simple look-up table.
The rate flexibility and probabilistic sha** gain of $4$-dimensional signaling is experimentally tested for short-reach, unrepeated transmission. A rate granularity of 0.5 bits/QAM symbol is achieved with a distribution matcher based on a simple look-up table.
△ Less
Submitted 18 March, 2018;
originally announced March 2018.
-
Dual polarization nonlinear Fourier transform-based optical communication system
Authors:
Simone Gaiarin,
Auro Michele Perego,
Edson Porto da Silva,
Francesco Da Ros,
Darko Zibar
Abstract:
New services and applications are causing an exponential increase in internet traffic. In a few years, current fiber optic communication system infrastructure will not be able to meet this demand because fiber nonlinearity dramatically limits the information transmission rate. Eigenvalue communication could potentially overcome these limitations. It relies on a mathematical technique called "nonli…
▽ More
New services and applications are causing an exponential increase in internet traffic. In a few years, current fiber optic communication system infrastructure will not be able to meet this demand because fiber nonlinearity dramatically limits the information transmission rate. Eigenvalue communication could potentially overcome these limitations. It relies on a mathematical technique called "nonlinear Fourier transform (NFT)" to exploit the "hidden" linearity of the nonlinear Schrödinger equation as the master model for signal propagation in an optical fiber. We present here the theoretical tools describing the NFT for the Manakov system and report on experimental transmission results for dual polarization in fiber optic eigenvalue communications. A transmission of up to 373.5 km with bit error rate less than the hard-decision forward error correction threshold has been achieved. Our results demonstrate that dual-polarization NFT can work in practice and enable an increased spectral efficiency in NFT-based communication systems, which are currently based on single polarization channels.
△ Less
Submitted 5 March, 2018; v1 submitted 27 February, 2018;
originally announced February 2018.
-
Experimental Demonstration of Dual Polarization Nonlinear Frequency Division Multiplexed Optical Transmission System
Authors:
Simone Gaiarin,
Auro Michele Perego,
Edson Porto da Silva,
Francesco Da Ros,
Darko Zibar
Abstract:
Multi-eigenvalues transmission with information encoded simultaneously in both orthogonal polarizations is experimentally demonstrated. Performance below the HD-FEC limit is demonstrated for 8-bits/symbol 1-GBd signals after transmission up to 207 km of SSMF.
Multi-eigenvalues transmission with information encoded simultaneously in both orthogonal polarizations is experimentally demonstrated. Performance below the HD-FEC limit is demonstrated for 8-bits/symbol 1-GBd signals after transmission up to 207 km of SSMF.
△ Less
Submitted 1 August, 2017;
originally announced August 2017.
-
Krylov Subspace Recycling for Fast Iterative Least-Squares in Machine Learning
Authors:
Filip de Roos,
Philipp Hennig
Abstract:
Solving symmetric positive definite linear problems is a fundamental computational task in machine learning. The exact solution, famously, is cubicly expensive in the size of the matrix. To alleviate this problem, several linear-time approximations, such as spectral and inducing-point methods, have been suggested and are now in wide use. These are low-rank approximations that choose the low-rank s…
▽ More
Solving symmetric positive definite linear problems is a fundamental computational task in machine learning. The exact solution, famously, is cubicly expensive in the size of the matrix. To alleviate this problem, several linear-time approximations, such as spectral and inducing-point methods, have been suggested and are now in wide use. These are low-rank approximations that choose the low-rank space a priori and do not refine it over time. While this allows linear cost in the data-set size, it also causes a finite, uncorrected approximation error. Authors from numerical linear algebra have explored ways to iteratively refine such low-rank approximations, at a cost of a small number of matrix-vector multiplications. This idea is particularly interesting in the many situations in machine learning where one has to solve a sequence of related symmetric positive definite linear problems. From the machine learning perspective, such deflation methods can be interpreted as transfer learning of a low-rank approximation across a time-series of numerical tasks. We study the use of such methods for our field. Our empirical results show that, on regression and classification problems of intermediate size, this approach can interpolate between low computational cost and numerical precision.
△ Less
Submitted 1 June, 2017;
originally announced June 2017.
-
Experimental Comparison of Probabilistic Sha** Methods for Unrepeated Fiber Transmission
Authors:
Julian Renner,
Tobias Fehenberger,
Metodi P. Yankov,
Francesco Da Ros,
Søren Forchhammer,
Georg Böcherer,
Norbert Hanik
Abstract:
This paper studies the impact of probabilistic sha** on effective signal-to-noise ratios (SNRs) and achievable information rates (AIRs) in a back-to-back configuration and in unrepeated nonlinear fiber transmissions. For back-to-back, various shaped quadrature amplitude modulation (QAM) distributions are found to have the same implementation penalty as uniform input. By demonstrating in transmis…
▽ More
This paper studies the impact of probabilistic sha** on effective signal-to-noise ratios (SNRs) and achievable information rates (AIRs) in a back-to-back configuration and in unrepeated nonlinear fiber transmissions. For back-to-back, various shaped quadrature amplitude modulation (QAM) distributions are found to have the same implementation penalty as uniform input. By demonstrating in transmission experiments that shaped QAM input leads to lower effective SNR than uniform input at a fixed average launch power, we experimentally confirm that sha** enhances the fiber nonlinearities. However, sha** is ultimately found to increase the AIR, which is the most relevant figure of merit as it is directly related to spectral efficiency. In a detailed study of these sha** gains for the nonlinear fiber channel, four strategies for optimizing QAM input distributions are evaluated and experimentally compared in wavelength division multiplexing (WDM) systems. The first sha** scheme generates a Maxwell-Boltzmann (MB) distribution based on a linear additive white Gaussian noise channel. The second strategy uses the Blahut-Arimoto algorithm to optimize an unconstrained QAM distribution for a split-step Fourier method based channel model. In the third and fourth approach, MB-shaped QAM and unconstrained QAM are optimized via the enhanced Gaussian noise (EGN) model. Although the absolute sha** gains are found to be relatively small, the relative improvements by EGN-optimized unconstrained distributions over linear AWGN optimized MB distributions are up to 59%. This general behavior is observed in 9-channel and fully loaded WDM experiments.
△ Less
Submitted 8 November, 2017; v1 submitted 3 May, 2017;
originally announced May 2017.
-
Constellation Sha** for WDM systems using 256QAM/1024QAM with Probabilistic Optimization
Authors:
Metodi P. Yankov,
Francesco Da Ros,
Edson P. da Silva,
Søren Forchhammer,
Knud J. Larsen,
Leif K. Oxenløwe,
Michael Galili,
Darko Zibar
Abstract:
In this paper, probabilistic sha** is numerically and experimentally investigated for increasing the transmission reach of wavelength division multiplexed (WDM) optical communication system employing quadrature amplitude modulation (QAM). An optimized probability mass function (PMF) of the QAM symbols is first found from a modified Blahut-Arimoto algorithm for the optical channel. A turbo coded…
▽ More
In this paper, probabilistic sha** is numerically and experimentally investigated for increasing the transmission reach of wavelength division multiplexed (WDM) optical communication system employing quadrature amplitude modulation (QAM). An optimized probability mass function (PMF) of the QAM symbols is first found from a modified Blahut-Arimoto algorithm for the optical channel. A turbo coded bit interleaved coded modulation system is then applied, which relies on many-to-one labeling to achieve the desired PMF, thereby achieving sha** gain. Pilot symbols at rate at most 2% are used for synchronization and equalization, making it possible to receive input constellations as large as 1024QAM. The system is evaluated experimentally on a 10 GBaud, 5 channels WDM setup. The maximum system reach is increased w.r.t. standard 1024QAM by 20% at input data rate of 4.65 bits/symbol and up to 75% at 5.46 bits/symbol. It is shown that rate adaptation does not require changing of the modulation format. The performance of the proposed 1024QAM shaped system is validated on all 5 channels of the WDM signal for selected distances and rates. Finally, it was shown via EXIT charts and BER analysis that iterative demap**, while generally beneficial to the system, is not a requirement for achieving the sha** gain.
△ Less
Submitted 11 September, 2016; v1 submitted 23 March, 2016;
originally announced March 2016.