-
L3 Ensembles: Lifelong Learning Approach for Ensemble of Foundational Language Models
Authors:
Aidin Shiri,
Kaushik Roy,
Amit Sheth,
Manas Gaur
Abstract:
Fine-tuning pre-trained foundational language models (FLM) for specific tasks is often impractical, especially for resource-constrained devices. This necessitates the development of a Lifelong Learning (L3) framework that continuously adapts to a stream of Natural Language Processing (NLP) tasks efficiently. We propose an approach that focuses on extracting meaningful representations from unseen d…
▽ More
Fine-tuning pre-trained foundational language models (FLM) for specific tasks is often impractical, especially for resource-constrained devices. This necessitates the development of a Lifelong Learning (L3) framework that continuously adapts to a stream of Natural Language Processing (NLP) tasks efficiently. We propose an approach that focuses on extracting meaningful representations from unseen data, constructing a structured knowledge base, and improving task performance incrementally. We conducted experiments on various NLP tasks to validate its effectiveness, including benchmarks like GLUE and SuperGLUE. We measured good performance across the accuracy, training efficiency, and knowledge transfer metrics. Initial experimental results show that the proposed L3 ensemble method increases the model accuracy by 4% ~ 36% compared to the fine-tuned FLM. Furthermore, L3 model outperforms naive fine-tuning approaches while maintaining competitive or superior performance (up to 15.4% increase in accuracy) compared to the state-of-the-art language model (T5) for the given task, STS benchmark.
△ Less
Submitted 11 November, 2023;
originally announced November 2023.
-
Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models
Authors:
Nancy Tyagi,
Aidin Shiri,
Surjodeep Sarkar,
Abhishek Kumar Umrawal,
Manas Gaur
Abstract:
Foundational Language Models (FLMs) have advanced natural language processing (NLP) research. Current researchers are develo** larger FLMs (e.g., XLNet, T5) to enable contextualized language representation, classification, and generation. While develo** larger FLMs has been of significant advantage, it is also a liability concerning hallucination and predictive uncertainty. Fundamentally, larg…
▽ More
Foundational Language Models (FLMs) have advanced natural language processing (NLP) research. Current researchers are develo** larger FLMs (e.g., XLNet, T5) to enable contextualized language representation, classification, and generation. While develo** larger FLMs has been of significant advantage, it is also a liability concerning hallucination and predictive uncertainty. Fundamentally, larger FLMs are built on the same foundations as smaller FLMs (e.g., BERT); hence, one must recognize the potential of smaller FLMs which can be realized through an ensemble. In the current research, we perform a reality check on FLMs and their ensemble on benchmark and real-world datasets. We hypothesize that the ensembling of FLMs can influence the individualistic attention of FLMs and unravel the strength of coordination and cooperation of different FLMs. We utilize BERT and define three other ensemble techniques: {Shallow, Semi, and Deep}, wherein the Deep-Ensemble introduces a knowledge-guided reinforcement learning approach. We discovered that the suggested Deep-Ensemble BERT outperforms its large variation i.e. BERTlarge, by a factor of many times using datasets that show the usefulness of NLP in sensitive fields, such as mental health.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Polarization coverage and self-healing characteristics of Poincaré-Bessel beam
Authors:
Subith Kumar,
Anupam Pal,
Arash Shiri,
G. K. Samanta,
Greg Gbur
Abstract:
As a vector version of scalar Bessel beams, Poincaré-Bessel beams (PBBs) have attracted a great deal of attention due to the presence of polarization singularities and their nondiffraction and self-healing properties. Previous studies of PBBs have been restricted primarily to understanding the disinclination patterns in the spatially variable polarization, and many of the properties of PBBs remain…
▽ More
As a vector version of scalar Bessel beams, Poincaré-Bessel beams (PBBs) have attracted a great deal of attention due to the presence of polarization singularities and their nondiffraction and self-healing properties. Previous studies of PBBs have been restricted primarily to understanding the disinclination patterns in the spatially variable polarization, and many of the properties of PBBs remain unexplored. Here, we present a theoretical and experimental study of the polarization characteristics of PBBs, investigating a variety of their features. Using a mode transformation of a full Poincaré (FP) beam in a rectangular basis, ideally carrying 100$\%$ polarization coverage of polarization states represented on the surface of the Poincaré sphere, we observe the PBB as the superposition of an infinite number of FP beams, as each ring of PBB has polarization coverage >75$\%$. We also observe the resilience of a PBB's degree of polarization to perturbation. The polarization-ellipse orientation map of PBBs shows the presence of infinite series of C-point singularity pairs. The number of such series pairs is decided by the number of C-point singularity pairs of the FP beam. The dynamics of C-point singularity pairs in the self-healing process show a non-trivial creation of new singularities and recombination of existing singularities. Such dynamics provide insight into ``Hilbert Hotel'' style evolution of singularities in light beams. The present study can be useful for imaging in the presence of depolarizing surroundings, studying turbulent atmospheric channels, and exploring the rich mathematical concepts of transfinite numbers.
△ Less
Submitted 10 June, 2023;
originally announced June 2023.
-
Simple experimental realization of optical Hilbert Hotel using scalar and vector fractional vortex beams
Authors:
Subith Kumar,
Anirban Ghosh,
Chahat Kaushik,
Arash Shiri,
Greg Gbur,
Sudhir Sharma,
G. K. Samanta
Abstract:
Historically, infinity was long considered a vague concept - boundless, endless, larger than the largest - without any quantifiable mathematical foundation. This view changed in the 1800s through the pioneering work of Georg Cantor showing that infinite sets follow their own seemingly paradoxical mathematical rules. In 1924, David Hilbert highlighted the strangeness of infinity through a thought e…
▽ More
Historically, infinity was long considered a vague concept - boundless, endless, larger than the largest - without any quantifiable mathematical foundation. This view changed in the 1800s through the pioneering work of Georg Cantor showing that infinite sets follow their own seemingly paradoxical mathematical rules. In 1924, David Hilbert highlighted the strangeness of infinity through a thought experiment now referred to as the Hilbert Hotel paradox, or simply Hilbert's Hotel. The paradox describes an "fully" occupied imaginary hotel having infinite number of single-occupancy rooms, the manager can always find a room for new guest by simply shifting current guests to the next highest room, leaving first room vacant. The investigation of wavefield singularities has uncovered the existence of a direct optical analogy to Hilbert's thought experiment. Since then, efforts have been made to investigate the properties of Hilbert's Hotel by controlling the dynamics of phase singularities in``fractional'' order optical vortex beams. Here, we have taken such proposals to the next level and experimentally demonstrated Hilbert's Hotel using both phase and polarization singularities of optical fields. Using a multi-ramped spiral-phase-plate and a supercontinuum source, we generated and controlled fractional order vortex beams for the practical implementation of Hilbert's Hotel in scalar and vector vortex beams. Using a multi-ramped spiral-phase-plate, we show the possibility for complicated transitions of the generalized Hilbert's Hotel. The generic experimental scheme illustrates the usefulness of structured beams in visualizing unusual mathematical concepts and also for fractional vector beams driven fundamental and applied research.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Omni-resonant imaging across the visible
Authors:
Layton A. Hall,
Abbas Shiri,
Ayman F. Abouraddy
Abstract:
Resonant field enhancement in optical cavities is provided over only narrow linewidths and for specific spatial modes. Consequently, spectrally restrictive planar Fabry-P{é}rot cavities have not contributed to date to white-light imaging, which necessitates a highly multimoded broadband field to satisfy the resonance condition. Here we show that introducing judicious angular-dispersion circumvents…
▽ More
Resonant field enhancement in optical cavities is provided over only narrow linewidths and for specific spatial modes. Consequently, spectrally restrictive planar Fabry-P{é}rot cavities have not contributed to date to white-light imaging, which necessitates a highly multimoded broadband field to satisfy the resonance condition. Here we show that introducing judicious angular-dispersion circumvents the fundamental trade-off between cavity linewidth and finesse in a Fabry-P{é}rot cavity by exciting a 130-nm-bandwidth achromatic resonance across the visible spectrum, which far exceeds the finesse-limited linewidth (0.5~nm), and even exceeds the free spectral range (45~nm). This omni-resonant configuration enables broadband color-imaging over a 100-nm-bandwidth in the visible with minimal spherical and chromatic aberrations. We demonstrate omni-resonant imaging using coherent and incoherent light, and spatially extended and localized fields comprising stationary and moving objects. This work paves the way to harnessing broadband resonant enhancements for spatially structured fields, as needed for example in solar windows.
△ Less
Submitted 16 September, 2023; v1 submitted 26 December, 2022;
originally announced December 2022.
-
Theory of space-time supermodes in planar multimode waveguides
Authors:
Abbas Shiri,
Kenneth L. Schepler,
Ayman F. Abouraddy
Abstract:
When an optical pulse is focused into a multimode waveguide or fiber, the energy is divided among the available guided modes. Consequently, the initially localized intensity spreads transversely, the spatial profile undergoes rapid variations with axial propagation, and the pulse disperses temporally. Space-time (ST) supermodes are pulsed guided field configurations that propagate invariantly in m…
▽ More
When an optical pulse is focused into a multimode waveguide or fiber, the energy is divided among the available guided modes. Consequently, the initially localized intensity spreads transversely, the spatial profile undergoes rapid variations with axial propagation, and the pulse disperses temporally. Space-time (ST) supermodes are pulsed guided field configurations that propagate invariantly in multimode waveguides by assigning each mode to a prescribed wavelength. ST supermodes can be thus viewed as spectrally discrete, guided-wave counterpart of the recently demonstrated propagation-invariant ST wave packets in free space. The group velocity of an ST supermode is tunable independently -- in principle -- of the waveguide structure, group-velocity dispersion is eliminated or dramatically curtailed, and the time-averaged intensity profile is axially invariant along the waveguide in absence of mode-coupling. We establish here a theoretical framework for studying ST supermodes in planar waveguides. Modal engineering allows sculpting this axially invariant transverse intensity profile from an on-axis peak or dip (dark beam), to a multi-peak or flat distribution. Moreover, ST supermodes can be synthesized using spectrally incoherent light, thus paving the way to potential applications in optical beam delivery for lighting applications.
△ Less
Submitted 24 December, 2022;
originally announced December 2022.
-
Spatial resolution of omni-resonant imaging
Authors:
Abbas Shiri,
Ayman F. Abouraddy
Abstract:
Omni-resonance refers to the broadening of the spectral transmission through a planar cavity, not by changing the cavity structure, but by judiciously preconditioning the incident optical field. As such, broadband imaging can be performed through such a cavity with all the wavelengths simultaneously resonating. We examine here the spatial resolution of omni-resonant imaging and find that the spect…
▽ More
Omni-resonance refers to the broadening of the spectral transmission through a planar cavity, not by changing the cavity structure, but by judiciously preconditioning the incident optical field. As such, broadband imaging can be performed through such a cavity with all the wavelengths simultaneously resonating. We examine here the spatial resolution of omni-resonant imaging and find that the spectral linewidth of the cavity resonance determines the spatial resolution. Surprisingly, the spatial resolution improves at longer wavelengths because of the negative angular dispersion intrinsic to Fabry-Perot resonances, in contrast to conventional diffraction-limited optical imaging systems where the spatial resolution improves at shorter wavelengths. These results are important for applications ranging from transparent solar windows to nonlinear resonant image processing.
△ Less
Submitted 29 June, 2022; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Propagation-invariant space-time supermodes in a multimode waveguide
Authors:
Abbas Shiri,
Scott Webster,
Kenneth L. Schepler,
Ayman F. Abouraddy
Abstract:
When an optical pulse is spatially localized in a highly multimoded waveguide, its energy is typically distributed among a multiplicity of modes, thus giving rise to a speckled transverse spatial profile that undergoes erratic changes with propagation. It has been suggested theoretically that pulsed multimode fields in which each wavelength is locked to an individual mode at a prescribed axial wav…
▽ More
When an optical pulse is spatially localized in a highly multimoded waveguide, its energy is typically distributed among a multiplicity of modes, thus giving rise to a speckled transverse spatial profile that undergoes erratic changes with propagation. It has been suggested theoretically that pulsed multimode fields in which each wavelength is locked to an individual mode at a prescribed axial wave number will propagate invariantly along the waveguide at a tunable group velocity. In this conception, an initially localized field remains localized along the waveguide. Here, we provide proof-of-principle experimental confirmation for the existence of this new class of pulsed guided fields, which we denote space-time supermodes, and verify their propagation invariance in a planar waveguide. By superposing up to 21 modes, each assigned to a prescribed wavelength, we construct space-time supermodes in a 170-micron-thick planar glass waveguide with group indices extending from 1 to 2. The initial transverse width of the field is 6 microns, and the waveguide length is 9.1 mm, which is 257x the associated Rayleigh range. A variety of axially invariant transverse spatial profiles are produced by judicious selection of the modes contributing to the ST supermode, including single-peak and multi-peak fields, dark fields (containing a spatial dip), and even flat uniform intensity profiles.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
Severing the link between modal order and group index using hybrid guided space-time modes
Authors:
Abbas Shiri,
Ayman F. Abouraddy
Abstract:
The structure of an optical waveguide determines the characteristics of its guided modes, such as their spatial profile and group index. General features are shared by modes regardless of the waveguiding structure; for example, modal dispersion is inevitable in multimode waveguides, every mode experiences group-velocity dispersion, and higher-order modes are usually slower than their lower-order c…
▽ More
The structure of an optical waveguide determines the characteristics of its guided modes, such as their spatial profile and group index. General features are shared by modes regardless of the waveguiding structure; for example, modal dispersion is inevitable in multimode waveguides, every mode experiences group-velocity dispersion, and higher-order modes are usually slower than their lower-order counterparts. We show here that such trends can be fundamentally altered -- altogether severing the link between modal order and group index hybrid and eliminating dispersion -- by exploiting hybrid guided space-time modes in a planar multimode waveguide. Such modes are confined in one-dimension by the waveguide and in the other by the spatio-temporal spectral structure of the field itself. Direct measurements of the modal group delays confirm that the group index for low-loss, dispersion-free, hybrid space-time modes can be each tuned away from the group index of the conventional mode of same order, and that the transverse size of these hybrid modes can be varied independently of the modal order and group index. These findings are verified in a few-mode planar waveguide consisting of a 25.5-mm-long, 4-$μ$m-thick silica film deposited on a MgF$_2$ substrate.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Roadmap on multimode light sha**
Authors:
Marco Piccardo,
Vincent Ginis,
Andrew Forbes,
Simon Mahler,
Asher A. Friesem,
Nir Davidson,
Haoran Ren,
Ahmed H. Dorrah,
Federico Capasso,
Firehun T. Dullo,
Balpreet S. Ahluwalia,
Antonio Ambrosio,
Sylvain Gigan,
Nicolas Treps,
Markus Hiekkamäki,
Robert Fickler,
Michael Kues,
David Moss,
Roberto Morandotti,
Johann Riemensberger,
Tobias J. Kippenberg,
Jérôme Faist,
Giacomo Scalari,
Nathalie Picqué,
Theodor W. Hänsch
, et al. (13 additional authors not shown)
Abstract:
Our ability to generate new distributions of light has been remarkably enhanced in recent years. At the most fundamental level, these light patterns are obtained by ingeniously combining different electromagnetic modes. Interestingly, the modal superposition occurs in the spatial, temporal as well as spatio-temporal domain. This generalized concept of structured light is being applied across the e…
▽ More
Our ability to generate new distributions of light has been remarkably enhanced in recent years. At the most fundamental level, these light patterns are obtained by ingeniously combining different electromagnetic modes. Interestingly, the modal superposition occurs in the spatial, temporal as well as spatio-temporal domain. This generalized concept of structured light is being applied across the entire spectrum of optics: generating classical and quantum states of light, harnessing linear and nonlinear light-matter interactions, and advancing applications in microscopy, spectroscopy, holography, communication, and synchronization. This Roadmap highlights the common roots of these different techniques and thus establishes links between research areas that complement each other seamlessly. We provide an overview of all these areas, their backgrounds, current research, and future developments. We highlight the power of multimodal light manipulation and want to inspire new eclectic approaches in this vibrant research community.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
Hybrid guided space-time optical modes in unpatterned films
Authors:
Abbas Shiri,
Murat Yessenov,
Scott Webster,
Kenneth L. Schepler,
Ayman F. Abouraddy
Abstract:
Light can be confined transversely and delivered axially in a waveguide. However, waveguides are lossy static structures whose modal characteristics are fundamentally determined by the boundary conditions, and thus cannot be readily changed post-fabrication. Here we show that unpatterned planar optical films can be exploited for low-loss two-dimensional waveguiding by using `space-time' wave packe…
▽ More
Light can be confined transversely and delivered axially in a waveguide. However, waveguides are lossy static structures whose modal characteristics are fundamentally determined by the boundary conditions, and thus cannot be readily changed post-fabrication. Here we show that unpatterned planar optical films can be exploited for low-loss two-dimensional waveguiding by using `space-time' wave packets, which are the unique family of one-dimensional propagation-invariant pulsed optical beams. We observe `hybrid guided' space-time modes that are index-guided in one transverse dimension in the film and localized along the unbounded transverse dimension via the intrinsic spatio-temporal structure of the field. We demonstrate that these field configurations enable overriding the boundary conditions by varying post-fabrication the group index of the fundamental mode in a 2-$μ$m-thick, 25-mm-long silica film, which is achieved by modifying the field's spatio-temporal structure along the unbounded dimension. Tunability of the group index over an unprecedented range from 1.26 to 1.77 around the planar-waveguide value of 1.47 is verified - while maintaining a spectrally flat zero-dispersion profile. Our work paves the way to to the utilization of space-time wave packets in on-chip photonic platforms, and may enable new phase-matching strategies that circumvent the restrictions due to intrinsic material properties.
△ Less
Submitted 7 January, 2020;
originally announced January 2020.
-
Omni-resonant space-time wave packets
Authors:
Abbas Shiri,
Murat Yessenov,
Rohinraj Aravindakshan,
Ayman F. Abouraddy
Abstract:
We describe theoretically and verify experimentally a novel class of diffraction-free pulsed optical beams that are `Omni-resonant': they have the remarkable property of transmission through planar Fabry-Perot resonators without spectral filtering even if their bandwidth far exceeds the cavity resonant linewidth. Ultrashort wave packets endowed with a specific Spatio-temporal structure couple to a…
▽ More
We describe theoretically and verify experimentally a novel class of diffraction-free pulsed optical beams that are `Omni-resonant': they have the remarkable property of transmission through planar Fabry-Perot resonators without spectral filtering even if their bandwidth far exceeds the cavity resonant linewidth. Ultrashort wave packets endowed with a specific Spatio-temporal structure couple to a \textit{single} resonant mode independently of its linewidth. We confirm that such `space-time' Omni-resonant wave packets retain their bandwidth (1.6~nm), Spatio-temporal profile (1.3-ps pulse width, 4-$μ$m beam width), and diffraction-free behavior upon transmission through cavities with resonant linewidths of 0.3-nm and 0.15-nm.
△ Less
Submitted 29 November, 2019;
originally announced December 2019.
-
Doubling the near-infrared photocurrent in a solar cell via omni-resonant coherent perfect absorption
Authors:
Massimo L. Villinger,
Abbas Shiri,
Soroush Shabahang,
Ali K. Jahromi,
Magued B. Nasr,
Christopher H. Villinger,
Ayman F. Abouraddy
Abstract:
Minimizing the material usage in thin-film solar cells can reduce manufacturing costs and enable mechanically flexible implementations, but concomitantly diminishes optical absorption. Coherent optical effects can help alleviate this inevitable drawback at discrete frequencies. For example, coherent perfect absorption guarantees that light is fully absorbed in a thin layer regardless of material o…
▽ More
Minimizing the material usage in thin-film solar cells can reduce manufacturing costs and enable mechanically flexible implementations, but concomitantly diminishes optical absorption. Coherent optical effects can help alleviate this inevitable drawback at discrete frequencies. For example, coherent perfect absorption guarantees that light is fully absorbed in a thin layer regardless of material or thickness but only on resonance. Here we show that omni resonance delivers such coherent enhancement over a broad bandwidth by structuring the optical field to nullify the angular dispersion intrinsic to resonant structures. After embedding an amorphous-silicon thin film photovoltaic cell in a planar cavity, pre conditioning the incident light using an alignment free optical arrangement severs the link between the resonant bandwidth and the cavity photon lifetime, thereby rendering the cavity omni resonant. Coherently enhanced near infrared absorption doubles the photocurrent over the targeted spectral range 660 to 740 nm where every wavelength resonates. These results may pave the way to transparent solar cells that optimally harvest near infrared light.
△ Less
Submitted 13 January, 2020; v1 submitted 20 November, 2019;
originally announced November 2019.
-
The left-greedy Lie algebra basis and star graphs
Authors:
Benjamin Walter,
Aminreza Shiri
Abstract:
We construct a basis for free Lie algebras via a ``left-greedy'' bracketing algorithm on Lyndon-Shirshov words. We use a new tool -- the configuration pairing between Lie brackets and graphs of Sinha-Walter -- to show that the left-greedy brackets form a basis. Our constructions further equip the left-greedy brackets with a dual monomial Lie coalgebra basis of ``star'' graphs. We end with a brief…
▽ More
We construct a basis for free Lie algebras via a ``left-greedy'' bracketing algorithm on Lyndon-Shirshov words. We use a new tool -- the configuration pairing between Lie brackets and graphs of Sinha-Walter -- to show that the left-greedy brackets form a basis. Our constructions further equip the left-greedy brackets with a dual monomial Lie coalgebra basis of ``star'' graphs. We end with a brief example using the dual basis of star graphs in a Lie algebra computation.
△ Less
Submitted 23 October, 2015;
originally announced October 2015.