-
LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts
Authors:
Yijia Xiao,
Edward Sun,
Tianyu Liu,
Wei Wang
Abstract:
We propose LogicVista, an evaluation benchmark that assesses the integrated logical reasoning capabilities of multimodal large language models (MLLMs) in Visual contexts. Recent advancements in MLLMs have demonstrated various fascinating abilities, from crafting poetry based on an image to performing mathematical reasoning. However, there is still a lack of systematic evaluation of MLLMs' proficie…
▽ More
We propose LogicVista, an evaluation benchmark that assesses the integrated logical reasoning capabilities of multimodal large language models (MLLMs) in Visual contexts. Recent advancements in MLLMs have demonstrated various fascinating abilities, from crafting poetry based on an image to performing mathematical reasoning. However, there is still a lack of systematic evaluation of MLLMs' proficiency in logical reasoning tasks, which are essential for activities like navigation and puzzle-solving. Thus we evaluate general logical cognition abilities across 5 logical reasoning tasks encompassing 9 different capabilities, using a sample of 448 multiple-choice questions. Each question is annotated with the correct answer and the human-written reasoning behind the selection, enabling both open-ended and multiple-choice evaluation. A total of 8 MLLMs are comprehensively evaluated using LogicVista. Code and Data Available at https://github.com/Yijia-Xiao/LogicVista.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Towards Improving Learning from Demonstration Algorithms via MCMC Methods
Authors:
Carl Qi,
Edward Sun,
Harry Zhang
Abstract:
Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own drawbacks, limiting its efficacy in real robot setups. In this work, we take one step towards improving learning from demonstration algorithms by leveraging impl…
▽ More
Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own drawbacks, limiting its efficacy in real robot setups. In this work, we take one step towards improving learning from demonstration algorithms by leveraging implicit energy-based policy models. Results suggest that in selected complex robot policy learning scenarios, treating supervised policy learning with an implicit model generally performs better, on average, than commonly used neural network-based explicit models, especially in the cases of approximating potentially discontinuous and multimodal functions.
△ Less
Submitted 23 May, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Authors:
Julien Piet,
Maha Alrashed,
Chawin Sitawarin,
Sizhe Chen,
Zeming Wei,
Elizabeth Sun,
Basel Alomair,
David Wagner
Abstract:
Large Language Models (LLMs) are attracting significant research attention due to their instruction-following abilities, allowing users and developers to leverage LLMs for a variety of tasks. However, LLMs are vulnerable to prompt-injection attacks: a class of attacks that hijack the model's instruction-following abilities, changing responses to prompts to undesired, possibly malicious ones. In th…
▽ More
Large Language Models (LLMs) are attracting significant research attention due to their instruction-following abilities, allowing users and developers to leverage LLMs for a variety of tasks. However, LLMs are vulnerable to prompt-injection attacks: a class of attacks that hijack the model's instruction-following abilities, changing responses to prompts to undesired, possibly malicious ones. In this work, we introduce Jatmo, a method for generating task-specific models resilient to prompt-injection attacks. Jatmo leverages the fact that LLMs can only follow instructions once they have undergone instruction tuning. It harnesses a teacher instruction-tuned model to generate a task-specific dataset, which is then used to fine-tune a base model (i.e., a non-instruction-tuned model). Jatmo only needs a task prompt and a dataset of inputs for the task: it uses the teacher model to generate outputs. For situations with no pre-existing datasets, Jatmo can use a single example, or in some cases none at all, to produce a fully synthetic dataset. Our experiments on seven tasks show that Jatmo models provide similar quality of outputs on their specific task as standard LLMs, while being resilient to prompt injections. The best attacks succeeded in less than 0.5% of cases against our models, versus 87% success rate against GPT-3.5-Turbo. We release Jatmo at https://github.com/wagner-group/prompt-injection-defense.
△ Less
Submitted 8 January, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
The 4m International Liquid Mirror Telescope: a brief history and some preliminary scientific results
Authors:
Jean Surdej,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Anna Pospieszalska-Surdej,
Kumar Pranshu,
Ethen Sun
Abstract:
The present article is based upon an invited talk delivered at the occasion of the inauguration of the 4m International Liquid Mirror Telescope (ILMT) which took place in Devasthal (ARIES, Uttarakhand, India) on 21st of March 2023. We present hereafter a short history of the liquid mirror telescopes and in particular of the 4m ILMT which is the first liquid mirror telescope entirely dedicated to a…
▽ More
The present article is based upon an invited talk delivered at the occasion of the inauguration of the 4m International Liquid Mirror Telescope (ILMT) which took place in Devasthal (ARIES, Uttarakhand, India) on 21st of March 2023. We present hereafter a short history of the liquid mirror telescopes and in particular of the 4m ILMT which is the first liquid mirror telescope entirely dedicated to astrophysical observations. We discuss a few preliminary scientific results and illustrate some direct CCD images taken during the first commissioning phase of the telescope. We invite the reader to refer to the series of ILMT poster papers published in these same proceedings of the BINA3 workshop for more details about the instrument, operation, first observations, performance and scientific results.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
SunPhot: Preparations for an upcoming quasar variability survey with the International Liquid Mirror Telescope
Authors:
Ethen Sun,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Jean Surdej
Abstract:
Recent research suggests a correlation between the variability and intrinsic brightness of quasars. If calibrated, this could lead to the use of quasars on the cosmic distance ladder, but this work is currently limited by lack of quasar light curve data with high cadence and precision. The Python photometric data pipeline SunPhot is being developed as part of preparations for an upcoming quasar va…
▽ More
Recent research suggests a correlation between the variability and intrinsic brightness of quasars. If calibrated, this could lead to the use of quasars on the cosmic distance ladder, but this work is currently limited by lack of quasar light curve data with high cadence and precision. The Python photometric data pipeline SunPhot is being developed as part of preparations for an upcoming quasar variability survey with the International Liquid Mirror Telescope (ILMT). SunPhot uses aperture photometry to directly extract light curves for a catalogue of sources from calibrated ILMT images. SunPhot v.2.1 is operational, but the project is awaiting completion of ILMT commissioning.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Surface Brightness Properties of LSB Galaxies with the International Liquid Mirror Telescope
Authors:
Jiuyang Fu,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
Low surface brightness (LSB) galaxies make up a significant fraction of the luminosity density of the local universe. Their low surface brightness suggests a different formation and evolution process compared to more-typical high-surface-brightness galaxies. This study presents an analysis of LSB galaxies found in images obtained by the International Liquid Mirror Telescope during the observation…
▽ More
Low surface brightness (LSB) galaxies make up a significant fraction of the luminosity density of the local universe. Their low surface brightness suggests a different formation and evolution process compared to more-typical high-surface-brightness galaxies. This study presents an analysis of LSB galaxies found in images obtained by the International Liquid Mirror Telescope during the observation period from October 24 to November 1, 2022. 3,092 LSB galaxies were measured and separated into blue and red LSB categories based on their $g'-i'$ colours. In these samples, the median effective radius is 4.7 arcsec, and the median value of the mean surface brightness within the effective radius is 26.1 mag arcsec$^{-2}$. The blue LSB galaxies are slightly brighter than the red LSB galaxies. No significant difference of ellipticity was found between the blue and the red LSB galaxies.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Survey of Variables with the ILMT
Authors:
Baldeep Grewal,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Paul Hickson,
Kuntal Misra,
Brajesh Kumar,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
Nestled in the mountains of Northern India, is a 4-metre rotating dish of liquid mercury. Over a 10-year period, the International Liquid Mirror Telescope (ILMT) will survey 117 square degrees of sky, to study the astrometric and photometric variability of all detected objects. One of the scientific programs will be a survey of variable stars. The data gathered will be used to construct a comprehe…
▽ More
Nestled in the mountains of Northern India, is a 4-metre rotating dish of liquid mercury. Over a 10-year period, the International Liquid Mirror Telescope (ILMT) will survey 117 square degrees of sky, to study the astrometric and photometric variability of all detected objects. One of the scientific programs will be a survey of variable stars. The data gathered will be used to construct a comprehensive catalog of light curves. This will be an essential resource for astronomers studying the formation and evolution of stars, the structure and dynamics of our Milky Way galaxy, and the properties of the Universe as a whole. This catalog will be an aid in our advance to understanding the cosmos and provide deeper insights into the fundamental processes that shape our Universe. In this work, we describe the survey and give some examples of variable stars found in the early commissioning data from the ILMT.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Observation of mulitply imaged quasars with the 4-m ILMT
Authors:
Talat Akhunov,
Bhavya Ailawadhi,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Anna Pospieszalska-Surdej,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
Gravitationally lensed quasars (GLQs) are known to potentially provide an independent way of determining the value of the Hubble-Lemaître parameter $H_{0}$, to probe the dark matter content of lensing galaxies and to resolve tiny structures in distant active galactic nuclei. That is why multiply imaged quasars are one of the main drivers for a photometric monitoring with the 4-m International Liqu…
▽ More
Gravitationally lensed quasars (GLQs) are known to potentially provide an independent way of determining the value of the Hubble-Lemaître parameter $H_{0}$, to probe the dark matter content of lensing galaxies and to resolve tiny structures in distant active galactic nuclei. That is why multiply imaged quasars are one of the main drivers for a photometric monitoring with the 4-m International Liquid Mirror Telescope (ILMT). We would like to answer the following questions -- how many multiply imaged quasars should we be able to detect with the ILMT? And how to derive accurate magnitudes of the GLQ images? Our estimation of the possible number of multiply imaged quasars is $15$, although optimistic forecasts predict up to $50$ of them. We propose to use the adaptive PSF fitting method for accurate flux measurements of the lensed images. During preliminary observations in spring 2022 we were able to detect the quadruply imaged quasar - SDSS J1251+2935 in the $\it{i}$ and $\it{r}$ spectral bands.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Follow-up strategy of ILMT discovered supernovae
Authors:
Brajesh Kumar,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
The 4m International Liquid Mirror Telescope (ILMT) facility continuously scans the same sky strip ($\sim$22$^\prime$ wide) on each night with a fixed pointing towards the zenith direction. It is possible to detect hundreds of supernovae (SNe) each year by implementing an optimal image subtraction technique on consecutive night images. Prompt monitoring of ILMT-detected SNe is planned under the se…
▽ More
The 4m International Liquid Mirror Telescope (ILMT) facility continuously scans the same sky strip ($\sim$22$^\prime$ wide) on each night with a fixed pointing towards the zenith direction. It is possible to detect hundreds of supernovae (SNe) each year by implementing an optimal image subtraction technique on consecutive night images. Prompt monitoring of ILMT-detected SNe is planned under the secured target of opportunity mode using ARIES telescopes (1.3m DFOT and 3.6m DOT). Spectroscopy with the DOT facility will be useful for the classification and detailed investigation of SNe. During the commissioning phase of the ILMT, supernova (SN) 2023af was identified in the ILMT field of view. The SN was further monitored with the ILMT and DOT facilities. Preliminary results based on the light curve and spectral features of SN 2023af are presented.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Astrometric and photometric calibrators for the 4-m International Liquid Mirror Telescope
Authors:
Naveen Dukiya,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
The International Liquid Mirror Telescope (ILMT) is a 4-meter class survey telescope. It achieved its first light on 29$^{\rm th}$ April 2022 and is now undergoing the commissioning phase. It scans the sky in a fixed \ang{;22;} wide strip centred at the declination of $+$\ang{29;21;41.4} and works in \emph{Time Delay Integration (TDI)} mode. We present a full catalog of sources in the ILMT strip d…
▽ More
The International Liquid Mirror Telescope (ILMT) is a 4-meter class survey telescope. It achieved its first light on 29$^{\rm th}$ April 2022 and is now undergoing the commissioning phase. It scans the sky in a fixed \ang{;22;} wide strip centred at the declination of $+$\ang{29;21;41.4} and works in \emph{Time Delay Integration (TDI)} mode. We present a full catalog of sources in the ILMT strip derived by crossmatching \textit{Gaia} DR3 with SDSS DR17 and PanSTARRS-1 (PS1) to supplement the catalog with apparent magnitudes of these sources in $g, r$, and $i$ filters. These sources can serve as astrometric calibrators. The release of Gaia DR3 provides synthetic photometry in popular broadband photometric systems, including the SDSS $g, r$, and $i$ bands for $\sim$220 million sources across the sky. We have used this synthetic photometry to verify our crossmatching performance and, in turn, create a subset of the catalog with accurate photometric measurements from two reliable sources.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
A year-long representation of the ILMT observations in different coordinate systems
Authors:
Monalisa Dubey,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Kuntal Misra,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
The 4m International Liquid Mirror Telescope (ILMT) is the first optical survey telescope in India that performs zenithal observations of a 22$'$ wide strip of the sky. To determine the portion of the sky covered by the ILMT during the entire year, we represent the ILMT Field of View (FoV) in three different coordinate systems - galactic, ecliptic, and equatorial. We adopt a constant declination o…
▽ More
The 4m International Liquid Mirror Telescope (ILMT) is the first optical survey telescope in India that performs zenithal observations of a 22$'$ wide strip of the sky. To determine the portion of the sky covered by the ILMT during the entire year, we represent the ILMT Field of View (FoV) in three different coordinate systems - galactic, ecliptic, and equatorial. We adopt a constant declination of $+29^{\circ}21'41.4"$ and varying right ascension (RA) ranges corresponding to the Local Sidereal Time (LST). The observations from June to September are hampered due to the monsoon season. The handiness of such representations will allow us to locate a transient event in the ILMT FoV. This will enable prompt follow-up observations with other facilities.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
The 4m International Liquid Mirror Telescope project
Authors:
Jean Surdej,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Anna Pospieszalska-Surdej,
Kumar Pranshu,
Ethen Sun
Abstract:
The International Liquid Mirror Telescope (ILMT) project is a scientific collaboration in observational astrophysics between the Li{è}ge Institute of Astrophysics and Geophysics (Li{è}ge University, Belgium), the Aryabatta Research Institute of observational sciencES (ARIES, Nainital, India) and several Canadian universities (British Columbia, Laval, Montr{é}al, Toronto, Victoria and York). Meanwh…
▽ More
The International Liquid Mirror Telescope (ILMT) project is a scientific collaboration in observational astrophysics between the Li{è}ge Institute of Astrophysics and Geophysics (Li{è}ge University, Belgium), the Aryabatta Research Institute of observational sciencES (ARIES, Nainital, India) and several Canadian universities (British Columbia, Laval, Montr{é}al, Toronto, Victoria and York). Meanwhile, several other institutes have joined the project: the Royal Observatory of Belgium, the National University of Uzbekistan and the Ulugh Beg Astronomical Institute (Uzbekistan) as well as the Pozna{ń} Observatory (Poland). The Li{è}ge company AMOS (Advanced Mechanical and Optical Systems) has fabricated the telescope structure that has been erected on the ARIES site in Devasthal (Uttarakhand, India). It is the first liquid mirror telescope being dedicated to astronomical observations. First light was obtained on 29 April 2022 and commissioning is being conducted at the present time. In this short article, we describe and illustrate the main components of the ILMT. We also highlight the ILMT papers presented during the third BINA workshop, which discuss various aspects of the ILMT science programs.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Serendipitous Detection of Orbital Debris by the International Liquid Mirror Telescope: First Results
Authors:
Paul Hickson,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
Orbital debris presents a growing risk to space operations, and is becoming a significant source of contamination of astronomical images. Much of the debris population is uncatalogued, making the impact more difficult to assess. We present initial results from the first ten nights of commissioning observations with the International Liquid Mirror Telescope, in which images were examined for streak…
▽ More
Orbital debris presents a growing risk to space operations, and is becoming a significant source of contamination of astronomical images. Much of the debris population is uncatalogued, making the impact more difficult to assess. We present initial results from the first ten nights of commissioning observations with the International Liquid Mirror Telescope, in which images were examined for streaks produced by orbiting objects including satellites, rocket bodies and other forms of debris. We detected 83 streaks and performed a correlation analysis to attempt to match these with objects in the public database. 48\% of these objects were uncorrelated, indicating substantial incompleteness in the database, even for some relatively-bright objects. We were able to detect correlated objects to an estimated magnitude of 14.5 and possibly about two magnitudes greater for the faintest uncorrelated object.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Detection and Identification of Asteroids with the 4-m ILMT
Authors:
Anna Pospieszalska-Surdej,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
A very unique strength of the Devasthal Observatory is its capability of detecting optical transients with the 4-m International Liquid Mirror Telescope (ILMT) and to rapidly follow them up using the 1.3-m Devasthal Fast Optical Telescope (DFOT) and/or the 3.6-m Devasthal Optical Telescope (DOT), installed right next to it. In this context, we have inspected 20 fields observed during 9 consecutive…
▽ More
A very unique strength of the Devasthal Observatory is its capability of detecting optical transients with the 4-m International Liquid Mirror Telescope (ILMT) and to rapidly follow them up using the 1.3-m Devasthal Fast Optical Telescope (DFOT) and/or the 3.6-m Devasthal Optical Telescope (DOT), installed right next to it. In this context, we have inspected 20 fields observed during 9 consecutive nights in October-November 2022 during the first commissioning phase of the ILMT. Each of these fields has an angular extent of $22^\prime$ in declination by $9 \times 22^\prime$ in right ascension. Combining both a visual search for optical transients and an automatic search for these using an image subtraction technique (see the ILMT poster paper by Pranshu et al.), we report a total of 232 significant transient candidates. After consulting the Minor Planet Center database of asteroids, we could identify among these 219 positions of known asteroids brighter than $V=22$. These correspond to the confirmed positions of 78 distinct known asteroids. Analysis of the remaining CCD frames covering 19 more fields (out of 20) should lead to an impressive number of asteroids observed in only 9 nights. The conclusion is that in order to detect and characterize new supernovae, micro-lensing events, highly variable stars, multiply imaged quasars, etc. among the ILMT optical transients, we shall first have to identify all known and new asteroids. Thanks to its large diameter and short focal length (f/D $\sim$ 2.4), the ILMT turns out to be an excellent asteroid hunter.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Accessibility of the ILMT survey data
Authors:
Kuntal Misra,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
The 4m International Liquid Mirror Telescope (ILMT) continuously scans a 22$'$ wide strip of the zenithal sky and records the images in three broadband filters (g', r' and i') using a 4K$\times$4K CCD camera. In about 10--12 hours of observations during a single night, $\sim$15 GB of data volume is generated. The raw images resulting from the observations in October--November 2022 have been pre-pr…
▽ More
The 4m International Liquid Mirror Telescope (ILMT) continuously scans a 22$'$ wide strip of the zenithal sky and records the images in three broadband filters (g', r' and i') using a 4K$\times$4K CCD camera. In about 10--12 hours of observations during a single night, $\sim$15 GB of data volume is generated. The raw images resulting from the observations in October--November 2022 have been pre-processed and astrometrically calibrated. In order to exploit the scientific capabilities of the ILMT survey data by the larger scientific community, we are disseminating the raw data (along with dark and flat fields) and the astrometrically calibrated data. These data sets can be downloaded by the users to conduct the scientific projects of their interest. In future, the data will be processed in near real-time and will be available via the ARIES data archive portal.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Automated transient detection in the context of the 4m ILMT
Authors:
Kumar Pranshu,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Ethen Sun,
Jean Surdej
Abstract:
In the era of sky surveys like Palomar Transient Factory (PTF), Zwicky Transient Facility (ZTF) and the upcoming Vera Rubin Observatory (VRO) and ILMT, a plethora of image data will be available. ZTF scans the sky with a field of view of 48 deg$^{2}$ and VRO will have a FoV of 9.6 deg$^{2}$ but with a much larger aperture. The 4m ILMT covers a 22$'$ wide strip of the sky. Being a zenith telescope,…
▽ More
In the era of sky surveys like Palomar Transient Factory (PTF), Zwicky Transient Facility (ZTF) and the upcoming Vera Rubin Observatory (VRO) and ILMT, a plethora of image data will be available. ZTF scans the sky with a field of view of 48 deg$^{2}$ and VRO will have a FoV of 9.6 deg$^{2}$ but with a much larger aperture. The 4m ILMT covers a 22$'$ wide strip of the sky. Being a zenith telescope, ILMT has several advantages like low observation air mass, best image quality, minimum light pollution and no pointing time loss. Transient detection requires all these imaging data to be processed through a Difference Imaging Algorithm (DIA) followed by subsequent identification and classification of transients. The ILMT is also expected to discover several known and unknown astrophysical objects including transients. Here, we propose a pipeline with an image subtraction algorithm and a convolutional neural network (CNN) based automated transient discovery and classification system. The pipeline was tested on ILMT data and the transients as well as variable candidates were recovered and classified.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
An automated photometric pipeline for the ILMT data
Authors:
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Vibhore Negi,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
The International Liquid Mirror Telescope (ILMT) is a 4-meter survey telescope continuously observing towards the zenith in the SDSS g', r', and i' bands. This survey telescope is designed to detect various astrophysical transients (for example, supernovae) and very faint objects like multiply-imaged quasars and low surface brightness galaxies. A single scan of a 22$'$ strip of sky contains a larg…
▽ More
The International Liquid Mirror Telescope (ILMT) is a 4-meter survey telescope continuously observing towards the zenith in the SDSS g', r', and i' bands. This survey telescope is designed to detect various astrophysical transients (for example, supernovae) and very faint objects like multiply-imaged quasars and low surface brightness galaxies. A single scan of a 22$'$ strip of sky contains a large amount of photometric information. To process this type of data, it becomes critical to have tools or pipelines that can handle it efficiently and accurately with minimal human biases. We offer a fully automated pipeline generated in Python to perform aperture photometry over the ILMT data acquired with the CCD in Time Delayed Integration (TDI) mode. The instrumental magnitudes are calibrated with respect to the Pan-STARRS-1 catalogue. The light curves generated from the calibrated magnitudes will allows us to characterize the objects as variable stars or rapidly decaying transients.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Necessity of a TDI optical corrector for ILMT observations
Authors:
Vibhore Negi,
Bhavya Ailawadhi,
Talat Akhunov,
Ermanno Borra,
Monalisa Dubey,
Naveen Dukiya,
Jiuyang Fu,
Baldeep Grewal,
Paul Hickson,
Brajesh Kumar,
Kuntal Misra,
Kumar Pranshu,
Ethen Sun,
Jean Surdej
Abstract:
The International Liquid Mirror Telescope (ILMT) has recently become operational at the Devasthal Observatory of ARIES, Nainital, India. The ILMT observes in the Time delay integration (TDI) mode where the images are formed by electronically step** the charges over the pixels of the CCD, along a column. Observations near the zenith impose certain constraints dependent on the latitude such as ima…
▽ More
The International Liquid Mirror Telescope (ILMT) has recently become operational at the Devasthal Observatory of ARIES, Nainital, India. The ILMT observes in the Time delay integration (TDI) mode where the images are formed by electronically step** the charges over the pixels of the CCD, along a column. Observations near the zenith impose certain constraints dependent on the latitude such as image deformation due to the star-trail curvature and differential speed. These effects make the stellar trajectories in the focal plane of the ILMT to be hyperbolic, which are corrected for by the introduction of a TDI optical corrector, designed specifically for the ILMT. Here, we report the first results on the effect of this corrector on the trajectories followed by the stars in the ILMT focal plane. Astrometrically calibrating nine nights of data recorded with the ILMT during its first commissioning phase, we find simple (nearly linear) relations between the CCD-y coordinate and the right ascension (RA) of stars and between the CCD-x coordinate and their declination (DEC), respectively, which confirms that the TDI corrector works very fine in converting the stellar trajectories into straight lines.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Knowledge Distilled Ensemble Model for sEMG-based Silent Speech Interface
Authors:
Wenqiang Lai,
Qihan Yang,
Ye Mao,
Endong Sun,
Jiangnan Ye
Abstract:
Voice disorders affect millions of people worldwide. Surface electromyography-based Silent Speech Interfaces (sEMG-based SSIs) have been explored as a potential solution for decades. However, previous works were limited by small vocabularies and manually extracted features from raw data. To address these limitations, we propose a lightweight deep learning knowledge-distilled ensemble model for sEM…
▽ More
Voice disorders affect millions of people worldwide. Surface electromyography-based Silent Speech Interfaces (sEMG-based SSIs) have been explored as a potential solution for decades. However, previous works were limited by small vocabularies and manually extracted features from raw data. To address these limitations, we propose a lightweight deep learning knowledge-distilled ensemble model for sEMG-based SSI (KDE-SSI). Our model can classify a 26 NATO phonetic alphabets dataset with 3900 data samples, enabling the unambiguous generation of any English word through spelling. Extensive experiments validate the effectiveness of KDE-SSI, achieving a test accuracy of 85.9\%. Our findings also shed light on an end-to-end system for portable, practical equipment.
△ Less
Submitted 6 August, 2023;
originally announced August 2023.
-
Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data
Authors:
Rong Ma,
Eric D. Sun,
David Donoho,
James Zou
Abstract:
Single-cell data integration can provide a comprehensive molecular view of cells, and many algorithms have been developed to remove unwanted technical or biological variations and integrate heterogeneous single-cell datasets. Despite their wide usage, existing methods suffer from several fundamental limitations. In particular, we lack a rigorous statistical test for whether two high-dimensional si…
▽ More
Single-cell data integration can provide a comprehensive molecular view of cells, and many algorithms have been developed to remove unwanted technical or biological variations and integrate heterogeneous single-cell datasets. Despite their wide usage, existing methods suffer from several fundamental limitations. In particular, we lack a rigorous statistical test for whether two high-dimensional single-cell datasets are alignable (and therefore should even be aligned). Moreover, popular methods can substantially distort the data during alignment, making the aligned data and downstream analysis difficult to interpret. To overcome these limitations, we present a spectral manifold alignment and inference (SMAI) framework, which enables principled and interpretable alignability testing and structure-preserving integration of single-cell data with the same type of features. SMAI provides a statistical test to robustly assess the alignability between datasets to avoid misleading inference, and is justified by high-dimensional statistical theory. On a diverse range of real and simulated benchmark datasets, it outperforms commonly used alignment methods. Moreover, we show that SMAI improves various downstream analyses such as identification of differentially expressed genes and imputation of single-cell spatial transcriptomics, providing further biological insights. SMAI's interpretability also enables quantification and a deeper understanding of the sources of technical confounders in single-cell data.
△ Less
Submitted 29 February, 2024; v1 submitted 3 August, 2023;
originally announced August 2023.
-
Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text
Authors:
Eric Sun,
**yu Li,
Jian Xue,
Yifan Gong
Abstract:
In end-to-end automatic speech recognition system, one of the difficulties for language expansion is the limited paired speech and text training data. In this paper, we propose a novel method to generate augmented samples with unpaired speech feature segments and text data for model pre-training, which has the advantage of low cost without using additional speech data. When mixing 20,000 hours aug…
▽ More
In end-to-end automatic speech recognition system, one of the difficulties for language expansion is the limited paired speech and text training data. In this paper, we propose a novel method to generate augmented samples with unpaired speech feature segments and text data for model pre-training, which has the advantage of low cost without using additional speech data. When mixing 20,000 hours augmented speech data generated by our method with 12,500 hours original transcribed speech data for Italian Transformer transducer model pre-training, we achieve 8.7% relative word error rate reduction. The pre-trained model achieves similar performance as the model pre-trained with multilingual transcribed 75,000 hours raw speech data. When merging the augmented speech data with the multilingual data to pre-train a new model, we achieve even more relative word error rate reduction of 12.2% over the baseline, which further verifies the effectiveness of our method for speech data augmentation.
△ Less
Submitted 30 July, 2023;
originally announced July 2023.
-
Data Cross-Segmentation for Improved Generalization in Reinforcement Learning Based Algorithmic Trading
Authors:
Vikram Duvvur,
Aashay Mehta,
Edward Sun,
Bo Wu,
Ken Yew Chan,
Jeff Schneider
Abstract:
The use of machine learning in algorithmic trading systems is increasingly common. In a typical set-up, supervised learning is used to predict the future prices of assets, and those predictions drive a simple trading and execution strategy. This is quite effective when the predictions have sufficient signal, markets are liquid, and transaction costs are low. However, those conditions often do not…
▽ More
The use of machine learning in algorithmic trading systems is increasingly common. In a typical set-up, supervised learning is used to predict the future prices of assets, and those predictions drive a simple trading and execution strategy. This is quite effective when the predictions have sufficient signal, markets are liquid, and transaction costs are low. However, those conditions often do not hold in thinly traded financial markets and markets for differentiated assets such as real estate or vehicles. In these markets, the trading strategy must consider the long-term effects of taking positions that are relatively more difficult to change. In this work, we propose a Reinforcement Learning (RL) algorithm that trades based on signals from a learned predictive model and addresses these challenges. We test our algorithm on 20+ years of equity data from Bursa Malaysia.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
Improved Algorithms for Online Rent Minimization Problem Under Unit-Size Jobs
Authors:
Enze Sun,
Zonghan Yang,
Yuhao Zhang
Abstract:
We consider the Online Rent Minimization problem, where online jobs with release times, deadlines, and processing times must be scheduled on machines that can be rented for a fixed length period of $T$. The objective is to minimize the number of machine rents. This problem generalizes the Online Machine Minimization problem where machines can be rented for an infinite period, and both problems hav…
▽ More
We consider the Online Rent Minimization problem, where online jobs with release times, deadlines, and processing times must be scheduled on machines that can be rented for a fixed length period of $T$. The objective is to minimize the number of machine rents. This problem generalizes the Online Machine Minimization problem where machines can be rented for an infinite period, and both problems have an asymptotically optimal competitive ratio of $O(\log(p_{\max}/p_{\min}))$ for general processing times, where $p_{\max}$ and $p_{\min}$ are the maximum and minimum processing times respectively. However, for small values of $p_{\max}/p_{\min}$, a better competitive ratio can be achieved by assuming unit-size jobs. Under this assumption, Devanur et al. (2014) gave an optimal $e$-competitive algorithm for Online Machine Minimization, and Chen and Zhang (2022) gave a $(3e+7)\approx 15.16$-competitive algorithm for Online Rent Minimization. In this paper, we significantly improve the competitive ratio of the Online Rent Minimization problem under unit size to $6$, by using a clean oracle-based online algorithm framework.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Motion robust MR fingerprinting scan to image neonates with prenatal opioid exposure
Authors:
Dan Ma,
Chaitra Badve,
Jessie EP Sun,
Siyuan Hu,
Xiaofeng Wang,
Yong Chen,
Ameya Nayate,
Michael Wien,
Douglas Martin,
Lynn T Singer,
Jared C. Durieux,
Chris Flask,
Deanne Wilson Costello
Abstract:
Background: A noninvasive and sensitive imaging tool is needed to assess the fast-evolving baby brain. However, using MRI to study non-sedated babies faces roadblocks, including high scan failure rates due to subjects motion and the lack of quantitative measures for assessing potential developmental delays. This feasibility study explores whether MR Fingerprinting scans can provide motion-robust a…
▽ More
Background: A noninvasive and sensitive imaging tool is needed to assess the fast-evolving baby brain. However, using MRI to study non-sedated babies faces roadblocks, including high scan failure rates due to subjects motion and the lack of quantitative measures for assessing potential developmental delays. This feasibility study explores whether MR Fingerprinting scans can provide motion-robust and quantitative brain tissue measurements for non-sedated infants with prenatal opioid exposure, presenting a viable alternative to clinical MR scans. Assessment: MRF image quality was compared to pediatric MRI scans using a fully crossed, multiple reader multiple case study. The quantitative T1 and T2 values were used to assess brain tissue changes between babies younger than one month and babies between one and two months. Statistical Tests: Generalized estimating equations (GEE) model was performed to test the significant difference of the T1 and T2 values from eight white matter regions of babies under one month and those are older. MRI and MRF image quality were assessed using Gwets second order auto-correlation coefficient (AC2) with its confidence levels. We used the Cochran-Mantel-Haenszel test to assess the difference in proportions between MRF and MRI for all features and stratified by the type of features. Results: In infants under one month of age, the T1 and T2 values are significantly higher (p<0.005) compared to those between one and two months. A multiple-reader and multiple-case study showed superior image quality ratings in anatomical features from the MRF images than the MRI images. Conclusions: This study suggested that the MR Fingerprinting scans offer a motion-robust and efficient method for non-sedated infants, delivering superior image quality than clinical MRI scans and additionally providing quantitative measures to assess brain development.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Strong Interaction Physics at the Luminosity Frontier with 22 GeV Electrons at Jefferson Lab
Authors:
A. Accardi,
P. Achenbach,
D. Adhikari,
A. Afanasev,
C. S. Akondi,
N. Akopov,
M. Albaladejo,
H. Albataineh,
M. Albrecht,
B. Almeida-Zamora,
M. Amaryan,
D. Androić,
W. Armstrong,
D. S. Armstrong,
M. Arratia,
J. Arrington,
A. Asaturyan,
A. Austregesilo,
H. Avagyan,
T. Averett,
C. Ayerbe Gayoso,
A. Bacchetta,
A. B. Balantekin,
N. Baltzell,
L. Barion
, et al. (419 additional authors not shown)
Abstract:
This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron…
▽ More
This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron beams, CEBAF's potential for a higher energy upgrade presents a unique opportunity for an innovative nuclear physics program, which seamlessly integrates a rich historical background with a promising future. The proposed physics program encompass a diverse range of investigations centered around the nonperturbative dynamics inherent in hadron structure and the exploration of strongly interacting systems. It builds upon the exceptional capabilities of CEBAF in high-luminosity operations, the availability of existing or planned Hall equipment, and recent advancements in accelerator technology. The proposed program cover various scientific topics, including Hadron Spectroscopy, Partonic Structure and Spin, Hadronization and Transverse Momentum, Spatial Structure, Mechanical Properties, Form Factors and Emergent Hadron Mass, Hadron-Quark Transition, and Nuclear Dynamics at Extreme Conditions, as well as QCD Confinement and Fundamental Symmetries. Each topic highlights the key measurements achievable at a 22 GeV CEBAF accelerator. Furthermore, this document outlines the significant physics outcomes and unique aspects of these programs that distinguish them from other existing or planned facilities. In summary, this document provides an exciting rationale for the energy upgrade of CEBAF to 22 GeV, outlining the transformative scientific potential that lies within reach, and the remarkable opportunities it offers for advancing our understanding of hadron physics and related fundamental phenomena.
△ Less
Submitted 24 August, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Authors:
Eric Sun,
**yu Li,
Yuxuan Hu,
Yimeng Zhu,
Long Zhou,
Jian Xue,
Peidong Wang,
Linquan Liu,
Shujie Liu,
Edward Lin,
Yifan Gong
Abstract:
We propose gated language experts and curriculum training to enhance multilingual transformer transducer models without requiring language identification (LID) input from users during inference. Our method incorporates a gating mechanism and LID loss, enabling transformer experts to learn language-specific information. By combining gated transformer experts with shared transformer layers, we const…
▽ More
We propose gated language experts and curriculum training to enhance multilingual transformer transducer models without requiring language identification (LID) input from users during inference. Our method incorporates a gating mechanism and LID loss, enabling transformer experts to learn language-specific information. By combining gated transformer experts with shared transformer layers, we construct multilingual transformer blocks and utilize linear experts to effectively regularize the joint network. The curriculum training scheme leverages LID to guide the gated experts in improving their respective language performance. Experimental results on a bilingual task involving English and Spanish demonstrate significant improvements, with average relative word error reductions of 12.5% and 7.3% compared to the baseline bilingual and monolingual models, respectively. Notably, our method achieves performance comparable to the upper-bound model trained and inferred with oracle LID. Extending our approach to trilingual, quadrilingual, and pentalingual models reveals similar advantages to those observed in the bilingual models, highlighting its ease of extension to multiple languages.
△ Less
Submitted 7 July, 2023; v1 submitted 1 March, 2023;
originally announced March 2023.
-
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Authors:
Peidong Wang,
Eric Sun,
Jian Xue,
Yu Wu,
Long Zhou,
Yashesh Gaur,
Shujie Liu,
**yu Li
Abstract:
Automatic speech recognition (ASR) and speech translation (ST) can both use neural transducers as the model structure. It is thus possible to use a single transducer model to perform both tasks. In real-world applications, such joint ASR and ST models may need to be streaming and do not require source language identification (i.e. language-agnostic). In this paper, we propose LAMASSU, a streaming…
▽ More
Automatic speech recognition (ASR) and speech translation (ST) can both use neural transducers as the model structure. It is thus possible to use a single transducer model to perform both tasks. In real-world applications, such joint ASR and ST models may need to be streaming and do not require source language identification (i.e. language-agnostic). In this paper, we propose LAMASSU, a streaming language-agnostic multilingual speech recognition and translation model using neural transducers. Based on the transducer model structure, we propose four methods, a unified joint and prediction network for multilingual output, a clustered multilingual encoder, target language identification for encoder, and connectionist temporal classification regularization. Experimental results show that LAMASSU not only drastically reduces the model size but also reaches the performances of monolingual ASR and bilingual ST models.
△ Less
Submitted 19 October, 2023; v1 submitted 5 November, 2022;
originally announced November 2022.
-
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Authors:
Jian Xue,
Peidong Wang,
**yu Li,
Eric Sun
Abstract:
In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which can transcribe or translate multiple spoken languages into texts of the target language. The backbone of SM2 is Transformer Transducer, which has high streaming capability. Instead of human labeled speech translation (ST) data, SM2 models are trained using weakly supervised data generated by convert…
▽ More
In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which can transcribe or translate multiple spoken languages into texts of the target language. The backbone of SM2 is Transformer Transducer, which has high streaming capability. Instead of human labeled speech translation (ST) data, SM2 models are trained using weakly supervised data generated by converting the transcriptions in speech recognition corpora with a machine translation service. With 351 thousand hours of anonymized speech training data from 25 languages, SM2 models achieve comparable or even better ST quality than some recent popular large-scale non-streaming speech models. More importantly, we show that SM2 has the truly zero-shot capability when expanding to new target languages, yielding high quality ST results for {source-speech, target-text} pairs that are not seen during training.
△ Less
Submitted 5 July, 2023; v1 submitted 4 November, 2022;
originally announced November 2022.
-
A Spectral Method for Assessing and Combining Multiple Data Visualizations
Authors:
Rong Ma,
Eric D. Sun,
James Zou
Abstract:
Dimension reduction and data visualization aim to project a high-dimensional dataset to a low-dimensional space while capturing the intrinsic structures in the data. It is an indispensable part of modern data science, and many dimensional reduction and visualization algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it critically important…
▽ More
Dimension reduction and data visualization aim to project a high-dimensional dataset to a low-dimensional space while capturing the intrinsic structures in the data. It is an indispensable part of modern data science, and many dimensional reduction and visualization algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it critically important to evaluate their relative performance for a given dataset, and to leverage and combine their individual strengths. In this paper, we propose an efficient spectral method for assessing and combining multiple visualizations of a given dataset produced by diverse algorithms. The proposed method provides a quantitative measure -- the visualization eigenscore -- of the relative performance of the visualizations for preserving the structure around each data point. Then it leverages the eigenscores to obtain a consensus visualization, which has much improved { quality over the individual visualizations in capturing the underlying true data structure.} Our approach is flexible and works as a wrapper around any visualizations. We analyze multiple simulated and real-world datasets from diverse applications to demonstrate the effectiveness of the eigenscores for evaluating visualizations and the superiority of the proposed consensus visualization. Furthermore, we establish rigorous theoretical justification of our method based on a general statistical framework, yielding fundamental principles behind the empirical success of consensus visualization along with practical guidance.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Better Approximation for Interdependent SOS Valuations
Authors:
Pinyan Lu,
Enze Sun,
Chenghan Zhou
Abstract:
Submodular over signal (SOS) defines a family of interesting functions for which there exist truthful mechanisms with constant approximation to the social welfare for agents with interdependent valuations. The best-known truthful auction is of $4$-approximation and a lower bound of 2 was proved. We propose a new and simple truthful mechanism to achieve an approximation ratio of 3.315.
Submodular over signal (SOS) defines a family of interesting functions for which there exist truthful mechanisms with constant approximation to the social welfare for agents with interdependent valuations. The best-known truthful auction is of $4$-approximation and a lower bound of 2 was proved. We propose a new and simple truthful mechanism to achieve an approximation ratio of 3.315.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Online Ordinal Problems: Optimality of Comparison-based Algorithms and their Cardinal Complexity
Authors:
Nick Gravin,
Enze Sun,
Zhihao Gavin Tang
Abstract:
We consider ordinal online problems, i.e., tasks that only require pairwise comparisons between elements of the input. A classic example is the secretary problem and the game of googol, as well as its multiple combinatorial extensions such as $(J,K)$-secretary, $2$-sided game of googol, ordinal-competitive matroid secretary. A natural approach to these tasks is to use ordinal algorithms that at ea…
▽ More
We consider ordinal online problems, i.e., tasks that only require pairwise comparisons between elements of the input. A classic example is the secretary problem and the game of googol, as well as its multiple combinatorial extensions such as $(J,K)$-secretary, $2$-sided game of googol, ordinal-competitive matroid secretary. A natural approach to these tasks is to use ordinal algorithms that at each step only consider relative ranking among the arrived elements, without looking at the numerical values of the input. We formally study the question of how cardinal algorithms can improve upon ordinal algorithms.
We give first a universal construction of the input distribution for any ordinal online problem, such that the advantage of any cardinal algorithm over the ordinal algorithms is at most $1+\varepsilon$ for arbitrary small $\varepsilon> 0$. As an implication, previous lower bounds for the aforementioned variants of secretary problems hold not only against ordinal algorithms, but also against any online algorithm. However, the value range of the input elements in our construction is huge: $N=O\left(\frac{n^3\cdot n!\cdot n!}{\varepsilon}\right)\uparrow\uparrow(n-1)$ (tower of exponents) for an input sequence of length $n$. As a second result, we identify a class of natural ordinal problems and find cardinal algorithm with a matching advantage of $1+ Ω\left(\frac{1}{\log^{(c)}N}\right),$ where $\log^{(c)}N=\log\ldots\log N$ with $c$ iterative logs and $c$ is an arbitrary constant. Further, we introduce the cardinal complexity for any given ordinal online task: the minimum size $N(\varepsilon)$ of different numerical values in the input such the advantage of cardinal over ordinal algorithms is at most $1+\varepsilon$. As a third result, we show that the game of googol has much lower cardinal complexity of $N=O\left(\left(\frac{n}{\varepsilon}\right)^n\right)$.
△ Less
Submitted 11 October, 2023; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Authors:
Kenichi Kumatani,
Robert Gmyr,
Felipe Cruz Salinas,
Linquan Liu,
Wei Zuo,
Devang Patel,
Eric Sun,
Yu Shi
Abstract:
The sparsely-gated Mixture of Experts (MoE) can magnify a network capacity with a little computational complexity. In this work, we investigate how multi-lingual Automatic Speech Recognition (ASR) networks can be scaled up with a simple routing algorithm in order to achieve better accuracy. More specifically, we apply the sparsely-gated MoE technique to two types of networks: Sequence-to-Sequence…
▽ More
The sparsely-gated Mixture of Experts (MoE) can magnify a network capacity with a little computational complexity. In this work, we investigate how multi-lingual Automatic Speech Recognition (ASR) networks can be scaled up with a simple routing algorithm in order to achieve better accuracy. More specifically, we apply the sparsely-gated MoE technique to two types of networks: Sequence-to-Sequence Transformer (S2S-T) and Transformer Transducer (T-T). We demonstrate through a set of ASR experiments on multiple language data that the MoE networks can reduce the relative word error rates by 16.3% and 4.6% with the S2S-T and T-T, respectively. Moreover, we thoroughly investigate the effect of the MoE on the T-T architecture in various conditions: streaming mode, non-streaming mode, the use of language ID and the label decoder with the MoE.
△ Less
Submitted 4 January, 2022; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Authors:
Rimita Lahiri,
Kenichi Kumatani,
Eric Sun,
Yao Qian
Abstract:
Multilingual end-to-end(E2E) models have shown a great potential in the expansion of the language coverage in the realm of automatic speech recognition(ASR). In this paper, we aim to enhance the multilingual ASR performance in two ways, 1)studying the impact of feeding a one-hot vector identifying the language, 2)formulating the task with a meta-learning objective combined with self-supervised lea…
▽ More
Multilingual end-to-end(E2E) models have shown a great potential in the expansion of the language coverage in the realm of automatic speech recognition(ASR). In this paper, we aim to enhance the multilingual ASR performance in two ways, 1)studying the impact of feeding a one-hot vector identifying the language, 2)formulating the task with a meta-learning objective combined with self-supervised learning (SSL). We associate every language with a distinct task manifold and attempt to improve the performance by transferring knowledge across learning processes itself as compared to transferring through final model parameters. We employ this strategy on a dataset comprising of 6 languages for an in-domain ASR task, by minimizing an objective related to expected gradient path length. Experimental results reveal the best pre-training strategy resulting in 3.55% relative reduction in overall WER. A combination of LEAP and SSL yields 3.51% relative reduction in overall WER when using language ID.
△ Less
Submitted 15 October, 2021;
originally announced October 2021.
-
Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian
Authors:
Baiyu Peng,
**gliang Duan,
Jianyu Chen,
Shengbo Eben Li,
Gen** Xie,
Congsheng Zhang,
Yang Guan,
Yao Mu,
Enxin Sun
Abstract:
Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address thes…
▽ More
Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address these shortcomings by proposing a separated proportional-integral Lagrangian (SPIL) algorithm. We first review the constrained policy optimization process from a feedback control perspective, which regards the penalty weight as the control input and the safe probability as the control output. Based on this, the penalty method is formulated as a proportional controller, and the Lagrangian method is formulated as an integral controller. We then unify them and present a proportional-integral Lagrangian method to get both their merits, with an integral separation technique to limit the integral value in a reasonable range. To accelerate training, the gradient of safe probability is computed in a model-based manner. We demonstrate our method can reduce the oscillations and conservatism of RL policy in a car-following simulation. To prove its practicality, we also apply our method to a real-world mobile robot navigation task, where our robot successfully avoids a moving obstacle with highly uncertain or even aggressive behaviors.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
A Configurable Multilingual Model is All You Need to Recognize All Languages
Authors:
Long Zhou,
**yu Li,
Eric Sun,
Shujie Liu
Abstract:
Multilingual automatic speech recognition (ASR) models have shown great promise in recent years because of the simplified model training and deployment process. Conventional methods either train a universal multilingual model without taking any language information or with a 1-hot language ID (LID) vector to guide the recognition of the target language. In practice, the user can be prompted to pre…
▽ More
Multilingual automatic speech recognition (ASR) models have shown great promise in recent years because of the simplified model training and deployment process. Conventional methods either train a universal multilingual model without taking any language information or with a 1-hot language ID (LID) vector to guide the recognition of the target language. In practice, the user can be prompted to pre-select several languages he/she can speak. The multilingual model without LID cannot well utilize the language information set by the user while the multilingual model with LID can only handle one pre-selected language. In this paper, we propose a novel configurable multilingual model (CMM) which is trained only once but can be configured as different models based on users' choices by extracting language-specific modules together with a universal model from the trained CMM. Particularly, a single CMM can be deployed to any user scenario where the users can pre-select any combination of languages. Trained with 75K hours of transcribed anonymized Microsoft multilingual data and evaluated with 10-language test sets, the proposed CMM improves from the universal multilingual model by 26.0%, 16.9%, and 10.4% relative word error reduction when the user selects 1, 2, or 3 languages, respectively. CMM also performs significantly better on code-switching test sets.
△ Less
Submitted 13 July, 2021;
originally announced July 2021.
-
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Authors:
Zhong Meng,
Yu Wu,
Naoyuki Kanda,
Liang Lu,
Xie Chen,
Guoli Ye,
Eric Sun,
**yu Li,
Yifan Gong
Abstract:
Integrating external language models (LMs) into end-to-end (E2E) models remains a challenging task for domain-adaptive speech recognition. Recently, internal language model estimation (ILME)-based LM fusion has shown significant word error rate (WER) reduction from Shallow Fusion by subtracting a weighted internal LM score from an interpolation of E2E model and external LM scores during beam searc…
▽ More
Integrating external language models (LMs) into end-to-end (E2E) models remains a challenging task for domain-adaptive speech recognition. Recently, internal language model estimation (ILME)-based LM fusion has shown significant word error rate (WER) reduction from Shallow Fusion by subtracting a weighted internal LM score from an interpolation of E2E model and external LM scores during beam search. However, on different test sets, the optimal LM interpolation weights vary over a wide range and have to be tuned extensively on well-matched validation sets. In this work, we perform LM fusion in the minimum WER (MWER) training of an E2E model to obviate the need for LM weights tuning during inference. Besides MWER training with Shallow Fusion (MWER-SF), we propose a novel MWER training with ILME (MWER-ILME) where the ILME-based fusion is conducted to generate N-best hypotheses and their posteriors. Additional gradient is induced when internal LM is engaged in MWER-ILME loss computation. During inference, LM weights pre-determined in MWER training enable robust LM integrations on test sets from different domains. Experimented with 30K-hour trained transformer transducers, MWER-ILME achieves on average 8.8% and 5.8% relative WER reductions from MWER and MWER-SF training, respectively, on 6 different test sets
△ Less
Submitted 4 June, 2021;
originally announced June 2021.
-
Classifying variety of customer's online engagement for churn prediction with mixed-penalty logistic regression
Authors:
Petra Posedel Šimović,
Davor Horvatic,
Edward W. Sun
Abstract:
Using big data to analyze consumer behavior can provide effective decision-making tools for preventing customer attrition (churn) in customer relationship management (CRM). Focusing on a CRM dataset with several different categories of factors that impact customer heterogeneity (i.e., usage of self-care service channels, duration of service, and responsiveness to marketing actions), we provide new…
▽ More
Using big data to analyze consumer behavior can provide effective decision-making tools for preventing customer attrition (churn) in customer relationship management (CRM). Focusing on a CRM dataset with several different categories of factors that impact customer heterogeneity (i.e., usage of self-care service channels, duration of service, and responsiveness to marketing actions), we provide new predictive analytics of customer churn rate based on a machine learning method that enhances the classification of logistic regression by adding a mixed penalty term. The proposed penalized logistic regression can prevent overfitting when dealing with big data and minimize the loss function when balancing the cost from the median (absolute value) and mean (squared value) regularization. We show the analytical properties of the proposed method and its computational advantage in this research. In addition, we investigate the performance of the proposed method with a CRM data set (that has a large number of features) under different settings by efficiently eliminating the disturbance of (1) least important features and (2) sensitivity from the minority (churn) class. Our empirical results confirm the expected performance of the proposed method in full compliance with the common classification criteria (i.e., accuracy, precision, and recall) for evaluating machine learning methods.
△ Less
Submitted 13 July, 2021; v1 submitted 17 May, 2021;
originally announced May 2021.
-
D2S: Document-to-Slide Generation Via Query-Based Text Summarization
Authors:
Edward Sun,
Yufang Hou,
Dakuo Wang,
Yunfeng Zhang,
Nancy X. R. Wang
Abstract:
Presentations are critical for communication in all areas of our lives, yet the creation of slide decks is often tedious and time-consuming. There has been limited research aiming to automate the document-to-slides generation process and all face a critical challenge: no publicly available dataset for training and benchmarking. In this work, we first contribute a new dataset, SciDuet, consisting o…
▽ More
Presentations are critical for communication in all areas of our lives, yet the creation of slide decks is often tedious and time-consuming. There has been limited research aiming to automate the document-to-slides generation process and all face a critical challenge: no publicly available dataset for training and benchmarking. In this work, we first contribute a new dataset, SciDuet, consisting of pairs of papers and their corresponding slides decks from recent years' NLP and ML conferences (e.g., ACL). Secondly, we present D2S, a novel system that tackles the document-to-slides task with a two-step approach: 1) Use slide titles to retrieve relevant and engaging text, figures, and tables; 2) Summarize the retrieved context into bullet points with long-form question answering. Our evaluation suggests that long-form QA outperforms state-of-the-art summarization baselines on both automated ROUGE metrics and qualitative human evaluation.
△ Less
Submitted 8 May, 2021;
originally announced May 2021.
-
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Authors:
Zhong Meng,
Naoyuki Kanda,
Yashesh Gaur,
Sarangarajan Parthasarathy,
Eric Sun,
Liang Lu,
Xie Chen,
**yu Li,
Yifan Gong
Abstract:
The efficacy of external language model (LM) integration with existing end-to-end (E2E) automatic speech recognition (ASR) systems can be improved significantly using the internal language model estimation (ILME) method. In this method, the internal LM score is subtracted from the score obtained by interpolating the E2E score with the external LM score, during inference. To improve the ILME-based…
▽ More
The efficacy of external language model (LM) integration with existing end-to-end (E2E) automatic speech recognition (ASR) systems can be improved significantly using the internal language model estimation (ILME) method. In this method, the internal LM score is subtracted from the score obtained by interpolating the E2E score with the external LM score, during inference. To improve the ILME-based inference, we propose an internal LM training (ILMT) method to minimize an additional internal LM loss by updating only the E2E model components that affect the internal LM estimation. ILMT encourages the E2E model to form a standalone LM inside its existing components, without sacrificing ASR accuracy. After ILMT, the more modular E2E model with matched training and inference criteria enables a more thorough elimination of the source-domain internal LM, and therefore leads to a more effective integration of the target-domain external LM. Experimented with 30K-hour trained recurrent neural network transducer and attention-based encoder-decoder models, ILMT with ILME-based inference achieves up to 31.5% and 11.4% relative word error rate reductions from standard E2E training with Shallow Fusion on out-of-domain LibriSpeech and in-domain Microsoft production test sets, respectively.
△ Less
Submitted 22 April, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Authors:
Zhong Meng,
Sarangarajan Parthasarathy,
Eric Sun,
Yashesh Gaur,
Naoyuki Kanda,
Liang Lu,
Xie Chen,
Rui Zhao,
**yu Li,
Yifan Gong
Abstract:
The external language models (LM) integration remains a challenging task for end-to-end (E2E) automatic speech recognition (ASR) which has no clear division between acoustic and language models. In this work, we propose an internal LM estimation (ILME) method to facilitate a more effective integration of the external LM with all pre-existing E2E models with no additional model training, including…
▽ More
The external language models (LM) integration remains a challenging task for end-to-end (E2E) automatic speech recognition (ASR) which has no clear division between acoustic and language models. In this work, we propose an internal LM estimation (ILME) method to facilitate a more effective integration of the external LM with all pre-existing E2E models with no additional model training, including the most popular recurrent neural network transducer (RNN-T) and attention-based encoder-decoder (AED) models. Trained with audio-transcript pairs, an E2E model implicitly learns an internal LM that characterizes the training data in the source domain. With ILME, the internal LM scores of an E2E model are estimated and subtracted from the log-linear interpolation between the scores of the E2E model and the external LM. The internal LM scores are approximated as the output of an E2E model when eliminating its acoustic components. ILME can alleviate the domain mismatch between training and testing, or improve the multi-domain E2E ASR. Experimented with 30K-hour trained RNN-T and AED models, ILME achieves up to 15.5% and 6.8% relative word error rate reductions from Shallow Fusion on out-of-domain LibriSpeech and in-domain Microsoft production test sets, respectively.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
Generalized Sorting with Predictions
Authors:
Pinyan Lu,
Xuandi Ren,
Enze Sun,
Yubo Zhang
Abstract:
Generalized sorting problem, also known as sorting with forbidden comparisons, was first introduced by Huang et al. together with a randomized algorithm which requires $\tilde O(n^{3/2})$ probes. We study this problem with additional predictions for all pairs of allowed comparisons as input. We propose a randomized algorithm which uses $O(n \log n+w)$ probes with high probability and a determinist…
▽ More
Generalized sorting problem, also known as sorting with forbidden comparisons, was first introduced by Huang et al. together with a randomized algorithm which requires $\tilde O(n^{3/2})$ probes. We study this problem with additional predictions for all pairs of allowed comparisons as input. We propose a randomized algorithm which uses $O(n \log n+w)$ probes with high probability and a deterministic algorithm which uses $O(nw)$ probes, where $w$ is the number of mistakes made by prediction.
△ Less
Submitted 30 October, 2020;
originally announced November 2020.
-
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model
Authors:
**yu Li,
Rui Zhao,
Eric Sun,
Jeremy H. M. Wong,
Amit Das,
Zhong Meng,
Yifan Gong
Abstract:
While the community keeps promoting end-to-end models over conventional hybrid models, which usually are long short-term memory (LSTM) models trained with a cross entropy criterion followed by a sequence discriminative training criterion, we argue that such conventional hybrid models can still be significantly improved. In this paper, we detail our recent efforts to improve conventional hybrid LST…
▽ More
While the community keeps promoting end-to-end models over conventional hybrid models, which usually are long short-term memory (LSTM) models trained with a cross entropy criterion followed by a sequence discriminative training criterion, we argue that such conventional hybrid models can still be significantly improved. In this paper, we detail our recent efforts to improve conventional hybrid LSTM acoustic models for high-accuracy and low-latency automatic speech recognition. To achieve high accuracy, we use a contextual layer trajectory LSTM (cltLSTM), which decouples the temporal modeling and target classification tasks, and incorporates future context frames to get more information for accurate acoustic modeling. We further improve the training strategy with sequence-level teacher-student learning. To obtain low latency, we design a two-head cltLSTM, in which one head has zero latency and the other head has a small latency, compared to an LSTM. When trained with Microsoft's 65 thousand hours of anonymized training data and evaluated with test sets with 1.8 million words, the proposed two-head cltLSTM model with the proposed training strategy yields a 28.2\% relative WER reduction over the conventional LSTM acoustic model, with a similar perceived latency.
△ Less
Submitted 16 March, 2020;
originally announced March 2020.
-
Optimal control of aging in complex networks
Authors:
Eric D. Sun,
Thomas C. T. Michaels,
L. Mahadevan
Abstract:
Many complex systems experience damage accumulation which leads to aging, manifest as an increasing probability of system collapse with time. This naturally raises the question of how to maximize health and longevity in an aging system at minimal cost of maintenance and intervention. Here, we pose this question in the context of a simple interdependent network model of aging in complex systems, an…
▽ More
Many complex systems experience damage accumulation which leads to aging, manifest as an increasing probability of system collapse with time. This naturally raises the question of how to maximize health and longevity in an aging system at minimal cost of maintenance and intervention. Here, we pose this question in the context of a simple interdependent network model of aging in complex systems, and use both optimal control theory and reinforcement learning alongside a combination of analysis and simulation to determine optimal maintenance protocols. These protocols may motivate the rational design of strategies for promoting longevity in aging complex systems with potential applications in therapeutic schedules and engineered system maintenance.
△ Less
Submitted 22 October, 2019;
originally announced October 2019.
-
Self-Teaching Networks
Authors:
Liang Lu,
Eric Sun,
Yifan Gong
Abstract:
We propose self-teaching networks to improve the generalization capacity of deep neural networks. The idea is to generate soft supervision labels using the output layer for training the lower layers of the network. During the network training, we seek an auxiliary loss that drives the lower layer to mimic the behavior of the output layer. The connection between the two network layers through the a…
▽ More
We propose self-teaching networks to improve the generalization capacity of deep neural networks. The idea is to generate soft supervision labels using the output layer for training the lower layers of the network. During the network training, we seek an auxiliary loss that drives the lower layer to mimic the behavior of the output layer. The connection between the two network layers through the auxiliary loss can help the gradient flow, which works similar to the residual networks. Furthermore, the auxiliary loss also works as a regularizer, which improves the generalization capacity of the network. We evaluated the self-teaching network with deep recurrent neural networks on speech recognition tasks, where we trained the acoustic model using 30 thousand hours of data. We tested the acoustic model using data collected from 4 scenarios. We show that the self-teaching network can achieve consistent improvements and outperform existing methods such as label smoothing and confidence penalization.
△ Less
Submitted 9 September, 2019;
originally announced September 2019.
-
ImageNet-trained deep neural network exhibits illusion-like response to the Scintillating Grid
Authors:
Eric D. Sun,
Ron Dekel
Abstract:
Deep neural network (DNN) models for computer vision are now capable of human-level object recognition. Consequently, similarities in the performance and vulnerabilities of DNN and human vision are of great interest. Here we characterize the response of the VGG-19 DNN to images of the Scintillating Grid visual illusion, in which white dots are perceived to be partially black. We observed a signifi…
▽ More
Deep neural network (DNN) models for computer vision are now capable of human-level object recognition. Consequently, similarities in the performance and vulnerabilities of DNN and human vision are of great interest. Here we characterize the response of the VGG-19 DNN to images of the Scintillating Grid visual illusion, in which white dots are perceived to be partially black. We observed a significant deviation from the expected monotonic relation between VGG-19 representational dissimilarity and dot whiteness in the Scintillating Grid. That is, a linear increase in dot whiteness leads to a non-linear increase and then, remarkably, a decrease (non-monotonicity) in representational dissimilarity. In control images, mostly monotonic relations between representational dissimilarity and dot whiteness were observed. Furthermore, the dot whiteness level corresponding to the maximal representational dissimilarity (i.e. onset of non-monotonic dissimilarity) matched closely with that corresponding to the onset of illusion perception in human observers. As such, the non-monotonic response in the DNN is a potential model correlate for human illusion perception.
△ Less
Submitted 4 August, 2019; v1 submitted 21 July, 2019;
originally announced July 2019.
-
Temperature dependence of normalized sensitivity of Love wave sensor with unidirectional carbon fiber epoxy composite/Mn-doped 0.24PIN-0.46PMN-0.30PT ternary single crystal configuration
Authors:
Ziqing Luo,
Yujiao Ma,
Xiaopeng Wang,
Naixing Huang,
Xudong Qi,
Enwei Sun,
Rui Zhang,
Bin Yang,
Tianquan Lü,
Jian Liu,
Wenwu Cao
Abstract:
We have derived a general formula for sensitivity optimization of gravimetric sensors and use it to design a high precision and high sensitivity gravimetric sensor using unidirectional carbon fiber epoxy composite (CFEC) guiding layer on single crystal Mn-doped yPb(In1/2Nb1/2)O3-(1-x-y)Pb(Mg1/3Nb2/3)O3-xPbTiO3 (Mn: PIN-PMN-PT) piezoelectric substrate. The normalized maximum sensitivity exhibits a…
▽ More
We have derived a general formula for sensitivity optimization of gravimetric sensors and use it to design a high precision and high sensitivity gravimetric sensor using unidirectional carbon fiber epoxy composite (CFEC) guiding layer on single crystal Mn-doped yPb(In1/2Nb1/2)O3-(1-x-y)Pb(Mg1/3Nb2/3)O3-xPbTiO3 (Mn: PIN-PMN-PT) piezoelectric substrate. The normalized maximum sensitivity exhibits a decreasing tendency with temperature up to 55 degrees Celsius. For the CFEC-on-Mn: PIN-PMN-PT sensor configuration with wavelength 24 {mu}m at 25 degrees Celsius, the maximum sensitivity can reach as high as 760.88 cm2/g, which is nearly twice that of traditional SiO2/ST quartz configuration gravimetric sensor.
△ Less
Submitted 24 March, 2019;
originally announced June 2019.
-
Machine-to-Machine (M2M) Communications in Software-defined and Virtualized Cellular Networks
Authors:
Meng Li,
F. Richard Yu,
Pengbo Si,
Enchang Sun,
Yanhua Zhang,
Haipeng Yao
Abstract:
Machine-to-machine (M2M) communications have attracted great attention from both academia and industry. In this paper, with recent advances in wireless network virtualization and software-defined networking (SDN), we propose a novel framework for M2M communications in software-defined cellular networks with wireless network virtualization. In the proposed framework, according to different function…
▽ More
Machine-to-machine (M2M) communications have attracted great attention from both academia and industry. In this paper, with recent advances in wireless network virtualization and software-defined networking (SDN), we propose a novel framework for M2M communications in software-defined cellular networks with wireless network virtualization. In the proposed framework, according to different functions and quality of service (QoS) requirements of machine-type communication devices (MTCDs), a hypervisor enables the virtualization of the physical M2M network, which is abstracted and sliced into multiple virtual M2M networks. In addition, we develop a decision-theoretic approach to optimize the random access process of M2M communications. Furthermore, we develop a feedback and control loop to dynamically adjust the number of resource blocks (RBs) that are used in the random access phase in a virtual M2M network by the SDN controller. Extensive simulation results with different system parameters are presented to show the performance of the proposed scheme.
△ Less
Submitted 26 November, 2016;
originally announced November 2016.
-
Software-defined and Virtualized Cellular Networks with M2M Communications
Authors:
Meng Li,
F. RichardYu,
Pengbo Si,
Enchang Sun,
Yanhua Zhang
Abstract:
Machine-to-machine (M2M) communications have attracted great attention from both academia and industry. In this paper, with recent advances in wireless network virtualization and software-defined networking (SDN), we propose a novel framework for M2M communications in software-defined cellular networks with wireless network virtualization. In the proposed framework, according to different function…
▽ More
Machine-to-machine (M2M) communications have attracted great attention from both academia and industry. In this paper, with recent advances in wireless network virtualization and software-defined networking (SDN), we propose a novel framework for M2M communications in software-defined cellular networks with wireless network virtualization. In the proposed framework, according to different functions and quality of service (QoS) requirements of machine-type communication devices (MTCDs), a hypervisor enables the virtualization of the physical M2M network, which is abstracted and sliced into multiple virtual M2M networks. Moreover, we formulate a decision-theoretic approach to optimize the random access process of M2M communications. In addition, we develop a feedback and control loop to dynamically adjust the number of resource blocks (RBs) that are used in the random access phase in a virtual M2M network by the SDN controller. Extensive simulation results with different system parameters are presented to show the performance of the proposed scheme.
△ Less
Submitted 15 November, 2016;
originally announced November 2016.
-
Machine to Machine (M2M) Communications in Virtualized Vehicular Ad Hoc Networks
Authors:
Meng Li,
F. Richard Yu,
Pengbo Si,
Enchang Sun,
Yanhua Zhang
Abstract:
With the growing interest in the use of internet of things (IoT), machine-to-machine (M2M) communications have become an important networking paradigm. In this paper, with recent advances in wireless network virtualization (WNV), we propose a novel framework for M2M communications in vehicular ad-hoc networks (VANETs) with WNV. In the proposed framework, according to different applications and qua…
▽ More
With the growing interest in the use of internet of things (IoT), machine-to-machine (M2M) communications have become an important networking paradigm. In this paper, with recent advances in wireless network virtualization (WNV), we propose a novel framework for M2M communications in vehicular ad-hoc networks (VANETs) with WNV. In the proposed framework, according to different applications and quality of service (QoS) requirements of vehicles, a hypervisor enables the virtualization of the physical vehicular network, which is abstracted and sliced into multiple virtual networks. Moreover, the process of resource blocks (RBs) selection and random access in each virtual vehicular network is formulated as a partially observable Markov decision process (POMDP), which can achieve the maximum reward about transmission capacity. The optimal policy for RBs selection is derived by virtue of a dynamic programming approach. Extensive simulation results with different system parameters are presented to show the performance improvement of the proposed scheme.
△ Less
Submitted 12 November, 2016;
originally announced November 2016.
-
Experimental study of coherent synchrotron radiation in the emittance exchange line at the A0-photoinjector
Authors:
Jayakar C. T. Thangaraj,
R. Thurman-Keup,
A. Johnson,
A. H. Lumpkin,
H. Edwards,
J. Ruan,
J. Santucci,
Y. E. - Sun,
M. Church,
P. Piot
Abstract:
Next generation accelerators will require a high current, low emittance beam with a low energy spread. Such accelerators will employ advanced beam conditioning systems such as emittance exchangers to manipulate high brightness beams. One of the goals of the Fermilab A0 photoinjector is to investigate the transverse to longitudinal emittance exchange principle. Coherent synchrotron radiation could…
▽ More
Next generation accelerators will require a high current, low emittance beam with a low energy spread. Such accelerators will employ advanced beam conditioning systems such as emittance exchangers to manipulate high brightness beams. One of the goals of the Fermilab A0 photoinjector is to investigate the transverse to longitudinal emittance exchange principle. Coherent synchrotron radiation could limit high current operation of the emittance exchanger. In this paper, we report on the preliminary experimental and simulation study of the coherent synchroton radiation (CSR) in the emittance exchange line at the A0 photoinjector.
△ Less
Submitted 9 February, 2012;
originally announced February 2012.