Skip to main content

Showing 1–50 of 50 results for author: Sun, E

.
  1. arXiv:2407.04973  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts

    Authors: Yijia Xiao, Edward Sun, Tianyu Liu, Wei Wang

    Abstract: We propose LogicVista, an evaluation benchmark that assesses the integrated logical reasoning capabilities of multimodal large language models (MLLMs) in Visual contexts. Recent advancements in MLLMs have demonstrated various fascinating abilities, from crafting poetry based on an image to performing mathematical reasoning. However, there is still a lack of systematic evaluation of MLLMs' proficie… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: LogicVista benchmarks the logical reasoning of multimodal large language models in visual tasks

  2. arXiv:2405.02243  [pdf, other

    cs.RO

    Towards Improving Learning from Demonstration Algorithms via MCMC Methods

    Authors: Carl Qi, Edward Sun, Harry Zhang

    Abstract: Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own drawbacks, limiting its efficacy in real robot setups. In this work, we take one step towards improving learning from demonstration algorithms by leveraging impl… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2207.04638, arXiv:2204.03597 by other authors

  3. arXiv:2312.17673  [pdf, other

    cs.CR cs.AI cs.CL

    Jatmo: Prompt Injection Defense by Task-Specific Finetuning

    Authors: Julien Piet, Maha Alrashed, Chawin Sitawarin, Sizhe Chen, Zeming Wei, Elizabeth Sun, Basel Alomair, David Wagner

    Abstract: Large Language Models (LLMs) are attracting significant research attention due to their instruction-following abilities, allowing users and developers to leverage LLMs for a variety of tasks. However, LLMs are vulnerable to prompt-injection attacks: a class of attacks that hijack the model's instruction-following abilities, changing responses to prompts to undesired, possibly malicious ones. In th… ▽ More

    Submitted 8 January, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 6 figures

  4. arXiv:2311.05623  [pdf, other

    astro-ph.IM

    The 4m International Liquid Mirror Telescope: a brief history and some preliminary scientific results

    Authors: Jean Surdej, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Anna Pospieszalska-Surdej, Kumar Pranshu, Ethen Sun

    Abstract: The present article is based upon an invited talk delivered at the occasion of the inauguration of the 4m International Liquid Mirror Telescope (ILMT) which took place in Devasthal (ARIES, Uttarakhand, India) on 21st of March 2023. We present hereafter a short history of the liquid mirror telescopes and in particular of the 4m ILMT which is the first liquid mirror telescope entirely dedicated to a… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 14 pages, 21 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  5. arXiv:2311.05622  [pdf, other

    astro-ph.IM astro-ph.GA

    SunPhot: Preparations for an upcoming quasar variability survey with the International Liquid Mirror Telescope

    Authors: Ethen Sun, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Jean Surdej

    Abstract: Recent research suggests a correlation between the variability and intrinsic brightness of quasars. If calibrated, this could lead to the use of quasars on the cosmic distance ladder, but this work is currently limited by lack of quasar light curve data with high cadence and precision. The Python photometric data pipeline SunPhot is being developed as part of preparations for an upcoming quasar va… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  6. arXiv:2311.05621  [pdf, other

    astro-ph.GA astro-ph.IM

    Surface Brightness Properties of LSB Galaxies with the International Liquid Mirror Telescope

    Authors: Jiuyang Fu, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Low surface brightness (LSB) galaxies make up a significant fraction of the luminosity density of the local universe. Their low surface brightness suggests a different formation and evolution process compared to more-typical high-surface-brightness galaxies. This study presents an analysis of LSB galaxies found in images obtained by the International Liquid Mirror Telescope during the observation… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  7. arXiv:2311.05620  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    Survey of Variables with the ILMT

    Authors: Baldeep Grewal, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Paul Hickson, Kuntal Misra, Brajesh Kumar, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Nestled in the mountains of Northern India, is a 4-metre rotating dish of liquid mercury. Over a 10-year period, the International Liquid Mirror Telescope (ILMT) will survey 117 square degrees of sky, to study the astrometric and photometric variability of all detected objects. One of the scientific programs will be a survey of variable stars. The data gathered will be used to construct a comprehe… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  8. arXiv:2311.05619  [pdf, other

    astro-ph.IM astro-ph.CO

    Observation of mulitply imaged quasars with the 4-m ILMT

    Authors: Talat Akhunov, Bhavya Ailawadhi, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Anna Pospieszalska-Surdej, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Gravitationally lensed quasars (GLQs) are known to potentially provide an independent way of determining the value of the Hubble-Lemaître parameter $H_{0}$, to probe the dark matter content of lensing galaxies and to resolve tiny structures in distant active galactic nuclei. That is why multiply imaged quasars are one of the main drivers for a photometric monitoring with the 4-m International Liqu… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 3 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  9. arXiv:2311.05618  [pdf, other

    astro-ph.IM astro-ph.GA

    Follow-up strategy of ILMT discovered supernovae

    Authors: Brajesh Kumar, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The 4m International Liquid Mirror Telescope (ILMT) facility continuously scans the same sky strip ($\sim$22$^\prime$ wide) on each night with a fixed pointing towards the zenith direction. It is possible to detect hundreds of supernovae (SNe) each year by implementing an optimal image subtraction technique on consecutive night images. Prompt monitoring of ILMT-detected SNe is planned under the se… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  10. arXiv:2311.05617  [pdf, other

    astro-ph.IM

    Astrometric and photometric calibrators for the 4-m International Liquid Mirror Telescope

    Authors: Naveen Dukiya, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The International Liquid Mirror Telescope (ILMT) is a 4-meter class survey telescope. It achieved its first light on 29$^{\rm th}$ April 2022 and is now undergoing the commissioning phase. It scans the sky in a fixed \ang{;22;} wide strip centred at the declination of $+$\ang{29;21;41.4} and works in \emph{Time Delay Integration (TDI)} mode. We present a full catalog of sources in the ILMT strip d… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  11. arXiv:2311.05616  [pdf, other

    astro-ph.IM

    A year-long representation of the ILMT observations in different coordinate systems

    Authors: Monalisa Dubey, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Kuntal Misra, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The 4m International Liquid Mirror Telescope (ILMT) is the first optical survey telescope in India that performs zenithal observations of a 22$'$ wide strip of the sky. To determine the portion of the sky covered by the ILMT during the entire year, we represent the ILMT Field of View (FoV) in three different coordinate systems - galactic, ecliptic, and equatorial. We adopt a constant declination o… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 6 pages, 1 figure, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  12. arXiv:2311.05615  [pdf, other

    astro-ph.IM

    The 4m International Liquid Mirror Telescope project

    Authors: Jean Surdej, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Anna Pospieszalska-Surdej, Kumar Pranshu, Ethen Sun

    Abstract: The International Liquid Mirror Telescope (ILMT) project is a scientific collaboration in observational astrophysics between the Li{è}ge Institute of Astrophysics and Geophysics (Li{è}ge University, Belgium), the Aryabatta Research Institute of observational sciencES (ARIES, Nainital, India) and several Canadian universities (British Columbia, Laval, Montr{é}al, Toronto, Victoria and York). Meanwh… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  13. arXiv:2311.05614  [pdf, other

    astro-ph.IM

    Serendipitous Detection of Orbital Debris by the International Liquid Mirror Telescope: First Results

    Authors: Paul Hickson, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: Orbital debris presents a growing risk to space operations, and is becoming a significant source of contamination of astronomical images. Much of the debris population is uncatalogued, making the impact more difficult to assess. We present initial results from the first ten nights of commissioning observations with the International Liquid Mirror Telescope, in which images were examined for streak… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 6 pages, 1 figure, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  14. arXiv:2311.04718  [pdf, other

    astro-ph.IM astro-ph.EP

    Detection and Identification of Asteroids with the 4-m ILMT

    Authors: Anna Pospieszalska-Surdej, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: A very unique strength of the Devasthal Observatory is its capability of detecting optical transients with the 4-m International Liquid Mirror Telescope (ILMT) and to rapidly follow them up using the 1.3-m Devasthal Fast Optical Telescope (DFOT) and/or the 3.6-m Devasthal Optical Telescope (DOT), installed right next to it. In this context, we have inspected 20 fields observed during 9 consecutive… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  15. arXiv:2311.04717  [pdf, other

    astro-ph.IM

    Accessibility of the ILMT survey data

    Authors: Kuntal Misra, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The 4m International Liquid Mirror Telescope (ILMT) continuously scans a 22$'$ wide strip of the zenithal sky and records the images in three broadband filters (g', r' and i') using a 4K$\times$4K CCD camera. In about 10--12 hours of observations during a single night, $\sim$15 GB of data volume is generated. The raw images resulting from the observations in October--November 2022 have been pre-pr… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  16. arXiv:2311.04716  [pdf, other

    astro-ph.IM

    Automated transient detection in the context of the 4m ILMT

    Authors: Kumar Pranshu, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Ethen Sun, Jean Surdej

    Abstract: In the era of sky surveys like Palomar Transient Factory (PTF), Zwicky Transient Facility (ZTF) and the upcoming Vera Rubin Observatory (VRO) and ILMT, a plethora of image data will be available. ZTF scans the sky with a field of view of 48 deg$^{2}$ and VRO will have a FoV of 9.6 deg$^{2}$ but with a much larger aperture. The 4m ILMT covers a 22$'$ wide strip of the sky. Being a zenith telescope,… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 9 pages, 3 figures, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  17. arXiv:2311.04713  [pdf, other

    astro-ph.IM

    An automated photometric pipeline for the ILMT data

    Authors: Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Vibhore Negi, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The International Liquid Mirror Telescope (ILMT) is a 4-meter survey telescope continuously observing towards the zenith in the SDSS g', r', and i' bands. This survey telescope is designed to detect various astrophysical transients (for example, supernovae) and very faint objects like multiply-imaged quasars and low surface brightness galaxies. A single scan of a 22$'$ strip of sky contains a larg… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  18. arXiv:2311.04712  [pdf, other

    astro-ph.IM

    Necessity of a TDI optical corrector for ILMT observations

    Authors: Vibhore Negi, Bhavya Ailawadhi, Talat Akhunov, Ermanno Borra, Monalisa Dubey, Naveen Dukiya, Jiuyang Fu, Baldeep Grewal, Paul Hickson, Brajesh Kumar, Kuntal Misra, Kumar Pranshu, Ethen Sun, Jean Surdej

    Abstract: The International Liquid Mirror Telescope (ILMT) has recently become operational at the Devasthal Observatory of ARIES, Nainital, India. The ILMT observes in the Time delay integration (TDI) mode where the images are formed by electronically step** the charges over the pixels of the CCD, along a column. Observations near the zenith impose certain constraints dependent on the latitude such as ima… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, 1 table, accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

  19. arXiv:2308.06533  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Knowledge Distilled Ensemble Model for sEMG-based Silent Speech Interface

    Authors: Wenqiang Lai, Qihan Yang, Ye Mao, Endong Sun, Jiangnan Ye

    Abstract: Voice disorders affect millions of people worldwide. Surface electromyography-based Silent Speech Interfaces (sEMG-based SSIs) have been explored as a potential solution for decades. However, previous works were limited by small vocabularies and manually extracted features from raw data. To address these limitations, we propose a lightweight deep learning knowledge-distilled ensemble model for sEM… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 6 pages, 5 figures

  20. arXiv:2308.01839  [pdf, other

    q-bio.QM cs.CV q-bio.GN stat.AP stat.ML

    Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data

    Authors: Rong Ma, Eric D. Sun, David Donoho, James Zou

    Abstract: Single-cell data integration can provide a comprehensive molecular view of cells, and many algorithms have been developed to remove unwanted technical or biological variations and integrate heterogeneous single-cell datasets. Despite their wide usage, existing methods suffer from several fundamental limitations. In particular, we lack a rigorous statistical test for whether two high-dimensional si… ▽ More

    Submitted 29 February, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the National Academy of Sciences, 2024, 121(10) e2313719121

  21. arXiv:2307.16332  [pdf

    eess.AS

    Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

    Authors: Eric Sun, **yu Li, Jian Xue, Yifan Gong

    Abstract: In end-to-end automatic speech recognition system, one of the difficulties for language expansion is the limited paired speech and text training data. In this paper, we propose a novel method to generate augmented samples with unpaired speech feature segments and text data for model pre-training, which has the advantage of low cost without using additional speech data. When mixing 20,000 hours aug… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  22. arXiv:2307.09377  [pdf, other

    cs.LG

    Data Cross-Segmentation for Improved Generalization in Reinforcement Learning Based Algorithmic Trading

    Authors: Vikram Duvvur, Aashay Mehta, Edward Sun, Bo Wu, Ken Yew Chan, Jeff Schneider

    Abstract: The use of machine learning in algorithmic trading systems is increasingly common. In a typical set-up, supervised learning is used to predict the future prices of assets, and those predictions drive a simple trading and execution strategy. This is quite effective when the predictions have sufficient signal, markets are liquid, and transaction costs are low. However, those conditions often do not… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  23. arXiv:2306.17241  [pdf, other

    cs.DS

    Improved Algorithms for Online Rent Minimization Problem Under Unit-Size Jobs

    Authors: Enze Sun, Zonghan Yang, Yuhao Zhang

    Abstract: We consider the Online Rent Minimization problem, where online jobs with release times, deadlines, and processing times must be scheduled on machines that can be rented for a fixed length period of $T$. The objective is to minimize the number of machine rents. This problem generalizes the Online Machine Minimization problem where machines can be rented for an infinite period, and both problems hav… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: To appear in the 31st Annual European Symposium on Algorithms (ESA 2023)

  24. arXiv:2306.16656  [pdf

    physics.med-ph

    Motion robust MR fingerprinting scan to image neonates with prenatal opioid exposure

    Authors: Dan Ma, Chaitra Badve, Jessie EP Sun, Siyuan Hu, Xiaofeng Wang, Yong Chen, Ameya Nayate, Michael Wien, Douglas Martin, Lynn T Singer, Jared C. Durieux, Chris Flask, Deanne Wilson Costello

    Abstract: Background: A noninvasive and sensitive imaging tool is needed to assess the fast-evolving baby brain. However, using MRI to study non-sedated babies faces roadblocks, including high scan failure rates due to subjects motion and the lack of quantitative measures for assessing potential developmental delays. This feasibility study explores whether MR Fingerprinting scans can provide motion-robust a… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  25. arXiv:2306.09360  [pdf, other

    nucl-ex hep-ex hep-ph nucl-th

    Strong Interaction Physics at the Luminosity Frontier with 22 GeV Electrons at Jefferson Lab

    Authors: A. Accardi, P. Achenbach, D. Adhikari, A. Afanasev, C. S. Akondi, N. Akopov, M. Albaladejo, H. Albataineh, M. Albrecht, B. Almeida-Zamora, M. Amaryan, D. Androić, W. Armstrong, D. S. Armstrong, M. Arratia, J. Arrington, A. Asaturyan, A. Austregesilo, H. Avagyan, T. Averett, C. Ayerbe Gayoso, A. Bacchetta, A. B. Balantekin, N. Baltzell, L. Barion , et al. (419 additional authors not shown)

    Abstract: This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron… ▽ More

    Submitted 24 August, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Updates to the list of authors; Preprint number changed from theory to experiment; Updates to sections 4 and 6, including additional figures

    Report number: JLAB-PHY-23-3840

  26. arXiv:2303.00786  [pdf

    cs.CL eess.AS

    Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

    Authors: Eric Sun, **yu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong

    Abstract: We propose gated language experts and curriculum training to enhance multilingual transformer transducer models without requiring language identification (LID) input from users during inference. Our method incorporates a gating mechanism and LID loss, enabling transformer experts to learn language-specific information. By combining gated transformer experts with shared transformer layers, we const… ▽ More

    Submitted 7 July, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  27. arXiv:2211.02809  [pdf, other

    cs.CL cs.SD eess.AS

    LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

    Authors: Peidong Wang, Eric Sun, Jian Xue, Yu Wu, Long Zhou, Yashesh Gaur, Shujie Liu, **yu Li

    Abstract: Automatic speech recognition (ASR) and speech translation (ST) can both use neural transducers as the model structure. It is thus possible to use a single transducer model to perform both tasks. In real-world applications, such joint ASR and ST models may need to be streaming and do not require source language identification (i.e. language-agnostic). In this paper, we propose LAMASSU, a streaming… ▽ More

    Submitted 19 October, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

    Comments: INTERSPEECH 2023

  28. arXiv:2211.02499  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

    Authors: Jian Xue, Peidong Wang, **yu Li, Eric Sun

    Abstract: In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which can transcribe or translate multiple spoken languages into texts of the target language. The backbone of SM2 is Transformer Transducer, which has high streaming capability. Instead of human labeled speech translation (ST) data, SM2 models are trained using weakly supervised data generated by convert… ▽ More

    Submitted 5 July, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

  29. arXiv:2210.13711  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    A Spectral Method for Assessing and Combining Multiple Data Visualizations

    Authors: Rong Ma, Eric D. Sun, James Zou

    Abstract: Dimension reduction and data visualization aim to project a high-dimensional dataset to a low-dimensional space while capturing the intrinsic structures in the data. It is an indispensable part of modern data science, and many dimensional reduction and visualization algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it critically important… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Under revision of Nature Communications

  30. arXiv:2210.06507  [pdf, ps, other

    cs.GT

    Better Approximation for Interdependent SOS Valuations

    Authors: Pinyan Lu, Enze Sun, Chenghan Zhou

    Abstract: Submodular over signal (SOS) defines a family of interesting functions for which there exist truthful mechanisms with constant approximation to the social welfare for agents with interdependent valuations. The best-known truthful auction is of $4$-approximation and a lower bound of 2 was proved. We propose a new and simple truthful mechanism to achieve an approximation ratio of 3.315.

    Submitted 12 October, 2022; originally announced October 2022.

  31. arXiv:2204.01418  [pdf, ps, other

    cs.DS

    Online Ordinal Problems: Optimality of Comparison-based Algorithms and their Cardinal Complexity

    Authors: Nick Gravin, Enze Sun, Zhihao Gavin Tang

    Abstract: We consider ordinal online problems, i.e., tasks that only require pairwise comparisons between elements of the input. A classic example is the secretary problem and the game of googol, as well as its multiple combinatorial extensions such as $(J,K)$-secretary, $2$-sided game of googol, ordinal-competitive matroid secretary. A natural approach to these tasks is to use ordinal algorithms that at ea… ▽ More

    Submitted 11 October, 2023; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: To appear at FOCS 2023. Abstract shortened to meet arXiv requirements

  32. arXiv:2112.05820  [pdf, other

    cs.CL cs.AI cs.LG eess.AS

    Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

    Authors: Kenichi Kumatani, Robert Gmyr, Felipe Cruz Salinas, Linquan Liu, Wei Zuo, Devang Patel, Eric Sun, Yu Shi

    Abstract: The sparsely-gated Mixture of Experts (MoE) can magnify a network capacity with a little computational complexity. In this work, we investigate how multi-lingual Automatic Speech Recognition (ASR) networks can be scaled up with a simple routing algorithm in order to achieve better accuracy. More specifically, we apply the sparsely-gated MoE technique to two types of networks: Sequence-to-Sequence… ▽ More

    Submitted 4 January, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

  33. arXiv:2110.07909  [pdf, other

    cs.CL eess.AS

    Multilingual Speech Recognition using Knowledge Transfer across Learning Processes

    Authors: Rimita Lahiri, Kenichi Kumatani, Eric Sun, Yao Qian

    Abstract: Multilingual end-to-end(E2E) models have shown a great potential in the expansion of the language coverage in the realm of automatic speech recognition(ASR). In this paper, we aim to enhance the multilingual ASR performance in two ways, 1)studying the impact of feeding a one-hot vector identifying the language, 2)formulating the task with a meta-learning objective combined with self-supervised lea… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 5 pages

  34. arXiv:2108.11623  [pdf, other

    cs.LG cs.RO eess.SY

    Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

    Authors: Baiyu Peng, **gliang Duan, Jianyu Chen, Shengbo Eben Li, Gen** Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun

    Abstract: Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address thes… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

  35. arXiv:2107.05876  [pdf, other

    eess.AS cs.CL cs.SD

    A Configurable Multilingual Model is All You Need to Recognize All Languages

    Authors: Long Zhou, **yu Li, Eric Sun, Shujie Liu

    Abstract: Multilingual automatic speech recognition (ASR) models have shown great promise in recent years because of the simplified model training and deployment process. Conventional methods either train a universal multilingual model without taking any language information or with a 1-hot language ID (LID) vector to guide the recognition of the target language. In practice, the user can be prompted to pre… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  36. arXiv:2106.02302  [pdf, ps, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

    Authors: Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, **yu Li, Yifan Gong

    Abstract: Integrating external language models (LMs) into end-to-end (E2E) models remains a challenging task for domain-adaptive speech recognition. Recently, internal language model estimation (ILME)-based LM fusion has shown significant word error rate (WER) reduction from Shallow Fusion by subtracting a weighted internal LM score from an interpolation of E2E model and external LM scores during beam searc… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 5 pages, Interspeech 2021

    Journal ref: Interspeech 2021, Brno, Czech Republic

  37. arXiv:2105.07671   

    stat.ML cs.LG econ.EM

    Classifying variety of customer's online engagement for churn prediction with mixed-penalty logistic regression

    Authors: Petra Posedel Šimović, Davor Horvatic, Edward W. Sun

    Abstract: Using big data to analyze consumer behavior can provide effective decision-making tools for preventing customer attrition (churn) in customer relationship management (CRM). Focusing on a CRM dataset with several different categories of factors that impact customer heterogeneity (i.e., usage of self-care service channels, duration of service, and responsiveness to marketing actions), we provide new… ▽ More

    Submitted 13 July, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: This version is not sufficiently exhaustive; a wrong version of validation results has been released (using a wrong part of a dataset for validation)

  38. arXiv:2105.03664  [pdf, other

    cs.CL

    D2S: Document-to-Slide Generation Via Query-Based Text Summarization

    Authors: Edward Sun, Yufang Hou, Dakuo Wang, Yunfeng Zhang, Nancy X. R. Wang

    Abstract: Presentations are critical for communication in all areas of our lives, yet the creation of slide decks is often tedious and time-consuming. There has been limited research aiming to automate the document-to-slides generation process and all face a critical challenge: no publicly available dataset for training and benchmarking. In this work, we first contribute a new dataset, SciDuet, consisting o… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: accepted at NAACL 2021

  39. arXiv:2102.01380  [pdf, ps, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

    Authors: Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, **yu Li, Yifan Gong

    Abstract: The efficacy of external language model (LM) integration with existing end-to-end (E2E) automatic speech recognition (ASR) systems can be improved significantly using the internal language model estimation (ILME) method. In this method, the internal LM score is subtracted from the score obtained by interpolating the E2E score with the external LM score, during inference. To improve the ILME-based… ▽ More

    Submitted 22 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: 5 pages, ICASSP 2021

    Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada

  40. arXiv:2011.01991  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

    Authors: Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, **yu Li, Yifan Gong

    Abstract: The external language models (LM) integration remains a challenging task for end-to-end (E2E) automatic speech recognition (ASR) which has no clear division between acoustic and language models. In this work, we propose an internal LM estimation (ILME) method to facilitate a more effective integration of the external LM with all pre-existing E2E models with no additional model training, including… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 8 pages, 2 figures, SLT 2021

    Journal ref: 2021 IEEE Spoken Language Technology Workshop (SLT)

  41. arXiv:2011.00172  [pdf, other

    cs.DS

    Generalized Sorting with Predictions

    Authors: Pinyan Lu, Xuandi Ren, Enze Sun, Yubo Zhang

    Abstract: Generalized sorting problem, also known as sorting with forbidden comparisons, was first introduced by Huang et al. together with a randomized algorithm which requires $\tilde O(n^{3/2})$ probes. We study this problem with additional predictions for all pairs of allowed comparisons as input. We propose a randomized algorithm which uses $O(n \log n+w)$ probes with high probability and a determinist… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Comments: To appear in SOSA 2021

    ACM Class: F.2.2

  42. arXiv:2003.07482  [pdf, other

    eess.AS cs.CL cs.SD

    High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model

    Authors: **yu Li, Rui Zhao, Eric Sun, Jeremy H. M. Wong, Amit Das, Zhong Meng, Yifan Gong

    Abstract: While the community keeps promoting end-to-end models over conventional hybrid models, which usually are long short-term memory (LSTM) models trained with a cross entropy criterion followed by a sequence discriminative training criterion, we argue that such conventional hybrid models can still be significantly improved. In this paper, we detail our recent efforts to improve conventional hybrid LST… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: Accepted by ICASSP 2020

  43. arXiv:1910.10002  [pdf, other

    q-bio.PE cond-mat.stat-mech physics.soc-ph

    Optimal control of aging in complex networks

    Authors: Eric D. Sun, Thomas C. T. Michaels, L. Mahadevan

    Abstract: Many complex systems experience damage accumulation which leads to aging, manifest as an increasing probability of system collapse with time. This naturally raises the question of how to maximize health and longevity in an aging system at minimal cost of maintenance and intervention. Here, we pose this question in the context of a simple interdependent network model of aging in complex systems, an… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  44. arXiv:1909.04157  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Self-Teaching Networks

    Authors: Liang Lu, Eric Sun, Yifan Gong

    Abstract: We propose self-teaching networks to improve the generalization capacity of deep neural networks. The idea is to generate soft supervision labels using the output layer for training the lower layers of the network. During the network training, we seek an auxiliary loss that drives the lower layer to mimic the behavior of the output layer. The connection between the two network layers through the a… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 5 pages, Interspeech 2019

  45. arXiv:1907.09019  [pdf, other

    cs.CV cs.LG eess.IV q-bio.NC

    ImageNet-trained deep neural network exhibits illusion-like response to the Scintillating Grid

    Authors: Eric D. Sun, Ron Dekel

    Abstract: Deep neural network (DNN) models for computer vision are now capable of human-level object recognition. Consequently, similarities in the performance and vulnerabilities of DNN and human vision are of great interest. Here we characterize the response of the VGG-19 DNN to images of the Scintillating Grid visual illusion, in which white dots are perceived to be partially black. We observed a signifi… ▽ More

    Submitted 4 August, 2019; v1 submitted 21 July, 2019; originally announced July 2019.

    Comments: Supplementary material at end of document

  46. arXiv:1906.10653  [pdf

    physics.app-ph physics.ins-det

    Temperature dependence of normalized sensitivity of Love wave sensor with unidirectional carbon fiber epoxy composite/Mn-doped 0.24PIN-0.46PMN-0.30PT ternary single crystal configuration

    Authors: Ziqing Luo, Yujiao Ma, Xiaopeng Wang, Naixing Huang, Xudong Qi, Enwei Sun, Rui Zhang, Bin Yang, Tianquan Lü, Jian Liu, Wenwu Cao

    Abstract: We have derived a general formula for sensitivity optimization of gravimetric sensors and use it to design a high precision and high sensitivity gravimetric sensor using unidirectional carbon fiber epoxy composite (CFEC) guiding layer on single crystal Mn-doped yPb(In1/2Nb1/2)O3-(1-x-y)Pb(Mg1/3Nb2/3)O3-xPbTiO3 (Mn: PIN-PMN-PT) piezoelectric substrate. The normalized maximum sensitivity exhibits a… ▽ More

    Submitted 24 March, 2019; originally announced June 2019.

  47. arXiv:1611.08725  [pdf, ps, other

    cs.NI

    Machine-to-Machine (M2M) Communications in Software-defined and Virtualized Cellular Networks

    Authors: Meng Li, F. Richard Yu, Pengbo Si, Enchang Sun, Yanhua Zhang, Haipeng Yao

    Abstract: Machine-to-machine (M2M) communications have attracted great attention from both academia and industry. In this paper, with recent advances in wireless network virtualization and software-defined networking (SDN), we propose a novel framework for M2M communications in software-defined cellular networks with wireless network virtualization. In the proposed framework, according to different function… ▽ More

    Submitted 26 November, 2016; originally announced November 2016.

    Comments: arXiv admin note: text overlap with arXiv:1611.05087

  48. arXiv:1611.05087  [pdf, ps, other

    cs.NI

    Software-defined and Virtualized Cellular Networks with M2M Communications

    Authors: Meng Li, F. RichardYu, Pengbo Si, Enchang Sun, Yanhua Zhang

    Abstract: Machine-to-machine (M2M) communications have attracted great attention from both academia and industry. In this paper, with recent advances in wireless network virtualization and software-defined networking (SDN), we propose a novel framework for M2M communications in software-defined cellular networks with wireless network virtualization. In the proposed framework, according to different function… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

    Comments: arXiv admin note: text overlap with arXiv:1611.04017

  49. arXiv:1611.04017  [pdf, ps, other

    cs.NI

    Machine to Machine (M2M) Communications in Virtualized Vehicular Ad Hoc Networks

    Authors: Meng Li, F. Richard Yu, Pengbo Si, Enchang Sun, Yanhua Zhang

    Abstract: With the growing interest in the use of internet of things (IoT), machine-to-machine (M2M) communications have become an important networking paradigm. In this paper, with recent advances in wireless network virtualization (WNV), we propose a novel framework for M2M communications in vehicular ad-hoc networks (VANETs) with WNV. In the proposed framework, according to different applications and qua… ▽ More

    Submitted 12 November, 2016; originally announced November 2016.

  50. arXiv:1202.2107  [pdf

    physics.acc-ph

    Experimental study of coherent synchrotron radiation in the emittance exchange line at the A0-photoinjector

    Authors: Jayakar C. T. Thangaraj, R. Thurman-Keup, A. Johnson, A. H. Lumpkin, H. Edwards, J. Ruan, J. Santucci, Y. E. - Sun, M. Church, P. Piot

    Abstract: Next generation accelerators will require a high current, low emittance beam with a low energy spread. Such accelerators will employ advanced beam conditioning systems such as emittance exchangers to manipulate high brightness beams. One of the goals of the Fermilab A0 photoinjector is to investigate the transverse to longitudinal emittance exchange principle. Coherent synchrotron radiation could… ▽ More

    Submitted 9 February, 2012; originally announced February 2012.

    Comments: 4 pp. 14th Advanced Accelerator Concepts Workshop, 13-19 Jun 2010: Annapolis, Maryland

    Report number: FERMILAB-CONF-10-317-APC

    Journal ref: AIP Conf.Proc. 1299 (2010) 643-646