Skip to main content

Showing 1–9 of 9 results for author: Bhandari, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07827  [pdf, other

    cs.CL

    Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

    Authors: Ahmet Üstün, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker

    Abstract: Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages. What does it take to broaden access to breakthroughs beyond first-class citizen languages? Our work introduces Aya, a massively multilingual generative language model that follows instructions in 101 languages of which over 50% are considered as lower-resourced. Aya outperforms mT0 and BLOOM… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  2. arXiv:2401.04144  [pdf, other

    cs.LG cs.AI

    Robust Calibration For Improved Weather Prediction Under Distributional Shift

    Authors: Sankalp Gilda, Neel Bhandari, Wendy Mak, Andrea Panizza

    Abstract: In this paper, we present results on improving out-of-domain weather prediction and uncertainty estimation as part of the \texttt{Shifts Challenge on Robustness and Uncertainty under Real-World Distributional Shift} challenge. We find that by leveraging a mixture of experts in conjunction with an advanced data augmentation technique borrowed from the computer vision domain, in conjunction with rob… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Presented at the Bayesian Deep Learning workshop at NeurIPS 2021

  3. Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation

    Authors: Neel Bhandari, Pin-Yu Chen

    Abstract: Language Models today provide a high accuracy across a large number of downstream tasks. However, they remain susceptible to adversarial attacks, particularly against those where the adversarial examples maintain considerable similarity to the original text. Given the multilingual nature of text, the effectiveness of adversarial examples across translations and how machine translations can improve… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Published at International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

  4. arXiv:2202.02958  [pdf

    q-bio.GN cs.AI cs.LG

    A comprehensive survey on computational learning methods for analysis of gene expression data

    Authors: Nikita Bhandari, Rahee Walambe, Ketan Kotecha, Satyajeet Khare

    Abstract: Computational analysis methods including machine learning have a significant impact in the fields of genomics and medicine. High-throughput gene expression analysis methods such as microarray technology and RNA sequencing produce enormous amounts of data. Traditionally, statistical methods are used for comparative analysis of gene expression data. However, more complex analysis for classification… ▽ More

    Submitted 27 September, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 43 pages, 8 figures, 5 tables

  5. arXiv:2105.07659  [pdf

    q-bio.GN cs.LG

    Comparison of machine learning and deep learning techniques in promoter prediction across diverse species

    Authors: Nikita Bhandari, Satyajeet Khare, Rahee Walambe, Ketan Kotecha

    Abstract: Gene promoters are the key DNA regulatory elements positioned around the transcription start sites and are responsible for regulating gene transcription process. Various alignment-based, signal-based and content-based approaches are reported for the prediction of promoters. However, since all promoter sequences do not show explicit features, the prediction performance of these techniques is poor.… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: 17 pages, 4 figures, 4 tables

    Journal ref: PeerJ Comput. Sci. 7:e365 (2021)

  6. Earnings-21: A Practical Benchmark for ASR in the Wild

    Authors: Miguel Del Rio, Natalie Delworth, Ryan Westerman, Michelle Huang, Nishchal Bhandari, Joseph Palakapilly, Quinten McNamara, Joshua Dong, Piotr Zelasko, Miguel Jette

    Abstract: Commonly used speech corpora inadequately challenge academic and commercial ASR systems. In particular, speech corpora lack metadata needed for detailed analysis and WER measurement. In response, we present Earnings-21, a 39-hour corpus of earnings calls containing entity-dense speech from nine different financial sectors. This corpus is intended to benchmark ASR systems in the wild with special a… ▽ More

    Submitted 15 June, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to INTERSPEECH 2021. June 15 2021: Addressing the comments of reviewers and updating the results of our internal ESPNet model. The results do not change our conclusions. April 28th, 2021: We found and resolved an issue in our experimental evaluation that scored the LibriSpeech model at ~20% worse relative WER than the actual WER. The updated results do not affect our conclusions

  7. arXiv:2104.10747  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Accented Speech Recognition: A Survey

    Authors: Arthur Hinsvark, Natalie Delworth, Miguel Del Rio, Quinten McNamara, Joshua Dong, Ryan Westerman, Michelle Huang, Joseph Palakapilly, Jennifer Drexler, Ilya Pirkin, Nishchal Bhandari, Miguel Jette

    Abstract: Automatic Speech Recognition (ASR) systems generalize poorly on accented speech. The phonetic and linguistic variability of accents present hard challenges for ASR systems today in both data collection and modeling strategies. The resulting bias in ASR performance across accents comes at a cost to both users and providers of ASR. We present a survey of current promising approaches to accented sp… ▽ More

    Submitted 2 June, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  8. arXiv:2007.08032  [pdf, other

    cs.CV cs.LG

    When and how CNNs generalize to out-of-distribution category-viewpoint combinations

    Authors: Spandan Madan, Timothy Henry, Jamell Dozier, Helen Ho, Nishchal Bhandari, Tomotake Sasaki, Frédo Durand, Hanspeter Pfister, Xavier Boix

    Abstract: Object recognition and viewpoint estimation lie at the heart of visual understanding. Recent works suggest that convolutional neural networks (CNNs) fail to generalize to out-of-distribution (OOD) category-viewpoint combinations, ie. combinations not seen during training. In this paper, we investigate when and how such OOD generalization may be possible by evaluating CNNs trained to classify both… ▽ More

    Submitted 17 November, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

  9. arXiv:1204.3874  [pdf

    cs.NI

    Overview of MC CDMA PAPR Reduction Techniques

    Authors: B. Sarala, D. S. Venkateswarulu, B. N. Bhandari

    Abstract: High Peak to Average Power Ratio (PAPR) of the transmitted signal is a critical problem in multicarrier modulation systems (MCM) such as Orthogonal Frequency Division Multiplexing (OFDM), and Multi-Carrier Code Division Multiple Access (MC CDMA) systems, due to large number of subcarriers. High PAPR leads to reduced resolution, and battery life. It also deteriorates system performance. This paper… ▽ More

    Submitted 10 April, 2012; originally announced April 2012.

    Comments: 14 pages, 7 figures, IJDPS March 2012