Skip to main content

Showing 1–9 of 9 results for author: Gaur, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.13491  [pdf, other

    physics.plasm-ph cs.LG physics.comp-ph

    Grad-Shafranov equilibria via data-free physics informed neural networks

    Authors: Byoungchan Jang, Alan A. Kaptanoglu, Rahul Gaur, Shaowu Pan, Matt Landreman, William Dorland

    Abstract: A large number of magnetohydrodynamic (MHD) equilibrium calculations are often required for uncertainty quantification, optimization, and real-time diagnostic information, making MHD equilibrium codes vital to the field of plasma physics. In this paper, we explore a method for solving the Grad-Shafranov equation by using Physics-Informed Neural Networks (PINNs). For PINNs, we optimize neural netwo… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  2. arXiv:2205.02475  [pdf, other

    cs.SD cs.CL eess.AS

    Speaker Recognition in the Wild

    Authors: Neeraj Chhimwal, Anirudh Gupta, Rishabh Gaur, Harveen Singh Chadha, Priyanshi Shah, Ankur Dhuriya, Vivek Raghavan

    Abstract: In this paper, we propose a pipeline to find the number of speakers, as well as audios belonging to each of these now identified speakers in a source of audio data where number of speakers or speaker labels are not known a priori. We used this approach as a part of our Data Preparation pipeline for Speech Recognition in Indic Languages (https://github.com/Open-Speech-EkStep/vakyansh-wav2vec2-exper… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: This paper was submitted to Interspeech 2022

  3. arXiv:2203.16825  [pdf, other

    cs.CL

    indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages

    Authors: Anirudh Gupta, Neeraj Chhimwal, Ankur Dhuriya, Rishabh Gaur, Priyanshi Shah, Harveen Singh Chadha, Vivek Raghavan

    Abstract: Automatic Speech Recognition (ASR) generates text which is most of the times devoid of any punctuation. Absence of punctuation is text can affect readability. Also, down stream NLP tasks such as sentiment analysis, machine translation, greatly benefit by having punctuation and sentence boundary information. We present an approach for automatic punctuation of text using a pretrained IndicBERT model… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Submitted to InterSpeech 2022. arXiv admin note: text overlap with arXiv:2104.05055 by other authors

  4. arXiv:2203.16823  [pdf, other

    cs.CL cs.SD eess.AS

    Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition

    Authors: Anirudh Gupta, Rishabh Gaur, Ankur Dhuriya, Harveen Singh Chadha, Neeraj Chhimwal, Priyanshi Shah, Vivek Raghavan

    Abstract: In the recent years end to end (E2E) automatic speech recognition (ASR) systems have achieved promising results given sufficient resources. Even for languages where not a lot of labelled data is available, state of the art E2E ASR systems can be developed by pretraining on huge amounts of high resource languages and finetune on low resource languages. For a lot of low resource languages the curren… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Submitted to InterSpeech 2022

  5. arXiv:2203.16601   

    cs.CL eess.AS

    Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?

    Authors: Priyanshi Shah, Harveen Singh Chadha, Anirudh Gupta, Ankur Dhuriya, Neeraj Chhimwal, Rishabh Gaur, Vivek Raghavan

    Abstract: We propose a new method for the calculation of error rates in Automatic Speech Recognition (ASR). This new metric is for languages that contain half characters and where the same character can be written in different forms. We implement our methodology in Hindi which is one of the main languages from Indic context and we think this approach is scalable to other similar languages containing a large… ▽ More

    Submitted 15 June, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Need to upgrade the content completely

  6. arXiv:2203.16595   

    cs.CL eess.AS

    Improving Speech Recognition for Indic Languages using Language Model

    Authors: Ankur Dhuriya, Harveen Singh Chadha, Anirudh Gupta, Priyanshi Shah, Neeraj Chhimwal, Rishabh Gaur, Vivek Raghavan

    Abstract: We study the effect of applying a language model (LM) on the output of Automatic Speech Recognition (ASR) systems for Indic languages. We fine-tune wav2vec $2.0$ models for $18$ Indic languages and adjust the results with language models trained on text derived from a variety of sources. Our findings demonstrate that the average Character Error Rate (CER) decreases by over $28$ \% and the average… ▽ More

    Submitted 15 June, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Need to upgrade the content completely

  7. arXiv:2203.16512  [pdf, other

    cs.CL eess.AS

    Vakyansh: ASR Toolkit for Low Resource Indic languages

    Authors: Harveen Singh Chadha, Anirudh Gupta, Priyanshi Shah, Neeraj Chhimwal, Ankur Dhuriya, Rishabh Gaur, Vivek Raghavan

    Abstract: We present Vakyansh, an end to end toolkit for Speech Recognition in Indic languages. India is home to almost 121 languages and around 125 crore speakers. Yet most of the languages are low resource in terms of data and pretrained models. Through Vakyansh, we introduce automatic data pipelines for data creation, model training, model evaluation and deployment. We create 14,000 hours of speech data… ▽ More

    Submitted 15 June, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  8. arXiv:2110.00878  [pdf, other

    quant-ph cs.CR

    Conditions for Advantageous Quantum Bitcoin Mining

    Authors: Robert R. Nerem, Daya R. Gaur

    Abstract: Our aim is to determine conditions for quantum computing technology to give rise to security risks associated with quantum Bitcoin mining. Specifically, we determine the speed and energy efficiency a quantum computer needs to offer an advantage over classical mining. We analyze the setting in which the Bitcoin network is entirely classical except for a single quantum miner who has small hash rate… ▽ More

    Submitted 2 October, 2021; originally announced October 2021.

    Comments: 16 pages, 2 figures

  9. arXiv:2107.07402  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    CLSRIL-23: Cross Lingual Speech Representations for Indic Languages

    Authors: Anirudh Gupta, Harveen Singh Chadha, Priyanshi Shah, Neeraj Chhimwal, Ankur Dhuriya, Rishabh Gaur, Vivek Raghavan

    Abstract: We present a CLSRIL-23, a self supervised learning based audio pre-trained model which learns cross lingual speech representations from raw audio across 23 Indic languages. It is built on top of wav2vec 2.0 which is solved by training a contrastive task over masked latent speech representations and jointly learns the quantization of latents shared across all languages. We compare the language wise… ▽ More

    Submitted 13 January, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: 7 pages, 2 figures