Skip to main content

Showing 1–5 of 5 results for author: Grover, M S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2111.15156  [pdf, other

    cs.CL cs.SD eess.AS

    Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency

    Authors: Pakhi Bamdev, Manraj Singh Grover, Yaman Kumar Singla, Payman Vafaee, Mika Hama, Rajiv Ratn Shah

    Abstract: English proficiency assessments have become a necessary metric for filtering and selecting prospective candidates for both academia and industry. With the rise in demand for such assessments, it has become increasingly necessary to have the automated human-interpretable results to prevent inconsistencies and ensure meaningful feedback to the second language learners. Feature-based classical approa… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: Accepted for publication in the International Journal of Artificial Intelligence in Education (IJAIED)

  2. arXiv:2006.05236  [pdf, other

    cs.SD cs.CL eess.AS

    audino: A Modern Annotation Tool for Audio and Speech

    Authors: Manraj Singh Grover, Pakhi Bamdev, Ratin Kumar Brala, Yaman Kumar, Mika Hama, Rajiv Ratn Shah

    Abstract: In this paper, we introduce a collaborative and modern annotation tool for audio and speech: audino. The tool allows annotators to define and describe temporal segmentation in audios. These segments can be labelled and transcribed easily using a dynamically generated form. An admin can centrally control user roles and project assignment through the admin dashboard. The dashboard also enables descr… ▽ More

    Submitted 28 November, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  3. arXiv:2005.08182  [pdf, other

    cs.CL cs.SD eess.AS

    Multi-modal Automated Speech Scoring using Attention Fusion

    Authors: Manraj Singh Grover, Yaman Kumar, Sumit Sarin, Payman Vafaee, Mika Hama, Rajiv Ratn Shah

    Abstract: In this study, we propose a novel multi-modal end-to-end neural approach for automated assessment of non-native English speakers' spontaneous speech using attention fusion. The pipeline employs Bi-directional Recurrent Convolutional Neural Networks and Bi-directional Long Short-Term Memory Neural Networks to encode acoustic and lexical cues from spectrograms and transcriptions, respectively. Atten… ▽ More

    Submitted 28 November, 2021; v1 submitted 17 May, 2020; originally announced May 2020.

  4. arXiv:1911.12152  [pdf, other

    eess.SP cs.LG

    Universal EEG Encoder for Learning Diverse Intelligent Tasks

    Authors: Baani Leen Kaur Jolly, Palash Aggrawal, Surabhi S Nath, Viresh Gupta, Manraj Singh Grover, Rajiv Ratn Shah

    Abstract: Brain Computer Interfaces (BCI) have become very popular with Electroencephalography (EEG) being one of the most commonly used signal acquisition techniques. A major challenge in BCI studies is the individualistic analysis required for each task. Thus, task-specific feature extraction and classification are performed, which fails to generalize to other tasks with similar time-series EEG input data… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

  5. arXiv:1911.11378  [pdf, other

    cs.LG cs.CV cs.MM eess.IV stat.ML

    Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions

    Authors: Osaid Rehman Nasir, Shailesh Kumar Jha, Manraj Singh Grover, Yi Yu, Ajit Kumar, Rajiv Ratn Shah

    Abstract: Powerful generative adversarial networks (GAN) have been developed to automatically synthesize realistic images from text. However, most existing tasks are limited to generating simple images such as flowers from captions. In this work, we extend this problem to the less addressed domain of face generation from fine-grained textual descriptions of face, e.g., "A person has curly hair, oval face, a… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.