Skip to main content

Showing 1–2 of 2 results for author: Chhetri, A

Searching in archive cs. Search in all archives.
.
  1. Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet

    Authors: Manish Dhakal, Arman Chhetri, Aman Kumar Gupta, Prabin Lamichhane, Suraj Pandey, Subarna Shakya

    Abstract: This paper presents an end-to-end deep learning model for Automatic Speech Recognition (ASR) that transcribes Nepali speech to text. The model was trained and tested on the OpenSLR (audio, text) dataset. The majority of the audio dataset have silent gaps at both ends which are clipped during dataset preprocessing for a more uniform map** of audio frames and their corresponding texts. Mel Frequen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted at 2022 International Conference on Inventive Computation Technologies (ICICT), IEEE

    Journal ref: 2022 International Conference on Inventive Computation Technologies (ICICT), pp. 515-521

  2. arXiv:1904.08971  [pdf, ps, other

    cs.SD cs.MM eess.AS

    On Acoustic Modeling for Broadband Beamforming

    Authors: Amit Chhetri, Mohamed Mansour, Wontak Kim, Guangdong Pan

    Abstract: In this work, we describe limitations of the free-field propagation model for designing broadband beamformers for microphone arrays on a rigid surface. Towards this goal, we describe a general framework for quantifying the microphone array performance in a general wave-field by directly solving the acoustic wave equation. The model utilizes Finite-Element-Method (FEM) for evaluating the response o… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: 5 pages, conference

    MSC Class: 94A12; 94A40; 94A15 ACM Class: H.1.2; H.5.1

    Journal ref: European Signal Processing Conference (EUSIPCO 2019)