Skip to main content

Showing 1–4 of 4 results for author: Shah, N J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.15469  [pdf, other

    cs.CL cs.LG eess.AS

    Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning

    Authors: Shivam Ratnakant Mhaskar, Nirmesh J. Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah

    Abstract: Traditional Automatic Video Dubbing (AVD) pipeline consists of three key modules, namely, Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS). Within AVD pipelines, isometric-NMT algorithms are employed to regulate the length of the synthesized output text. This is done to guarantee synchronization with respect to the alignment of video and audio subseque… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted in NAACL2024 Findings

  2. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  3. arXiv:1807.10831  [pdf

    eess.IV cs.CV cs.LG

    MoCoNet: Motion Correction in 3D MPRAGE images using a Convolutional Neural Network approach

    Authors: Kamlesh Pawar, Zhaolin Chen, N. Jon Shah, Gary F. Egan

    Abstract: Purpose: The suppression of motion artefacts from MR images is a challenging task. The purpose of this paper is to develop a standalone novel technique to suppress motion artefacts from MR images using a data-driven deep learning approach. Methods: A deep learning convolutional neural network (CNN) was developed to remove motion artefacts in brain MR images. A CNN was trained on simulated motion c… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.

  4. arXiv:1511.04867   

    cs.SD

    Quality assessment of voice converted speech using articulatory features

    Authors: Avni Rajpal, Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil

    Abstract: We propose a novel application based on acoustic-to-articulatory inversion towards quality assessment of voice converted speech. The ability of humans to speak effortlessly requires coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speakers identity which is partially present in voice converted speech. Hence,… ▽ More

    Submitted 23 November, 2015; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: The paper is withdrawn from the arxiv. Author doesnot want circulation of unpublished unverified results