Skip to main content

Showing 1–8 of 8 results for author: Mohan, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2111.10734  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Deep Probability Estimation

    Authors: Sheng Liu, Aakash Kaku, Weicheng Zhu, Matan Leibovich, Sreyas Mohan, Boyang Yu, Haoxiang Huang, Laure Zanna, Narges Razavian, Jonathan Niles-Weed, Carlos Fernandez-Granda

    Abstract: Reliable probability estimation is of crucial importance in many real-world applications where there is inherent (aleatoric) uncertainty. Probability-estimation models are trained on observed outcomes (e.g. whether it has rained or not, or whether a patient has died or not), because the ground-truth probabilities of the events of interest are typically unknown. The problem is therefore analogous t… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: SL, AK, WZ, ML, SM contributed equally to this work; 36 pages, 17 figures, 12 tables

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:13746-13781, 2022

  2. arXiv:2011.15045  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Unsupervised Deep Video Denoising

    Authors: Dev Yashpal Sheth, Sreyas Mohan, Joshua L. Vincent, Ramon Manzorro, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli, Carlos Fernandez-Granda

    Abstract: Deep convolutional neural networks (CNNs) for video denoising are typically trained with supervision, assuming the availability of clean videos. However, in many applications, such as microscopy, noiseless videos are not available. To address this, we propose an Unsupervised Deep Video Denoiser (UDVD), a CNN architecture designed to be trained exclusively with noisy data. The performance of UDVD i… ▽ More

    Submitted 19 August, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: Dev and Sreyas contributed equally. To appear at 2021 IEEE/CVF International Conference on Computer Vision (ICCV). See https://sreyas-mohan.github.io/udvd/ for code and more results

  3. arXiv:2008.03096  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

    Authors: Devang S Ram Mohan, Raphael Lenain, Lorenzo Foglianti, Tian Huey Teh, Marlene Staib, Alexandra Torresquintero, Jiameng Gao

    Abstract: Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation. Interleaving the action of reading a character with that of synthesising audio reduces this latency. However, the order of this sequence of interleaved actions v… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: To be published in Interspeech 2020. 5 pages, 4 figures

  4. arXiv:2002.04019  [pdf, other

    cs.LG stat.ML

    Be Like Water: Robustness to Extraneous Variables Via Adaptive Feature Normalization

    Authors: Aakash Kaku, Sreyas Mohan, Avinash Parnandi, Heidi Schambra, Carlos Fernandez-Granda

    Abstract: Extraneous variables are variables that are irrelevant for a certain task, but heavily affect the distribution of the available data. In this work, we show that the presence of such variables can degrade the performance of deep-learning models. We study three datasets where there is a strong influence of known extraneous variables: classification of upper-body movements in stroke patients, annotat… ▽ More

    Submitted 25 February, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: Aakash and Sreyas contributed equally

  5. arXiv:1911.07335  [pdf, other

    cs.CL cs.LG stat.ML

    Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

    Authors: Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan, Rheeya Uppaal, Andrew McCallum

    Abstract: Existing deep active learning algorithms achieve impressive sampling efficiency on natural language processing tasks. However, they exhibit several weaknesses in practice, including (a) inability to use uncertainty sampling with black-box models, (b) lack of robustness to labeling noise, and (c) lack of transparency. In response, we propose a transparent batch active sampling framework by estimati… ▽ More

    Submitted 20 July, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: This is a pre-print of an article published in Springer Machine Learning journal. The final authenticated version is available online at: https://doi.org/10.1007/s10994-020-05897-1

  6. Real-time Person Re-identification at the Edge: A Mixed Precision Approach

    Authors: Mohammadreza Baharani, Shrey Mohan, Hamed Tabkhi

    Abstract: A critical part of multi-person multi-camera tracking is person re-identification (re-ID) algorithm, which recognizes and retains identities of all detected unknown people throughout the video stream. Many re-ID algorithms today exemplify state of the art results, but not much work has been done to explore the deployment of such algorithms for computation and power constrained real-time scenarios.… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: This is a pre-print of an article published in International Conference on Image Analysis and Recognition (ICIAR 2019), Lecture Notes in Computer Science. The final authenticated version is available online at https://doi.org/10.1007/978-3-030-27272-2_3

    Journal ref: International Conference on Image Analysis and Recognition (ICIAR 2019), Lecture Notes in Computer Science

  7. arXiv:1906.05478  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Robust and interpretable blind image denoising via bias-free convolutional neural networks

    Authors: Sreyas Mohan, Zahra Kadkhodaie, Eero P. Simoncelli, Carlos Fernandez-Granda

    Abstract: Deep convolutional networks often append additive constant ("bias") terms to their convolution operations, enabling a richer repertoire of functional map**s. Biases are also used to facilitate training, by subtracting mean response over batches of training images (a component of "batch normalization"). Recent state-of-the-art blind denoising methods (e.g., DnCNN) seem to require these terms for… ▽ More

    Submitted 8 February, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Published as conference paper in ICLR 2020

  8. arXiv:1906.00823  [pdf, other

    cs.LG eess.SP stat.ML

    Data-driven Estimation of Sinusoid Frequencies

    Authors: Gautier Izacard, Sreyas Mohan, Carlos Fernandez-Granda

    Abstract: Frequency estimation is a fundamental problem in signal processing, with applications in radar imaging, underwater acoustics, seismic imaging, and spectroscopy. The goal is to estimate the frequency of each component in a multisinusoidal signal from a finite number of noisy samples. A recent machine-learning approach uses a neural network to output a learned representation with local maxima at the… ▽ More

    Submitted 3 February, 2021; v1 submitted 3 June, 2019; originally announced June 2019.