Skip to main content

Showing 1–16 of 16 results for author: Shah, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.13008  [pdf, other

    cs.SD eess.AS

    Enhancing Generalization in Audio Deepfake Detection: A Neural Collapse based Sampling and Training Approach

    Authors: Mohammed Yousif, Jonat John Mathew, Huzaifa Pallan, Agamjeet Singh Padda, Syed Daniyal Shah, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Generalization in audio deepfake detection presents a significant challenge, with models trained on specific datasets often struggling to detect deepfakes generated under varying conditions and unknown algorithms. While collectively training a model using diverse datasets can enhance its generalization ability, it comes with high computational costs. To address this, we propose a neural collapse-b… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  2. arXiv:2402.05983  [pdf, other

    eess.IV cs.LG physics.app-ph physics.ins-det

    Capability enhancement of the X-ray micro-tomography system via ML-assisted approaches

    Authors: Dhruvi Shah, Shruti Mehta, Ashish Agrawal, Shishir Purohit, Bhaskar Chaudhury

    Abstract: Ring artifacts in X-ray micro-CT images are one of the primary causes of concern in their accurate visual interpretation and quantitative analysis. The geometry of X-ray micro-CT scanners is similar to the medical CT machines, except the sample is rotated with a stationary source and detector. The ring artifacts are caused by a defect or non-linear responses in detector pixels during the MicroCT d… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  3. arXiv:2402.03390  [pdf, other

    eess.IV cs.AI cs.CV cs.NI

    PixelGen: Rethinking Embedded Camera Systems

    Authors: Kunjun Li, Manoj Gulati, Steven Waskito, Dhairya Shah, Shantanu Chakrabarty, Ambuj Varshney

    Abstract: Embedded camera systems are ubiquitous, representing the most widely deployed example of a wireless embedded system. They capture a representation of the world - the surroundings illuminated by visible or infrared light. Despite their widespread usage, the architecture of embedded camera systems has remained unchanged, which leads to limitations. They visualize only a tiny portion of the world. Ad… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  4. arXiv:2305.16491  [pdf, other

    cs.LG eess.SY stat.ML

    SAMoSSA: Multivariate Singular Spectrum Analysis with Stochastic Autoregressive Noise

    Authors: Abdullah Alomar, Munther Dahleh, Sean Mann, Devavrat Shah

    Abstract: The well-established practice of time series analysis involves estimating deterministic, non-stationary trend and seasonality components followed by learning the residual stochastic, stationary components. Recently, it has been shown that one can learn the deterministic non-stationary components accurately using multivariate Singular Spectrum Analysis (mSSA) in the absence of a correlated stationa… ▽ More

    Submitted 26 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  5. arXiv:2303.13243  [pdf, other

    eess.AS cs.SD

    Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

    Authors: Kai Liu, Hailiang Xiong, Gangqiang Yang, Zhengfeng Du, Yewen Cao, Danyal Shah

    Abstract: As one of the major branches of automatic speech recognition, attention-based models greatly improves the feature representation ability of the model. In particular, the multi-head mechanism is employed in the attention, ho** to learn speech features of more aspects in different attention subspaces. For speech recognition of complex languages, on the one hand, a small head size will lead to an o… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  6. arXiv:2303.00983  [pdf, other

    cs.CV cs.GR eess.IV

    Using simulation to quantify the performance of automotive perception systems

    Authors: Zhenyi Liu, Devesh Shah, Alireza Rahimpour, Devesh Upadhyay, Joyce Farrell, Brian A Wandell

    Abstract: The design and evaluation of complex systems can benefit from a software simulation - sometimes called a digital twin. The simulation can be used to characterize system performance or to test its performance under conditions that are difficult to measure (e.g., nighttime for automotive perception systems). We describe the image system simulation software tools that we use to evaluate the performan… ▽ More

    Submitted 10 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  7. arXiv:2302.11768  [pdf, other

    eess.AS cs.SD

    A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

    Authors: Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

    Abstract: In this study, we present an approach to train a single speech enhancement network that can perform both personalized and non-personalized speech enhancement. This is achieved by incorporating a frame-wise conditioning input that specifies the type of enhancement output. To improve the quality of the enhanced output and mitigate oversuppression, we experiment with re-weighting frames by the presen… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  8. arXiv:2211.09727  [pdf, other

    cond-mat.mtrl-sci cs.CV cs.LG eess.IV

    A Survey on Evaluation Metrics for Synthetic Material Micro-Structure Images from Generative Models

    Authors: Devesh Shah, Anirudh Suresh, Alemayehu Admasu, Devesh Upadhyay, Kalyanmoy Deb

    Abstract: The evaluation of synthetic micro-structure images is an emerging problem as machine learning and materials science research have evolved together. Typical state of the art methods in evaluating synthetic images from generative models have relied on the Fréchet Inception Distance. However, this and other similar methods, are limited in the materials domain due to both the unique features that char… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: Accepted in Neural Information Processing Systems (NeurIPS) 2022 Workshop on AI for Accelerated Materials Design (AI4Mat). Selected as spotlight paper for workshop

    ACM Class: I.2.m; J.2

  9. arXiv:2203.15916  [pdf, other

    q-bio.PE eess.SY math.OC stat.AP

    Current Implicit Policies May Not Eradicate COVID-19

    Authors: Ali Jadbabaie, Arnab Sarker, Devavrat Shah

    Abstract: Successful predictive modeling of epidemics requires an understanding of the implicit feedback control strategies which are implemented by populations to modulate the spread of contagion. While this task of capturing endogenous behavior can be achieved through intricate modeling assumptions, we find that a population's reaction to case counts can be described through a second order affine dynamica… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  10. arXiv:2202.11271  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints

    Authors: Dhruv Shah, Sergey Levine

    Abstract: Robotic navigation has been approached as a problem of 3D reconstruction and planning, as well as an end-to-end learning problem. However, long-range navigation requires both planning and reasoning about local traversability, as well as being able to utilize general knowledge about global geography, in the form of a roadmap, GPS, or other side information providing important cues. In this work, we… ▽ More

    Submitted 9 January, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Best Systems Paper Finalist at XVII Robotics: Science and Systems (RSS 2022), New York City, USA. Project page https://sites.google.com/view/viking-release

  11. arXiv:2010.02400  [pdf, ps, other

    math.NA eess.IV physics.med-ph

    A Generalized Framework for Analytic Regularization of Uniform Cubic B-spline Displacement Fields

    Authors: Keyur D. Shah, James A. Shackleford, Nagarajan Kandasamy, Gregory C. Sharp

    Abstract: Image registration is an inherently ill-posed problem that lacks the constraints needed for a unique map** between voxels of the two images being registered. As such, one must regularize the registration to achieve physically meaningful transforms. The regularization penalty is usually a function of derivatives of the displacement-vector field, and can be calculated either analytically or numeri… ▽ More

    Submitted 5 April, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 17 pages, 5 figures

    Journal ref: https://iopscience.iop.org/article/10.1088/2057-1976/abf9e6

  12. arXiv:2009.08645  [pdf

    cs.IT eess.SP

    Low Density Parity Check Code (LDPC Codes) Overview

    Authors: Saumya Borwankar, Dhruv Shah

    Abstract: This paper basically expresses the core fundamentals and brief overview of the research of R. G. GALLAGER [1] on Low-Density Parity-Check (LDPC) codes and various parameters related to LDPC codes like, encoding and decoding of LDPC codes, code rate, parity check matrix, tanner graph. We also discuss advantages and applications as well as the usage of LDPC codes in 5G technology. We have simulated… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  13. arXiv:2009.08317  [pdf

    eess.SP

    Effect Of Weather Conditions On FSO Link

    Authors: Saumya Borwankar, Dhruv Shah

    Abstract: Free Space Optics (FSO) is a develo** technology for Line of Sight communication that uses light propagation in free space that provides various advantages like high bandwidth, high data rate, ease of installation, free licensing and secure communication. Thus, FSO is a develo** technology that can be used in numerous applications for Line of Sight Communication. But the diverse effects like a… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  14. arXiv:1911.09645  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features

    Authors: Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto

    Abstract: This paper presents a simple yet effective method to achieve prosody transfer from a reference speech signal to synthesized speech. The main idea is to incorporate well-known acoustic correlates of prosody such as pitch and loudness contours of the reference speech into a modern neural text-to-speech (TTS) synthesizer such as Tacotron2 (TC2). More specifically, a small set of acoustic features are… ▽ More

    Submitted 15 May, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: 5 pages, in review for conference publication

  15. arXiv:1911.00344  [pdf, other

    cs.NE eess.SY physics.soc-ph

    Short and Wide Network Paths

    Authors: Lavanya Marla, Lav R. Varshney, Devavrat Shah, Nirmal A. Prakash, Michael E. Gale

    Abstract: Network flow is a powerful mathematical framework to systematically explore the relationship between structure and function in biological, social, and technological networks. We introduce a new pipelining model of flow through networks where commodities must be transported over single paths rather than split over several paths and recombined. We show this notion of pipelined network flow is optimi… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  16. arXiv:1402.3654  [pdf

    eess.SY

    Temperature Control using Fuzzy Logic

    Authors: Piyush Singhala, Dhrumil Shah, Bhavikkumar Patel

    Abstract: The aim of the temperature control is to heat the system up todelimitated temperature, afterwardhold it at that temperature in insured manner. Fuzzy Logic Controller (FLC) is best way in which this type of precision control can be accomplished by controller. During past twenty yearssignificant amount of research using fuzzy logichas done in this field of control of non-linear dynamical system. Her… ▽ More

    Submitted 15 February, 2014; originally announced February 2014.

    Comments: 10 pages