Skip to main content

Showing 1–29 of 29 results for author: Jha, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14875  [pdf, other

    cs.SD eess.AS

    GLOBE: A High-quality English Corpus with Global Accents for Zero-shot Speaker Adaptive Text-to-Speech

    Authors: Wenbin Wang, Yang Song, Sanjay Jha

    Abstract: This paper introduces GLOBE, a high-quality English corpus with worldwide accents, specifically designed to address the limitations of current zero-shot speaker adaptive Text-to-Speech (TTS) systems that exhibit poor generalizability in adapting to speakers with accents. Compared to commonly used English corpora, such as LibriTTS and VCTK, GLOBE is unique in its inclusion of utterances from 23,519… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024, 4 pages, 3 figures

  2. arXiv:2406.10893  [pdf, other

    eess.IV cs.AI cs.CV q-bio.QM q-bio.TO

    Development and Validation of Fully Automatic Deep Learning-Based Algorithms for Immunohistochemistry Reporting of Invasive Breast Ductal Carcinoma

    Authors: Sumit Kumar Jha, Purnendu Mishra, Shubham Mathur, Gursewak Singh, Rajiv Kumar, Kiran Aatre, Suraj Rengarajan

    Abstract: Immunohistochemistry (IHC) analysis is a well-accepted and widely used method for molecular subty**, a procedure for prognosis and targeted therapy of breast carcinoma, the most common type of tumor affecting women. There are four molecular biomarkers namely progesterone receptor (PR), estrogen receptor (ER), antigen Ki67, and human epidermal growth factor receptor 2 (HER2) whose assessment is n… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  3. arXiv:2406.05828  [pdf, other

    cs.CV cs.AI eess.IV

    Multi-Stain Multi-Level Convolutional Network for Multi-Tissue Breast Cancer Image Segmentation

    Authors: Akash Modi, Sumit Kumar Jha, Purnendu Mishra, Rajiv Kumar, Kiran Aatre, Gursewak Singh, Shubham Mathur

    Abstract: Digital pathology and microscopy image analysis are widely employed in the segmentation of digitally scanned IHC slides, primarily to identify cancer and pinpoint regions of interest (ROI) indicative of tumor presence. However, current ROI segmentation models are either stain-specific or suffer from the issues of stain and scanner variance due to different staining protocols or modalities across m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  4. arXiv:2404.18094  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    USAT: A Universal Speaker-Adaptive Text-to-Speech Approach

    Authors: Wenbin Wang, Yang Song, Sanjay Jha

    Abstract: Conventional text-to-speech (TTS) research has predominantly focused on enhancing the quality of synthesized speech for speakers in the training dataset. The challenge of synthesizing lifelike speech for unseen, out-of-dataset speakers, especially those with limited reference data, remains a significant and unresolved problem. While zero-shot or few-shot speaker-adaptive TTS approaches have been e… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 15 pages, 13 figures. Copyright has been transferred to IEEE

    Journal ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2024

  5. arXiv:2404.15293  [pdf, other

    eess.IV cs.GR q-bio.NC

    Interactive Manipulation and Visualization of 3D Brain MRI for Surgical Training

    Authors: Siddharth Jha, Zichen Gui, Benjamin Delbos, Richard Moreau, Arnaud Leleve, Irene Cheng

    Abstract: In modern medical diagnostics, magnetic resonance imaging (MRI) is an important technique that provides detailed insights into anatomical structures. In this paper, we present a comprehensive methodology focusing on streamlining the segmentation, reconstruction, and visualization process of 3D MRI data. Segmentation involves the extraction of anatomical regions with the help of state-of-the-art de… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

  6. arXiv:2311.00429  [pdf, other

    eess.IV cs.LG

    Crop Disease Classification using Support Vector Machines with Green Chromatic Coordinate (GCC) and Attention based feature extraction for IoT based Smart Agricultural Applications

    Authors: Shashwat Jha, Vishvaditya Luhach, Gauri Shanker Gupta, Beependra Singh

    Abstract: Crops hold paramount significance as they serve as the primary provider of energy, nutrition, and medicinal benefits for the human population. Plant diseases, however, can negatively affect leaves during agricultural cultivation, resulting in significant losses in crop output and economic value. Therefore, it is crucial for farmers to identify crop diseases. However, this method frequently necessi… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  7. arXiv:2309.05070  [pdf, other

    cs.RO cs.AI eess.SY

    Chasing the Intruder: A Reinforcement Learning Approach for Tracking Intruder Drones

    Authors: Shivam Kainth, Subham Sahoo, Rajtilak Pal, Shashi Shekhar Jha

    Abstract: Drones are becoming versatile in a myriad of applications. This has led to the use of drones for spying and intruding into the restricted or private air spaces. Such foul use of drone technology is dangerous for the safety and security of many critical infrastructures. In addition, due to the varied low-cost design and agility of the drones, it is a challenging task to identify and track them usin… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  8. arXiv:2308.13007  [pdf, other

    cs.SD cs.AI eess.AS

    Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations

    Authors: Wenbin Wang, Yang Song, Sanjay Jha

    Abstract: While most research into speech synthesis has focused on synthesizing high-quality speech for in-dataset speakers, an equally essential yet unsolved problem is synthesizing speech for unseen speakers who are out-of-dataset with limited reference data, i.e., speaker adaptive speech synthesis. Many studies have proposed zero-shot speaker adaptive text-to-speech and voice conversion approaches aimed… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 5 pages, 3 figures. Accepted by Interspeech 2023, Oral

  9. arXiv:2303.01702  [pdf, other

    eess.SP

    Hardware Software Co-Design Based Reconfigurable Radar Signal Processing Accelerator for Joint Radar-Communication System

    Authors: Shragvi Sidharth Jha, Aakanksha Tewari, Sumit J Darak, Akanksha Sneh, Shobha Sundar Ram

    Abstract: Millimeter wave (mmW) codesigned 802.11ad-based joint radar communication (JRC) systems have been identified as a potential solution for realizing high bandwidth connected vehicles for next-generation intelligent transportation systems. The radar functionality within the JRC enables accurate detection and localization of mobile targets, which can significantly speed up the selection of the optimal… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  10. arXiv:2302.05494  [pdf

    eess.SP

    Data-Driven Web-Based Patching Management Tool Using Multi-Sensor Pavement Structure Measurements

    Authors: Sneha Jha, Yaguang Zhang, Bongsuk Park, Seonghwan Cho, James V. Krogmeier, Tandra Bagchi, John E. Haddock

    Abstract: Automating pavement maintenance suggestions is challenging,especially for actionable recommendations such as patching location,depth and priority.It is common practice among State agencies to manually inspect road segments of interest and decide maintenance requirements based on the pavement condition index (PCI).However,standalone PCI only evaluates the pavement surface condition and coupled with… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Presented at: Transportation Research Board Annual Meeting 2023

    Report number: PaperID-TRBAM-23-04483

  11. arXiv:2209.08795  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    AutoLV: Automatic Lecture Video Generator

    Authors: Wenbin Wang, Yang Song, Sanjay Jha

    Abstract: We propose an end-to-end lecture video generation system that can generate realistic and complete lecture videos directly from annotated slides, instructor's reference voice and instructor's reference portrait video. Our system is primarily composed of a speech synthesis module with few-shot speaker adaptation and an adversarial learning-based talking-head generation module. It is capable of not o… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 4 pages, 4 figures, ICIP 2022

  12. arXiv:2109.05222  [pdf, ps, other

    cs.IT cs.LG eess.SP stat.ML

    Fundamental limits of over-the-air optimization: Are analog schemes optimal?

    Authors: Shubham K Jha, Prathamesh Mayekar, Himanshu Tyagi

    Abstract: We consider over-the-air convex optimization on a $d-$dimensional space where coded gradients are sent over an additive Gaussian noise channel with variance $σ^2$. The codewords satisfy an average power constraint $P$, resulting in the signal-to-noise ratio (SNR) of $P/σ^2$. We derive bounds for the convergence rates for over-the-air optimization. Our first result is a lower bound for the converge… ▽ More

    Submitted 15 September, 2021; v1 submitted 11 September, 2021; originally announced September 2021.

    Comments: Few typos fixed and one reference added. An abridged version of this paper will appear in the proceedings of IEEE Global Communications Conference (GLOBECOM), Spain, 2021

  13. arXiv:2108.06884  [pdf, other

    eess.SP cs.NI

    Seirios: Leveraging Multiple Channels for LoRaWAN Indoor and Outdoor Localization

    Authors: Jun Liu, Jiayao Gao, Sanjay Jha, Wen Hu

    Abstract: Localization is important for a large number of Internet of Things (IoT) endpoint devices connected by LoRaWAN. Due to the bandwidth limitations of LoRaWAN, existing localization methods without specialized hardware (e.g., GPS) produce poor performance. To increase the localization accuracy, we propose a super-resolution localization method, called Seirios, which features a novel algorithm to sync… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: MOBICOM 2021

  14. arXiv:2108.00874  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    Few-Shot Domain Adaptation For End-to-End Communication

    Authors: Jayaram Raghuram, Yi**g Zeng, Dolores GarcĂ­a MartĂ­, Rafael Ruiz Ortiz, Somesh Jha, Joerg Widmer, Suman Banerjee

    Abstract: The problem of end-to-end learning of a communication system using an autoencoder -- consisting of an encoder, channel, and decoder modeled using neural networks -- has recently been shown to be a promising approach. A challenge faced in the practical adoption of this learning approach is that under changing channel conditions (e.g. a wireless link), it requires frequent retraining of the autoenco… ▽ More

    Submitted 25 July, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 32 pages, 11 figures

  15. arXiv:2108.00640  [pdf, ps, other

    cs.LG eess.SP

    Few-shot calibration of low-cost air pollution (PM2.5) sensors using meta-learning

    Authors: Kalpit Yadav, Vipul Arora, Sonu Kumar Jha, Mohit Kumar, Sachchida Nand Tripathi

    Abstract: Low-cost particulate matter sensors are transforming air quality monitoring because they have lower costs and greater mobility as compared to reference monitors. Calibration of these low-cost sensors requires training data from co-deployed reference monitors. Machine Learning based calibration gives better performance than conventional techniques, but requires a large amount of training data from… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 3+1 pages, submitted to IEEE sensors conference 2021

  16. arXiv:2106.07972  [pdf

    eess.AS cs.SD

    SRIB Submission to Interspeech 2021 DiCOVA Challenge

    Authors: Vishwanath Pratap Singh, Shashi Kumar, Ravi Shekhar Jha, Abhishek Pandey

    Abstract: The COVID-19 pandemic has resulted in more than 125 million infections and more than 2.7 million casualties. In this paper, we attempt to classify covid vs non-covid cough sounds using signal processing and deep learning methods. Air turbulence, the vibration of tissues, movement of fluid through airways, opening, and closure of glottis are some of the causes for the production of the acoustic sou… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 5 pages, 5 figures

  17. arXiv:2104.09396  [pdf, other

    eess.SP cs.HC cs.LG

    Continual Learning in Sensor-based Human Activity Recognition: an Empirical Benchmark Analysis

    Authors: Saurav Jha, Martin Schiemer, Franco Zambonelli, Juan Ye

    Abstract: Sensor-based human activity recognition (HAR), i.e., the ability to discover human daily activity patterns from wearable or embedded sensors, is a key enabler for many real-world applications in smart homes, personal healthcare, and urban planning. However, with an increasing number of applications being deployed, an important question arises: how can a HAR system autonomously learn new activities… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted in the Information Sciences journal

  18. arXiv:2011.14674  [pdf, other

    eess.SY

    Thermal Analysis of PEM Fuel Cell and Lithium Ion Battery Pack in Confined Space

    Authors: Ashita Victor, Abhay Shankar Jha, Janamejaya Channegowda, Sumukh Surya, Kali vara prasad Naraharisetti

    Abstract: Hybrid energy storage systems (HESS) have carved a niche in the industry. HESS improve the system efficiency, reduce the overall cost and increase the lifespan of the system. The proton exchange membrane (PEM) fuel cell is hybridized with Li-ion batteries (LIB) for vehicular applications, robotic applications etc. In applications which have geometrical space constraints, the temperature of the ene… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  19. arXiv:2011.12569  [pdf, other

    cs.RO eess.SY

    Learning Certified Control using Contraction Metric

    Authors: Dawei Sun, Susmit Jha, Chuchu Fan

    Abstract: In this paper, we solve the problem of finding a certified control policy that drives a robot from any given initial state and under any bounded disturbance to the desired reference trajectory, with guarantees on the convergence or bounds on the tracking error. Such a controller is crucial in safe motion planning. We leverage the advanced theory in Control Contraction Metric and design a learning… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Accepted to Conference on Robot Learning (CoRL) 2020

  20. arXiv:2006.04570  [pdf

    cs.CV cs.LG eess.IV

    Incorporating Image Gradients as Secondary Input Associated with Input Image to Improve the Performance of the CNN Model

    Authors: Vijay Pandey, Shashi Bhushan Jha

    Abstract: CNN is very popular neural network architecture in modern days. It is primarily most used tool for vision related task to extract the important features from the given image. Moreover, CNN works as a filter to extract the important features using convolutional operation in distinct layers. In existing CNN architectures, to train the network on given input, only single form of given input is fed to… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  21. arXiv:1911.11378  [pdf, other

    cs.LG cs.CV cs.MM eess.IV stat.ML

    Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions

    Authors: Osaid Rehman Nasir, Shailesh Kumar Jha, Manraj Singh Grover, Yi Yu, Ajit Kumar, Rajiv Ratn Shah

    Abstract: Powerful generative adversarial networks (GAN) have been developed to automatically synthesize realistic images from text. However, most existing tasks are limited to generating simple images such as flowers from captions. In this work, we extend this problem to the less addressed domain of face generation from fine-grained textual descriptions of face, e.g., "A person has curly hair, oval face, a… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

  22. arXiv:1909.03900  [pdf, other

    cs.NI eess.SP

    Measurement, Characterization and Modeling of LoRa Technology in Multi-floor Buildings

    Authors: Weitao Xu, Jun Young Kim, Walter Huang, Salil Kanhere, Sanjay Jha, Wen Hu

    Abstract: In recent years, we have witnessed the rapid development of LoRa technology, together with extensive studies trying to understand its performance in various application settings. In contrast to measurements performed in large outdoor areas, limited number of attempts have been made to understand the characterization and performance of LoRa technology in indoor environments. In this paper, we prese… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 10 pages, 9 figures

  23. AVFI: Fault Injection for Autonomous Vehicles

    Authors: Saurabh Jha, Subho S. Banerjee, James Cyriac, Zbigniew T. Kalbarczyk, Ravishankar K. Iyer

    Abstract: Autonomous vehicle (AV) technology is rapidly becoming a reality on U.S. roads, offering the promise of improvements in traffic management, safety, and the comfort and efficiency of vehicular travel. With this increasing popularity and ubiquitous deployment, resilience has become a critical requirement for public acceptance and adoption. Recent studies into the resilience of AVs have shown that th… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: Published in: 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)

  24. arXiv:1902.11277  [pdf, other

    eess.SY

    A Risk-Sensitive Finite-Time Reachability Approach for Safety of Stochastic Dynamic Systems

    Authors: Margaret P. Chapman, Jonathan Lacotte, Aviv Tamar, Donggun Lee, Kevin M. Smith, Victoria Cheng, Jaime F. Fisac, Susmit Jha, Marco Pavone, Claire J. Tomlin

    Abstract: A classic reachability problem for safety of dynamic systems is to compute the set of initial states from which the state trajectory is guaranteed to stay inside a given constraint set over a given time horizon. In this paper, we leverage existing theory of reachability analysis and risk measures to devise a risk-sensitive reachability approach for safety of stochastic dynamic systems under non-ad… ▽ More

    Submitted 30 April, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

  25. arXiv:1809.06472  [pdf, other

    eess.SY

    Ground vehicle odometry using a non-intrusive inertial speed sensor

    Authors: Het Shah, Siddhant Haldar, Rohit Ner, Siddharth Jha, Debashish Chakravarty

    Abstract: This paper describes the design and development of a non-intrusive inertial speed sensor that can be reliably used to replace a conventional optical or hall effect-based speedometer on any kind of ground vehicle. The design allows for simple assembly-disassembly from tyre rims. The sensor design and data flow are explained. Algorithms and filters for pre-processing and processing the data are deta… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

  26. arXiv:1612.01476  [pdf, other

    eess.SY cs.RO

    Modeling and Control of an Autonomous Three Wheeled Mobile Robot with Front Steer

    Authors: Ayush Pandey, Siddharth Jha, Debashish Chakravarty

    Abstract: Modeling and control strategies for a design of an autonomous three wheeled mobile robot with front wheel steer is presented. Although, the three-wheel vehicle design with front wheel steer is common in automotive vehicles used often in public transport, but its advantages in navigation and localization of autonomous vehicles is seldom utilized. We present the system model for such a robotic vehic… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

    Comments: IEEE International Conference on Robotic Computing 2017. (under review)

  27. arXiv:1407.4937   

    cs.LO cs.FL cs.SE eess.SY

    Proceedings 3rd Workshop on Synthesis

    Authors: Krishnendu Chatterjee, RĂ¼diger Ehlers, Susmit Jha

    Abstract: The idea of synthesis, i.e., the process of automatically computing implementations from their specifications, has recently gained a lot of momentum in the contexts of software engineering and reactive system design. While it is widely believed that, due to complexity/undecidability issues, synthesis cannot completely replace manual engineering, it can assist the process of designing the intricate… ▽ More

    Submitted 18 July, 2014; originally announced July 2014.

    ACM Class: B.1.2; D.2.2; F.1.1; F.1.2; I.2.2

    Journal ref: EPTCS 157, 2014

  28. arXiv:1302.1920  [pdf, ps, other

    eess.SY

    SWATI: Synthesizing Wordlengths Automatically Using Testing and Induction

    Authors: Susmit Jha, Sanjit A. Seshia

    Abstract: In this paper, we present an automated technique SWATI: Synthesizing Wordlengths Automatically Using Testing and Induction, which uses a combination of Nelder-Mead optimization based testing, and induction from examples to automatically synthesize optimal fixedpoint implementation of numerical routines. The design of numerical software is commonly done using floating-point arithmetic in design-env… ▽ More

    Submitted 7 February, 2013; originally announced February 2013.

    ACM Class: F.2.1; D.2.4; I.2.2

  29. arXiv:1103.0800  [pdf, ps, other

    eess.SY math.OC

    Synthesizing Switching Logic to Minimize Long-Run Cost

    Authors: Susmit Jha, Sanjit A. Seshia, Ashish Tiwari

    Abstract: Given a multi-modal dynamical system, optimal switching logic synthesis involves generating the conditions for switching between the system modes such that the resulting hybrid system satisfies a quantitative specification. We formalize and solve the problem of optimal switching logic synthesis for quantitative specifications over long run behavior. Each trajectory of the system, and each state of… ▽ More

    Submitted 5 May, 2011; v1 submitted 3 March, 2011; originally announced March 2011.

    Comments: UC Berkeley Technical Report