Skip to main content

Showing 1–35 of 35 results for author: Jain, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.06798  [pdf, other

    eess.AS cs.SD

    The Reasonable Effectiveness of Speaker Embeddings for Violence Detection

    Authors: Sarthak Jain, Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma

    Abstract: In this paper, we focus on audio violence detection (AVD). AVD is necessary for several reasons, especially in the context of maintaining safety, preventing harm, and ensuring security in various environments. This calls for accurate AVD systems. Like many related applications in audio processing, the most common approach for improving the performance, would be by leveraging self-supervised (SSL)… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 24 Show & Tell Demonstrations

  2. arXiv:2406.06781  [pdf, other

    eess.AS cs.SD

    PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

    Authors: Devyani Koshal, Orchid Chetia Phukan, Sarthak Jain, Arun Balaji Buduru, Rajesh Sharma

    Abstract: Emotion Recognition (ER), Gender Recognition (GR), and Age Estimation (AE) constitute paralinguistic tasks that rely not on the spoken content but primarily on speech characteristics such as pitch and tone. While previous research has made significant strides in develo** models for each task individually, there has been comparatively less emphasis on concurrently learning these tasks, despite th… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024 Show & Tell Demonstrations

  3. arXiv:2406.06774  [pdf, other

    eess.AS cs.SD

    ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

    Authors: Orchid Chetia Phukan, Sarthak Jain, Shubham Singh, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma

    Abstract: In this work, we focus on the detection of depression through speech analysis. Previous research has widely explored features extracted from pre-trained models (PTMs) primarily trained for paralinguistic tasks. Although these features have led to sufficient advances in speech-based depression detection, their performance declines in real-world settings. To address this, in this paper, we introduce… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024 Show & Tell Demonstrations

  4. arXiv:2403.15966  [pdf, other

    eess.SY

    Fisher Information Approach for Masking the Sensing Plan: Applications in Multifunction Radars

    Authors: Shashwat Jain, Vikram Krishnamurthy, Muralidhar Rangaswamy, Bosung Kang, Sandeep Gogineni

    Abstract: How to design a Markov Decision Process (MDP) based radar controller that makes small sacrifices in performance to mask its sensing plan from an adversary? The radar controller purposefully minimizes the Fisher information of its emissions so that an adversary cannot identify the controller's model parameters accurately. Unlike classical open loop statistical inference, where the Fisher informatio… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  5. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  6. arXiv:2311.12564  [pdf

    eess.AS cs.LG eess.SP

    Summary of the DISPLACE Challenge 2023 - DIarization of SPeaker and LAnguage in Conversational Environments

    Authors: Shikha Baghel, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, Sriram Ganapathy

    Abstract: In multi-lingual societies, where multiple languages are spoken in a small geographic vicinity, informal conversations often involve mix of languages. Existing speech technologies may be inefficient in extracting information from such conversations, where the speech data is rich in diversity with multiple languages and speakers. The DISPLACE (DIarization of SPeaker and LAnguage in Conversational E… ▽ More

    Submitted 3 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  7. arXiv:2305.16333  [pdf, ps, other

    cs.CL cs.AI cs.LG eess.AS

    Text Generation with Speech Synthesis for ASR Data Augmentation

    Authors: Zhuangqun Huang, Gil Keren, Ziran Jiang, Shashank Jain, David Goss-Grubbs, Nelson Cheng, Farnaz Abtahi, Duc Le, David Zhang, Antony D'Avirro, Ethan Campbell-Taylor, Jessie Salas, Irina-Elena Veliche, Xi Chen

    Abstract: Aiming at reducing the reliance on expensive human annotations, data synthesis for Automatic Speech Recognition (ASR) has remained an active area of research. While prior work mainly focuses on synthetic speech generation for ASR data augmentation, its combination with text generation methods is considerably less explored. In this work, we explore text augmentation for ASR using large-scale pre-tr… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  8. arXiv:2303.00830  [pdf, other

    eess.AS cs.SD eess.SP

    DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

    Authors: Shikha Baghel, Shreyas Ramoji, Sidharth, Ranjana H, Prachi Singh, Somil Jain, Pratik Roy Chowdhuri, Kaustubh Kulkarni, Swapnil Padhi, Deepu Vijayasenan, Sriram Ganapathy

    Abstract: In multilingual societies, social conversations often involve code-mixed speech. The current speech technology may not be well equipped to extract information from multi-lingual multi-speaker conversations. The DISPLACE challenge entails a first-of-kind task to benchmark speaker and language diarization on the same data, as the data contains multi-speaker conversations in multilingual code-mixed s… ▽ More

    Submitted 5 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  9. arXiv:2302.12520  [pdf, other

    cs.LG eess.SY

    A Novel Demand Response Model and Method for Peak Reduction in Smart Grids -- PowerTAC

    Authors: Sanjay Chandlekar, Arthik Boroju, Shweta Jain, Sujit Gujar

    Abstract: One of the widely used peak reduction methods in smart grids is demand response, where one analyzes the shift in customers' (agents') usage patterns in response to the signal from the distribution company. Often, these signals are in the form of incentives offered to agents. This work studies the effect of incentives on the probabilities of accepting such offers in a real-world smart grid simulato… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 11 pages, 5 figures, 2 tables, Accepted as an Extended Abstract in AAMAS'23

  10. arXiv:2302.02045  [pdf, ps, other

    eess.SP

    Radar Clutter Covariance Estimation: A Nonlinear Spectral Shrinkage Approach

    Authors: Shashwat Jain, Vikram Krishnamurthy, Muralidhar Rangaswamy, Bosung Kang, Sandeep Gogineni

    Abstract: In this paper, we exploit the spiked covariance structure of the clutter plus noise covariance matrix for radar signal processing. Using state-of-the-art techniques high dimensional statistics, we propose a nonlinear shrinkage-based rotation invariant spiked covariance matrix estimator. We state the convergence of the estimated spiked eigenvalues. We use a dataset generated from the high-fidelity,… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  11. arXiv:2212.02002  [pdf, other

    eess.SP cs.LG

    Adaptive ECCM for Mitigating Smart Jammers

    Authors: Kunal Pattanayak, Shashwat Jain, Vikram Krishnamurthy, Chris Berry

    Abstract: This paper considers adaptive radar electronic counter-counter measures (ECCM) to mitigate ECM by an adversarial jammer. Our ECCM approach models the jammer-radar interaction as a Principal Agent Problem (PAP), a popular economics framework for interaction between two entities with an information imbalance. In our setup, the radar does not know the jammer's utility. Instead, the radar learns the j… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  12. arXiv:2210.11302  [pdf, other

    physics.soc-ph eess.SY physics.ao-ph

    Fleet-Level Environmental Assessments for Feasibility of Aviation Emission Reduction Goals

    Authors: Kolawole Ogunsina, Hsun Chao, Nithin Jojo Kolencherry, Samarth Jain, Kushal Moolchandani, Daniel DeLaurentis, William Crossley

    Abstract: The International Air Transport Association (IATA) is one of several organizations that have presented goals for future CO2 emissions from commercial aviation with the intent of alleviating the associated environmental impacts. These goals include attaining carbon-neutral growth in the year 2020 and total aviation CO2 emissions in 2050 equal to 50% of 2005 aviation CO2 emissions. This paper presen… ▽ More

    Submitted 16 September, 2022; originally announced October 2022.

    Comments: Presented at the Council of Engineering Systems Universities (CESUN) conference in 2018

  13. arXiv:2209.06573  [pdf, other

    math.OC cs.RO eess.SY

    Using Spectral Submanifolds for Nonlinear Periodic Control

    Authors: Florian Mahlknecht, John Irvin Alora, Shobhit Jain, Edward Schmerling, Riccardo Bonalli, George Haller, Marco Pavone

    Abstract: Very high dimensional nonlinear systems arise in many engineering problems due to semi-discretization of the governing partial differential equations, e.g. through finite element methods. The complexity of these systems present computational challenges for direct application to automatic control. While model reduction has seen ubiquitous applications in control, the use of nonlinear model reductio… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 8 pages, 6 figures, conference on decision and control 2022

  14. arXiv:2209.04235  [pdf, other

    eess.SP

    IEEE 802.11ad Based Joint Radar Communication Transceiver: Design, Prototype and Performance Analysis

    Authors: Akanksha Sneh, Soumya Jain, V Sri Sindhu, Shobha Sundar Ram, Sumit Darak

    Abstract: Rapid beam alignment is required to support high gain millimeter wave (mmW) communication links between a base station (BS) and mobile users (MU). The standard IEEE 802.11ad protocol enables beam alignment at the BS and MU through a lengthy beam training procedure accomplished through additional packet overhead. However, this results in reduced latency and throughput. Auxiliary radar functionality… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 14 pages, 13 figures

  15. arXiv:2205.12378  [pdf, ps, other

    eess.SY

    Lyapunov based Stochastic Stability of a Quantum Decision System for Human-Machine Interaction

    Authors: Luke Snow, Shashwat Jain, Vikram Krishnamurthy

    Abstract: In mathematical psychology, decision makers are modeled using the Lindbladian equations from quantum mechanics to capture important human-centric features such as order effects and violation of the sure thing principle. We consider human-machine interaction involving a quantum decision maker (human) and a controller (machine). Given a sequence of human decisions over time, how can the controller d… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.00059

  16. arXiv:2204.00059  [pdf, ps, other

    eess.SY econ.GN

    Lyapunov based Stochastic Stability of Human-Machine Interaction: A Quantum Decision System Approach

    Authors: Luke Snow, Shashwat Jain, Vikram Krishnamurthy

    Abstract: In mathematical psychology, decision makers are modeled using the Lindbladian equations from quantum mechanics to capture important human-centric features such as order effects and violation of the sure thing principle. We consider human-machine interaction involving a quantum decision maker (human) and a controller (machine). Given a sequence of human decisions over time, how can the controller d… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  17. Leveraging Clinically Relevant Biometric Constraints To Supervise A Deep Learning Model For The Accurate Caliper Placement To Obtain Sonographic Measurements Of The Fetal Brain

    Authors: Hari Shankar, Adithya Narayan, Shefali Jain, Divya Singh, Pooja Vyas, Nivedita Hegde, Purbayan Kar, Abhi Lad, Jens Thang, Jagruthi Atada, Duy Nguyen, PS Roopa, Akhila Vasudeva, Prathima Radhakrishnan, Sripad Krishna Devalla

    Abstract: Multiple studies have demonstrated that obtaining standardized fetal brain biometry from mid-trimester ultrasonography (USG) examination is key for the reliable assessment of fetal neurodevelopment and the screening of central nervous system (CNS) anomalies. Obtaining these measurements is highly subjective, expertise-driven, and requires years of training experience, limiting quality prenatal car… ▽ More

    Submitted 31 July, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI)

  18. arXiv:2202.13553  [pdf, other

    eess.IV cs.CV cs.LG

    Towards A Device-Independent Deep Learning Approach for the Automated Segmentation of Sonographic Fetal Brain Structures: A Multi-Center and Multi-Device Validation

    Authors: Abhi Lad, Adithya Narayan, Hari Shankar, Shefali Jain, Pooja Punjani Vyas, Divya Singh, Nivedita Hegde, Jagruthi Atada, Jens Thang, Saw Shier Nee, Arunkumar Govindarajan, Roopa PS, Muralidhar V Pai, Akhila Vasudeva, Prathima Radhakrishnan, Sripad Krishna Devalla

    Abstract: Quality assessment of prenatal ultrasonography is essential for the screening of fetal central nervous system (CNS) anomalies. The interpretation of fetal brain structures is highly subjective, expertise-driven, and requires years of training experience, limiting quality prenatal care for all pregnant mothers. With recent advancement in Artificial Intelligence (AI), specifically deep learning (DL)… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: SPIE Medical Imaging 2022: Computer Aided Diagnosis (12033-75), 11 pages, 7 figures

  19. arXiv:2110.06123  [pdf, other

    cs.SD eess.AS

    COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation

    Authors: Saranga Kingkor Mahanta, Darsh Kaushik, Shubham Jain, Hoang Van Truong, Koushik Guha

    Abstract: With the periodic rise and fall of COVID-19 and countries being inflicted by its waves, an efficient, economic, and effortless diagnosis procedure for the virus has been the utmost need of the hour. COVID-19 positive individuals may even be asymptomatic making the diagnosis difficult, but amongst the infected subjects, the asymptomatic ones need not be entirely free of symptoms caused by the virus… ▽ More

    Submitted 3 May, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: DiCOVA, top 1st, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2109.14546  [pdf

    eess.SP cs.LG cs.NI

    An Energy Efficient Health Monitoring Approach with Wireless Body Area Networks

    Authors: Seemandhar Jain, Prarthi Jain, Prabhat K. Upadhyay, Jules M. Moualeu, Abhishek Srivastava

    Abstract: Wireless Body Area Networks (WBANs) comprise a network of sensors subcutaneously implanted or placed near the body surface and facilitate continuous monitoring of health parameters of a patient. Research endeavours involving WBAN are directed towards effective transmission of detected parameters to a Local Processing Unit (LPU, usually a mobile device) and analysis of the parameters at the LPU or… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 23 pages, 18 figures. (Full Abstract : https://seemandhar.herokuapp.com/wban)

  21. arXiv:2105.11241  [pdf

    eess.IV cs.CV cs.LG

    Generation of COVID-19 Chest CT Scan Images using Generative Adversarial Networks

    Authors: Prerak Mann, Sahaj Jain, Saurabh Mittal, Aruna Bhat

    Abstract: SARS-CoV-2, also known as COVID-19 or Coronavirus, is a viral contagious disease that is infected by a novel coronavirus, and has been rapidly spreading across the globe. It is very important to test and isolate people to reduce spread, and from here comes the need to do this quickly and efficiently. According to some studies, Chest-CT outperforms RT-PCR lab testing, which is the current standard,… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  22. arXiv:2104.00793  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation

    Authors: Saahil Jain, Akshay Smit, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Although deep learning models for chest X-ray interpretation are commonly trained on labels generated by automatic radiology report labelers, the impact of improvements in report labeling on the performance of chest X-ray classification models has not been systematically investigated. We first compare the CheXpert, CheXbert, and VisualCheXbert labelers on the task of extracting accurate chest X-ra… ▽ More

    Submitted 27 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: In Neural Information Processing Systems (NeurIPS) Workshop on Data-Centric AI (DCAI)

  23. Contrastive Learning of Single-Cell Phenotypic Representations for Treatment Classification

    Authors: Alexis Perakis, Ali Gorji, Samriddhi Jain, Krishna Chaitanya, Simone Rizza, Ender Konukoglu

    Abstract: Learning robust representations to discriminate cell phenotypes based on microscopy images is important for drug discovery. Drug development efforts typically analyse thousands of cell images to screen for potential treatments. Early works focus on creating hand-engineered features from these images or learn such features with deep neural networks in a fully or weakly-supervised framework. Both re… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: 12 pages, 2 figures, 7 tables. This article is a pre-print and is currently under review at a conference

    Journal ref: In: Lian C., Cao X., Rekik I., Xu X., Yan P. (eds) Machine Learning in Medical Imaging. MLMI 2021. Lecture Notes in Computer Science, vol 12966. Springer, Cham

  24. arXiv:2103.00383  [pdf, other

    cs.SD cs.LG eess.AS q-bio.QM

    Brain Signals to Rescue Aphasia, Apraxia and Dysarthria Speech Recognition

    Authors: Gautam Krishna, Mason Carnahan, Shilpa Shamapant, Yashitha Surendranath, Saumya Jain, Arundhati Ghosh, Co Tran, Jose del R Millan, Ahmed H Tewfik

    Abstract: In this paper, we propose a deep learning-based algorithm to improve the performance of automatic speech recognition (ASR) systems for aphasia, apraxia, and dysarthria speech by utilizing electroencephalography (EEG) features recorded synchronously with aphasia, apraxia, and dysarthria speech. We demonstrate a significant decoding performance improvement by more than 50\% during test time for isol… ▽ More

    Submitted 17 July, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: Accepted to IEEE EMBC 2021

  25. arXiv:2102.11467  [pdf, other

    eess.IV cs.CV cs.LG

    VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels

    Authors: Saahil Jain, Akshay Smit, Steven QH Truong, Chanh DT Nguyen, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P. Lungren, Pranav Rajpurkar

    Abstract: Automatic extraction of medical conditions from free-text radiology reports is critical for supervising computer vision models to interpret medical images. In this work, we show that radiologists labeling reports significantly disagree with radiologists labeling corresponding chest X-ray images, which reduces the quality of report labels as proxies for image labels. We develop and evaluate methods… ▽ More

    Submitted 15 March, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted to ACM Conference on Health, Inference, and Learning (ACM-CHIL) 2021

  26. arXiv:2010.06200  [pdf, other

    cs.SD eess.AS

    End-to-end Triplet Loss based Emotion Embedding System for Speech Emotion Recognition

    Authors: Puneet Kumar, Sidharth Jain, Balasubramanian Raman, Partha Pratim Roy, Masakazu Iwamura

    Abstract: In this paper, an end-to-end neural embedding system based on triplet loss and residual learning has been proposed for speech emotion recognition. The proposed system learns the embeddings from the emotional information of the speech utterances. The learned embeddings are used to recognize the emotions portrayed by given speech samples of various lengths. The proposed system implements Residual Ne… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted in ICPR 2020

  27. arXiv:2008.02344  [pdf, ps, other

    eess.IV cs.CV

    Exploiting Temporal Attention Features for Effective Denoising in Videos

    Authors: Aryansh Omray, Samyak Jain, Utsav Krishnan, Pratik Chattopadhyay

    Abstract: Video Denoising is one of the fundamental tasks of any videoprocessing pipeline. It is different from image denoising due to the tem-poral aspects of video frames, and any image denoising approach appliedto videos will result in flickering. The proposed method makes use oftemporal as well as spatial dimensions of video frames as part of a two-stage pipeline. Each stage in the architecture named as… ▽ More

    Submitted 27 August, 2020; v1 submitted 5 August, 2020; originally announced August 2020.

  28. arXiv:2006.13817  [pdf, other

    eess.IV cs.CV cs.LG

    Stacked Convolutional Neural Network for Diagnosis of COVID-19 Disease from X-ray Images

    Authors: Mahesh Gour, Sweta Jain

    Abstract: Automatic and rapid screening of COVID-19 from the chest X-ray images has become an urgent need in this pandemic situation of SARS-CoV-2 worldwide in 2020. However, accurate and reliable screening of patients is a massive challenge due to the discrepancy between COVID-19 and other viral pneumonia in X-ray images. In this paper, we design a new stacked convolutional neural network model for the aut… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: 6 tables, 4 figures

  29. arXiv:2005.08834  [pdf, other

    cs.HC eess.SP

    Designing Just-in-Time Detection for Gamified Fitness Frameworks

    Authors: Slobodan Milanko, Alexander Launi, Shubham Jain

    Abstract: This paper presents our findings from a multi-year effort to detect motion events early using inertial sensors in real-world settings. We believe early event detection is the next step in advancing motion tracking, and can enable just-in-time interventions, particularly for mHealth applications. Our system targets strength training workouts in the fitness domain, where users perform well-defined m… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

  30. arXiv:2004.04736  [pdf, other

    eess.IV cs.CV cs.LG

    Capsules for Biomedical Image Segmentation

    Authors: Rodney LaLonde, Ziyue Xu, Ismail Irmakci, Sanjay Jain, Ulas Bagci

    Abstract: Our work expands the use of capsule networks to the task of object segmentation for the first time in the literature. This is made possible via the introduction of locally-constrained routing and transformation matrix sharing, which reduces the parameter/memory burden and allows for the segmentation of objects at large resolutions. To compensate for the loss of global information in constraining t… ▽ More

    Submitted 10 December, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: Extension of the non-archival Capsules of Object Segmentation with experiments on both clinical and pre-clinical pathological lung segmentation from CT scans and muscular and adipose tissue segmentation from MR images. Accepted for publication in Medical Image Analysis. DOI: https://doi.org/10.1016/j.media.2020.101889. arXiv admin note: text overlap with arXiv:1804.04241

  31. arXiv:2003.08809  [pdf, other

    eess.IV q-bio.QM

    Morphological Reconstruction of Detached Dendritic Spines via Geodesic Path Prediction

    Authors: Sammit Jain, Suvadip Mukherjee, Lydia Danglot, Jean-Christophe Olivo-Marin

    Abstract: Morphological reconstruction of dendritic spines from fluorescent microscopy is a critical open problem in neuro-image analysis. Existing segmentation tools are ill-equipped to handle thin spines with long, poorly illuminated neck membranes. We address this issue, and introduce an unsupervised path prediction technique based on a stochastic framework which seeks the optimal solution from a path-sp… ▽ More

    Submitted 21 September, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: S. Jain and S. Mukherjee contributed equally to this work. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  32. arXiv:2002.12868  [pdf

    q-bio.TO cs.CV eess.IV

    Neural Network Segmentation of Interstitial Fibrosis, Tubular Atrophy, and Glomerulosclerosis in Renal Biopsies

    Authors: Brandon Ginley, Kuang-Yu Jen, Avi Rosenberg, Felicia Yen, Sanjay Jain, Agnes Fogo, Pinaki Sarder

    Abstract: Glomerulosclerosis, interstitial fibrosis, and tubular atrophy (IFTA) are histologic indicators of irrecoverable kidney injury. In standard clinical practice, the renal pathologist visually assesses, under the microscope, the percentage of sclerotic glomeruli and the percentage of renal cortical involvement by IFTA. Estimation of IFTA is a subjective process due to a varied spectrum and definition… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

  33. arXiv:2002.11151  [pdf, other

    cs.LG eess.SP stat.ML

    TxSim:Modeling Training of Deep Neural Networks on Resistive Crossbar Systems

    Authors: Sourjya Roy, Shrihari Sridharan, Shubham Jain, Anand Raghunathan

    Abstract: Resistive crossbars have attracted significant interest in the design of Deep Neural Network (DNN) accelerators due to their ability to natively execute massively parallel vector-matrix multiplications within dense memory arrays. However, crossbar-based computations face a major challenge due to a variety of device and circuit-level non-idealities, which manifest as errors in the vector-matrix mul… ▽ More

    Submitted 7 January, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

  34. arXiv:1908.01134  [pdf, ps, other

    eess.IV math.AP math.NA

    A Fuzzy Edge Detector Driven Telegraph Total Variation Model For Image Despeckling

    Authors: Sudeb Majee, Subit K Jain, Rajendra K Ray, Ananta K Majee

    Abstract: Speckle noise suppression is a challenging and crucial pre-processing stage for higher-level image analysis. In this work, a new attempt has been made using telegraph total variation equation and fuzzy set theory for speckle noise suppression. The intuitionistic fuzzy divergence (IFD) function has been used to distinguish between edges and noise. To the best of the author's knowledge, most of the… ▽ More

    Submitted 5 August, 2019; v1 submitted 3 August, 2019; originally announced August 2019.

    Comments: 19 pages, 4 figures, 3 tables

  35. arXiv:1812.07509  [pdf

    eess.IV cs.CV cs.HC cs.LG stat.ML

    Iterative annotation to ease neural network training: Specialized machine learning in medical image analysis

    Authors: Brendon Lutnick, Brandon Ginley, Darshana Govind, Sean D. McGarry, Peter S. LaViolette, Rabi Yacoub, Sanjay Jain, John E. Tomaszewski, Kuang-Yu Jen, Pinaki Sarder

    Abstract: Neural networks promise to bring robust, quantitative analysis to medical fields, but adoption is limited by the technicalities of training these networks. To address this translation gap between medical researchers and neural networks in the field of pathology, we have created an intuitive interface which utilizes the commonly used whole slide image (WSI) viewer, Aperio ImageScope (Leica Biosyste… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 15 pages, 7 figures, 2 supplemental figures (on the last page)

    Journal ref: Nature Machine Intelligence 1.2 (2019): 112