-
Robust Broadband Beamforming using Bilinear Programming
Authors:
Nakul Singh,
Coleman DeLude,
Mark A. Davenport,
Justin Romberg
Abstract:
We introduce a new method for robust beamforming, where the goal is to estimate a signal from array samples when there is uncertainty in the angle of arrival. Our method offers state-of-the-art performance on narrowband signals and is naturally applied to broadband signals. Our beamformer operates by treating the forward model for the array samples as unknown. We show that the "true" forward model…
▽ More
We introduce a new method for robust beamforming, where the goal is to estimate a signal from array samples when there is uncertainty in the angle of arrival. Our method offers state-of-the-art performance on narrowband signals and is naturally applied to broadband signals. Our beamformer operates by treating the forward model for the array samples as unknown. We show that the "true" forward model lies in the linear span of a small number of fixed linear systems. As a result, we can estimate the forward operator and the signal simultaneously by solving a bilinear inverse problem using least squares. Our numerical experiments show that if the angle of arrival is known to only be within an interval of reasonable size, there is very little loss in estimation performance compared to the case where the angle is known exactly.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Contrastive Learning from Synthetic Audio Doppelgangers
Authors:
Manuel Cherep,
Nikhil Singh
Abstract:
Learning robust audio representations currently demands extensive datasets of real-world sound recordings. By applying artificial transformations to these recordings, models can learn to recognize similarities despite subtle variations through techniques like contrastive learning. However, these transformations are only approximations of the true diversity found in real-world sounds, which are gen…
▽ More
Learning robust audio representations currently demands extensive datasets of real-world sound recordings. By applying artificial transformations to these recordings, models can learn to recognize similarities despite subtle variations through techniques like contrastive learning. However, these transformations are only approximations of the true diversity found in real-world sounds, which are generated by complex interactions of physical processes, from vocal cord vibrations to the resonance of musical instruments. We propose a solution to both the data scale and transformation limitations, leveraging synthetic audio. By randomly perturbing the parameters of a sound synthesizer, we generate audio doppelgängers-synthetic positive pairs with causally manipulated variations in timbre, pitch, and temporal envelopes. These variations, difficult to achieve through transformations of existing audio, provide a rich source of contrastive information. Despite the shift to randomly generated synthetic data, our method produces strong representations, competitive with real data on standard audio classification benchmarks. Notably, our approach is lightweight, requires no data storage, and has only a single hyperparameter, which we extensively analyze. We offer this method as a complement to existing strategies for contrastive learning in audio, using synthesized sounds to reduce the data burden on practitioners.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Creative Text-to-Audio Generation via Synthesizer Programming
Authors:
Manuel Cherep,
Nikhil Singh,
Jessica Shand
Abstract:
Neural audio synthesis methods now allow specifying ideas in natural language. However, these methods produce results that cannot be easily tweaked, as they are based on large latent spaces and up to billions of uninterpretable parameters. We propose a text-to-audio generation method that leverages a virtual modular sound synthesizer with only 78 parameters. Synthesizers have long been used by ski…
▽ More
Neural audio synthesis methods now allow specifying ideas in natural language. However, these methods produce results that cannot be easily tweaked, as they are based on large latent spaces and up to billions of uninterpretable parameters. We propose a text-to-audio generation method that leverages a virtual modular sound synthesizer with only 78 parameters. Synthesizers have long been used by skilled sound designers for media like music and film due to their flexibility and intuitive controls. Our method, CTAG, iteratively updates a synthesizer's parameters to produce high-quality audio renderings of text prompts that can be easily inspected and tweaked. Sounds produced this way are also more abstract, capturing essential conceptual features over fine-grained acoustic details, akin to how simple sketches can vividly convey visual concepts. Our results show how CTAG produces sounds that are distinctive, perceived as artistic, and yet similarly identifiable to recent neural audio synthesis models, positioning it as a valuable and complementary tool.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Adaptive Reinforcement Learning for Robot Control
Authors:
Yu Tang Liu,
Nilaksh Singh,
Aamir Ahmad
Abstract:
Deep reinforcement learning (DRL) has shown remarkable success in simulation domains, yet its application in designing robot controllers remains limited, due to its single-task orientation and insufficient adaptability to environmental changes. To overcome these limitations, we present a novel adaptive agent that leverages transfer learning techniques to dynamically adapt policy in response to dif…
▽ More
Deep reinforcement learning (DRL) has shown remarkable success in simulation domains, yet its application in designing robot controllers remains limited, due to its single-task orientation and insufficient adaptability to environmental changes. To overcome these limitations, we present a novel adaptive agent that leverages transfer learning techniques to dynamically adapt policy in response to different tasks and environmental conditions. The approach is validated through the blimp control challenge, where multitasking capabilities and environmental adaptability are essential. The agent is trained using a custom, highly parallelized simulator built on IsaacGym. We perform zero-shot transfer to fly the blimp in the real world to solve various tasks. We share our code at \url{https://github.com/robot-perception-group/adaptive\_agent/}.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Authors:
Nikhil Kumar Singh,
Indranil Saha
Abstract:
Efficient utilization of the replay buffer plays a significant role in the off-policy actor-critic reinforcement learning (RL) algorithms used for model-free control policy synthesis for complex dynamical systems. We propose a method for achieving sample efficiency, which focuses on selecting unique samples and adding them to the replay buffer during the exploration with the goal of reducing the b…
▽ More
Efficient utilization of the replay buffer plays a significant role in the off-policy actor-critic reinforcement learning (RL) algorithms used for model-free control policy synthesis for complex dynamical systems. We propose a method for achieving sample efficiency, which focuses on selecting unique samples and adding them to the replay buffer during the exploration with the goal of reducing the buffer size and maintaining the independent and identically distributed (IID) nature of the samples. Our method is based on selecting an important subset of the set of state variables from the experiences encountered during the initial phase of random exploration, partitioning the state space into a set of abstract states based on the selected important state variables, and finally selecting the experiences with unique state-reward combination by using a kernel density estimator. We formally prove that the off-policy actor-critic algorithm incorporating the proposed method for unique experience accumulation converges faster than the vanilla off-policy actor-critic algorithm. Furthermore, we evaluate our method by comparing it with two state-of-the-art actor-critic RL algorithms on several continuous control benchmarks available in the Gym environment. Experimental results demonstrate that our method achieves a significant reduction in the size of the replay buffer for all the benchmarks while achieving either faster convergent or better reward accumulation compared to the baseline algorithms.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning
Authors:
Nikhil Singh,
Chih-Wei Wu,
Iroro Orife,
Mahdi Kalayeh
Abstract:
Audiovisual representation learning typically relies on the correspondence between sight and sound. However, there are often multiple audio tracks that can correspond with a visual scene. Consider, for example, different conversations on the same crowded street. The effect of such counterfactual pairs on audiovisual representation learning has not been previously explored. To investigate this, we…
▽ More
Audiovisual representation learning typically relies on the correspondence between sight and sound. However, there are often multiple audio tracks that can correspond with a visual scene. Consider, for example, different conversations on the same crowded street. The effect of such counterfactual pairs on audiovisual representation learning has not been previously explored. To investigate this, we use dubbed versions of movies and television shows to augment cross-modal contrastive learning. Our approach learns to represent alternate audio tracks, differing only in speech, similarly to the same video. Our results, from a comprehensive set of experiments investigating different training strategies, show this general approach improves performance on a range of downstream auditory and audiovisual tasks, without majorly affecting linguistic task performance overall. These findings highlight the importance of considering speech variation when learning scene-level audiovisual correspondences and suggest that dubbed audio can be a useful augmentation technique for training audiovisual models toward more robust performance on diverse downstream tasks.
△ Less
Submitted 8 June, 2024; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Data Consistent Deep Rigid MRI Motion Correction
Authors:
Nalini M. Singh,
Neel Dey,
Malte Hoffmann,
Bruce Fischl,
Elfar Adalsteinsson,
Robert Frost,
Adrian V. Dalca,
Polina Golland
Abstract:
Motion artifacts are a pervasive problem in MRI, leading to misdiagnosis or mischaracterization in population-level imaging studies. Current retrospective rigid intra-slice motion correction techniques jointly optimize estimates of the image and the motion parameters. In this paper, we use a deep network to reduce the joint image-motion parameter search to a search over rigid motion parameters alo…
▽ More
Motion artifacts are a pervasive problem in MRI, leading to misdiagnosis or mischaracterization in population-level imaging studies. Current retrospective rigid intra-slice motion correction techniques jointly optimize estimates of the image and the motion parameters. In this paper, we use a deep network to reduce the joint image-motion parameter search to a search over rigid motion parameters alone. Our network produces a reconstruction as a function of two inputs: corrupted k-space data and motion parameters. We train the network using simulated, motion-corrupted k-space data generated with known motion parameters. At test-time, we estimate unknown motion parameters by minimizing a data consistency loss between the motion parameters, the network-based image reconstruction given those parameters, and the acquired measurements. Intra-slice motion correction experiments on simulated and realistic 2D fast spin echo brain MRI achieve high reconstruction fidelity while providing the benefits of explicit data consistency optimization. Our code is publicly available at https://www.github.com/nalinimsingh/neuroMoCo.
△ Less
Submitted 16 November, 2023; v1 submitted 24 January, 2023;
originally announced January 2023.
-
Passivity and Immersion based-modified gradient estimator: A control perspective in parameter estimation
Authors:
Syed Shadab Nayyer,
G. Revati,
S. R. Wagh,
N. M. Singh
Abstract:
In this paper, a constructive and systematic strategy with more apparent degrees of freedom to achieve the accurate estimation of unknown parameters via a control perspective is proposed. By adding a virtual control in the final equation of the gradient dynamics, the Gradient Estimator (GE) and Memory Regressor and Extension (MRE) approaches are extended. The solution of the virtual control law is…
▽ More
In this paper, a constructive and systematic strategy with more apparent degrees of freedom to achieve the accurate estimation of unknown parameters via a control perspective is proposed. By adding a virtual control in the final equation of the gradient dynamics, the Gradient Estimator (GE) and Memory Regressor and Extension (MRE) approaches are extended. The solution of the virtual control law is identified by the P&I approach. The P&I approach is based on the choice of an appropriate implicit manifold and the generation of a suitable passive output and a related storage function. This facilitates the virtual control law being obtained in a way that the parametric error converges asymptotically to zero. Because the above ideas connect with the P&I approach and GE, the developed methodology is labeled the passivity and immersion-based modified gradient estimator (MGE). The proposed P&I-based modified gradient estimator is extended via the MRE approach. This modification provides improved transient response and fast convergence. Based on certain PE and non-PE examples, a comparative analysis is carried out to show the efficacy of the proposed approaches.
△ Less
Submitted 19 November, 2022;
originally announced November 2022.
-
Granger Causality for Predictability in Dynamic Mode Decomposition
Authors:
G. Revati,
Syed Shadab,
K. Sonam,
S. R. Wagh,
N. M. Singh
Abstract:
The dynamic mode decomposition (DMD) technique extracts the dominant modes characterizing the innate dynamical behavior of the system within the measurement data. For appropriate identification of dominant modes from the measurement data, the DMD algorithm necessitates ensuring the quality of the input measurement data sequences. On that account, for validating the usability of the dataset for the…
▽ More
The dynamic mode decomposition (DMD) technique extracts the dominant modes characterizing the innate dynamical behavior of the system within the measurement data. For appropriate identification of dominant modes from the measurement data, the DMD algorithm necessitates ensuring the quality of the input measurement data sequences. On that account, for validating the usability of the dataset for the DMD algorithm, the paper proposed two conditions: Persistence of excitation (PE) and the Granger Causality Test (GCT). The virtual data sequences are designed with the hankel matrix representation such that the dimensions of the subspace spanning the essential system modes are increased with the addition of new state variables. The PE condition provides the lower bound for the trajectory length, and the GCT provides the order of the model. Satisfying the PE condition enables estimating an approximate linear model, but the predictability with the identified model is only assured with the temporal causation among data searched with GCT. The proposed methodology is validated with the application for coherency identification (CI) in a multi-machine power system (MMPS), an essential phenomenon in transient stability analysis. The significance of PE condition and GCT is demonstrated through various case studies implemented on 22 bus six generator system.
△ Less
Submitted 23 October, 2022;
originally announced October 2022.
-
Automated detection of Alzheimer disease using MRI images and deep neural networks- A review
Authors:
Narotam Singh,
Patteshwari. D,
Neha Soni,
Amita Kapoor
Abstract:
Early detection of Alzheimer disease is crucial for deploying interventions and slowing the disease progression. A lot of machine learning and deep learning algorithms have been explored in the past decade with the aim of building an automated detection for Alzheimer. Advancements in data augmentation techniques and advanced deep learning architectures have opened up new frontiers in this field, a…
▽ More
Early detection of Alzheimer disease is crucial for deploying interventions and slowing the disease progression. A lot of machine learning and deep learning algorithms have been explored in the past decade with the aim of building an automated detection for Alzheimer. Advancements in data augmentation techniques and advanced deep learning architectures have opened up new frontiers in this field, and research is moving at a rapid speed. Hence, the purpose of this survey is to provide an overview of recent research on deep learning models for Alzheimer disease diagnosis. In addition to categorizing the numerous data sources, neural network architectures, and commonly used assessment measures, we also classify implementation and reproducibility. Our objective is to assist interested researchers in kee** up with the newest developments and in reproducing earlier investigations as benchmarks. In addition, we also indicate future research directions for this topic.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
Towards a Constructive Framework for Stabilization and Control of Nonlinear Systems: Passivity and Immersion (P\&I) Approach
Authors:
Syed Shadab Nayyer,
Sushama R. Wagh,
Navdeep M. Singh
Abstract:
The varied and complex dynamics of real-world systems challenge the formulation of a systematic strategy for designing a stabilizing feedback law. Rather than taking a universal approach, the control strategies developed thus far to handle this problem are specific to the inherent structure of the system under consideration. Therefore, this paper attempts to develop a generalized theory for the de…
▽ More
The varied and complex dynamics of real-world systems challenge the formulation of a systematic strategy for designing a stabilizing feedback law. Rather than taking a universal approach, the control strategies developed thus far to handle this problem are specific to the inherent structure of the system under consideration. Therefore, this paper attempts to develop a generalized theory for the design of the stabilizing feedback law wherever possible for a general class of systems, including the systems in standard structured and unstructured forms discussed in the existing literature. The theory behind this general controller design theory utilizes the idea of an invariant target manifold giving rise to a non-degenerate two form, through which the definition of certain passive outputs and storage functions leads to a generation of control law for stabilizing the system. Because the above concepts are linked with the Immersion and Invariance (I&I) design policy and the passivity theory of controller design, the proposed methodology is labeled as the "Passivity and Immersion (P&I) approach". Furthermore, being a constructive methodology, the various worked examples in this paper exemplify and demonstrate the various design paradigms such as Backstep**, Incremental Backstep**, Forwarding, I&I, and the techniques based on the generation of Control Contraction Metrics (CCM) can be unified in the P&I methodology.
△ Less
Submitted 11 December, 2022; v1 submitted 22 August, 2022;
originally announced August 2022.
-
UniPreCIS : A data pre-processing solution for collocated services on shared IoT
Authors:
Anirban Das,
Navlika Singh,
Suchetana Chakraborty
Abstract:
Next-generation smart city applications, attributed by the power of Internet of Things (IoT) and Cyber-Physical Systems (CPS), significantly rely on the quality of sensing data. With an exponential increase in intelligent applications for urban development and enterprises offering sensing-as-aservice these days, it is imperative to provision for a shared sensing infrastructure for better utilizati…
▽ More
Next-generation smart city applications, attributed by the power of Internet of Things (IoT) and Cyber-Physical Systems (CPS), significantly rely on the quality of sensing data. With an exponential increase in intelligent applications for urban development and enterprises offering sensing-as-aservice these days, it is imperative to provision for a shared sensing infrastructure for better utilization of resources. However, a shared sensing infrastructure that leverages low-cost sensing devices for a cost effective solution, still remains an unexplored territory. A significant research effort is still needed to make edge based data sha** solutions, more reliable, feature-rich and costeffective while addressing the associated challenges in sharing the sensing infrastructure among multiple collocated services with diverse Quality of Service (QoS) requirements. Towards this, we propose a novel edge based data pre-processing solution, named UniPreCIS that accounts for the inherent characteristics of lowcost ambient sensors and the exhibited measurement dynamics with respect to application-specific QoS. UniPreCIS aims to identify and select quality data sources by performing sensor ranking and selection followed by multimodal data pre-processing in order to meet heterogeneous application QoS and at the same time reducing the resource consumption footprint for the resource constrained network edge. As observed, the processing time and memory utilization has been reduced in the proposed approach while achieving upto 90% accuracy which is arguably significant as compared to state-of-the-art techniques for sensing. The effectiveness of UniPreCIS has been evaluated on a testbed for a specific use case of indoor occupancy estimation that proves its effectiveness.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
A Model-Based Reinforcement Learning Approach for PID Design
Authors:
Hozefa Jesawada,
Amol Yerudkar,
Carmen Del Vecchio,
Navdeep Singh
Abstract:
Proportional-integral-derivative (PID) controller is widely used across various industrial process control applications because of its straightforward implementation. However, it can be challenging to fine-tune the PID parameters in practice to achieve robust performance. The paper proposes a model-based reinforcement learning (RL) framework to design PID controllers leveraging the probabilistic i…
▽ More
Proportional-integral-derivative (PID) controller is widely used across various industrial process control applications because of its straightforward implementation. However, it can be challenging to fine-tune the PID parameters in practice to achieve robust performance. The paper proposes a model-based reinforcement learning (RL) framework to design PID controllers leveraging the probabilistic inference for learning control (PILCO) method and Kullback-Leibler divergence (KLD). Since PID controllers have a much more interpretable control structure than a network basis function, an optimal policy given by PILCO is transformed into a set of robust PID tuning parameters for underactuated mechanical systems. The presented method is general and can blend with several model-based and model-free algorithms. The performance of the devised PID controllers is demonstrated with simulation studies for a benchmark cart-pole system under disturbances and system parameter uncertainties.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Authors:
Francisca Vasconcelos,
Bobby He,
Nalini Singh,
Yee Whye Teh
Abstract:
Implicit neural representations (INRs) have achieved impressive results for scene reconstruction and computer graphics, where their performance has primarily been assessed on reconstruction accuracy. As INRs make their way into other domains, where model predictions inform high-stakes decision-making, uncertainty quantification of INR inference is becoming critical. To that end, we study a Bayesia…
▽ More
Implicit neural representations (INRs) have achieved impressive results for scene reconstruction and computer graphics, where their performance has primarily been assessed on reconstruction accuracy. As INRs make their way into other domains, where model predictions inform high-stakes decision-making, uncertainty quantification of INR inference is becoming critical. To that end, we study a Bayesian reformulation of INRs, UncertaINR, in the context of computed tomography, and evaluate several Bayesian deep learning implementations in terms of accuracy and calibration. We find that they achieve well-calibrated uncertainty, while retaining accuracy competitive with other classical, INR-based, and CNN-based reconstruction techniques. Contrary to common intuition in the Bayesian deep learning literature, we find that INRs obtain the best calibration with computationally efficient Monte Carlo dropout, outperforming Hamiltonian Monte Carlo and deep ensembles. Moreover, in contrast to the best-performing prior approaches, UncertaINR does not require a large training dataset, but only a handful of validation images.
△ Less
Submitted 2 May, 2023; v1 submitted 22 February, 2022;
originally announced February 2022.
-
Single-shot multispectral quantitative phase imaging using deep neural network
Authors:
Sunil Bhatt,
Ankit Butola,
Anand Kumar,
Pramila Thapa,
Akshay Joshi,
Neetu Singh,
Krishna Agarwal,
Dalip Singh Mehta
Abstract:
Multi-spectral quantitative phase imaging (MS-QPI) is a cutting-edge label-free technique to determine the morphological changes, refractive index variations and spectroscopic information of the specimens. The bottleneck to implement this technique to extract quantitative information, is the need of more than two measurements for generating MS-QPI images. We propose a single-shot MS-QPI technique…
▽ More
Multi-spectral quantitative phase imaging (MS-QPI) is a cutting-edge label-free technique to determine the morphological changes, refractive index variations and spectroscopic information of the specimens. The bottleneck to implement this technique to extract quantitative information, is the need of more than two measurements for generating MS-QPI images. We propose a single-shot MS-QPI technique using highly spatially sensitive digital holographic microscope assisted with deep neural network (DNN). Our method first acquires the interferometric datasets corresponding to multiple wavelengths (λ=532, 633 and 808 nm used here). The acquired datasets are used to train generative adversarial network (GAN) to generate multi-spectral quantitative phase maps from a single input interferogram. The network is trained and validated on two different samples, the optical waveguide and a MG63 osteosarcoma cells. Further, validation of the framework is performed by comparing the predicted phase maps with experimentally acquired and processed multi-spectral phase maps. The current MS-QPI+DNN framework can further empower spectroscopic QPI to improve the chemical specificity without complex instrumentation and color-cross talk.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
Authors:
Nikhil Singh,
Jeff Mentch,
Jerry Ng,
Matthew Beveridge,
Iddo Drori
Abstract:
Measuring the acoustic characteristics of a space is often done by capturing its impulse response (IR), a representation of how a full-range stimulus sound excites it. This work generates an IR from a single image, which can then be applied to other signals using convolution, simulating the reverberant characteristics of the space shown in the image. Recording these IRs is both time-intensive and…
▽ More
Measuring the acoustic characteristics of a space is often done by capturing its impulse response (IR), a representation of how a full-range stimulus sound excites it. This work generates an IR from a single image, which can then be applied to other signals using convolution, simulating the reverberant characteristics of the space shown in the image. Recording these IRs is both time-intensive and expensive, and often infeasible for inaccessible locations. We use an end-to-end neural network architecture to generate plausible audio impulse responses from single images of acoustic environments. We evaluate our method both by comparisons to ground truth data and by human expert evaluation. We demonstrate our approach by generating plausible impulse responses from diverse settings and formats including well known places, musical halls, rooms in paintings, images from animations and computer games, synthetic environments generated from text, panoramic images, and video conference backgrounds.
△ Less
Submitted 13 August, 2021; v1 submitted 25 March, 2021;
originally announced March 2021.
-
Parallelized Instantaneous Velocity and Heading Estimation of Objects using Single Imaging Radar
Authors:
Nihal Singh,
Dibakar Sil,
Ankit Sharma
Abstract:
The development of high-resolution imaging radars introduce a plethora of useful applications, particularly in the automotive sector. With increasing attention on active transport safety and autonomous driving, these imaging radars are set to form the core of an autonomous engine. One of the most important tasks of such high-resolution radars is to estimate the instantaneous velocities and heading…
▽ More
The development of high-resolution imaging radars introduce a plethora of useful applications, particularly in the automotive sector. With increasing attention on active transport safety and autonomous driving, these imaging radars are set to form the core of an autonomous engine. One of the most important tasks of such high-resolution radars is to estimate the instantaneous velocities and heading angles of the detected objects (vehicles, pedestrians, etc.). Feasible estimation methods should be fast enough in real-time scenarios, bias-free and robust against micro-Dopplers, noise and other systemic variations. This work proposes a parallel-computing scheme that achieves a real-time and accurate implementation of vector velocity determination using frequency modulated continuous wave (FMCW) radars. The proposed scheme is tested against traffic data collected using an FMCW radar at a center frequency of 78.6 GHz and a bandwidth of 4 GHz. Experiments show that the parallel algorithm presented performs much faster than its conventional counterparts without any loss in precision.
△ Less
Submitted 23 December, 2020;
originally announced December 2020.
-
Design and Comparative Analysis of a Two-Stage Ultra-Low-Power Subthreshold Operational Amplifier in 180nm, 90nm, and 45nm technology
Authors:
Sumukh Nitundil,
Nihal Singh,
Rushabha Balaji,
Pankaj Arora
Abstract:
In this paper, a two-stage ultra-low-power operational amplifier is designed, and a comparative analysis of the proposed subthreshold complementary amplifier is presented between 180nm, 90nm, and 45nm CMOS technology. The proposed operational amplifier is compared across several different parameters to determine the optimal design. It achieves a maximum gain of around 75 dB and a phase margin of 7…
▽ More
In this paper, a two-stage ultra-low-power operational amplifier is designed, and a comparative analysis of the proposed subthreshold complementary amplifier is presented between 180nm, 90nm, and 45nm CMOS technology. The proposed operational amplifier is compared across several different parameters to determine the optimal design. It achieves a maximum gain of around 75 dB and a phase margin of 76°, dissipating just 140nW with a supply voltage of 0.5 V which is well suited for biomedical applications that require low power and high gain. The proposed operational amplifier has been designed using a SPICE-based circuit simulator.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
On-board Electrical, Electronics and Pose Estimation System for Hyperloop Pod Design
Authors:
Nihal Singh,
Jay Karhade,
Ishika Bhattacharya,
Prathamesh Saraf,
Plava Kattamuri,
Alivelu Manga Parimi
Abstract:
Hyperloop is a high-speed ground-based transportation system utilizing sealed tubes, with the aim of ultimately transporting passengers between metropolitan cities in efficiently designed autonomous capsules. In recent years, the design and development of sub-scale prototypes for these Hyperloop pods has set the foundation for realizing more practical and scalable pod architectures. This paper pro…
▽ More
Hyperloop is a high-speed ground-based transportation system utilizing sealed tubes, with the aim of ultimately transporting passengers between metropolitan cities in efficiently designed autonomous capsules. In recent years, the design and development of sub-scale prototypes for these Hyperloop pods has set the foundation for realizing more practical and scalable pod architectures. This paper proposes a practical, power and space optimized on-board electronics architecture, coupled with an end-to-end computationally efficient pose estimation algorithm. Considering the high energy density and discharge rate of on-board batteries, this work additionally presents a robust system for fault detection, protection and management of batteries, along with the design of the surrounding electrical system. Performance evaluation and verification of proposed algorithms and circuits has been carried out by software simulations using both Python and Simulink.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
Efficient Kernel based Matched Filter Approach for Segmentation of Retinal Blood Vessels
Authors:
Sushil Kumar Saroj,
Vikas Ratna,
Rakesh Kumar,
Nagendra Pratap Singh
Abstract:
Retinal blood vessels structure contains information about diseases like obesity, diabetes, hypertension and glaucoma. This information is very useful in identification and treatment of these fatal diseases. To obtain this information, there is need to segment these retinal vessels. Many kernel based methods have been given for segmentation of retinal vessels but their kernels are not appropriate…
▽ More
Retinal blood vessels structure contains information about diseases like obesity, diabetes, hypertension and glaucoma. This information is very useful in identification and treatment of these fatal diseases. To obtain this information, there is need to segment these retinal vessels. Many kernel based methods have been given for segmentation of retinal vessels but their kernels are not appropriate to vessel profile cause poor performance. To overcome this, a new and efficient kernel based matched filter approach has been proposed. The new matched filter is used to generate the matched filter response (MFR) image. We have applied Otsu thresholding method on obtained MFR image to extract the vessels. We have conducted extensive experiments to choose best value of parameters for the proposed matched filter kernel. The proposed approach has examined and validated on two online available DRIVE and STARE datasets. The proposed approach has specificity 98.50%, 98.23% and accuracy 95.77 %, 95.13% for DRIVE and STARE dataset respectively. Obtained results confirm that the proposed method has better performance than others. The reason behind increased performance is due to appropriate proposed kernel which matches retinal blood vessel profile more accurately.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.
-
Towards Intelligent Reconfigurable Wireless Physical Layer (PHY)
Authors:
Neelam Singh,
S. V. Sai Santosh,
Sumit J. Darak
Abstract:
Next-generation wireless networks are getting significant attention because they promise 10-factor enhancement in mobile broadband along with the potential to enable new heterogeneous services. Services include massive machine type communications desired for Industrial 4.0 along with ultra-reliable low latency services for remote healthcare and vehicular communications. In this paper, we present t…
▽ More
Next-generation wireless networks are getting significant attention because they promise 10-factor enhancement in mobile broadband along with the potential to enable new heterogeneous services. Services include massive machine type communications desired for Industrial 4.0 along with ultra-reliable low latency services for remote healthcare and vehicular communications. In this paper, we present the design of an intelligent and reconfigurable physical layer (PHY) to bring these services to reality. First, we design and implement the reconfigurable PHY via a hardware-software co-design approach on system-on-chip consisting of the ARM processor and field-programmable gate array (FPGA). The reconfigurable PHY is then made intelligent by augmenting it with online machine learning (OML) based decision-making algorithm. Such PHY can learn the environment (for example, wireless channel) and dynamically adapt the transceivers' configuration (i.e., modulation scheme, word-length) and select the wireless channel on-the-fly. Since the environment is unknown and changes with time, we make the OML architecture reconfigurable to enable dynamic switch between various OML algorithms on-the-fly. We have demonstrated the functional correctness of the proposed architecture for different environments and word-lengths. The detailed throughput, latency, and complexity analysis validate the feasibility and importance of the proposed intelligent and reconfigurable PHY in next-generation networks.
△ Less
Submitted 2 December, 2020;
originally announced December 2020.
-
Joint Frequency and Image Space Learning for MRI Reconstruction and Analysis
Authors:
Nalini M. Singh,
Juan Eugenio Iglesias,
Elfar Adalsteinsson,
Adrian V. Dalca,
Polina Golland
Abstract:
We propose neural network layers that explicitly combine frequency and image feature representations and show that they can be used as a versatile building block for reconstruction from frequency space data. Our work is motivated by the challenges arising in MRI acquisition where the signal is a corrupted Fourier transform of the desired image. The proposed joint learning schemes enable both corre…
▽ More
We propose neural network layers that explicitly combine frequency and image feature representations and show that they can be used as a versatile building block for reconstruction from frequency space data. Our work is motivated by the challenges arising in MRI acquisition where the signal is a corrupted Fourier transform of the desired image. The proposed joint learning schemes enable both correction of artifacts native to the frequency space and manipulation of image space representations to reconstruct coherent image structures at every layer of the network. This is in contrast to most current deep learning approaches for image reconstruction that treat frequency and image space features separately and often operate exclusively in one of the two spaces. We demonstrate the advantages of joint convolutional learning for a variety of tasks, including motion correction, denoising, reconstruction from undersampled acquisitions, and combined undersampling and motion correction on simulated and real world multicoil MRI data. The joint models produce consistently high quality output images across all tasks and datasets. When integrated into a state of the art unrolled optimization network with physics-inspired data consistency constraints for undersampled reconstruction, the proposed architectures significantly improve the optimization landscape, which yields an order of magnitude reduction of training time. This result suggests that joint representations are particularly well suited for MRI signals in deep learning networks. Our code and pretrained models are publicly available at https://github.com/nalinimsingh/interlacer.
△ Less
Submitted 17 June, 2022; v1 submitted 2 July, 2020;
originally announced July 2020.
-
Medical Image Generation using Generative Adversarial Networks
Authors:
Nripendra Kumar Singh,
Khalid Raza
Abstract:
Generative adversarial networks (GANs) are unsupervised Deep Learning approach in the computer vision community which has gained significant attention from the last few years in identifying the internal structure of multimodal medical imaging data. The adversarial network simultaneously generates realistic medical images and corresponding annotations, which proven to be useful in many cases such a…
▽ More
Generative adversarial networks (GANs) are unsupervised Deep Learning approach in the computer vision community which has gained significant attention from the last few years in identifying the internal structure of multimodal medical imaging data. The adversarial network simultaneously generates realistic medical images and corresponding annotations, which proven to be useful in many cases such as image augmentation, image registration, medical image generation, image reconstruction, and image-to-image translation. These properties bring the attention of the researcher in the field of medical image analysis and we are witness of rapid adaption in many novel and traditional applications. This chapter provides state-of-the-art progress in GANs-based clinical application in medical image generation, and cross-modality synthesis. The various framework of GANs which gained popularity in the interpretation of medical images, such as Deep Convolutional GAN (DCGAN), Laplacian GAN (LAPGAN), pix2pix, CycleGAN, and unsupervised image-to-image translation model (UNIT), continue to improve their performance by incorporating additional hybrid architecture, has been discussed. Further, some of the recent applications of these frameworks for image reconstruction, and synthesis, and future research directions in the area have been covered.
△ Less
Submitted 19 May, 2020;
originally announced May 2020.
-
Spectrum Prediction and Interference Detection for Satellite Communications
Authors:
Lissy Pellaco,
Nirankar Singh,
Joakim Jaldén
Abstract:
Spectrum monitoring and interference detection are crucial for the satellite service performance and the revenue of SatCom operators. Interference is one of the major causes of service degradation and deficient operational efficiency. Moreover, the satellite spectrum is becoming more crowded, as more satellites are being launched for different applications. This increases the risk of interference,…
▽ More
Spectrum monitoring and interference detection are crucial for the satellite service performance and the revenue of SatCom operators. Interference is one of the major causes of service degradation and deficient operational efficiency. Moreover, the satellite spectrum is becoming more crowded, as more satellites are being launched for different applications. This increases the risk of interference, which causes anomalies in the received signal, and mandates the adoption of techniques that can enable the automatic and real-time detection of such anomalies as a first step towards interference mitigation and suppression. In this paper, we present a Machine Learning (ML)-based approach able to guarantee a real-time and automatic detection of both short-term and long-term interference in the spectrum of the received signal at the base station. The proposed approach can localize the interference both in time and in frequency and is universally applicable across a discrete set of different signal spectra. We present experimental results obtained by applying our method to real spectrum data from the Swedish Space Corporation. We also compare our ML-based approach to a model-based approach applied to the same spectrum data and used as a realistic baseline. Experimental results show that our method is a more reliable interference detector.
△ Less
Submitted 10 December, 2019;
originally announced December 2019.
-
Interarea Oscillations & Chimera in Power Systems
Authors:
Pratik K. Bajaria,
Sushama R. Wagh,
Navdeep M. Singh
Abstract:
This paper proposes a novel second order mathematical model in the Kuramoto framework to simulate and study low frequency oscillations in power systems. This model facilitates better understanding of the complex dynamics of a power network. A standard four generator power system with all-to-all connectivity is considered and results obtained from the proposed model are verified. It is shown, that…
▽ More
This paper proposes a novel second order mathematical model in the Kuramoto framework to simulate and study low frequency oscillations in power systems. This model facilitates better understanding of the complex dynamics of a power network. A standard four generator power system with all-to-all connectivity is considered and results obtained from the proposed model are verified. It is shown, that the model simulates various properties related to low frequency oscillations in power systems which presently are obtained through small-signal analysis. Further, we provide analogy to blackouts in a power grid, by emulating chimera behavior and thereby discuss bifurcation analysis of the proposed model.
△ Less
Submitted 23 November, 2019;
originally announced November 2019.
-
A deep learning system for differential diagnosis of skin diseases
Authors:
Yuan Liu,
Ayush Jain,
Clara Eng,
David H. Way,
Kang Lee,
Peggy Bui,
Kimberly Kanada,
Guilherme de Oliveira Marinho,
Jessica Gallegos,
Sara Gabriele,
Vishakha Gupta,
Nalini Singh,
Vivek Natarajan,
Rainer Hofmann-Wellenhof,
Greg S. Corrado,
Lily H. Peng,
Dale R. Webster,
Dennis Ai,
Susan Huang,
Yun Liu,
R. Carter Dunn,
David Coz
Abstract:
Skin conditions affect an estimated 1.9 billion people worldwide. A shortage of dermatologists causes long wait times and leads patients to seek dermatologic care from general practitioners. However, the diagnostic accuracy of general practitioners has been reported to be only 0.24-0.70 (compared to 0.77-0.96 for dermatologists), resulting in referral errors, delays in care, and errors in diagnosi…
▽ More
Skin conditions affect an estimated 1.9 billion people worldwide. A shortage of dermatologists causes long wait times and leads patients to seek dermatologic care from general practitioners. However, the diagnostic accuracy of general practitioners has been reported to be only 0.24-0.70 (compared to 0.77-0.96 for dermatologists), resulting in referral errors, delays in care, and errors in diagnosis and treatment. In this paper, we developed a deep learning system (DLS) to provide a differential diagnosis of skin conditions for clinical cases (skin photographs and associated medical histories). The DLS distinguishes between 26 skin conditions that represent roughly 80% of the volume of skin conditions seen in primary care. The DLS was developed and validated using de-identified cases from a teledermatology practice serving 17 clinical sites via a temporal split: the first 14,021 cases for development and the last 3,756 cases for validation. On the validation set, where a panel of three board-certified dermatologists defined the reference standard for every case, the DLS achieved 0.71 and 0.93 top-1 and top-3 accuracies respectively. For a random subset of the validation set (n=963 cases), 18 clinicians reviewed the cases for comparison. On this subset, the DLS achieved a 0.67 top-1 accuracy, non-inferior to board-certified dermatologists (0.63, p<0.001), and higher than primary care physicians (PCPs, 0.45) and nurse practitioners (NPs, 0.41). The top-3 accuracy showed a similar trend: 0.90 DLS, 0.75 dermatologists, 0.60 PCPs, and 0.55 NPs. These results highlight the potential of the DLS to augment general practitioners to accurately diagnose skin conditions by suggesting differential diagnoses that may not have been considered. Future work will be needed to prospectively assess the clinical impact of using this tool in actual clinical workflows.
△ Less
Submitted 11 September, 2019;
originally announced September 2019.
-
Disturbance Decoupling and Instantaneous Fault Detection in Boolean Control Networks
Authors:
S Sutavani,
K Sonam,
S Wagh,
N Singh
Abstract:
The literature available on disturbance decoupling (DD) of Boolean control network (BCN) is built on a restrictive notion of what constitutes as disturbance decoupling. The results available on necessary and sufficient conditions are of limited applicability because of their stringent requirements. This work tries to expand the notion of DD in BCN to incorporate a larger number of systems deemed u…
▽ More
The literature available on disturbance decoupling (DD) of Boolean control network (BCN) is built on a restrictive notion of what constitutes as disturbance decoupling. The results available on necessary and sufficient conditions are of limited applicability because of their stringent requirements. This work tries to expand the notion of DD in BCN to incorporate a larger number of systems deemed unsuitable for DD. The methods available are further restrictive in the sense that system is forced to follow trajectory unaffected by the disturbances rather than decoupling disturbances while the system follows its natural course. Some sufficient conditions are provided under which the problem can be addressed. This work tries to establish the notion of disturbance decoupling via feedback control,analogous to the classical control theory. This approach though, is not limited to DD problems and can be extended to the general control problems of BCNs. Determination of observability, which is sufficient for the fault detection, is proven to be NP-hard for Boolean Control Network. Algorithms based on reconstructability, a necessary condition, of BCN turn out to be of exponential complexity in general.In such cases it makes sense to search for the availability of some special structure in BCN that could be utilized for fault detection with minimal computational efforts. An attempt is made to address this problem by introducing instantaneous fault detection (IFD) and providing necessary and sufficient conditions for the same. Later necessary and sufficient conditions are proposed for solving the problem of instantaneous fault detection along with disturbance decoupling using a single controller.
△ Less
Submitted 8 August, 2019;
originally announced August 2019.
-
A Tour of Unsupervised Deep Learning for Medical Image Analysis
Authors:
Khalid Raza,
Nripendra Kumar Singh
Abstract:
Interpretation of medical images for diagnosis and treatment of complex disease from high-dimensional and heterogeneous data remains a key challenge in transforming healthcare. In the last few years, both supervised and unsupervised deep learning achieved promising results in the area of medical imaging and image analysis. Unlike supervised learning which is biased towards how it is being supervis…
▽ More
Interpretation of medical images for diagnosis and treatment of complex disease from high-dimensional and heterogeneous data remains a key challenge in transforming healthcare. In the last few years, both supervised and unsupervised deep learning achieved promising results in the area of medical imaging and image analysis. Unlike supervised learning which is biased towards how it is being supervised and manual efforts to create class label for the algorithm, unsupervised learning derive insights directly from the data itself, group the data and help to make data driven decisions without any external bias. This review systematically presents various unsupervised models applied to medical image analysis, including autoencoders and its several variants, Restricted Boltzmann machines, Deep belief networks, Deep Boltzmann machine and Generative adversarial network. Future research opportunities and challenges of unsupervised techniques for medical image analysis have also been discussed.
△ Less
Submitted 19 December, 2018;
originally announced December 2018.
-
Analysis of Average Consensus Algorithm for Asymmetric Regular Networks
Authors:
Sateeshkrishna Dhuli,
Y. N. Singh
Abstract:
Average consensus algorithms compute the global average of sensor data in a distributed fashion using local sensor nodes. Simple execution, decentralized philosophy make these algorithms suitable for WSN scenarios. Most of the researchers have studied the average consensus algorithms by modeling the network as an undirected graph. But, WSNs in practice consist of asymmetric links and the undirecte…
▽ More
Average consensus algorithms compute the global average of sensor data in a distributed fashion using local sensor nodes. Simple execution, decentralized philosophy make these algorithms suitable for WSN scenarios. Most of the researchers have studied the average consensus algorithms by modeling the network as an undirected graph. But, WSNs in practice consist of asymmetric links and the undirected graph cannot model the asymmetric links. Therefore, these studies fail to study the actual performance of consensus algorithms on WSNs. In this paper, we model the WSN as a directed graph and derive the explicit formulas of the ring, torus, $r$-nearest neighbor ring, and $m$-dimensional torus networks. Numerical results subsequently demonstrate the accuracy of directed graph modeling. Further, we study the effect of asymmetric links, the number of nodes, network dimension, and node overhead on the convergence rate of average consensus algorithms.
△ Less
Submitted 18 August, 2018; v1 submitted 11 June, 2018;
originally announced June 2018.
-
Differential passivity like properties for a class of nonlinear systems
Authors:
Krishna Chaitanya Kosaraju,
Venkatesh Chinde,
Ramkrishna Pasumarthy,
N M Singh
Abstract:
In this paper, we derive new passive maps akin to incremental passive maps, for a class of nonlinear systems using dynamic feedback and Krasovskii's method. Further using the passive maps we present a control methodology for stabilization to a desired operating point. This work is illustrated by designing a controller for a nonlinear building heating ventilating and air conditioning (HVAC) subsyst…
▽ More
In this paper, we derive new passive maps akin to incremental passive maps, for a class of nonlinear systems using dynamic feedback and Krasovskii's method. Further using the passive maps we present a control methodology for stabilization to a desired operating point. This work is illustrated by designing a controller for a nonlinear building heating ventilating and air conditioning (HVAC) subsystem.
△ Less
Submitted 2 May, 2018;
originally announced May 2018.
-
Analysis of Network Robustness for Finite Sized Wireless Sensor Networks
Authors:
Sateeshkrishna Dhuli,
Chakravarthy Gopi,
Yatindra Nath Singh
Abstract:
Studying network robustness for wireless sensor networks(WSNs) is an exciting topic of research as sensor nodes often fail due to hardware degradation, resource constraints, and environmental changes. The application of spectral graph theory to networked systems has generated several important results. However, previous research has often failed to consider the network parameters, which is crucial…
▽ More
Studying network robustness for wireless sensor networks(WSNs) is an exciting topic of research as sensor nodes often fail due to hardware degradation, resource constraints, and environmental changes. The application of spectral graph theory to networked systems has generated several important results. However, previous research has often failed to consider the network parameters, which is crucial to study the real network applications. Network criticality is one of the effective metrics to quantify the network robustness against such failures and attacks. In this work, we derive the exact formulas of network criticality for WSNs using r-nearest neighbor networks and we show the effect of nearest neighbors and network dimension on robustness using analytical and numerical evaluations. Furthermore, we also show how symmetric and static approximations can wrongly designate the network robustness when implemented to WSNs.
△ Less
Submitted 1 April, 2021; v1 submitted 22 September, 2016;
originally announced September 2016.
-
Complex Laplacian based Distributed Control for Multi-Agent Network
Authors:
Aniket Deshpande,
Pushpak Jagtap,
Prashant Bansode,
Arun Mahindrakar,
Navadeep Singh
Abstract:
The work done in this paper, proposes a complex Laplacian-based distributed control scheme for convergence in the multi-agent network. The proposed scheme has been designated as cascade formulation. The proposed technique exploits the traditional method of organizing large scattered networks into smaller interconnected clusters to optimize information flow within the network. The complex Laplacian…
▽ More
The work done in this paper, proposes a complex Laplacian-based distributed control scheme for convergence in the multi-agent network. The proposed scheme has been designated as cascade formulation. The proposed technique exploits the traditional method of organizing large scattered networks into smaller interconnected clusters to optimize information flow within the network. The complex Laplacian-based approach results in a hierarchical structure, with formation of a meta-cluster leading other clusters in the network. The proposed formulation enables flexibility to constrain the eigen spectra of the overall closed-loop dynamics, ensuring desired convergence rate and control input intensity. The sufficient conditions ensuring globally stable formation for proposed formulation are also asserted. Robustness of the proposed formulation to uncertainties like loss in communication links and actuator failure has also been discussed. The effectiveness of the proposed approach is illustrated by simulating a finitely large network of thirty vehicles.
△ Less
Submitted 12 July, 2018; v1 submitted 18 September, 2016;
originally announced September 2016.
-
Convergence Analysis for Regular Wireless Consensus Networks
Authors:
Sateeshkrishna Dhuli,
Kumar Gaurav,
Y. N. Singh
Abstract:
Average consensus algorithms can be implemented over wireless sensor networks (WSN), where global statistics can be computed using communications among sensor nodes locally. Simple execution, robustness to global topology changes due to frequent node failures and underlying distributed philosophy has made consensus algorithms more suitable to WSNs. Since these algorithms are iterative in nature, t…
▽ More
Average consensus algorithms can be implemented over wireless sensor networks (WSN), where global statistics can be computed using communications among sensor nodes locally. Simple execution, robustness to global topology changes due to frequent node failures and underlying distributed philosophy has made consensus algorithms more suitable to WSNs. Since these algorithms are iterative in nature, their performance is characterized by convergence speed. We study the convergence of the average consensus algorithms for WSNs using regular graphs. We obtained the analytical expressions for optimal consensus and convergence parameters which decides the convergence time for r-nearest neighbor cycle and torus networks. We have also derived the generalized expression for optimal consensus and convergence parameters for m-dimensional r-nearest neighbor torus networks. The obtained analytical results agree with the simulation results and shown the effect of network dimension, number of nodes and transmission radius on convergence time. This work provides the basic analytical tools for managing and controlling the performance of average consensus algorithm in the finite sized practical networks.
△ Less
Submitted 22 September, 2016; v1 submitted 4 November, 2014;
originally announced November 2014.