Skip to main content

Showing 1–37 of 37 results for author: Singh, V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.09999  [pdf, other

    eess.AS

    ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR

    Authors: Vishwanath Pratap Singh, Federico Malato, Ville Hautamaki, Md. Sahidullah, Tomi Kinnunen

    Abstract: While automatic speech recognition (ASR) greatly benefits from data augmentation, the augmentation recipes themselves tend to be heuristic. In this paper, we address one of the heuristic approach associated with balancing the right amount of augmented data in ASR training by introducing a reinforcement learning (RL) based dynamic adjustment of original-to-augmented data ratio (OAR). Unlike the fix… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted: Interspeech 2024

    Journal ref: Interspeech 2024

  2. arXiv:2405.18297  [pdf, other

    eess.SP

    Artificial Intelligence Satellite Telecommunication Testbed using Commercial Off-The-Shelf Chipsets

    Authors: Luis M. Garces, Amirhossein Nik, Flor Ortiz, Juan A. Vásquez-Peralvo, Jorge L. Gonzalez, Mouhamad Chehailty, Marcele Kuhfuss, Eva Lagunas, Jan Thoemel, Sumit Kumar, Vishal Singh, Juan C. Duncan, Sahar Malmir, Swetha Varadajulu, Jorge Querol, Symeon Chatzinotas

    Abstract: The Artificial Intelligence Satellite Telecommunications Testbed (AISTT), part of the ESA project SPAICE, is focused on the transformation of the satellite payload by using artificial intelligence (AI) and machine learning (ML) methodologies over available commercial off-the-shelf (COTS) AI chips for on-board processing. The objectives include validating artificial intelligence-driven SATCOM scena… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Submitted to SPAICE Conference 2024: AI in and for Space, 5 pages, 3 figures

    Journal ref: SPAICE Conference 2024

  3. arXiv:2402.15214  [pdf, other

    eess.AS cs.SD

    ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker Verification

    Authors: Vishwanath Pratap Singh, Md Sahidullah, Tomi Kinnunen

    Abstract: The accuracy of modern automatic speaker verification (ASV) systems, when trained exclusively on adult data, drops substantially when applied to children's speech. The scarcity of children's speech corpora hinders fine-tuning ASV systems for children's speech. Hence, there is a timely need to explore more effective ways of reusing adults' speech data. One promising approach is to align vocal-tract… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: The following article has been accepted by The Journal of the Acoustical Society of America (JASA). After it is published, it will be found at https://pubs.aip.org/asa/jasa

  4. arXiv:2402.06463  [pdf, other

    eess.IV cs.CV cs.LG

    Cardiac ultrasound simulation for autonomous ultrasound navigation

    Authors: Abdoul Aziz Amadou, Laura Peralta, Paul Dryburgh, Paul Klein, Kaloian Petkov, Richard James Housden, Vivek Singh, Rui Liao, Young-Ho Kim, Florin Christian Ghesu, Tommaso Mansi, Ronak Rajani, Alistair Young, Kawal Rhode

    Abstract: Ultrasound is well-established as an imaging modality for diagnostic and interventional purposes. However, the image quality varies with operator skills as acquiring and interpreting ultrasound images requires extensive training due to the imaging artefacts, the range of acquisition parameters and the variability of patient anatomies. Automating the image acquisition task could improve acquisition… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 24 pages, 10 figures, 5 tables

    ACM Class: I.6.0; I.5.4; J.3

  5. EEG-Based Reaction Time Prediction with Fuzzy Common Spatial Patterns and Phase Cohesion using Deep Autoencoder Based Data Fusion

    Authors: Vivek Singh, Tharun Kumar Reddy

    Abstract: Drowsiness state of a driver is a topic of extensive discussion due to its significant role in causing traffic accidents. This research presents a novel approach that combines Fuzzy Common Spatial Patterns (CSP) optimised Phase Cohesive Sequence (PCS) representations and fuzzy CSP-optimized signal amplitude representations. The research aims to examine alterations in Electroencephalogram (EEG) syn… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  6. arXiv:2311.08689  [pdf, other

    eess.SP cs.AR

    Low Complexity High Speed Deep Neural Network Augmented Wireless Channel Estimation

    Authors: Syed Asrar ul haq, Varun Singh, Bhanu Teja Tanaji, Sumit Darak

    Abstract: The channel estimation (CE) in wireless receivers is one of the most critical and computationally complex signal processing operations. Recently, various works have shown that the deep learning (DL) based CE outperforms conventional minimum mean square error (MMSE) based CE, and it is hardware-friendly. However, DL-based CE has higher complexity and latency than popularly used least square (LS) ba… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  7. arXiv:2309.15750  [pdf, other

    eess.IV cs.CV

    Automated CT Lung Cancer Screening Workflow using 3D Camera

    Authors: Brian Teixeira, Vivek Singh, Birgi Tamersoy, Andreas Prokein, Ankur Kapoor

    Abstract: Despite recent developments in CT planning that enabled automation in patient positioning, time-consuming scout scans are still needed to compute dose profile and ensure the patient is properly positioned. In this paper, we present a novel method which eliminates the need for scout scans in CT lung cancer screening by estimating patient scan range, isocenter, and Water Equivalent Diameter (WED) fr… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted at MICCAI 2023

  8. pyParaOcean: A System for Visual Analysis of Ocean Data

    Authors: Toshit Jain, Varun Singh, Vijay Kumar Boda, Upkar Singh, Ingrid Hotz, P. N. Vinayachandran, Vijay Natarajan

    Abstract: Visual analysis is well adopted within the field of oceanography for the analysis of model simulations, detection of different phenomena and events, and tracking of dynamic processes. With increasing data sizes and the availability of multivariate dynamic data, there is a growing need for scalable and extensible tools for visualization and interactive exploration. We describe pyParaOcean, a visual… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 8 pages, EnvirVis2023

    ACM Class: F.7; I.3.6

    Journal ref: envirvis2023

  9. arXiv:2308.09106  [pdf

    eess.SY

    Optimal Closed Loop Control of G2V/V2G Action Using Model Predictive Controller

    Authors: Satya Vikram Pratap Singh, Siddharth Kamila, Prashanth Agnihotri

    Abstract: This paper has developed a closed-loop control algorithm to operate the G2V/V2G action, tested under varying battery voltage conditions and load and source power differences. Under V2G action, to maintain total harmonic distortion under minimum level and grid frequency under the standard limit, a Model predictive controller (MPC) has been used to control the gate driver circuit of the inverter. Th… ▽ More

    Submitted 11 October, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: \c{opyright}2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  10. arXiv:2308.09046  [pdf

    eess.SY

    Fault Detection and Classification using Wavelet and ANN in DFIG and TCSC Connected Transmission Line

    Authors: Satya Vikram Pratap Singh, Tanu Prasad, Siddharth Kamila, Prashant Agnihotri

    Abstract: This paper presents fault detection and classification using Wavelet and ANN based methods in a DFIG-based series compensated system. The state-of-the art methods include Wavelet transform, Fourier transform, and Wavelet-neuro fuzzy methods-based system for fault detection and classification. However, the accuracy of these state-of-the-art methods diminishes during variable conditions such as chan… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  11. arXiv:2306.07501  [pdf, other

    eess.AS cs.SD

    Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech

    Authors: Vishwanath Pratap Singh, Md Sahidullah, Tomi Kinnunen

    Abstract: In this paper, we study the impact of the ageing on modern deep speaker embedding based automatic speaker verification (ASV) systems. We have selected two different datasets to examine ageing on the state-of-the-art ECAPA-TDNN system. The first dataset, used for addressing short-term ageing (up to 10 years time difference between enrollment and test) under uncontrolled conditions, is VoxCeleb. The… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Journal ref: Interspeech 2023

  12. arXiv:2305.03546  [pdf, other

    eess.IV cs.CV

    Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

    Authors: Chuang Zhu, Shengjie Liu, Zekuan Yu, Feng Xu, Arpit Aggarwal, Germán Corredor, Anant Madabhushi, Qixun Qu, Hongwei Fan, Fangda Li, Yueheng Li, Xianchao Guan, Yongbing Zhang, Vivek Kumar Singh, Farhan Akram, Md. Mostafa Kamal Sarker, Zhongyue Shi, Mulan **

    Abstract: For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from Hematoxylin and Eosin (H&E) stained images is a valuable research direct… ▽ More

    Submitted 22 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 12 pages, 12 figures, 2tables

  13. arXiv:2303.15852  [pdf

    eess.IV cs.CV cs.LG

    Exploring Deep Learning Methods for Classification of SAR Images: Towards NextGen Convolutions via Transformers

    Authors: Aakash Singh, Vivek Kumar Singh

    Abstract: Images generated by high-resolution SAR have vast areas of application as they can work better in adverse light and weather conditions. One such area of application is in the military systems. This study is an attempt to explore the suitability of current state-of-the-art models introduced in the domain of computer vision for SAR target classification (MSTAR). Since the application of any solution… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 6 pages, 9 figures

    Journal ref: In Advanced Network Technologies and Intelligent Computing Second International Conference, ANTIC 2022, Varanasi, India, December 22 24, 2022, Proceedings, Part II pp. 249 260. Cham Springer Nature Switzerland

  14. arXiv:2303.04584  [pdf, other

    math.OC cs.IT eess.SP eess.SY math.ST

    Estimating a scalar log-concave random variable, using a silence set based probabilistic sampling

    Authors: Maben Rabi, Junfeng Wu, Vyoma Singh, Karl Henrik Johansson

    Abstract: We study the probabilistic sampling of a random variable, in which the variable is sampled only if it falls outside a given set, which is called the silence set. This helps us to understand optimal event-based sampling for the special case of IID random processes, and also to understand the design of a sub-optimal scheme for other cases. We consider the design of this probabilistic sampling for a… ▽ More

    Submitted 16 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in the 2023 American Control Conference

  15. Modeling and Analysis of Multiple Electrostatic Actuators on the Response of Vibrotactile Haptic Device

    Authors: Santosh Mohan Rajkumar, Kumar Vikram Singh, Jeong-Hoi Koo

    Abstract: In this research, modeling and analysis of a beam-type touchscreen interface with multiple actuators is considered. As thin beams, a mechanical model of a touch screen system is developed with embedded electrostatic actuators at different spatial locations. This discrete finite element-based model is developed to compute the analytical and numerical vibrotactile response due to multiple actuators… ▽ More

    Submitted 14 February, 2023; originally announced March 2023.

    Journal ref: ASME International Mechanical Engineering Congress and Exposition 2022

  16. arXiv:2209.11675  [pdf

    cs.NI eess.SY

    An analysis of the Internet of Things in wireless sensor network technologies

    Authors: Harshit Poddar, Vansh Singh

    Abstract: Information may be accessed from a distance thanks to computer networks. Wireless or wired networks are also possible. Due to recent developments in wireless infrastructure, wireless sensor networks (WSNs) were developed. Activities or events occurring in the environment are monitored, recorded, and managed by WSN. Through a variety of routing techniques, data relaying is done in these systems. Th… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: 8 pages, 13 figures, 3 tables, preprint

  17. arXiv:2209.11230  [pdf

    eess.IV cs.CV

    A Trio-Method for Retinal Vessel Segmentation using Image Processing

    Authors: Mahendra Kumar Gourisaria, Vinayak Singh, Manoj Sahni

    Abstract: Inner Retinal neurons are a most essential part of the retina and they are supplied with blood via retinal vessels. This paper primarily focuses on the segmentation of retinal vessels using a triple preprocessing approach. DRIVE database was taken into consideration and preprocessed by Gabor Filtering, Gaussian Blur, and Edge Detection by Sobel and Pruning. Segmentation was driven out by 2 propose… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted at 26th UK Conference on Medical Image Understanding and Analysis (MIUA-2022) (Abstract short paper)

  18. arXiv:2207.10284  [pdf, other

    cs.LG cs.CL eess.SP

    Multi Resolution Analysis (MRA) for Approximate Self-Attention

    Authors: Zhanpeng Zeng, Sourav Pal, Jeffery Kline, Glenn M Fung, Vikas Singh

    Abstract: Transformers have emerged as a preferred model for many tasks in natural langugage processing and vision. Recent efforts on training and deploying Transformers more efficiently have identified many strategies to approximate the self-attention matrix, a key module in a Transformer architecture. Effective ideas include various prespecified sparsity patterns, low-rank basis expansions and combination… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ICML2022

  19. arXiv:2206.11520  [pdf, other

    cs.CV eess.IV

    ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?

    Authors: Vivek Kumar Singh, Paul O Reilly, Jacqueline James, Manuel Salto Tellez, Perry Maxwell

    Abstract: Biomarkers identify a patients response to treatment. With the recent advances in artificial intelligence based on the Transformer networks, there is only limited research has been done to measure the performance on challenging histopathology images. In this paper, we investigate the efficacy of the numerous state-of-the-art Transformer networks for immune-checkpoint biomarker, Inducible Tcell COS… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted MIUA conference (Abstract short paper)

  20. arXiv:2205.01640  [pdf

    physics.soc-ph eess.SY

    Adaptive Traffic Signal Control for Develo** Countries Using Fused Parameters Derived from Crowd-Source Data

    Authors: Sumit Mishra, Vishal Singh, Ankit Gupta, Devanjan Bhattacharya, Abhisek Mudgal

    Abstract: Advancement of mobile technologies has enabled economical collection, storage, processing, and sharing of traffic data. These data are made accessible to intended users through various application program interfaces (API) and can be used to recognize and mitigate congestion in real time. In this paper, quantitative (time of arrival) and qualitative (color-coded congestion levels) data were acquire… ▽ More

    Submitted 11 March, 2022; originally announced May 2022.

    Comments: 15 pages, 11 figures, 7 tables, Accepted by Transportation Letters: the International Journal of Transportation Research

  21. arXiv:2203.06600  [pdf, other

    eess.AS eess.SP

    Spectral Modification Based Data Augmentation For Improving End-to-End ASR For Children's Speech

    Authors: Vishwanath Pratap Singh, Hardik Sailor, Supratik Bhattacharya, Abhishek Pandey

    Abstract: Training a robust Automatic Speech Recognition (ASR) system for children's speech recognition is a challenging task due to inherent differences in acoustic attributes of adult and child speech and scarcity of publicly available children's speech dataset. In this paper, a novel segmental spectrum war** and perturbations in formant energy are introduced, to generate a children-like speech spectrum… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  22. arXiv:2112.01025  [pdf, other

    eess.AS cs.CL cs.SD

    A Mixture of Expert Based Deep Neural Network for Improved ASR

    Authors: Vishwanath Pratap Singh, Shakti P. Rath, Abhishek Pandey

    Abstract: This paper presents a novel deep learning architecture for acoustic model in the context of Automatic Speech Recognition (ASR), termed as MixNet. Besides the conventional layers, such as fully connected layers in DNN-HMM and memory cells in LSTM-HMM, the model uses two additional layers based on Mixture of Experts (MoE). The first MoE layer operating at the input is based on pre-defined broad phon… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

  23. arXiv:2112.01023  [pdf, other

    eess.AS cs.SD

    A higher order Minkowski loss for improved prediction ability of acoustic model in ASR

    Authors: Vishwanath Pratap Singh, Shakti P. Rath, Abhishek Pandey

    Abstract: Conventional automatic speech recognition (ASR) system uses second-order minkowski loss during inference time which is suboptimal as it incorporates only first order statistics in posterior estimation [2]. In this paper we have proposed higher order minkowski loss (4th Order and 6th Order) during inference time, without any changes during training time. The main contribution of the paper is to sho… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

  24. arXiv:2106.07972  [pdf

    eess.AS cs.SD

    SRIB Submission to Interspeech 2021 DiCOVA Challenge

    Authors: Vishwanath Pratap Singh, Shashi Kumar, Ravi Shekhar Jha, Abhishek Pandey

    Abstract: The COVID-19 pandemic has resulted in more than 125 million infections and more than 2.7 million casualties. In this paper, we attempt to classify covid vs non-covid cough sounds using signal processing and deep learning methods. Air turbulence, the vibration of tissues, movement of fluid through airways, opening, and closure of glottis are some of the causes for the production of the acoustic sou… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 5 pages, 5 figures

  25. arXiv:2102.10640  [pdf, other

    eess.IV cs.CV cs.LG

    Tchebichef Transform Domain-based Deep Learning Architecture for Image Super-resolution

    Authors: Ahlad Kumar, Harsh Vardhan Singh

    Abstract: The recent outbreak of COVID-19 has motivated researchers to contribute in the area of medical imaging using artificial intelligence and deep learning. Super-resolution (SR), in the past few years, has produced remarkable results using deep learning methods. The ability of deep learning methods to learn the non-linear map** from low-resolution (LR) images to their corresponding high-resolution (… ▽ More

    Submitted 22 February, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: 11 pages, 12 figures, 53 references

  26. arXiv:2010.03199  [pdf, other

    eess.IV cs.CV

    WDN: A Wide and Deep Network to Divide-and-Conquer Image Super-resolution

    Authors: Vikram Singh, Anurag Mittal

    Abstract: Divide and conquer is an established algorithm design paradigm that has proven itself to solve a variety of problems efficiently. However, it is yet to be fully explored in solving problems with a neural network, particularly the problem of image super-resolution. In this work, we propose an approach to divide the problem of image super-resolution into multiple sub-problems and then solve/conquer… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    MSC Class: 68T07 (Primary) 68T45; 68U10 (Secondary) ACM Class: I.4.3

  27. arXiv:2008.05060  [pdf, other

    cs.CV cs.LG eess.SP stat.ML

    Online Graph Completion: Multivariate Signal Recovery in Computer Vision

    Authors: Won Hwa Kim, Mona Jalal, Seongjae Hwang, Sterling C. Johnson, Vikas Singh

    Abstract: The adoption of "human-in-the-loop" paradigms in computer vision and machine learning is leading to various applications where the actual data acquisition (e.g., human supervision) and the underlying inference algorithms are closely interwined. While classical work in active learning provides effective solutions when the learning module involves classification and regression tasks, many practical… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 9 pages, 7 figures, CVPR 2017 Conference

  28. arXiv:2006.16848  [pdf

    eess.SP cs.LG cs.NE stat.ML

    Modeling and Uncertainty Analysis of Groundwater Level Using Six Evolutionary Optimization Algorithms Hybridized with ANFIS, SVM, and ANN

    Authors: Akram Seifi, Mohammad Ehteram, Vijay P. Singh, Amir Mosavi

    Abstract: In the present study, six meta-heuristic schemes are hybridized with artificial neural network (ANN), adaptive neuro-fuzzy interface system (ANFIS), and support vector machine (SVM), to predict monthly groundwater level (GWL), evaluate uncertainty analysis of predictions and spatial variation analysis. The six schemes, including grasshopper optimization algorithm (GOA), cat swarm optimization (CSO… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Comments: 42 pages, 11 figures

    MSC Class: 68T07

    Journal ref: Sustainability 2020, 12, 4023

  29. arXiv:2005.04258  [pdf, other

    cs.CV cs.LG eess.IV

    View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors

    Authors: Walid Bekhtaoui, Ruhan Sa, Brian Teixeira, Vivek Singh, Klaus Kirchberg, Yao-jen Chang, Ankur Kapoor

    Abstract: Point cloud based methods have produced promising results in areas such as 3D object detection in autonomous driving. However, most of the recent point cloud work focuses on single depth sensor data, whereas less work has been done on indoor monitoring applications, such as operation room monitoring in hospitals or indoor surveillance. In these scenarios multiple cameras are often used to tackle o… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  30. arXiv:2001.01277  [pdf, other

    eess.IV cs.CV

    Automated Segmentation of Vertebrae on Lateral Chest Radiography Using Deep Learning

    Authors: Sanket Badhe, Varun Singh, Joy Li, Paras Lakhani

    Abstract: The purpose of this study is to develop an automated algorithm for thoracic vertebral segmentation on chest radiography using deep learning. 124 de-identified lateral chest radiographs on unique patients were obtained. Segmentations of visible vertebrae were manually performed by a medical student and verified by a board-certified radiologist. 74 images were used for training, 10 for validation, a… ▽ More

    Submitted 5 January, 2020; originally announced January 2020.

    Comments: 10 pages, Accepted Poster presentation at Conference on Machine Intelligence in Medical Imaging 2018

  31. arXiv:1911.08616  [pdf, other

    cs.CV eess.IV

    Attention Guided Anomaly Localization in Images

    Authors: Shashanka Venkataramanan, Kuan-Chuan Peng, Rajat Vikram Singh, Abhijit Mahalanobis

    Abstract: Anomaly localization is an important problem in computer vision which involves localizing anomalous regions within images with applications in industrial inspection, surveillance, and medical imaging. This task is challenging due to the small sample size and pixel coverage of the anomaly in real-world scenarios. Most prior works need to use anomalous training images to compute a class-specific thr… ▽ More

    Submitted 16 July, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted to ECCV 2020

  32. arXiv:1908.08074  [pdf, other

    eess.IV cs.CV

    DUAL-GLOW: Conditional Flow-Based Generative Model for Modality Transfer

    Authors: Haoliang Sun, Ronak Mehta, Hao H. Zhou, Zhichun Huang, Sterling C. Johnson, Vivek Prabhakaran, Vikas Singh

    Abstract: Positron emission tomography (PET) imaging is an imaging modality for diagnosing a number of neurological diseases. In contrast to Magnetic Resonance Imaging (MRI), PET is costly and involves injecting a radioactive substance into the patient. Motivated by developments in modality transfer in vision, we study the generation of certain types of PET images from MRI data. We derive new flow-based gen… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

    Journal ref: ICCV 2019

  33. arXiv:1907.02742  [pdf, other

    eess.IV cs.CV

    Adversarial Learning with Multiscale Features and Kernel Factorization for Retinal Blood Vessel Segmentation

    Authors: Farhan Akram, Vivek Kumar Singh, Hatem A. Rashwan, Mohamed Abdel-Nasser, Md. Mostafa Kamal Sarker, Nidhi Pandey, Domenec Puig

    Abstract: In this paper, we propose an efficient blood vessel segmentation method for the eye fundus images using adversarial learning with multiscale features and kernel factorization. In the generator network of the adversarial framework, spatial pyramid pooling, kernel factorization and squeeze excitation block are employed to enhance the feature representation in spatial domain on different scales with… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: 9 pages, 4 figures

  34. arXiv:1907.00887  [pdf, other

    eess.IV cs.CV

    An Efficient Solution for Breast Tumor Segmentation and Classification in Ultrasound Images Using Deep Adversarial Learning

    Authors: Vivek Kumar Singh, Hatem A. Rashwan, Mohamed Abdel-Nasser, Md. Mostafa Kamal Sarker, Farhan Akram, Nidhi Pandey, Santiago Romani, Domenec Puig

    Abstract: This paper proposes an efficient solution for tumor segmentation and classification in breast ultrasound (BUS) images. We propose to add an atrous convolution layer to the conditional generative adversarial network (cGAN) segmentation model to learn tumor features at different resolutions of BUS images. To automatically re-balance the relative impact of each of the highest level encoded features,… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 9 pages

  35. arXiv:1907.00856  [pdf, other

    eess.IV cs.CV

    SLSNet: Skin lesion segmentation using a lightweight generative adversarial network

    Authors: Md. Mostafa Kamal Sarker, Hatem A. Rashwan, Farhan Akram, Vivek Kumar Singh, Syeda Furruka Banu, Forhad U H Chowdhury, Kabir Ahmed Choudhury, Sylvie Chambon, Petia Radeva, Domenec Puig, Mohamed Abdel-Nasser

    Abstract: The determination of precise skin lesion boundaries in dermoscopic images using automated methods faces many challenges, most importantly, the presence of hair, inconspicuous lesion edges and low contrast in dermoscopic images, and variability in the color, texture and shapes of skin lesions. Existing deep learning-based skin lesion segmentation algorithms are expensive in terms of computational t… ▽ More

    Submitted 17 June, 2021; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: Accepted in Expert Systems with Applications

  36. arXiv:1811.03343  [pdf, other

    cs.CV eess.IV

    Repetitive Motion Estimation Network: Recover cardiac and respiratory signal from thoracic imaging

    Authors: Xiaoxiao Li, Vivek Singh, Yifan Wu, Klaus Kirchberg, James Duncan, Ankur Kapoor

    Abstract: Tracking organ motion is important in image-guided interventions, but motion annotations are not always easily available. Thus, we propose Repetitive Motion Estimation Network (RMEN) to recover cardiac and respiratory signals. It learns the spatio-temporal repetition patterns, embedding high dimensional motion manifolds to 1D vectors with partial motion phase boundary annotations. Compared with th… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted by NIPS workshop MED-NIPS 2018

  37. arXiv:1306.5412  [pdf

    eess.SY

    Electronically Tunable Voltage-Mode Biquad Filter/Oscillator Based On CCCCTAs

    Authors: Sajai Vir Singh, Gungan Gupta, Rahul Chhabra, Kanika Nagpal, Devansh

    Abstract: In this paper, a circuit employing current controlled current conveyor trans-conductance amplifiers (CCCCTAs) as active element is proposed which can function both as biquad filter and oscillator. It uses two CCCCTAs and two capacitors. As a biquad filter it can realizes all the standard filtering functions (low pass, band pass, high pass, band reject and all pass) in voltage-mode and provides the… ▽ More

    Submitted 23 June, 2013; originally announced June 2013.

    Comments: 5 pages, 7 figures, 1 table, Authors profile