Search | arXiv e-print repository

HemSeg-200: A Voxel-Annotated Dataset for Intracerebral Hemorrhages Segmentation in Brain CT Scans

Authors: Changwei Song, Qing Zhao, Jianqiang Li, Xin Yue, Ruoyun Gao, Zhaoxuan Wang, An Gao, Guanghui Fu

Abstract: Acute intracerebral hemorrhage is a life-threatening condition that demands immediate medical intervention. Intraparenchymal hemorrhage (IPH) and intraventricular hemorrhage (IVH) are critical subtypes of this condition. Clinically, when such hemorrhages are suspected, immediate CT scanning is essential to assess the extent of the bleeding and to facilitate the formulation of a targeted treatment… ▽ More Acute intracerebral hemorrhage is a life-threatening condition that demands immediate medical intervention. Intraparenchymal hemorrhage (IPH) and intraventricular hemorrhage (IVH) are critical subtypes of this condition. Clinically, when such hemorrhages are suspected, immediate CT scanning is essential to assess the extent of the bleeding and to facilitate the formulation of a targeted treatment plan. While current research in deep learning has largely focused on qualitative analyses, such as identifying subtypes of cerebral hemorrhages, there remains a significant gap in quantitative analysis crucial for enhancing clinical treatments. Addressing this gap, our paper introduces a dataset comprising 222 CT annotations, sourced from the RSNA 2019 Brain CT Hemorrhage Challenge and meticulously annotated at the voxel level for precise IPH and IVH segmentation. This dataset was utilized to train and evaluate seven advanced medical image segmentation algorithms, with the goal of refining the accuracy of segmentation for these hemorrhages. Our findings demonstrate that this dataset not only furthers the development of sophisticated segmentation algorithms but also substantially aids scientific research and clinical practice by improving the diagnosis and management of these severe hemorrhages. Our dataset and codes are available at \url{https://github.com/songchangwei/3DCT-SD-IVH-ICH}. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2304.05589 [pdf, other]

Discovering Structure From Corruption for Unsupervised Image Reconstruction

Authors: Oscar Leong, Angela F. Gao, He Sun, Katherine L. Bouman

Abstract: We consider solving ill-posed imaging inverse problems without access to an image prior or ground-truth examples. An overarching challenge in these inverse problems is that an infinite number of images, including many that are implausible, are consistent with the observed measurements. Thus, image priors are required to reduce the space of possible solutions to more desirable reconstructions. Howe… ▽ More We consider solving ill-posed imaging inverse problems without access to an image prior or ground-truth examples. An overarching challenge in these inverse problems is that an infinite number of images, including many that are implausible, are consistent with the observed measurements. Thus, image priors are required to reduce the space of possible solutions to more desirable reconstructions. However, in many applications it is difficult or potentially impossible to obtain example images to construct an image prior. Hence inaccurate priors are often used, which inevitably result in biased solutions. Rather than solving an inverse problem using priors that encode the spatial structure of any one image, we propose to solve a set of inverse problems jointly by incorporating prior constraints on the collective structure of the underlying images. The key assumption of our work is that the underlying images we aim to reconstruct share common, low-dimensional structure. We show that such a set of inverse problems can be solved simultaneously without the use of a spatial image prior by instead inferring a shared image generator with a low-dimensional latent space. The parameters of the generator and latent embeddings are found by maximizing a proxy for the Evidence Lower Bound (ELBO). Once identified, the generator and latent embeddings can be combined to provide reconstructed images for each inverse problem. The framework we propose can handle general forward model corruptions, and we show that measurements derived from only a small number of ground-truth images ($\leqslant 150$) are sufficient for image reconstruction. We demonstrate our approach on a variety of convex and non-convex inverse problems, including denoising, phase retrieval, and black hole video reconstruction. △ Less

Submitted 1 November, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

Comments: Extended version of arXiv:2303.12217

arXiv:2303.12217 [pdf, other]

Image Reconstruction without Explicit Priors

Authors: Angela F. Gao, Oscar Leong, He Sun, Katherine L. Bouman

Abstract: We consider solving ill-posed imaging inverse problems without access to an explicit image prior or ground-truth examples. An overarching challenge in inverse problems is that there are many undesired images that fit to the observed measurements, thus requiring image priors to constrain the space of possible solutions to more plausible reconstructions. However, in many applications it is difficult… ▽ More We consider solving ill-posed imaging inverse problems without access to an explicit image prior or ground-truth examples. An overarching challenge in inverse problems is that there are many undesired images that fit to the observed measurements, thus requiring image priors to constrain the space of possible solutions to more plausible reconstructions. However, in many applications it is difficult or potentially impossible to obtain ground-truth images to learn an image prior. Thus, inaccurate priors are often used, which inevitably result in biased solutions. Rather than solving an inverse problem using priors that encode the explicit structure of any one image, we propose to solve a set of inverse problems jointly by incorporating prior constraints on the collective structure of the underlying images.The key assumption of our work is that the ground-truth images we aim to reconstruct share common, low-dimensional structure. We show that such a set of inverse problems can be solved simultaneously by learning a shared image generator with a low-dimensional latent space. The parameters of the generator and latent embedding are learned by maximizing a proxy for the Evidence Lower Bound (ELBO). Once learned, the generator and latent embeddings can be combined to provide reconstructions for each inverse problem. The framework we propose can handle general forward model corruptions, and we show that measurements derived from only a few ground-truth images (O(10)) are sufficient for image reconstruction without explicit priors. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: ICASSP 2023

arXiv:2112.06388 [pdf]

A Cluster-Based Weighted Feature Similarity Moving Target Tracking Algorithm for Automotive FMCW Radar

Authors: Rongqian Chen, Yingquan Zou, Anyong Gao, Leshi Chen

Abstract: We studied a target tracking algorithm based on millimeter-wave (MMW) radar in an autonomous driving environment. Aiming at the cluster matching in the target tracking stage, a new weighted feature similarity algorithm is proposed, which increases the matching rate of the same target in adjacent frames under strong environmental noise and multiple interference targets. For autonomous driving scena… ▽ More We studied a target tracking algorithm based on millimeter-wave (MMW) radar in an autonomous driving environment. Aiming at the cluster matching in the target tracking stage, a new weighted feature similarity algorithm is proposed, which increases the matching rate of the same target in adjacent frames under strong environmental noise and multiple interference targets. For autonomous driving scenarios, we constructed a method that uses its motion parameters to extract and correct the trajectory of a moving target, which solves the problem of moving target detection and trajectory correction during vehicle movement. Finally, the feasibility of the proposed method was verified by a series of experiments in autonomous driving environments. The results verify the high recognition accuracy and low positional error of the method. △ Less

Submitted 12 December, 2021; originally announced December 2021.

arXiv:2111.04185 [pdf, other]

CoughTrigger: Earbuds IMU Based Cough Detection Activator Using An Energy-efficient Sensitivity-prioritized Time Series Classifier

Authors: Shibo Zhang, Ebrahim Nemati, Minh Dinh, Nathan Folkman, Tousif Ahmed, Mahbubur Rahman, Jilong Kuang, Nabil Alshurafa, Alex Gao

Abstract: Persistent coughs are a major symptom of respiratory-related diseases. Increasing research attention has been paid to detecting coughs using wearables, especially during the COVID-19 pandemic. Among all types of sensors utilized, microphone is most widely used to detect coughs. However, the intense power consumption needed to process audio signals hinders continuous audio-based cough detection on… ▽ More Persistent coughs are a major symptom of respiratory-related diseases. Increasing research attention has been paid to detecting coughs using wearables, especially during the COVID-19 pandemic. Among all types of sensors utilized, microphone is most widely used to detect coughs. However, the intense power consumption needed to process audio signals hinders continuous audio-based cough detection on battery-limited commercial wearable products, such as earbuds. We present CoughTrigger, which utilizes a lower-power sensor, an inertial measurement unit (IMU), in earbuds as a cough detection activator to trigger a higher-power sensor for audio processing and classification. It is able to run all-the-time as a standby service with minimal battery consumption and trigger the audio-based cough detection when a candidate cough is detected from IMU. Besides, the use of IMU brings the benefit of improved specificity of cough detection. Experiments are conducted on 45 subjects and our IMU-based model achieved 0.77 AUC score under leave one subject out evaluation. We also validated its effectiveness on free-living data and through on-device implementation. △ Less

Submitted 7 November, 2021; originally announced November 2021.

arXiv:2109.00630 [pdf, other]

A Novel Multi-Centroid Template Matching Algorithm and Its Application to Cough Detection

Authors: Shibo Zhang, Ebrahim Nemati, Tousif Ahmed, Md Mahbubur Rahman, Jilong Kuang, Alex Gao

Abstract: Cough is a major symptom of respiratory-related diseases. There exists a tremendous amount of work in detecting coughs from audio but there has been no effort to identify coughs from solely inertial measurement unit (IMU). Coughing causes motion across the whole body and especially on the neck and head. Therefore, head motion data during coughing captured by a head-worn IMU sensor could be leverag… ▽ More Cough is a major symptom of respiratory-related diseases. There exists a tremendous amount of work in detecting coughs from audio but there has been no effort to identify coughs from solely inertial measurement unit (IMU). Coughing causes motion across the whole body and especially on the neck and head. Therefore, head motion data during coughing captured by a head-worn IMU sensor could be leveraged to detect coughs using a template matching algorithm. In time series template matching problems, K-Nearest Neighbors (KNN) combined with elastic distance measurement (esp. Dynamic Time War** (DTW)) achieves outstanding performance. However, it is often regarded as prohibitively time-consuming. Nearest Centroid Classifier is thereafter proposed. But the accuracy is comprised of only one centroid obtained for each class. Centroid-based Classifier performs clustering and averaging for each cluster, but requires manually setting the number of clusters. We propose a novel self-tuning multi-centroid template-matching algorithm, which can automatically adjust the number of clusters to balance accuracy and inference time. Through experiments conducted on synthetic datasets and a real-world earbud-based cough dataset, we demonstrate the superiority of our proposed algorithm and present the result of cough detection with a single accelerometer sensor on the earbuds platform. △ Less

Submitted 4 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

ACM Class: I.5.4; I.5.1

arXiv:2107.01682 [pdf]

COVID-VIT: Classification of COVID-19 from CT chest images based on vision transformer models

Authors: Xiaohong Gao, Yu Qian, Alice Gao

Abstract: This paper is responding to the MIA-COV19 challenge to classify COVID from non-COVID based on CT lung images. The COVID-19 virus has devastated the world in the last eighteen months by infecting more than 182 million people and causing over 3.9 million deaths. The overarching aim is to predict the diagnosis of the COVID-19 virus from chest radiographs, through the development of explainable vision… ▽ More This paper is responding to the MIA-COV19 challenge to classify COVID from non-COVID based on CT lung images. The COVID-19 virus has devastated the world in the last eighteen months by infecting more than 182 million people and causing over 3.9 million deaths. The overarching aim is to predict the diagnosis of the COVID-19 virus from chest radiographs, through the development of explainable vision transformer deep learning techniques, leading to population screening in a more rapid, accurate and transparent way. In this competition, there are 5381 three-dimensional (3D) datasets in total, including 1552 for training, 374 for evaluation and 3455 for testing. While most of the data volumes are in axial view, there are a number of subjects' data are in coronal or sagittal views with 1 or 2 slices are in axial view. Hence, while 3D data based classification is investigated, in this competition, 2D images remains the main focus. Two deep learning methods are studied, which are vision transformer (ViT) based on attention models and DenseNet that is built upon conventional convolutional neural network (CNN). Initial evaluation results based on validation datasets whereby the ground truth is known indicate that ViT performs better than DenseNet with F1 scores being 0.76 and 0.72 respectively. Codes are available at GitHub at <https://github/xiaohong1/COVID-ViT>. △ Less

Submitted 4 July, 2021; originally announced July 2021.

arXiv:2106.13315 [pdf, other]

Generalized Unsupervised Clustering of Hyperspectral Images of Geological Targets in the Near Infrared

Authors: Angela F. Gao, Brandon Rasmussen, Peter Kulits, Eva L. Scheller, Rebecca Greenberger, Bethany L. Ehlmann

Abstract: The application of infrared hyperspectral imagery to geological problems is becoming more popular as data become more accessible and cost-effective. Clustering and classifying spectrally similar materials is often a first step in applications ranging from economic mineral exploration on Earth to planetary exploration on Mars. Semi-manual classification guided by expertly developed spectral paramet… ▽ More The application of infrared hyperspectral imagery to geological problems is becoming more popular as data become more accessible and cost-effective. Clustering and classifying spectrally similar materials is often a first step in applications ranging from economic mineral exploration on Earth to planetary exploration on Mars. Semi-manual classification guided by expertly developed spectral parameters can be time consuming and biased, while supervised methods require abundant labeled data and can be difficult to generalize. Here we develop a fully unsupervised workflow for feature extraction and clustering informed by both expert spectral geologist input and quantitative metrics. Our pipeline uses a lightweight autoencoder followed by Gaussian mixture modeling to map the spectral diversity within any image. We validate the performance of our pipeline at submillimeter-scale with expert-labelled data from the Oman ophiolite drill core and evaluate performance at meters-scale with partially classified orbital data of Jezero Crater on Mars (the landing site for the Perseverance rover). We additionally examine the effects of various preprocessing techniques used in traditional analysis of hyperspectral imagery. This pipeline provides a fast and accurate clustering map of similar geological materials and consistently identifies and separates major mineral classes in both laboratory imagery and remote sensing imagery. We refer to our pipeline as "Generalized Pipeline for Spectroscopic Unsupervised clustering of Minerals (GyPSUM)." △ Less

Submitted 24 June, 2021; originally announced June 2021.

Comments: 10 pages, 4 figures. Accepted, CVPR PBVS Workshop 2021

arXiv:1905.07094 [pdf]

doi 10.1109/FCS.2019.8856145

A 300-500 MHz Tunable Oscillator Exploiting Ten Overtones in Single Lithium Niobate Resonator

Authors: Ali Kourani, Ruochen Lu, Anming Gao, Songbin Gong

Abstract: This paper presents the first voltage-controlled MEMS oscillator (VCMO) based on a Lithium Niobate (LiNbO3) lateral overtone bulk acoustic resonator (LOBAR). The VCMO consists of a LOBAR in a closed loop with 2 amplification stages and a varactor-embedded tunable LC tank. By adjusting the bias voltage applied to the varactor, the tank can be tuned to change the closed-loop gain and phase responses… ▽ More This paper presents the first voltage-controlled MEMS oscillator (VCMO) based on a Lithium Niobate (LiNbO3) lateral overtone bulk acoustic resonator (LOBAR). The VCMO consists of a LOBAR in a closed loop with 2 amplification stages and a varactor-embedded tunable LC tank. By adjusting the bias voltage applied to the varactor, the tank can be tuned to change the closed-loop gain and phase responses of the oscillator so that Barkhausen conditions are satisfied for a particular resonance mode. The tank is designed to allow the proposed VCMO to lock to any of the ten overtones ranging from 300 to 500 MHz. Owing to the high-quality factors of the LiNbO3 LOBAR, the measured VCMO shows a low close-in phase noise of -100 dBc/Hz at 1 kHz offset from a 300 MHz carrier and a noise floor of -153 dBc/Hz while consuming 9 mW. With further optimization, this VCMO can lead to direct radio frequency (RF) synthesis for ultra-low-power transceivers in multi-mode Internet-of-Things (IoT) nodes. △ Less

Submitted 16 May, 2019; originally announced May 2019.

arXiv:1806.02727 [pdf, other]

Downlink Interference Management in Dense Interference-Aware Drone Small Cells Networks Using Mean-Field Game Theory

Authors: Zihe Zhang, Lixin Li, Wei Liang, Xu Li, Ang Gao, Wei Chen, Zhu Han

Abstract: The use of drone small cells (DSCs) has recently drawn significant attentions as one key enabler for providing air-to-ground communication services in various situations. This paper investigates the co-channel deployment of dense DSCs, which are mounted on captive unmanned aerial vehicles (UAVs). As the altitude of a DSC has a huge impact on the performance of downlink, the downlink interference c… ▽ More The use of drone small cells (DSCs) has recently drawn significant attentions as one key enabler for providing air-to-ground communication services in various situations. This paper investigates the co-channel deployment of dense DSCs, which are mounted on captive unmanned aerial vehicles (UAVs). As the altitude of a DSC has a huge impact on the performance of downlink, the downlink interference control problem is mapped to an altitude control problem in this paper. All DSCs adjust their altitude to improve the available signal-to-interference-plus-noise ratio (SINR). The control problem is modeled as a mean-field game (MFG), where the cost function is designed to combine the available SINR with the cost of altitude controling. The interference introduced from a big amount of DSCs is derived through a mean-field approximation approach. Within the proposed MFG framework, the related Hamilton-Jacobi-Bellman and Fokker-Planck-Kolmogorov equations are deduced to describe and explain the control policy. The optimal altitude control policy is obtained by solving the partial differential equations with a proposed finite difference algorithm based on the upwind scheme. The simulations illustrate the optimal power controls and corresponding mean field distribution of DSCs. The numerical results also validate that the proposed control policy achieves better SINR performance of DSCs compared to the uniform control scheme. △ Less

Submitted 7 June, 2018; originally announced June 2018.

arXiv:1803.10665 [pdf]

doi 10.1109/TMTT.2019.2895577

A Radio Frequency Non-reciprocal Network Based on Switched Acoustic Delay Lines

Authors: Ruochen Lu, Tomas Manzaneque, Yansong Yang, Liuqing Gao, Anming Gao, Songbin Gong

Abstract: This work demonstrates the first non-reciprocal network based on switched low-loss acoustic delay lines. The 4-port circulator is built upon a recently reported frequency-independent, programmable, non-reciprocal framework based on switched delay lines. The design space for such a system, including the origins of the insertion loss and harmonic responses, is theoretically investigated, illustratin… ▽ More This work demonstrates the first non-reciprocal network based on switched low-loss acoustic delay lines. The 4-port circulator is built upon a recently reported frequency-independent, programmable, non-reciprocal framework based on switched delay lines. The design space for such a system, including the origins of the insertion loss and harmonic responses, is theoretically investigated, illustrating that the key to better performance and low-cost modulation signal synthesis lies in a large delay. To implement a large delay, we resort to in-house fabricated low-loss, wide-band lithium niobate (LiNbO3) SH0 mode acoustic delay lines employing single-phase unidirectional transducers (SPUDT). The 4-port circulator, consisting of two switch modules and one delay line module, has been modularly designed, assembled, and tested. The design process employs time-domain full circuit simulation and the results match well with measurements. A 18.8 dB non-reciprocal contrast between insertion loss (IL = 6.6 dB) and isolation (25.4 dB) has been achieved over a fractional bandwidth of 8.8% at a center frequency 155 MHz, using a record low switching frequency of 877.19 kHz. The circulator also shows 25.9 dB suppression for the intra-modulated tone and 30 dBm for IIP3. Upon further development, such a system can potentially lead to future wide-band, low-loss chip-scale nonreciprocal RF systems with unprecedented programmability. △ Less

Submitted 28 March, 2018; originally announced March 2018.

Comments: 13 pages, 23 figures

Journal ref: IEEE Transactions on Microwave Theory and Techniques, vol. 67, no. 4, pp. 1516-1530, April 2019

arXiv:1801.03814 [pdf]

doi 10.1109/ULTSYM.2018.8579758

A Radio Frequency Non-reciprocal Network Based on Switched Low-loss Acoustic Delay Lines

Authors: Ruochen Lu, Tomas Manzaneque, Yansong Yang, Anming Gao, Liuqing Gao, Songbin Gong

Abstract: This work demonstrates the first non-reciprocal network based on switched low-loss acoustic delay lines. A 21 dB non-reciprocal contrast between insertion loss (IL=6.7 dB) and isolation (28.3 dB) has been achieved over a fractional bandwidth of 8.8% at a center frequency 155MHz, using a record low switching frequency of 877.22 kHz. The 4-port circulator is built upon a newly reported framework by… ▽ More This work demonstrates the first non-reciprocal network based on switched low-loss acoustic delay lines. A 21 dB non-reciprocal contrast between insertion loss (IL=6.7 dB) and isolation (28.3 dB) has been achieved over a fractional bandwidth of 8.8% at a center frequency 155MHz, using a record low switching frequency of 877.22 kHz. The 4-port circulator is built upon a newly reported framework by the authors, but using two in-house fabricated low-loss, wide-band lithium niobate (LiNbO3) delay lines with single-phase unidirectional transducers (SPUDT) and commercial available switches. Such a system can potentially lead to future wide-band, low-loss chip-scale nonreciprocal RF systems with unprecedented programmability. △ Less

Submitted 9 January, 2018; originally announced January 2018.

Comments: 4 pages, 7 figures

Showing 1–12 of 12 results for author: Gao, A