Skip to main content

Showing 1–50 of 53 results for author: Gao, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16083  [pdf, other

    eess.IV cs.CV

    Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning

    Authors: Ruisheng Gao, Zeyu Xiao, Zhiwei Xiong

    Abstract: Transformer-based methods have demonstrated impressive performance in 4D light field (LF) super-resolution by effectively modeling long-range spatial-angular correlations, but their quadratic complexity hinders the efficient processing of high resolution 4D inputs, resulting in slow inference speed and high memory cost. As a compromise, most prior work adopts a patch-based strategy, which fails to… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 17 pages,7 figures

  2. arXiv:2406.07532  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Hearing Anything Anywhere

    Authors: Mason Wang, Ryosuke Sawata, Samuel Clarke, Ruohan Gao, Shangzhe Wu, Jiajun Wu

    Abstract: Recent years have seen immense progress in 3D computer vision and computer graphics, with emerging tools that can virtualize real-world 3D environments for numerous Mixed Reality (XR) applications. However, alongside immersive visual experiences, immersive auditory experiences are equally vital to our holistic perception of an environment. In this paper, we aim to reconstruct the spatial acoustic… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024. The first two authors contributed equally. Project page: https://masonlwang.com/hearinganythinganywhere/

    ACM Class: I.2.10; I.4.8

  3. arXiv:2405.14559  [pdf, other

    eess.IV

    HemSeg-200: A Voxel-Annotated Dataset for Intracerebral Hemorrhages Segmentation in Brain CT Scans

    Authors: Changwei Song, Qing Zhao, Jianqiang Li, Xin Yue, Ruoyun Gao, Zhaoxuan Wang, An Gao, Guanghui Fu

    Abstract: Acute intracerebral hemorrhage is a life-threatening condition that demands immediate medical intervention. Intraparenchymal hemorrhage (IPH) and intraventricular hemorrhage (IVH) are critical subtypes of this condition. Clinically, when such hemorrhages are suspected, immediate CT scanning is essential to assess the extent of the bleeding and to facilitate the formulation of a targeted treatment… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2402.17718  [pdf

    cs.LG eess.SP

    Towards a Digital Twin Framework in Additive Manufacturing: Machine Learning and Bayesian Optimization for Time Series Process Optimization

    Authors: Vispi Karkaria, Anthony Goeckner, Ru**g Zha, Jie Chen, Jian**g Zhang, Qi Zhu, Jian Cao, Robert X. Gao, Wei Chen

    Abstract: Laser-directed-energy deposition (DED) offers advantages in additive manufacturing (AM) for creating intricate geometries and material grading. Yet, challenges like material inconsistency and part variability remain, mainly due to its layer-wise fabrication. A key issue is heat accumulation during DED, which affects the material microstructure and properties. While closed-loop control methods for… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 Pages, 10 Figures, 1 Table, NAMRC Conference

  5. arXiv:2402.06841  [pdf

    eess.IV cs.CV

    Point cloud-based registration and image fusion between cardiac SPECT MPI and CTA

    Authors: Shaojie Tang, Penpen Miao, Xingyu Gao, Yu Zhong, Dantong Zhu, Haixing Wen, Zhihui Xu, Qiuyue Wei, Hong** Yao, Xin Huang, Rui Gao, Chen Zhao, Weihua Zhou

    Abstract: A method was proposed for the point cloud-based registration and image fusion between cardiac single photon emission computed tomography (SPECT) myocardial perfusion images (MPI) and cardiac computed tomography angiograms (CTA). Firstly, the left ventricle (LV) epicardial regions (LVERs) in SPECT and CTA images were segmented by using different U-Net neural networks trained to generate the point c… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  6. arXiv:2402.02735  [pdf, other

    eess.SY

    Timed-Elastic-Band Based Variable Splitting for Autonomous Trajectory Planning

    Authors: Hao Zhu, Kefan **, Rui Gao, Jialin Wang, C. -J. Richard Shi

    Abstract: Existing trajectory planning methods are struggling to handle the issue of autonomous track swinging during navigation, resulting in significant errors when reaching the destination. In this article, we address autonomous trajectory planning problems, which aims at develo** innovative solutions to enhance the adaptability and robustness of unmanned systems in navigating complex and dynamic envir… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2311.03517  [pdf, other

    cs.SD cs.CV eess.AS

    SoundCam: A Dataset for Finding Humans Using Room Acoustics

    Authors: Mason Wang, Samuel Clarke, Jui-Hsien Wang, Ruohan Gao, Jiajun Wu

    Abstract: A room's acoustic properties are a product of the room's geometry, the objects within the room, and their specific positions. A room's acoustic properties can be characterized by its impulse response (RIR) between a source and listener location, or roughly inferred from recordings of natural signals present in the room. Variations in the positions of objects in a room can effect measurable changes… ▽ More

    Submitted 15 January, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: In NeurIPS 2023 Datasets and Benchmarks Track. Project page: https://masonlwang.com/soundcam/. Wang and Clarke contributed equally to this work

  8. arXiv:2309.09392  [pdf, other

    eess.IV cs.CV

    Deep conditional generative models for longitudinal single-slice abdominal computed tomography harmonization

    Authors: Xin Yu, Qi Yang, Yucheng Tang, Riqiang Gao, Shunxing Bao, Leon Y. Cai, Ho Hin Lee, Yuankai Huo, Ann Zenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: Two-dimensional single-slice abdominal computed tomography (CT) provides a detailed tissue map with high resolution allowing quantitative characterization of relationships between health conditions and aging. However, longitudinal analysis of body composition changes using these scans is difficult due to positional variation between slices acquired in different years, which leading to different or… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  9. arXiv:2306.09944  [pdf, other

    cs.SD cs.CV cs.GR eess.AS

    RealImpact: A Dataset of Impact Sound Fields for Real Objects

    Authors: Samuel Clarke, Ruohan Gao, Mason Wang, Mark Rau, Julia Xu, Jui-Hsien Wang, Doug L. James, Jiajun Wu

    Abstract: Objects make unique sounds under different perturbations, environment conditions, and poses relative to the listener. While prior works have modeled impact sounds and sound propagation in simulation, we lack a standard dataset of impact sound fields of real objects for audio-visual learning and calibration of the sim-to-real gap. We present RealImpact, a large-scale dataset of real object impact s… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: CVPR 2023 (Highlight). Project page: https://samuelpclarke.com/realimpact/

  10. arXiv:2305.18994  [pdf, other

    cs.CV eess.IV

    Toward Real-World Light Field Super-Resolution

    Authors: Zeyu Xiao, Ruisheng Gao, Yutong Liu, Yueyi Zhang, Zhiwei Xiong

    Abstract: Deep learning has opened up new possibilities for light field super-resolution (SR), but existing methods trained on synthetic datasets with simple degradations (e.g., bicubic downsampling) suffer from poor performance when applied to complex real-world scenarios. To address this problem, we introduce LytroZoom, the first real-world light field SR dataset capturing paired low- and high-resolution… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: CVPRW 2023

  11. arXiv:2304.02836  [pdf, other

    eess.IV cs.CV cs.LG

    Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification

    Authors: Thomas Z. Li, John M. Still, Kaiwen Xu, Ho Hin Lee, Leon Y. Cai, Aravind R. Krishnan, Riqiang Gao, Mirza S. Khan, Sanja Antic, Michael Kammer, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman, Thomas A. Lasko

    Abstract: The accuracy of predictive models for solitary pulmonary nodule (SPN) diagnosis can be greatly increased by incorporating repeat imaging and medical context, such as electronic health records (EHRs). However, clinically routine modalities such as imaging and diagnostic codes can be asynchronous and irregularly sampled over different time scales which are obstacles to longitudinal multimodal learni… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to MICCAI 2023

  12. arXiv:2301.00322  [pdf, other

    eess.SY cs.CR

    Encrypted Data-driven Predictive Cloud Control with Disturbance Observer

    Authors: Qiwen Li, Runze Gao, Yuanqing Xia

    Abstract: In data-driven predictive cloud control tasks, the privacy of data stored and used in cloud services could be leaked to malicious attackers or curious eavesdroppers. Homomorphic encryption technique could be used to protect data privacy while allowing computation. However, extra errors are introduced by the homomorphic encryption extension to ensure the privacy-preserving properties, and the real… ▽ More

    Submitted 6 February, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

  13. Reducing Positional Variance in Cross-sectional Abdominal CT Slices with Deep Conditional Generative Models

    Authors: Xin Yu, Qi Yang, Yucheng Tang, Riqiang Gao, Shunxing Bao, LeonY. Cai, Ho Hin Lee, Yuankai Huo, Ann Zenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: 2D low-dose single-slice abdominal computed tomography (CT) slice enables direct measurements of body composition, which are critical to quantitatively characterizing health relationships on aging. However, longitudinal analysis of body composition changes using 2D abdominal slices is challenging due to positional variance between longitudinal slices acquired in different years. To reduce the posi… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 11 pages, 4 figures

    Journal ref: Medical Image Computing and Computer Assisted Intervention MICCAI 2022, Cham, 2022, pp202,212

  14. arXiv:2209.14378  [pdf, other

    eess.IV cs.CV

    UNesT: Local Spatial Representation Learning with Hierarchical Transformer for Efficient Medical Segmentation

    Authors: Xin Yu, Qi Yang, Yinchi Zhou, Leon Y. Cai, Riqiang Gao, Ho Hin Lee, Thomas Li, Shunxing Bao, Zhoubing Xu, Thomas A. Lasko, Richard G. Abramson, Zizhao Zhang, Yuankai Huo, Bennett A. Landman, Yucheng Tang

    Abstract: Transformer-based models, capable of learning better global dependencies, have recently demonstrated exceptional representation learning capabilities in computer vision and medical image analysis. Transformer reformats the image into separate patches and realizes global communication via the self-attention mechanism. However, positional information between patches is hard to preserve in such 1D se… ▽ More

    Submitted 7 September, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 19 pages, 17 figures. arXiv admin note: text overlap with arXiv:2203.02430

  15. arXiv:2209.07884  [pdf, ps, other

    eess.SY cs.DC

    Workflow-based Fast Data-driven Predictive Control with Disturbance Observer in Cloud-edge Collaborative Architecture

    Authors: Runze Gao, Qiwen Li, Li Dai, Yufeng Zhan, Yuanqing Xia

    Abstract: Data-driven predictive control (DPC) has been studied and used in various scenarios, since it could generate the predicted control sequence only relying on the historical input and output data. Recently, based on cloud computing, data-driven predictive cloud control system (DPCCS) has been proposed with the advantage of sufficient computational resources. However, the existing computation mode of… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: 58 pages and 23 figures

  16. arXiv:2209.01676  [pdf

    eess.IV cs.CV q-bio.QM

    Time-distance vision transformers in lung cancer diagnosis from longitudinal computed tomography

    Authors: Thomas Z. Li, Kaiwen Xu, Riqiang Gao, Yucheng Tang, Thomas A. Lasko, Fabien Maldonado, Kim Sandler, Bennett A. Landman

    Abstract: Features learned from single radiologic images are unable to provide information about whether and how much a lesion may be changing over time. Time-dependent features computed from repeated images can capture those changes and help identify malignant lesions by their temporal behavior. However, longitudinal medical imaging presents the unique challenge of sparse, irregular time intervals in data… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: Summited to SPIE 2023 - Medical Imaging. 10 pages

  17. arXiv:2207.06551  [pdf, other

    eess.IV cs.CV

    Body Composition Assessment with Limited Field-of-view Computed Tomography: A Semantic Image Extension Perspective

    Authors: Kaiwen Xu, Thomas Li, Mirza S. Khan, Riqiang Gao, Sanja L. Antic, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman

    Abstract: Field-of-view (FOV) tissue truncation beyond the lungs is common in routine lung screening computed tomography (CT). This poses limitations for opportunistic CT- based body composition (BC) assessment as key anatomical structures are missing. Traditionally, extending the FOV of CT is considered as a CT reconstruction problem using limited data. However, this approach relies on the projection domai… ▽ More

    Submitted 15 April, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Updated with additional evaluation and clarification

  18. arXiv:2206.07599  [pdf, other

    eess.IV cs.CV

    How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical Images

    Authors: Yiqing Shen, Bingxin Zhou, Xinye Xiong, Ruitian Gao, Yu Guang Wang

    Abstract: Gigapixel medical images provide massive data, both morphological textures and spatial information, to be mined. Due to the large data scale in histology, deep learning methods play an increasingly significant role as feature extractors. Existing solutions heavily rely on convolutional neural networks (CNNs) for global pixel-level analysis, leaving the underlying local geometric structure such as… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: 21 pages

  19. arXiv:2205.05898  [pdf

    eess.IV cs.CV cs.LG

    Pseudo-Label Guided Multi-Contrast Generalization for Non-Contrast Organ-Aware Segmentation

    Authors: Ho Hin Lee, Yucheng Tang, Riqiang Gao, Qi Yang, Xin Yu, Shunxing Bao, James G. Terry, J. Jeffrey Carr, Yuankai Huo, Bennett A. Landman

    Abstract: Non-contrast computed tomography (NCCT) is commonly acquired for lung cancer screening, assessment of general abdominal pain or suspected renal stones, trauma evaluation, and many other indications. However, the absence of contrast limits distinguishing organ in-between boundaries. In this paper, we propose a novel unsupervised approach that leverages pairwise contrast-enhanced CT (CECT) context t… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  20. arXiv:2204.02389  [pdf, other

    cs.CV cs.LG cs.RO cs.SD eess.AS

    ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer

    Authors: Ruohan Gao, Zilin Si, Yen-Yu Chang, Samuel Clarke, Jeannette Bohg, Li Fei-Fei, Wenzhen Yuan, Jiajun Wu

    Abstract: Objects play a crucial role in our everyday activities. Though multisensory object-centric learning has shown great potential lately, the modeling of objects in prior work is rather unrealistic. ObjectFolder 1.0 is a recent dataset that introduces 100 virtualized objects with visual, acoustic, and tactile sensory data. However, the dataset is small in scale and the multisensory data is of limited… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: In CVPR 2022. Gao, Si, and Chang contributed equally to this work. Project page: https://ai.stanford.edu/~rhgao/objectfolder2.0/

  21. arXiv:2203.12910  [pdf, other

    eess.SP

    SSGCNet: A Sparse Spectra Graph Convolutional Network for Epileptic EEG Signal Classification

    Authors: Jialin Wang, Rui Gao, Haotian Zheng, Hao Zhu, C. -J. Richard Shi

    Abstract: In this article, we propose a sparse spectra graph convolutional network (SSGCNet) for solving Epileptic EEG signal classification problems. The aim is to achieve a lightweight deep learning model without losing model classification accuracy. We propose a weighted neighborhood field graph (WNFG) to represent EEG signals, which reduces the redundant edges between graph nodes. WNFG has lower time co… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: 12 pages, 7 figures

  22. arXiv:2203.09477  [pdf, other

    eess.SP cs.HC cs.LG

    A Decomposition-Based Hybrid Ensemble CNN Framework for Driver Fatigue Recognition

    Authors: Ruilin Li, Ruobin Gao, P. N. Suganthan

    Abstract: Electroencephalogram (EEG) has become increasingly popular in driver fatigue monitoring systems. Several decomposition methods have been attempted to analyze the EEG signals that are complex, nonlinear and non-stationary and improve the EEG decoding performance in different applications. However, it remains challenging to extract more distinguishable features from different decomposed components f… ▽ More

    Submitted 10 January, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

    Journal ref: Information Sciences. 2023

  23. arXiv:2203.02430  [pdf, other

    eess.IV cs.CV

    Characterizing Renal Structures with 3D Block Aggregate Transformers

    Authors: Xin Yu, Yucheng Tang, Yinchi Zhou, Riqiang Gao, Qi Yang, Ho Hin Lee, Thomas Li, Shunxing Bao, Yuankai Huo, Zhoubing Xu, Thomas A. Lasko, Richard G. Abramson, Bennett A. Landman

    Abstract: Efficiently quantifying renal structures can provide distinct spatial context and facilitate biomarker discovery for kidney morphology. However, the development and evaluation of the transformer model to segment the renal cortex, medulla, and collecting system remains challenging due to data inefficiency. Inspired by the hierarchical structures in vision transformer, we propose a novel method usin… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  24. arXiv:2202.06875  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Visual Acoustic Matching

    Authors: Changan Chen, Ruohan Gao, Paul Calamia, Kristen Grauman

    Abstract: We introduce the visual acoustic matching task, in which an audio clip is transformed to sound like it was recorded in a target environment. Given an image of the target environment and a waveform for the source audio, the goal is to re-synthesize the audio to match the target room acoustics as suggested by its visible geometry and materials. To address this novel task, we propose a cross-modal tr… ▽ More

    Submitted 13 June, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Project page: https://vision.cs.utexas.edu/projects/visual-acoustic-matching. Accepted at CVPR 2022

  25. arXiv:2202.06012   

    math.OC eess.SY

    Cloud-based computational model predictive control using a parallel multi-block ADMM approach

    Authors: Yaling Ma, Runze Gao, Li Dai, **xian Wu, Yuanqing Xia

    Abstract: Heavy computational load for solving nonconvex problems for large-scale systems or systems with real-time demands at each sample step has been recognized as one of the reasons for preventing a wider application of nonlinear model predictive control (NMPC). To improve the real-time feasibility of NMPC with input nonlinearity, we devise an innovative scheme called cloud-based computational model pre… ▽ More

    Submitted 15 April, 2022; v1 submitted 12 February, 2022; originally announced February 2022.

    Comments: Statements and experiments are flawed

  26. arXiv:2112.14349  [pdf, ps, other

    eess.SY cs.DC

    Fast Subspace Identification Method Based on Containerised Cloud Workflow Processing System

    Authors: Runze Gao, Yuanqing Xia, Guan Wang, Liwen Yang, Yufeng Zhan

    Abstract: Subspace identification (SID) has been widely used in system identification and control fields since it can estimate system models only relying on the input and output data by reliable numerical operations such as singular value decomposition (SVD). However, high-dimension Hankel matrices are involved to store these data and used to obtain the system models, which increases the computation amount… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  27. arXiv:2112.14347  [pdf, ps, other

    eess.SY

    Design and Implementation of Data-driven Predictive Cloud Control System

    Authors: Runze Gao, Yuanqing Xia, Li Dai, Zhongqi Sun

    Abstract: Nowadays, the rapid increases of the scale and complexity of the controlled plants bring new challenges such as computing power and storage for conventional control systems. Cloud computing is concerned as a powerful solution to handle the complex large-scale control missions using sufficient computing resources. However, the developed computing ability enables more complex devices and mass data b… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  28. arXiv:2111.10882  [pdf, other

    cs.CV cs.SD eess.AS

    Geometry-Aware Multi-Task Learning for Binaural Audio Generation from Video

    Authors: Rishabh Garg, Ruohan Gao, Kristen Grauman

    Abstract: Binaural audio provides human listeners with an immersive spatial sound experience, but most existing videos lack binaural audio recordings. We propose an audio spatialization method that draws on visual information in videos to convert their monaural (single-channel) audio to binaural audio. Whereas existing approaches leverage visual features extracted directly from video frames, our approach ex… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

    Comments: Published in BMVC 2021, project page: http://vision.cs.utexas.edu/projects/geometry-aware-binaural/

  29. arXiv:2111.09957  [pdf, other

    cs.CV eess.IV

    Rethinking Dilated Convolution for Real-time Semantic Segmentation

    Authors: Roland Gao

    Abstract: The field-of-view is an important metric when designing a model for semantic segmentation. To obtain a large field-of-view, previous approaches generally choose to rapidly downsample the resolution, usually with average poolings or stride 2 convolutions. We take a different approach by using dilated convolutions with large dilation rates throughout the backbone, allowing the backbone to easily tun… ▽ More

    Submitted 27 November, 2023; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: CVPR 2023 Efficient CV workshop

  30. arXiv:2111.02493  [pdf

    eess.SP cs.AI cs.CV physics.ins-det

    Roadmap on Signal Processing for Next Generation Measurement Systems

    Authors: D. K. Iakovidis, M. Ooi, Y. C. Kuang, S. Demidenko, A. Shestakov, V. Sinitsin, M. Henry, A. Sciacchitano, A. Discetti, S. Donati, M. Norgia, A. Menychtas, I. Maglogiannis, S. C. Wriessnegger, L. A. Barradas Chacon, G. Dimas, D. Filos, A. H. Aletras, J. Töger, F. Dong, S. Ren, A. Uhl, J. Paziewski, J. Geng, F. Fioranelli , et al. (9 additional authors not shown)

    Abstract: Signal processing is a fundamental component of almost any sensor-enabled system, with a wide range of applications across different scientific disciplines. Time series data, images, and video sequences comprise representative forms of signals that can be enhanced and analysed for information extraction and quantification. The recent advances in artificial intelligence and machine learning are shi… ▽ More

    Submitted 28 January, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 48 pages, https://iopscience.iop.org/article/10.1088/1361-6501/ac2dbd

    Journal ref: Measurement Science and Technology 33(1) (2022) 1-48

  31. arXiv:2110.08718  [pdf, other

    cs.CV eess.IV

    AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

    Authors: Ligong Han, Sri Harsha Musunuri, Martin Renqiang Min, Ruijiang Gao, Yu Tian, Dimitris Metaxas

    Abstract: StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad hoc after the generator is trained in a two-stage fashion. In this paper, we focus on style-based generators asking a scientific question: Does forcing such a g… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV-22

  32. arXiv:2109.09004  [pdf, other

    eess.IV cs.CV

    Random Multi-Channel Image Synthesis for Multiplexed Immunofluorescence Imaging

    Authors: Shunxing Bao, Yucheng Tang, Ho Hin Lee, Riqiang Gao, Sophie Chiron, Ilwoo Lyu, Lori A. Coburn, Keith T. Wilson, Joseph T. Roland, Bennett A. Landman, Yuankai Huo

    Abstract: Multiplex immunofluorescence (MxIF) is an emerging imaging technique that produces the high sensitivity and specificity of single-cell map**. With a tenet of 'seeing is believing', MxIF enables iterative staining and imaging extensive antibodies, which provides comprehensive biomarkers to segment and group different cells on a single tissue section. However, considerable depletion of the scarce… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

    Comments: Accepted at the third MICCAI workshop on Computational Pathology (COMPAY 2021)

  33. arXiv:2107.14385  [pdf, other

    cs.LG cs.AI eess.SP

    Random vector functional link neural network based ensemble deep learning for short-term load forecasting

    Authors: Ruobin Gao, Liang Du, P. N. Suganthan, Qin Zhou, Kum Fai Yuen

    Abstract: Electricity load forecasting is crucial for the power systems' planning and maintenance. However, its un-stationary and non-linear characteristics impose significant difficulties in anticipating future demand. This paper proposes a novel ensemble deep Random Vector Functional Link (edRVFL) network for electricity load forecasting. The weights of hidden layers are randomly initialized and kept fixe… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Journal ref: Expert Systems with Applications,2022

  34. arXiv:2107.11882  [pdf, other

    eess.IV cs.CV cs.LG

    Lung Cancer Risk Estimation with Incomplete Data: A Joint Missing Imputation Perspective

    Authors: Riqiang Gao, Yucheng Tang, Kaiwen Xu, Ho Hin Lee, Steve Deppen, Kim Sandler, Pierre Massion, Thomas A. Lasko, Yuankai Huo, Bennett A. Landman

    Abstract: Data from multi-modality provide complementary information in clinical prediction, but missing data in clinical cohorts limits the number of subjects in multi-modal learning context. Multi-modal missing imputation is challenging with existing methods when 1) the missing data span across heterogeneous modalities (e.g., image vs. non-image); or 2) one modality is largely missing. In this paper, we a… ▽ More

    Submitted 25 July, 2021; originally announced July 2021.

    Comments: Early Accepted by MICCAI 2021. Traveling Award

  35. arXiv:2105.11332  [pdf, other

    cs.NI eess.SP

    Physical Layer Security for UAV Communications in 5G and Beyond Networks

    Authors: Jue Wang, Xuanxuan Wang, Ruifeng Gao, Chengleyang Lei, Wei Feng, Ning Ge, Shi **, Tony Q. S. Quek

    Abstract: Due to its high mobility and flexible deployment, unmanned aerial vehicle (UAV) is drawing unprecedented interest in both military and civil applications to enable agile wireless communications and provide ubiquitous connectivity. Mainly operating in an open environment, UAV communications can benefit from dominant line-of-sight links; however, it on the other hand renders the UAVs more vulnerable… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  36. arXiv:2101.03711  [pdf, other

    eess.IV cs.CV

    Generalize Ultrasound Image Segmentation via Instant and Plug & Play Style Transfer

    Authors: Zhendong Liu, Xiaoqiong Huang, Xin Yang, Rui Gao, Rui Li, Yuanji Zhang, Yankai Huang, Guangquan Zhou, Yi Xiong, Alejandro F Frangi, Dong Ni

    Abstract: Deep segmentation models that generalize to images with unknown appearance are important for real-world medical image analysis. Retraining models leads to high latency and complex pipelines, which are impractical in clinical settings. The situation becomes more severe for ultrasound image analysis because of their large appearance shifts. In this paper, we propose a novel method for robust segment… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Accepted by IEEE ISBI 2021

  37. arXiv:2101.03149  [pdf, other

    cs.CV cs.SD eess.IV

    VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency

    Authors: Ruohan Gao, Kristen Grauman

    Abstract: We introduce a new approach for audio-visual speech separation. Given a video, the goal is to extract the speech associated with a face in spite of simultaneous background sounds and/or other human speakers. Whereas existing methods focus on learning the alignment between the speaker's lip movements and the sounds they generate, we propose to leverage the speaker's face appearance as an additional… ▽ More

    Submitted 6 April, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

    Comments: In CVPR 2021. Project page: http://vision.cs.utexas.edu/projects/VisualVoice/

  38. arXiv:2012.03124  [pdf

    eess.IV cs.CV

    Development and Characterization of a Chest CT Atlas

    Authors: Kaiwen Xu, Riqiang Gao, Mirza S. Khan, Shunxing Bao, Yucheng Tang, Steve A. Deppen, Yuankai Huo, Kim L. Sandler, Pierre P. Massion, Mattias P. Heinrich, Bennett A. Landman

    Abstract: A major goal of lung cancer screening is to identify individuals with particular phenotypes that are associated with high risk of cancer. Identifying relevant phenotypes is complicated by the variation in body position and body composition. In the brain, standardized coordinate systems (e.g., atlases) have enabled separate consideration of local features from gross/global structure. To date, no an… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: Accepted by SPIE2021 Medical Imaging (oral)

  39. A Data-Fusion-Assisted Telemetry Layer for Autonomous Optical Networks

    Authors: Xiaomin Liu, Huazhi Lun, Ruoxuan Gao, Meng Cai, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: For further improving the capacity and reliability of optical networks, a closed-loop autonomous architecture is preferred. Considering a large number of optical components in an optical network and many digital signal processing modules in each optical transceiver, massive real-time data can be collected. However, for a traditional monitoring structure, collecting, storing and processing a large… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  40. arXiv:2010.09524  [pdf

    eess.IV cs.CV

    Deep Multi-path Network Integrating Incomplete Biomarker and Chest CT Data for Evaluating Lung Cancer Risk

    Authors: Riqiang Gao, Yucheng Tang, Kaiwen Xu, Michael N. Kammer, Sanja L. Antic, Steve Deppen, Kim L. Sandler, Pierre P. Massion, Yuankai Huo, Bennett A. Landman

    Abstract: Clinical data elements (CDEs) (e.g., age, smoking history), blood markers and chest computed tomography (CT) structural features have been regarded as effective means for assessing lung cancer risk. These independent variables can provide complementary information and we hypothesize that combining them will improve the prediction accuracy. In practice, not all patients have all these variables ava… ▽ More

    Submitted 9 February, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: RFW all-conference best paper finalist, SPIE2021 Medical Imaging

  41. arXiv:2010.04928  [pdf, other

    eess.IV cs.CV cs.LG

    Contrastive Rendering for Ultrasound Image Segmentation

    Authors: Haoming Li, Xin Yang, Jiamin Liang, Wenlong Shi, Chaoyu Chen, Haoran Dou, Rui Li, Rui Gao, Guangquan Zhou, **ghui Fang, Xiaowen Liang, Ruobing Huang, Alejandro Frangi, Zhiyi Chen, Dong Ni

    Abstract: Ultrasound (US) image segmentation embraced its significant improvement in deep learning era. However, the lack of sharp boundaries in US images still remains an inherent challenge for segmentation. Previous methods often resort to global context, multi-scale cues or auxiliary guidance to estimate the boundaries. It is hard for these methods to approach pixel-level learning for fine-grained bounda… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: 10 pages, 5 figures, 2 tables, 13 references

  42. arXiv:2005.01616  [pdf, other

    cs.CV cs.SD eess.AS

    VisualEchoes: Spatial Image Representation Learning through Echolocation

    Authors: Ruohan Gao, Changan Chen, Ziad Al-Halah, Carl Schissler, Kristen Grauman

    Abstract: Several animal species (e.g., bats, dolphins, and whales) and even visually impaired humans have the remarkable ability to perform echolocation: a biological sonar used to perceive spatial layout and locate objects in the world. We explore the spatial cues contained in echoes and how they can benefit vision tasks that require spatial reasoning. First we capture echo responses in photo-realistic 3D… ▽ More

    Submitted 17 July, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Appears in ECCV 2020

  43. arXiv:2002.05844  [pdf, other

    eess.IV cs.CV

    Remove Appearance Shift for Ultrasound Image Segmentation via Fast and Universal Style Transfer

    Authors: Zhendong Liu, Xin Yang, Rui Gao, Shengfeng Liu, Haoran Dou, Shuangchi He, Yuhao Huang, Yankai Huang, Huanjia Luo, Yuanji Zhang, Yi Xiong, Dong Ni

    Abstract: Deep Neural Networks (DNNs) suffer from the performance degradation when image appearance shift occurs, especially in ultrasound (US) image segmentation. In this paper, we propose a novel and intuitive framework to remove the appearance shift, and hence improve the generalization ability of DNNs. Our work has three highlights. First, we follow the spirit of universal style transfer to remove appea… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: IEEE International Symposium on Biomedical Imaging (IEEE ISBI 2020)

  44. arXiv:2002.04102  [pdf

    eess.IV cs.CV

    Validation and Optimization of Multi-Organ Segmentation on Clinical Imaging Archives

    Authors: Yuchen Xu, Olivia Tang, Yucheng Tang, Ho Hin Lee, Yunqiang Chen, Dashan Gao, Shizhong Han, Riqiang Gao, Michael R. Savona, Richard G. Abramson, Yuankai Huo, Bennett A. Landman

    Abstract: Segmentation of abdominal computed tomography(CT) provides spatial context, morphological properties, and a framework for tissue-specific radiomics to guide quantitative Radiological assessment. A 2015 MICCAI challenge spurred substantial innovation in multi-organ abdominal CT segmentation with both traditional and deep learning methods. Recent innovations in deep methods have driven performance t… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: SPIE2020 Medical Imaging

  45. arXiv:1912.04487  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    Listen to Look: Action Recognition by Previewing Audio

    Authors: Ruohan Gao, Tae-Hyun Oh, Kristen Grauman, Lorenzo Torresani

    Abstract: In the face of the video data deluge, today's expensive clip-level classifiers are increasingly impractical. We propose a framework for efficient action recognition in untrimmed video that uses audio as a preview mechanism to eliminate both short-term and long-term visual redundancies. First, we devise an ImgAud2Vid framework that hallucinates clip-level features by distilling from lighter modalit… ▽ More

    Submitted 28 March, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: Appears in CVPR 2020; Project page: http://vision.cs.utexas.edu/projects/listen_to_look/

  46. arXiv:1911.07372  [pdf, other

    eess.IV

    Deep Learning for the Digital Pathologic Diagnosis of Cholangiocarcinoma and Hepatocellular Carcinoma: Evaluating the Impact of a Web-based Diagnostic Assistant

    Authors: Bora Uyumazturk, Amirhossein Kiani, Pranav Rajpurkar, Alex Wang, Robyn L. Ball, Rebecca Gao, Yifan Yu, Erik Jones, Curtis P. Langlotz, Brock Martin, Gerald J. Berry, Michael G. Ozawa, Florette K. Hazard, Ryanne A. Brown, Simon B. Chen, Mona Wood, Libby S. Allard, Lourdes Ylagan, Andrew Y. Ng, Jeanne Shen

    Abstract: While artificial intelligence (AI) algorithms continue to rival human performance on a variety of clinical tasks, the question of how best to incorporate these algorithms into clinical workflows remains relatively unexplored. We investigated how AI can affect pathologist performance on the task of differentiating between two subtypes of primary liver cancer, hepatocellular carcinoma (HCC) and chol… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  47. arXiv:1911.06395  [pdf

    eess.IV cs.CV

    Contrast Phase Classification with a Generative Adversarial Network

    Authors: Yucheng Tang, Ho Hin Lee, Yuchen Xu, Olivia Tang, Yunqiang Chen, Dashan Gao, Shizhong Han, Riqiang Gao, Camilo Bermudez, Michael R. Savona, Richard G. Abramson, Yuankai Huo, Bennett A. Landman

    Abstract: Dynamic contrast enhanced computed tomography (CT) is an imaging technique that provides critical information on the relationship of vascular structure and dynamics in the context of underlying anatomy. A key challenge for image processing with contrast enhanced CT is that phase discrepancies are latent in different tissues due to contrast protocols, vascular dynamics, and metabolism variance. Pre… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: 8 pages, 4 figures

    Journal ref: SPIE2020

  48. arXiv:1911.05113  [pdf

    eess.IV cs.CV cs.LG

    Semi-Supervised Multi-Organ Segmentation through Quality Assurance Supervision

    Authors: Ho Hin Lee, Yucheng Tang, Olivia Tang, Yuchen Xu, Yunqiang Chen, Dashan Gao, Shizhong Han, Riqiang Gao, Michael R. Savona, Richard G. Abramson, Yuankai Huo, Bennett A. Landman

    Abstract: Human in-the-loop quality assurance (QA) is typically performed after medical image segmentation to ensure that the systems are performing as intended, as well as identifying and excluding outliers. By performing QA on large-scale, previously unlabeled testing data, categorical QA scores can be generatedIn this paper, we propose a semi-supervised multi-organ segmentation deep neural network consis… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: 7 pages, 5 figures, Accepted by SPIE 2020: Medical Imaging

  49. arXiv:1909.05321  [pdf

    eess.IV cs.CV

    Distanced LSTM: Time-Distanced Gates in Long Short-Term Memory Models for Lung Cancer Detection

    Authors: Riqiang Gao, Yuankai Huo, Shunxing Bao, Yucheng Tang, Sanja L. Antic, Emily S. Epstein, Aneri B. Balar, Steve Deppen, Alexis B. Paulson, Kim L. Sandler, Pierre P. Massion, Bennett A. Landman

    Abstract: The field of lung nodule detection and cancer prediction has been rapidly develo** with the support of large public data archives. Previous studies have largely focused on cross-sectional (single) CT data. Herein, we consider longitudinal data. The Long Short-Term Memory (LSTM) model addresses learning with regularly spaced time points (i.e., equal temporal intervals). However, clinical imaging… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: This paper is accepted by MLMI (oral), MICCAI workshop

  50. arXiv:1907.03994  [pdf, other

    eess.SP cs.HC

    FarSense: Pushing the Range Limit of WiFi-based Respiration Sensing with CSI Ratio of Two Antennas

    Authors: Youwei Zeng, Dan Wu, Jie Xiong, Enze Yi, Ruiyang Gao, Daqing Zhang

    Abstract: The past few years have witnessed the great potential of exploiting channel state information retrieved from commodity WiFi devices for respiration monitoring. However, existing approaches only work when the target is close to the WiFi transceivers and the performance degrades significantly when the target is far away. On the other hand, most home environments only have one WiFi access point and i… ▽ More

    Submitted 10 August, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: This work is a pre-print version to appear at UbiComp 2019