Skip to main content

Showing 1–29 of 29 results for author: Yao, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.12367  [pdf, other

    eess.IV cs.CV

    Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning

    Authors: Zheyuan Zhang, Elif Keles, Gorkem Durak, Yavuz Taktak, Onkar Susladkar, Vandan Gorade, Debesh Jha, Asli C. Ormeci, Alpay Medetalibeyoglu, Lanhong Yao, Bin Wang, Ilkin Sevgi Isler, Linkai Peng, Hongyi Pan, Camila Lopes Vendrami, Amir Bourhani, Yury Velichko, Boqing Gong, Concetto Spampinato, Ayis Pyrros, Pallavi Tiwari, Derk C. F. Klatte, Megan Engels, Sanne Hoogenboom, Candice W. Bolan , et al. (13 additional authors not shown)

    Abstract: Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective st… ▽ More

    Submitted 25 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: under review version

  2. arXiv:2405.03178  [pdf, other

    cs.SD eess.AS

    POPDG: Popular 3D Dance Generation with PopDanceSet

    Authors: Zhenye Luo, Min Ren, Xuecai Hu, Yongzhen Huang, Li Yao

    Abstract: Generating dances that are both lifelike and well-aligned with music continues to be a challenging task in the cross-modal domain. This paper introduces PopDanceSet, the first dataset tailored to the preferences of young audiences, enabling the generation of aesthetically oriented dances. And it surpasses the AIST++ dataset in music genre diversity and the intricacy and depth of dance movements. M… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2402.10642  [pdf, other

    eess.AS cs.AI

    Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

    Authors: Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

    Abstract: Recently, Denoising Diffusion Probabilistic Models (DDPMs) have attained leading performances across a diverse range of generative tasks. However, in the field of speech synthesis, although DDPMs exhibit impressive performance, their long training duration and substantial inference costs hinder practical deployment. Existing approaches primarily focus on enhancing inference speed, while approaches… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  4. arXiv:2311.08225  [pdf, other

    eess.IV cs.CV

    Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images

    Authors: Zhiyun Song, Zengxin Qi, Xin Wang, Xiangyu Zhao, Zhenrong Shen, Sheng Wang, Manman Fei, Zhe Wang, Di Zang, Dongdong Chen, Linlin Yao, Qian Wang, Xuehai Wu, Lichi Zhang

    Abstract: Cross-modality synthesis (CMS), super-resolution (SR), and their combination (CMSR) have been extensively studied for magnetic resonance imaging (MRI). Their primary goals are to enhance the imaging quality by synthesizing the desired modality and reducing the slice thickness. Despite the promising synthetic results, these techniques are often tailored to specific tasks, thereby limiting their ada… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  5. arXiv:2309.05857  [pdf, other

    eess.IV cs.CV

    Radiomics Boosts Deep Learning Model for IPMN Classification

    Authors: Lanhong Yao, Zheyuan Zhang, Ugur Demir, Elif Keles, Camila Vendrami, Emil Agarunov, Candice Bolan, Ivo Schoots, Marc Bruno, Rajesh Keswani, Frank Miller, Tamas Gonda, Cemal Yazici, Temel Tirkes, Michael Wallace, Concetto Spampinato, Ulas Bagci

    Abstract: Intraductal Papillary Mucinous Neoplasm (IPMN) cysts are pre-malignant pancreas lesions, and they can progress into pancreatic cancer. Therefore, detecting and stratifying their risk level is of ultimate importance for effective treatment planning and disease control. However, this is a highly challenging task because of the diverse and irregular shape, texture, and size of the IPMN cysts as well… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 10 pages, MICCAI MLMI 2023

  6. arXiv:2309.00971  [pdf, other

    eess.IV cs.CV

    AdLER: Adversarial Training with Label Error Rectification for One-Shot Medical Image Segmentation

    Authors: Xiangyu Zhao, Sheng Wang, Zhiyun Song, Zhenrong Shen, Linlin Yao, Haolei Yuan, Qian Wang, Lichi Zhang

    Abstract: Accurate automatic segmentation of medical images typically requires large datasets with high-quality annotations, making it less applicable in clinical settings due to limited training data. One-shot segmentation based on learned transformations (OSSLT) has shown promise when labeled data is extremely limited, typically including unsupervised deformable registration, data augmentation with learne… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  7. arXiv:2308.04956  [pdf, other

    eess.IV cs.CV q-bio.BM q-bio.QM

    Improved Cryo-EM Pose Estimation and 3D Classification through Latent-Space Disentanglement

    Authors: Weijie Chen, Yuhang Wang, Lin Yao

    Abstract: Due to the extremely low signal-to-noise ratio (SNR) and unknown poses (projection angles and image shifts) in cryo-electron microscopy (cryo-EM) experiments, reconstructing 3D volumes from 2D images is very challenging. In addition to these challenges, heterogeneous cryo-EM reconstruction requires conformational classification. In popular cryo-EM reconstruction algorithms, poses and conformation… ▽ More

    Submitted 22 April, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: 21 pages

    ACM Class: J.2; J.3; I.4.5

  8. arXiv:2308.00128  [pdf, other

    eess.IV cs.CV cs.LG

    Ensemble Learning with Residual Transformer for Brain Tumor Segmentation

    Authors: Lanhong Yao, Zheyuan Zhang, Ulas Bagci

    Abstract: Brain tumor segmentation is an active research area due to the difficulty in delineating highly complex shaped and textured tumors as well as the failure of the commonly used U-Net architectures. The combination of different neural architectures is among the mainstream research recently, particularly the combination of U-Net with Transformers because of their innate attention mechanism and pixel-w… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: 9 pages, 4 figures, ISBI 2023

  9. arXiv:2304.02720  [pdf, other

    eess.IV cs.CR cs.CV

    Domain Generalization with Adversarial Intensity Attack for Medical Image Segmentation

    Authors: Zheyuan Zhang, Bin Wang, Lanhong Yao, Ugur Demir, Debesh Jha, Ismail Baris Turkbey, Boqing Gong, Ulas Bagci

    Abstract: Most statistical learning algorithms rely on an over-simplified assumption, that is, the train and test data are independent and identically distributed. In real-world scenarios, however, it is common for models to encounter data from new and different domains to which they were not exposed to during training. This is often the case in medical imaging applications due to differences in acquisition… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Code is available upon publication

  10. arXiv:2303.03598  [pdf, other

    cs.CV eess.IV

    Guided Image-to-Image Translation by Discriminator-Generator Communication

    Authors: Yuanjiang Cao, Lina Yao, Le Pan, Quan Z. Sheng, Xiaojun Chang

    Abstract: The goal of Image-to-image (I2I) translation is to transfer an image from a source domain to a target domain, which has recently drawn increasing attention. One major branch of this research is to formulate I2I translation based on Generative Adversarial Network (GAN). As a zero-sum game, GAN can be reformulated as a Partially-observed Markov Decision Process (POMDP) for generators, where generato… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  11. arXiv:2205.03231  [pdf, other

    eess.SP cs.LG

    Side-aware Meta-Learning for Cross-Dataset Listener Diagnosis with Subjective Tinnitus

    Authors: Yun Li, Zhe Liu, Lina Yao, Molly Lucas, Jessica J. M. Monaghan, Yu Zhang

    Abstract: With the development of digital technology, machine learning has paved the way for the next generation of tinnitus diagnoses. Although machine learning has been widely applied in EEG-based tinnitus analysis, most current models are dataset-specific. Each dataset may be limited to a specific range of symptoms, overall disease severity, and demographic attributes; further, dataset formats may differ… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  12. arXiv:2205.03230  [pdf, other

    eess.SP cs.LG

    Disentangled and Side-aware Unsupervised Domain Adaptation for Cross-dataset Subjective Tinnitus Diagnosis

    Authors: Yun Li, Zhe Liu, Lina Yao, Jessica J. M. Monaghan, David McAlpine

    Abstract: EEG-based tinnitus classification is a valuable tool for tinnitus diagnosis, research, and treatments. Most current works are limited to a single dataset where data patterns are similar. But EEG signals are highly non-stationary, resulting in model's poor generalization to new users, sessions or datasets. Thus, designing a model that can generalize to new datasets is beneficial and indispensable.… ▽ More

    Submitted 7 November, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

  13. arXiv:2204.10461  [pdf, other

    cs.CL cs.SD eess.AS

    WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment

    Authors: Lin Yao, Jianfei Song, Ruizhuo Xu, Yingfang Yang, Zijian Chen, Yafeng Deng

    Abstract: Historically lower-level tasks such as automatic speech recognition (ASR) and speaker identification are the main focus in the speech field. Interest has been growing in higher-level spoken language understanding (SLU) tasks recently, like sentiment analysis (SA). However, improving performances on SLU tasks remains a big challenge. Basically, there are two main methods for SLU tasks: (1) Two-stag… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  14. arXiv:2203.14044  [pdf, other

    cs.LG eess.IV

    Contrastive Graph Learning for Population-based fMRI Classification

    Authors: Xuesong Wang, Lina Yao, Islem Rekik, Yu Zhang

    Abstract: Contrastive self-supervised learning has recently benefited fMRI classification with inductive biases. Its weak label reliance prevents overfitting on small medical datasets and tackles the high intraclass variances. Nonetheless, existing contrastive methods generate resemblant pairs only on pixel-level features of 3D medical images, while the functional connectivity that reveals critical cognitiv… ▽ More

    Submitted 17 July, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

  15. arXiv:2112.02965  [pdf, other

    eess.SP

    A Novel Full-Polarization SAR Images Ship Detector Based on the Scattering Mechanisms and the Wave Polarization Anisotropy

    Authors: Chuan Zhang, Gui Gao, Linlin Zhang, C. Chen, S. Gao, Libo Yao, Shiquan Gou

    Abstract: Synthetic aperture radar (SAR) is considered being a good option for earth observation with its unique advantages. In this paper, we proposed an adaptive ship detector using full-polarization SAR images. First, by thoroughly investigating the scattering characteristics between ships and their background, and the wave polarization anisotropy, a novel ship detector is proposed by jointing the two ch… ▽ More

    Submitted 6 December, 2021; v1 submitted 6 December, 2021; originally announced December 2021.

  16. arXiv:2106.10277  [pdf, other

    eess.AS cs.LG cs.SD

    GPLA-12: An Acoustic Signal Dataset of Gas Pipeline Leakage

    Authors: Jie Li, Lizhong Yao

    Abstract: In this paper, we introduce a new acoustic leakage dataset of gas pipelines, called as GPLA-12, which has 12 categories over 684 training/testing acoustic signals. Unlike massive image and voice datasets, there have relatively few acoustic signal datasets, especially for engineering fault detection. In order to enhance the development of fault diagnosis, we collect acoustic leakage signals on the… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

  17. arXiv:2103.12954  [pdf, ps, other

    math.OC cs.LG eess.SY

    Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-order Coordinate Method

    Authors: Shengjun Zhang, Yunlong Dong, Dong Xie, Lisha Yao, Colleen P. Bailey, Shengli Fu

    Abstract: This paper investigates the stochastic distributed nonconvex optimization problem of minimizing a global cost function formed by the summation of $n$ local cost functions. We solve such a problem by involving zeroth-order (ZO) information exchange. In this paper, we propose a ZO distributed primal-dual coordinate method (ZODIAC) to solve the stochastic optimization problem. Agents approximate thei… ▽ More

    Submitted 13 October, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  18. arXiv:2101.04793  [pdf, other

    eess.IV cs.CV

    Generative Adversarial U-Net for Domain-free Medical Image Augmentation

    Authors: Xiaocong Chen, Yun Li, Lina Yao, Ehsan Adeli, Yu Zhang

    Abstract: The shortage of annotated medical images is one of the biggest challenges in the field of medical image computing. Without a sufficient number of training samples, deep learning based models are very likely to suffer from over-fitting problem. The common solution is image manipulation such as image rotation, crop**, or resizing. Those methods can help relieve the over-fitting problem as more tra… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

  19. Momentum Contrastive Learning for Few-Shot COVID-19 Diagnosis from Chest CT Images

    Authors: Xiaocong Chen, Lina Yao, Tao Zhou, **ming Dong, Yu Zhang

    Abstract: The current pandemic, caused by the outbreak of a novel coronavirus (COVID-19) in December 2019, has led to a global emergency that has significantly impacted economies, healthcare systems and personal wellbeing all around the world. Controlling the rapidly evolving disease requires highly sensitive and specific diagnostics. While real-time RT-PCR is the most commonly used, these can take up to 8… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  20. arXiv:2005.08221   

    eess.SY

    Harmonic Mitigation Schemes for Wind Power Plants by Embedding Control in Wind Turbines

    Authors: Qiupin Lai, Chengxi Liu, Liangzhong Yao

    Abstract: Harmonic pollution may damage the electric devices in wind power plants (WPPs), and propagate to the external grid. This paper proposes a harmonic mitigation scheme by embedding harmonic control functions in wind turbines (WTs) to manage the harmonics in WPPs. It can improve the power quality at the remote Point of Common Coupling (PCC), regulated by grid codes. The proposed scheme detects the har… ▽ More

    Submitted 15 June, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: The study developed in the paper is few clear. The justification of the proposed method is poor, the numerical example should be enhanced

  21. arXiv:2004.05645  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    Residual Attention U-Net for Automated Multi-Class Segmentation of COVID-19 Chest CT Images

    Authors: Xiaocong Chen, Lina Yao, Yu Zhang

    Abstract: The novel coronavirus disease 2019 (COVID-19) has been spreading rapidly around the world and caused significant impact on the public health and economy. However, there is still lack of studies on effectively quantifying the lung infection caused by COVID-19. As a basic but challenging task of the diagnostic framework, segmentation plays a crucial role in accurate quantification of COVID-19 infect… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

  22. arXiv:2002.09821  [pdf, other

    eess.AS cs.LG cs.SD

    A Multi-view CNN-based Acoustic Classification System for Automatic Animal Species Identification

    Authors: Weitao Xu, Xiang Zhang, Lina Yao, Wanli Xue, Bo Wei

    Abstract: Automatic identification of animal species by their vocalization is an important and challenging task. Although many kinds of audio monitoring system have been proposed in the literature, they suffer from several disadvantages such as non-trivial feature selection, accuracy degradation because of environmental noise or intensive local computation. In this paper, we propose a deep learning based ac… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    Journal ref: Ad Hoc Networks 2020

  23. arXiv:1910.06154  [pdf

    eess.IV cs.CV physics.med-ph

    Direct Energy-resolving CT Imaging via Energy-integrating CT images using a Unified Generative Adversarial Network

    Authors: Lisha Yao, Sui Li, Manman Zhu, Dong Zeng, Zhaoying Bian, Jianhua Ma

    Abstract: Energy-resolving computed tomography (ErCT) has the ability to acquire energy-dependent measurements simultaneously and quantitative material information with improved contrast-to-noise ratio. Meanwhile, ErCT imaging system is usually equipped with an advanced photon counting detector, which is expensive and technically complex. Therefore, clinical ErCT scanners are not yet commercially available,… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: 5 pages, 3 figures, Accepted by MIC/NSS 2019

  24. arXiv:1909.10868  [pdf, other

    eess.SP cs.LG

    Adversarial Representation Learning for Robust Patient-Independent Epileptic Seizure Detection

    Authors: Xiang Zhang, Lina Yao, Manqing Dong, Zhe Liu, Yu Zhang, Yong Li

    Abstract: Objective: Epilepsy is a chronic neurological disorder characterized by the occurrence of spontaneous seizures, which affects about one percent of the world's population. Most of the current seizure detection approaches strongly rely on patient history records and thus fail in the patient-independent situation of detecting the new patients. To overcome such limitation, we propose a robust and expl… ▽ More

    Submitted 31 January, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: Accepted by the IEEE Journal of Biomedical and Health Informatics (J-BHI)

  25. arXiv:1907.13351  [pdf, other

    eess.SP cs.LG

    Multi-task Generative Adversarial Learning on Geometrical Shape Reconstruction from EEG Brain Signals

    Authors: Xiang Zhang, Xiaocong Chen, Manqing Dong, Huan Liu, Chang Ge, Lina Yao

    Abstract: Synthesizing geometrical shapes from human brain activities is an interesting and meaningful but very challenging topic. Recently, the advancements of deep generative models like Generative Adversarial Networks (GANs) have supported the object generation from neurological signals. However, the Electroencephalograph (EEG)-based shape generation still suffer from the low realism problem. In particul… ▽ More

    Submitted 28 February, 2020; v1 submitted 31 July, 2019; originally announced July 2019.

    Comments: 12 pages

    Journal ref: Published on ICONIP 2019

  26. arXiv:1905.08948  [pdf, other

    cs.HC cs.LG eess.SP

    Multi-agent Attentional Activity Recognition

    Authors: Kaixuan Chen, Lina Yao, Dalin Zhang, Bin Guo, Zhiwen Yu

    Abstract: Multi-modality is an important feature of sensor based activity recognition. In this work, we consider two inherent characteristics of human activities, the spatially-temporally varying salience of features and the relations between activities and corresponding body part motions. Based on these, we propose a multi-agent spatial-temporal attention model. The spatial-temporal attention mechanism hel… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Comments: Accepted by IJCAI 2019

  27. arXiv:1905.04149  [pdf, other

    cs.HC cs.LG eess.SP q-bio.NC

    A Survey on Deep Learning-based Non-Invasive Brain Signals:Recent Advances and New Frontiers

    Authors: Xiang Zhang, Lina Yao, Xianzhi Wang, Jessica Monaghan, David Mcalpine, Yu Zhang

    Abstract: Brain-Computer Interface (BCI) bridges the human's neural world and the outer physical world by decoding individuals' brain signals into commands recognizable by computer devices. Deep learning has lifted the performance of brain-computer interface systems significantly in recent years. In this article, we systematically investigate brain signal types for BCI and related deep learning concepts for… ▽ More

    Submitted 21 October, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: Accepted by Journal of Neural Engineering. Summarized more than 200+ brain signal-related papers, systematically covering 8 Brain-Computer Interface (BCI) categories and 10+ deep learning models

  28. arXiv:1905.02283  [pdf, other

    cs.CL cs.CV eess.IV

    Caveats in Generating Medical Imaging Labels from Radiology Reports

    Authors: Tobi Olatunji, Li Yao, Ben Covington, Alexander Rhodes, Anthony Upton

    Abstract: Acquiring high-quality annotations in medical imaging is usually a costly process. Automatic label extraction with natural language processing (NLP) has emerged as a promising workaround to bypass the need of expert annotation. Despite the convenience, the limitation of such an approximation has not been carefully examined and is not well understood. With a challenging set of 1,000 chest X-ray stu… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: Accepted workshop contribution for Medical Imaging with Deep Learning (MIDL), 2019

  29. arXiv:1904.01638  [pdf, other

    cs.CV cs.AI eess.IV stat.ML

    A Strong Baseline for Domain Adaptation and Generalization in Medical Imaging

    Authors: Li Yao, Jordan Prosky, Ben Covington, Kevin Lyman

    Abstract: This work provides a strong baseline for the problem of multi-source multi-target domain adaptation and generalization in medical imaging. Using a diverse collection of ten chest X-ray datasets, we empirically demonstrate the benefits of training medical imaging deep learning models on varied patient populations for generalization to out-of-sample domains.

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: Extended abstract of a journal submission