Search | arXiv e-print repository

Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI

Authors: Minhui Yu, Mengqi Wu, Ling Yue, Andrea Bozoki, Mingxia Liu

Abstract: Magnetic resonance imaging (MRI) and positron emission tomography (PET) are increasingly used in multimodal analysis of neurodegenerative disorders. While MRI is broadly utilized in clinical settings, PET is less accessible. Many studies have attempted to use deep generative models to synthesize PET from MRI scans. However, they often suffer from unstable training and inadequately preserve brain f… ▽ More Magnetic resonance imaging (MRI) and positron emission tomography (PET) are increasingly used in multimodal analysis of neurodegenerative disorders. While MRI is broadly utilized in clinical settings, PET is less accessible. Many studies have attempted to use deep generative models to synthesize PET from MRI scans. However, they often suffer from unstable training and inadequately preserve brain functional information conveyed by PET. To this end, we propose a functional imaging constrained diffusion (FICD) framework for 3D brain PET image synthesis with paired structural MRI as input condition, through a new constrained diffusion model (CDM). The FICD introduces noise to PET and then progressively removes it with CDM, ensuring high output fidelity throughout a stable training phase. The CDM learns to predict denoised PET with a functional imaging constraint introduced to ensure voxel-wise alignment between each denoised PET and its ground truth. Quantitative and qualitative analyses conducted on 293 subjects with paired T1-weighted MRI and 18F-fluorodeoxyglucose (FDG)-PET scans suggest that FICD achieves superior performance in generating FDG-PET data compared to state-of-the-art methods. We further validate the effectiveness of the proposed FICD on data from a total of 1,262 subjects through three downstream tasks, with experimental results suggesting its utility and generalizability. △ Less

Submitted 8 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

arXiv:2401.10070 [pdf, other]

Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks

Authors: Yichao Du, Zhirui Zhang, Linan Yue, Xu Huang, Yuqing Zhang, Tong Xu, Linli Xu, Enhong Chen

Abstract: To protect privacy and meet legal regulations, federated learning (FL) has gained significant attention for training speech-to-text (S2T) systems, including automatic speech recognition (ASR) and speech translation (ST). However, the commonly used FL approach (i.e., \textsc{FedAvg}) in S2T tasks typically suffers from extensive communication overhead due to multi-round interactions based on the wh… ▽ More To protect privacy and meet legal regulations, federated learning (FL) has gained significant attention for training speech-to-text (S2T) systems, including automatic speech recognition (ASR) and speech translation (ST). However, the commonly used FL approach (i.e., \textsc{FedAvg}) in S2T tasks typically suffers from extensive communication overhead due to multi-round interactions based on the whole model and performance degradation caused by data heterogeneity among clients.To address these issues, we propose a personalized federated S2T framework that introduces \textsc{FedLoRA}, a lightweight LoRA module for client-side tuning and interaction with the server to minimize communication overhead, and \textsc{FedMem}, a global model equipped with a $k$-nearest-neighbor ($k$NN) classifier that captures client-specific distributional shifts to achieve personalization and overcome data heterogeneity. Extensive experiments based on Conformer and Whisper backbone models on CoVoST and GigaSpeech benchmarks show that our approach significantly reduces the communication overhead on all S2T tasks and effectively personalizes the global model to overcome data heterogeneity. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: ICASSP 2024

arXiv:2207.12002 [pdf, ps, other]

An Optimal Motion Planning Framework for Quadruped Jum**

Authors: Zhitao Song, Linzhu Yue, Guangli Sun, Yihu Ling, Hongshuo Wei, Linhai Gui, Yun-Hui Liu

Abstract: This paper presents an optimal motion planning framework to generate versatile energy-optimal quadrupedal jum** motions automatically (e.g., flips, spin). The jum** motions via the centroidal dynamics are formulated as a 12-dimensional black-box optimization problem subject to the robot kino-dynamic constraints. Gradient-based approaches offer great success in addressing trajectory optimizatio… ▽ More This paper presents an optimal motion planning framework to generate versatile energy-optimal quadrupedal jum** motions automatically (e.g., flips, spin). The jum** motions via the centroidal dynamics are formulated as a 12-dimensional black-box optimization problem subject to the robot kino-dynamic constraints. Gradient-based approaches offer great success in addressing trajectory optimization (TO), yet, prior knowledge (e.g., reference motion, contact schedule) is required and results in sub-optimal solutions. The new proposed framework first employed a heuristics-based optimization method to avoid these problems. Moreover, a prioritization fitness function is created for heuristics-based algorithms in robot ground reaction force (GRF) planning, enhancing convergence and searching performance considerably. Since heuristics-based algorithms often require significant time, motions are planned offline and stored as a pre-motion library. A selector is designed to automatically choose motions with user-specified or perception information as input. The proposed framework has been successfully validated only with a simple continuously tracking PD controller in an open-source Mini-Cheetah by several challenging jum** motions, including jum** over a window-shaped obstacle with 30 cm height and left-flip** over a rectangle obstacle with 27 cm height. △ Less

Submitted 25 July, 2022; originally announced July 2022.

Comments: Accept by IROS 2022

arXiv:2206.12480 [pdf, other]

Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRI

Authors: Hao Guan, Ling Yue, Pew-Thian Yap, Shifu Xiao, Andrea Bozoki, Mingxia Liu

Abstract: Subjective cognitive decline (SCD) is a preclinical stage of Alzheimer's disease (AD) which occurs even before mild cognitive impairment (MCI). Progressive SCD will convert to MCI with the potential of further evolving to AD. Therefore, early identification of progressive SCD with neuroimaging techniques (e.g., structural MRI) is of great clinical value for early intervention of AD. However, exist… ▽ More Subjective cognitive decline (SCD) is a preclinical stage of Alzheimer's disease (AD) which occurs even before mild cognitive impairment (MCI). Progressive SCD will convert to MCI with the potential of further evolving to AD. Therefore, early identification of progressive SCD with neuroimaging techniques (e.g., structural MRI) is of great clinical value for early intervention of AD. However, existing MRI-based machine/deep learning methods usually suffer the small-sample-size problem which poses a great challenge to related neuroimaging analysis. The central question we aim to tackle in this paper is how to leverage related domains (e.g., AD/NC) to assist the progression prediction of SCD. Meanwhile, we are concerned about which brain areas are more closely linked to the identification of progressive SCD. To this end, we propose an attention-guided autoencoder model for efficient cross-domain adaptation which facilitates the knowledge transfer from AD to SCD. The proposed model is composed of four key components: 1) a feature encoding module for learning shared subspace representations of different domains, 2) an attention module for automatically locating discriminative brain regions of interest defined in brain atlases, 3) a decoding module for reconstructing the original input, 4) a classification module for identification of brain diseases. Through joint training of these four modules, domain invariant features can be learned. Meanwhile, the brain disease related regions can be highlighted by the attention mechanism. Extensive experiments on the publicly available ADNI dataset and a private CLAS dataset have demonstrated the effectiveness of the proposed method. The proposed model is straightforward to train and test with only 5-10 seconds on CPUs and is suitable for medical tasks with small datasets. △ Less

Submitted 16 February, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

Comments: 10 pages, 12 figures

arXiv:1907.03095 [pdf]

Map** PM2.5 concentration at sub-km level resolution: a dual-scale retrieval method

Authors: Qianqian Yang, Qiangqiang Yuan, Linwei Yue, Huanfeng Shen, Liangpei Zhang

Abstract: Satellite-based retrieval has become a popular PM2.5 monitoring method currently. To improve the retrieval performance, multiple variables are usually introduced as auxiliary variable in addition to aerosol optical depth (AOD). Different kinds of variables are usually at different resolutions varying from sub-kilometers to dozens of kilometers. Generally, when doing the retrieval, variables at dif… ▽ More Satellite-based retrieval has become a popular PM2.5 monitoring method currently. To improve the retrieval performance, multiple variables are usually introduced as auxiliary variable in addition to aerosol optical depth (AOD). Different kinds of variables are usually at different resolutions varying from sub-kilometers to dozens of kilometers. Generally, when doing the retrieval, variables at different resolutions are resampled to the same resolution as the AOD product to keep the scale consistency. A deficiency of doing this is that the information contained in the scale difference is discarded. To fully utilize the information contained at different scales, a dual-scale retrieval method is proposed in this study. At the first stage, variables which influence PM2.5 concentration at large scale were used for PM2.5 retrieval at coarse resolution. Then at the second stage, variables which affect PM2.5 distribution in finer scale, were used for the further PM2.5 retrieval at high resolution (sub-km level resolution) with the retrieved PM2.5 at the first stage at coarser resolution also as input. In this study, four different retrieval models including multiple linear regression (MLR), geographically weighted regression (GWR), random forest (RF) and generalized regression neural network (GRNN) are adopted to test the performance of the dual-scale retrieval method. Compared with the traditional retrieval method, the proposed dual-scale retrieval method can achieve PM2.5 map** at finer resolution and with higher accuracy. Dual-scale retrieval can fully utilize the information contained at different scales, thus achieving a higher resolution and accuracy. It can be used for the generation of quantitative remote sensing products in various fields, and promote the improvement of the quality of quantitative remote sensing products. △ Less

Submitted 6 July, 2019; originally announced July 2019.

Showing 1–5 of 5 results for author: Yue, L