Skip to main content

Showing 1–50 of 131 results for author: Liu, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.10395  [pdf, other

    eess.IV cs.CV q-bio.NC

    BrainFounder: Towards Brain Foundation Models for Neuroimage Analysis

    Authors: Joseph Cox, Peng Liu, Skylar E. Stolte, Yunchao Yang, Kang Liu, Kyle B. See, Huiwen Ju, Ruogu Fang

    Abstract: The burgeoning field of brain health research increasingly leverages artificial intelligence (AI) to interpret and analyze neurological data. This study introduces a novel approach towards the creation of medical foundation models by integrating a large-scale multi-modal magnetic resonance imaging (MRI) dataset derived from 41,400 participants in its own. Our method involves a novel two-stage pret… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 17 pages, 5 figures, to be published in Medical Image Analysis

  2. arXiv:2406.09950  [pdf, other

    cs.SD cs.CL eess.AS

    An efficient text augmentation approach for contextualized Mandarin speech recognition

    Authors: Naijun Zheng, Xucheng Wan, Kai Liu, Ziqing Du, Zhou Huan

    Abstract: Although contextualized automatic speech recognition (ASR) systems are commonly used to improve the recognition of uncommon words, their effectiveness is hindered by the inherent limitations of speech-text data availability. To address this challenge, our study proposes to leverage extensive text-only datasets and contextualize pre-trained ASR models using a straightforward text-augmentation (TA)… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: accepted to interspeech2024

  3. arXiv:2406.06649  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution

    Authors: Kai Liu, Haotong Qin, Yong Guo, Xin Yuan, Linghe Kong, Guihai Chen, Yulun Zhang

    Abstract: Low-bit quantization has become widespread for compressing image super-resolution (SR) models for edge deployment, which allows advanced SR models to enjoy compact low-bit parameters and efficient integer/bitwise constructions for storage compression and inference acceleration, respectively. However, it is notorious that low-bit quantization degrades the accuracy of SR models compared to their ful… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures. The code and models will be available at https://github.com/Kai-Liu001/2DQuant

  4. arXiv:2406.06337  [pdf, other

    physics.optics eess.IV physics.bio-ph

    System- and Sample-agnostic Isotropic 3D Microscopy by Weakly Physics-informed, Domain-shift-resistant Axial Deblurring

    Authors: Jiashu Han, Kunzan Liu, Keith B. Isaacson, Kristina Monakhova, Linda G. Griffith, Sixian You

    Abstract: Three-dimensional (3D) subcellular imaging is essential for biomedical research, but the diffraction limit of optical microscopy compromises axial resolution, hindering accurate 3D structural analysis. This challenge is particularly pronounced in label-free imaging of thick, heterogeneous tissues, where assumptions about data distribution (e.g. sparsity, label-specific distribution, and lateral-ax… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 27 pages, 6 figures

  5. arXiv:2405.16905  [pdf, other

    eess.SY

    Privacy and Security Trade-off in Interconnected Systems with Known or Unknown Privacy Noise Covariance

    Authors: Haojun Wang, Kun Liu, Baojia Li, Emilia Fridman, Yuanqing Xia

    Abstract: This paper is concerned with the security problem for interconnected systems, where each subsystem is required to detect local attacks using locally available information and the information received from its neighboring subsystems. Moreover, we consider that there exists an additional eavesdropper being able to infer the private information by eavesdrop** transmitted data between subsystems. Th… ▽ More

    Submitted 1 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.14905  [pdf, other

    eess.IV cs.AI cs.CL

    Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation

    Authors: Kang Liu, Zhuoqi Ma, Xiaolu Kang, Zhusi Zhong, Zhicheng Jiao, Grayson Baird, Harrison Bai, Qiguang Miao

    Abstract: The automated generation of imaging reports proves invaluable in alleviating the workload of radiologists. A clinically applicable reports generation algorithm should demonstrate its effectiveness in producing reports that accurately describe radiology findings and attend to patient-specific indications. In this paper, we introduce a novel method, \textbf{S}tructural \textbf{E}ntities extraction a… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: The code is available at https://github.com/mk-runner/SEI-Temp or https://github.com/mk-runner/SEI

  7. arXiv:2405.11211  [pdf

    eess.SY cs.LG

    Excess Delay from GDP: Measurement and Causal Analysis

    Authors: Ke Liu, Mark Hansen

    Abstract: Ground Delay Programs (GDPs) have been widely used to resolve excessive demand-capacity imbalances at arrival airports by shifting foreseen airborne delay to pre-departure ground delay. While offering clear safety and efficiency benefits, GDPs may also create additional delay because of imperfect execution and uncertainty in predicting arrival airport capacity. This paper presents a methodology fo… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: International Conference on Research in Air Transportation (ICRAT 2022) link: https://www.icrat.org/previous-conferences/10th-international-conference/papers/

    Journal ref: International Conference on Research in Air Transportation (ICRAT 2022)

  8. arXiv:2405.09586  [pdf, other

    eess.IV cs.AI cs.CV

    Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation

    Authors: Kang Liu, Zhuoqi Ma, Mengmeng Liu, Zhicheng Jiao, Xiaolu Kang, Qiguang Miao, Kun Xie

    Abstract: The automation of writing imaging reports is a valuable tool for alleviating the workload of radiologists. Crucial steps in this process involve the cross-modal alignment between medical images and reports, as well as the retrieval of similar historical cases. However, the presence of presentation-style vocabulary (e.g., sentence structure and grammar) in reports poses challenges for cross-modal a… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  9. arXiv:2405.09207  [pdf, other

    cs.IT eess.SY

    An Exact Theory of Causal Emergence for Linear Stochastic Iteration Systems

    Authors: Kaiwei Liu, Bing Yuan, Jiang Zhang

    Abstract: After coarse-graining a complex system, the dynamics of its macro-state may exhibit more pronounced causal effects than those of its micro-state. This phenomenon, known as causal emergence, is quantified by the indicator of effective information. However, two challenges confront this theory: the absence of well-developed frameworks in continuous stochastic dynamical systems and the reliance on coa… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  10. Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer

    Authors: Whenty Ariyanti, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao

    Abstract: Respiratory disease, the third leading cause of deaths globally, is considered a high-priority ailment requiring significant research on identification and treatment. Stethoscope-recorded lung sounds and artificial intelligence-powered devices have been used to identify lung disorders and aid specialists in making accurate diagnoses. In this study, audio-spectrogram vision transformer (AS-ViT), a… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Published in 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

    Journal ref: 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (2023) 1-4

  11. arXiv:2405.08306  [pdf, other

    math.OC eess.SY

    Flight Path Optimization with Optimal Control Method

    Authors: Gaofeng Su, Xi Cheng, Siyuan Feng, Ke Liu, Jilin Song, Jianan Chen, Chen Zhu, Hui Lin

    Abstract: This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to d… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  12. arXiv:2405.01200  [pdf, other

    eess.SY cs.LG

    Learning-to-solve unit commitment based on few-shot physics-guided spatial-temporal graph convolution network

    Authors: Mei Yang, Gao Qiu andJunyong Liu, Kai Liu

    Abstract: This letter proposes a few-shot physics-guided spatial temporal graph convolutional network (FPG-STGCN) to fast solve unit commitment (UC). Firstly, STGCN is tailored to parameterize UC. Then, few-shot physics-guided learning scheme is proposed. It exploits few typical UC solutions yielded via commercial optimizer to escape from local minimum, and leverages the augmented Lagrangian method for cons… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  13. arXiv:2404.19182  [pdf, other

    eess.SP

    Robust Proximity Detection using On-Device Gait Monitoring

    Authors: Yuqian Hu, Guozhen Zhu, Beibei Wang, K. J. Ray Liu

    Abstract: Proximity detection in indoor environments based on WiFi signals has gained significant attention in recent years. Existing works rely on the dynamic signal reflections and their extracted features are dependent on motion strength. To address this issue, we design a robust WiFi-based proximity detector by considering gait monitoring. Specifically, we propose a gait score that accurately evaluates… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: This work has been accepted in IEEE 9th World Forum on Internet of Things (WFIoT)

  14. arXiv:2404.15290  [pdf

    eess.SP

    A point cloud processing method of mmWave radar over automotive scenario

    Authors: Qingmian Wan, Hongli Peng, Xing Liao, Kuayue Liu

    Abstract: This paper introduces in detail the effective method of comprehensive target judgment by using radar RA map and point cloud map. Different output of radar can effectively judge the road boundary of target and the relative coordinates of target, avoid the error of output caused by excessive processing information, and greatly improve the processing efficiency of DBSCAN of the measured target.

    Submitted 23 March, 2024; originally announced April 2024.

  15. arXiv:2404.13550  [pdf, other

    cs.CV eess.IV

    Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes

    Authors: Kang You, Kai Liu, Li Yu, Pan Gao, Dandan Ding

    Abstract: Despite considerable progress being achieved in point cloud geometry compression, there still remains a challenge in effectively compressing large-scale scenes with sparse surfaces. Another key challenge lies in reducing decoding latency, a crucial requirement in real-world application. In this paper, we propose Pointsoup, an efficient learning-based geometry codec that attains high-performance an… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  16. arXiv:2404.06693  [pdf, other

    cs.CV eess.IV

    Binomial Self-compensation for Motion Error in Dynamic 3D Scanning

    Authors: Geyou Zhang, Ce Zhu, Kai Liu

    Abstract: Phase shifting profilometry (PSP) is favored in high-precision 3D scanning due to its high accuracy, robustness, and pixel-wise property. However, a fundamental assumption of PSP that the object should remain static is violated in dynamic measurement, making PSP susceptible to object moving, resulting in ripple-like errors in the point clouds. We propose a pixel-wise and frame-wise loopable binomi… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  17. arXiv:2402.05482  [pdf, other

    eess.SP cs.LG

    A Non-Intrusive Neural Quality Assessment Model for Surface Electromyography Signals

    Authors: Cho-Yuan Lee, Kuan-Chen Wang, Kai-Chun Liu, Yu-Te Wang, Xugang Lu, **-Cheng Yeh, Yu Tsao

    Abstract: In practical scenarios involving the measurement of surface electromyography (sEMG) in muscles, particularly those areas near the heart, one of the primary sources of contamination is the presence of electrocardiogram (ECG) signals. To assess the quality of real-world sEMG data more effectively, this study proposes QASE-net, a new non-intrusive model that predicts the SNR of sEMG signals. QASE-net… ▽ More

    Submitted 13 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  18. SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising

    Authors: Yu-Tung Liu, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao

    Abstract: Surface electromyography (sEMG) recordings can be influenced by electrocardiogram (ECG) signals when the muscle being monitored is close to the heart. Several existing methods use signal-processing-based approaches, such as high-pass filter and template subtraction, while some derive map** functions to restore clean sEMG signals from noisy sEMG (sEMG with ECG interference). Recently, the score-b… ▽ More

    Submitted 23 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: This paper is accepted by ICASSP 2024

  19. arXiv:2402.00996  [pdf, other

    cs.CV eess.SP

    mmID: High-Resolution mmWave Imaging for Human Identification

    Authors: Sakila S. Jayaweera, Sai Deepika Regani, Yuqian Hu, Beibei Wang, K. J. Ray Liu

    Abstract: Achieving accurate human identification through RF imaging has been a persistent challenge, primarily attributed to the limited aperture size and its consequent impact on imaging resolution. The existing imaging solution enables tasks such as pose estimation, activity recognition, and human tracking based on deep neural networks by estimating skeleton joints. In contrast to estimating joints, this… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: This paper was published in the IEEE 9th World Forum on Internet of Things

  20. arXiv:2401.16714  [pdf

    eess.IV

    A Point Cloud Enhancement Method for 4D mmWave Radar Imagery

    Authors: Qingmian Wan, Hongli Peng, Xing Liao, Kuayue Liu, Junfa Mao

    Abstract: A point cloud enhancement method for 4D mmWave radar imagery is proposed in this paper. Based on the patch antenna and MIMO array theories, the MIMO array with small redundancy and high SNR is designed to provide the probability of high angular resolution and detection rate. The antenna array is deployed using a ladder shape in vertical direction to decrease the redundancy and improve the resoluti… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  21. arXiv:2312.12653  [pdf, other

    eess.IV cs.CV

    Diagnosis Of Takotsubo Syndrome By Robust Feature Selection From The Complex Latent Space Of DL-based Segmentation Network

    Authors: Fahim Ahmed Zaman, Wahidul Alam, Tarun Kanti Roy, Amanda Chang, Kan Liu, Xiaodong Wu

    Abstract: Researchers have shown significant correlations among segmented objects in various medical imaging modalities and disease related pathologies. Several studies showed that using hand crafted features for disease prediction neglects the immense possibility to use latent features from deep learning (DL) models which may reduce the overall accuracy of differential diagnosis. However, directly using cl… ▽ More

    Submitted 18 January, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, conference

  22. arXiv:2312.12649  [pdf, other

    eess.IV cs.CV

    Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation

    Authors: Fahim Ahmed Zaman, Mathews Jacob, Amanda Chang, Kan Liu, Milan Sonka, Xiaodong Wu

    Abstract: Diffusion models have shown impressive performance for image generation, often times outperforming other generative models. Since their introduction, researchers have extended the powerful noise-to-image denoising pipeline to discriminative tasks, including image segmentation. In this work we propose a conditional score-based generative modeling framework for medical image segmentation which relie… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, conference

  23. arXiv:2312.08267  [pdf, other

    eess.IV cs.CV q-bio.QM

    TABSurfer: a Hybrid Deep Learning Architecture for Subcortical Segmentation

    Authors: Aaron Cao, Vishwanatha M. Rao, Kejia Liu, Xinru Liu, Andrew F. Laine, Jia Guo

    Abstract: Subcortical segmentation remains challenging despite its important applications in quantitative structural analysis of brain MRI scans. The most accurate method, manual segmentation, is highly labor intensive, so automated tools like FreeSurfer have been adopted to handle this task. However, these traditional pipelines are slow and inefficient for processing large datasets. In this study, we propo… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, 2 tables

  24. arXiv:2311.15221  [pdf, other

    cs.IT cs.LG eess.SP math.OC math.ST stat.ML

    The Local Landscape of Phase Retrieval Under Limited Samples

    Authors: Kaizhao Liu, Zihao Wang, Lei Wu

    Abstract: In this paper, we provide a fine-grained analysis of the local landscape of phase retrieval under the regime with limited samples. Our aim is to ascertain the minimal sample size necessary to guarantee a benign local landscape surrounding global minima in high dimensions. Let $n$ and $d$ denote the sample size and input dimension, respectively. We first explore the local convexity and establish th… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 41 pages

  25. arXiv:2311.10568  [pdf, other

    eess.IV cs.CV

    Phase Guided Light Field for Spatial-Depth High Resolution 3D Imaging

    Authors: Geyou Zhang, Ce Zhu, Kai Liu, Yipeng Liu

    Abstract: On 3D imaging, light field cameras typically are of single shot, and however, they heavily suffer from low spatial resolution and depth accuracy. In this paper, by employing an optical projector to project a group of single high-frequency phase-shifted sinusoid patterns, we propose a phase guided light field algorithm to significantly improve both the spatial and depth resolutions for off-the-shel… ▽ More

    Submitted 9 April, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  26. arXiv:2310.20148  [pdf, other

    cs.AI cs.RO eess.SY

    Decision-Making for Autonomous Vehicles with Interaction-Aware Behavioral Prediction and Social-Attention Neural Network

    Authors: Xiao Li, Kaiwen Liu, H. Eric Tseng, Anouck Girard, Ilya Kolmanovsky

    Abstract: Autonomous vehicles need to accomplish their tasks while interacting with human drivers in traffic. It is thus crucial to equip autonomous vehicles with artificial reasoning to better comprehend the intentions of the surrounding traffic, thereby facilitating the accomplishments of the tasks. In this work, we propose a behavioral model that encodes drivers' interacting intentions into latent social… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  27. arXiv:2310.16102  [pdf, other

    eess.IV cs.CV physics.optics

    Learned, Uncertainty-driven Adaptive Acquisition for Photon-Efficient Multiphoton Microscopy

    Authors: Cassandra Tong Ye, Jiashu Han, Kunzan Liu, Anastasios Angelopoulos, Linda Griffith, Kristina Monakhova, Sixian You

    Abstract: Multiphoton microscopy (MPM) is a powerful imaging tool that has been a critical enabler for live tissue imaging. However, since most multiphoton microscopy platforms rely on point scanning, there is an inherent trade-off between acquisition time, field of view (FOV), phototoxicity, and image quality, often resulting in noisy measurements when fast, large FOV, and/or gentle imaging is needed. Deep… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  28. arXiv:2309.14497  [pdf, other

    cs.AI cs.RO eess.SY

    Interaction-Aware Decision-Making for Autonomous Vehicles in Forced Merging Scenario Leveraging Social Psychology Factors

    Authors: Xiao Li, Kaiwen Liu, H. Eric Tseng, Anouck Girard, Ilya Kolmanovsky

    Abstract: Understanding the intention of vehicles in the surrounding traffic is crucial for an autonomous vehicle to successfully accomplish its driving tasks in complex traffic scenarios such as highway forced merging. In this paper, we consider a behavioral model that incorporates both social behaviors and personal objectives of the interacting drivers. Leveraging this model, we develop a receding-horizon… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  29. arXiv:2308.15943  [pdf

    eess.SP

    Improved Ultrasound Attenuation Coefficient Estimation Using Spectral Normalization on Local Interference-Free Single-Scatterer Power Spectrum

    Authors: Kun-Lin Liu, Yu-Heng Chen, Chiao-Yin Wang, Po-Hsiang Tsui, Meng-Lin Li

    Abstract: Ultrasound attenuation coefficient estimation (ACE) can be utilized to quantify liver fat content, offering significant diagnostic potential in addressing the growing global public health issue of non-alcoholic fatty liver and other chronic liver diseases. Among ACE methods, the reference frequency method (RFM) proposed recently possesses the advantages of being system-independent and not requirin… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  30. arXiv:2308.02190  [pdf, other

    cs.SD cs.CL eess.AS

    Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition

    Authors: Jiaxin Ye, Yujie Wei, Xin-Cheng Wen, Chenglong Ma, Zhizhong Huang, Kunhong Liu, Hongming Shan

    Abstract: Cross-corpus speech emotion recognition (SER) seeks to generalize the ability of inferring speech emotion from a well-labeled corpus to an unlabeled one, which is a rather challenging task due to the significant discrepancy between two corpora. Existing methods, typically based on unsupervised domain adaptation (UDA), struggle to learn corpus-invariant features by global distribution alignment, bu… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  31. arXiv:2307.15510  [pdf, other

    eess.SY cs.MA cs.RO

    Formation Control for Moving Target Enclosing via Relative Localization

    Authors: Xueming Liu, Kunda Liu, Tianjiang Hu, Qingrui Zhang

    Abstract: In this paper, we investigate the problem of controlling multiple unmanned aerial vehicles (UAVs) to enclose a moving target in a distributed fashion based on a relative distance and self-displacement measurements. A relative localization technique is developed based on the recursive least square estimation (RLSE) technique with a forgetting factor to estimates both the ``UAV-UAV'' and ``UAV-targe… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 8 Pages, accepted by IEEE CDC 2023

  32. arXiv:2307.09714  [pdf

    physics.optics eess.IV

    Flexible single multimode fiber imaging using white LED

    Authors: Minyu Fan, Kun Liu, Jie Zhu, Yu Cao, Sha Wang

    Abstract: Multimode fiber (MMF) has been proven to have good potential in imaging and optical communication because of its advantages of small diameter and large mode numbers. However, due to the mode coupling and modal dispersion, it is very sensitive to environmental changes. Minor changes in the fiber shape can lead to difficulties in information reconstruction. Here, white LED and cascaded Unet are used… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  33. arXiv:2305.00216  [pdf, other

    eess.SY cs.LG

    Physics-Guided Graph Neural Networks for Real-time AC/DC Power Flow Analysis

    Authors: Mei Yang, Gao Qiu, Yong Wu, Junyong Liu, Nina Dai, Yue Shui, Kai Liu, Lijie Ding

    Abstract: The increasing scale of alternating current and direct current (AC/DC) hybrid systems necessitates a faster power flow analysis tool than ever. This letter thus proposes a specific physics-guided graph neural network (PG-GNN). The tailored graph modelling of AC and DC grids is firstly advanced to enhance the topology adaptability of the PG-GNN. To eschew unreliable experience emulation from data,… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  34. arXiv:2304.07813  [pdf, other

    cs.IT eess.SP

    Deep Reinforcement Learning-Assisted Age-optimal Transmission Policy for HARQ-aided NOMA Networks

    Authors: Kunpeng Liu, Aimin Li, Shaohua Wu

    Abstract: The recent interweaving of AI-6G technologies has sparked extensive research interest in further enhancing reliable and timely communications. \emph{Age of Information} (AoI), as a novel and integrated metric implying the intricate trade-offs among reliability, latency, and update frequency, has been well-researched since its conception. This paper contributes new results in this area by employing… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  35. arXiv:2304.06335  [pdf

    cs.LG eess.SP

    Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks

    Authors: Chien-Pin Liu, Ju-Hsuan Li, En-** Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao

    Abstract: Falls are the public health issue for the elderly all over the world since the fall-induced injuries are associated with a large amount of healthcare cost. Falls can cause serious injuries, even leading to death if the elderly suffers a "long-lie". Hence, a reliable fall detection (FD) system is required to provide an emergency alarm for first aid. Due to the advances in wearable device technology… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  36. arXiv:2304.05685  [pdf

    eess.IV eess.SP eess.SY

    Multisensor fusion-based digital twin in additive manufacturing for in-situ quality monitoring and defect correction

    Authors: Lequn Chen, Xiling Yao, Kui Liu, Chaolin Tan, Seung Ki Moon

    Abstract: Early detection and correction of defects are critical in additive manufacturing (AM) to avoid build failures. In this paper, we present a multisensor fusion-based digital twin for in-situ quality monitoring and defect correction in a robotic laser direct energy deposition process. Multisensor fusion sources consist of an acoustic sensor, an infrared thermal camera, a coaxial vision camera, and a… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 11 pages, 9 figures. Accepted at 24th International Conference on Engineering Design (ICED23)

  37. arXiv:2303.13243  [pdf, other

    eess.AS cs.SD

    Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

    Authors: Kai Liu, Hailiang Xiong, Gangqiang Yang, Zhengfeng Du, Yewen Cao, Danyal Shah

    Abstract: As one of the major branches of automatic speech recognition, attention-based models greatly improves the feature representation ability of the model. In particular, the multi-head mechanism is employed in the attention, ho** to learn speech features of more aspects in different attention subspaces. For speech recognition of complex languages, on the one hand, a small head size will lead to an o… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  38. Non-Orthogonal Multiple Access Enhanced Multi-User Semantic Communication

    Authors: Weizhi Li, Haotai Liang, Chen Dong, Xiaodong Xu, ** Zhang, Kaijun Liu

    Abstract: Semantic communication serves as a novel paradigm and attracts the broad interest of researchers. One critical aspect of it is the multi-user semantic communication theory, which can further promote its application to the practical network environment. While most existing works focused on the design of end-to-end single-user semantic transmission, a novel non-orthogonal multiple access (NOMA)-base… ▽ More

    Submitted 20 November, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: accepted by IEEE Transactions on Cognitive Communications and Networking

  39. arXiv:2303.05023  [pdf, other

    eess.AS cs.AI cs.SD

    X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion

    Authors: Kai Liu, Ziqing Du, Xucheng Wan, Huan Zhou

    Abstract: Target speech extraction (TSE) systems are designed to extract target speech from a multi-talker mixture. The popular training objective for most prior TSE networks is to enhance reconstruction performance of extracted speech waveform. However, it has been reported that a TSE system delivers high reconstruction performance may still suffer low-quality experience problems in practice. One such expe… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  40. arXiv:2303.03634  [pdf

    eess.SP cs.LG

    PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation

    Authors: Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao, Chia-Tai Chan

    Abstract: Fall accidents are critical issues in an aging and aged society. Recently, many researchers developed pre-impact fall detection systems using deep learning to support wearable-based fall protection systems for preventing severe injuries. However, most works only employed simple neural network models instead of complex models considering the usability in resource-constrained mobile devices and stri… ▽ More

    Submitted 28 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  41. arXiv:2302.06727  [pdf, other

    cs.LG cs.CV eess.IV

    Deep Learning Predicts Prevalent and Incident Parkinson's Disease From UK Biobank Fundus Imaging

    Authors: Charlie Tran, Kai Shen, Kang Liu, Akshay Ashok, Adolfo Ramirez-Zamora, **ghua Chen, Yulin Li, Ruogu Fang

    Abstract: Parkinson's disease is the world's fastest-growing neurological disorder. Research to elucidate the mechanisms of Parkinson's disease and automate diagnostics would greatly improve the treatment of patients with Parkinson's disease. Current diagnostic methods are expensive and have limited availability. Considering the insidious and preclinical onset and progression of the disease, a desirable scr… ▽ More

    Submitted 18 February, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 17 pages, 4 figures, 2 tables, 4 supplementary tables

  42. arXiv:2301.06277  [pdf, ps, other

    cs.SD cs.AI cs.LG eess.AS

    Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings

    Authors: Kai Liu, Xucheng Wan, Ziqing Du, Huan Zhou

    Abstract: As a practical alternative of speech separation, target speaker extraction (TSE) aims to extract the speech from the desired speaker using additional speaker cue extracted from the speaker. Its main challenge lies in how to properly extract and leverage the speaker cue to benefit the extracted speech quality. The cue extraction method adopted in majority existing TSE studies is to directly utilize… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: ACCEPTED by NCMMSC 2022

  43. arXiv:2301.02383  [pdf, other

    q-bio.QM cs.LG eess.IV q-bio.GN

    Deep Biological Pathway Informed Pathology-Genomic Multimodal Survival Prediction

    Authors: Lin Qiu, Aminollah Khormali, Kai Liu

    Abstract: The integration of multi-modal data, such as pathological images and genomic data, is essential for understanding cancer heterogeneity and complexity for personalized treatments, as well as for enhancing survival predictions. Despite the progress made in integrating pathology and genomic data, most existing methods cannot mine the complex inter-modality relations thoroughly. Additionally, identify… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

  44. arXiv:2301.00553  [pdf, other

    eess.IV

    Lightweight Image Inpainting by Stripe Window Transformer with Joint Attention to CNN

    Authors: Tsung-Jung Liu, Bo-Wei Chen, Kuan-Hsien Liu

    Abstract: Image inpainting is an important task in computer vision. As admirable methods are presented, the inpainted image is getting closer to reality. However, the result is still not good enough in the reconstructed texture and structure based on human vision. Although recent advances in computer hardware have enabled the development of larger and more complex models, there is still a need for lightweig… ▽ More

    Submitted 23 August, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: 6 pages and 5 images, contributions to MLSP 2023

  45. arXiv:2212.03515  [pdf, other

    eess.SP cs.IT cs.LG stat.ML

    FPGA Implementation of Multi-Layer Machine Learning Equalizer with On-Chip Training

    Authors: Keren Liu, Erik Börjeson, Christian Häger, Per Larsson-Edefors

    Abstract: We design and implement an adaptive machine learning equalizer that alternates multiple linear and nonlinear computational layers on an FPGA. On-chip training via gradient backpropagation is shown to allow for real-time adaptation to time-varying channel impairments.

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: To be presented at the 2023 Optical Fiber Communication Conference (OFC)

  46. arXiv:2211.10658  [pdf, other

    cs.SD cs.CV cs.GR eess.AS

    EDGE: Editable Dance Generation From Music

    Authors: Jonathan Tseng, Rodrigo Castellon, C. Karen Liu

    Abstract: Dance is an important human art form, but creating new dances can be difficult and time-consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art method for editable dance generation that is capable of creating realistic, physically-plausible dances while remaining faithful to the input music. EDGE uses a transformer-based diffusion model paired with Jukebox, a str… ▽ More

    Submitted 27 November, 2022; v1 submitted 19 November, 2022; originally announced November 2022.

    Comments: Project website: https://edge-dance.github.io

  47. Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition

    Authors: Jiaxin Ye, Xin-cheng Wen, Yujie Wei, Yong Xu, Kunhong Liu, Hongming Shan

    Abstract: Speech emotion recognition (SER) plays a vital role in improving the interactions between humans and machines by inferring human emotion and affective states from speech signals. Whereas recent works primarily focus on mining spatiotemporal information from hand-crafted features, we explore how to model the temporal patterns of speech emotions from dynamic temporal scales. Towards that goal, we in… ▽ More

    Submitted 14 August, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: ICASSP 2023

    Journal ref: IEEE ICASSP 2023

  48. arXiv:2210.17386  [pdf, other

    cs.NI eess.SP

    Cooperative Sensing and Uploading for Quality-Cost Tradeoff of Digital Twins in VEC

    Authors: Kai Liu, Xincao Xu, Penglin Dai, Biwen Chen

    Abstract: Recent advances in sensing technologies, wireless communications, and computing paradigms drive the evolution of vehicles in becoming an intelligent and electronic consumer products. This paper investigates enabling digital twins in vehicular edge computing (DT-VEC) via cooperative sensing and uploading, and makes the first attempt to achieve the quality-cost tradeoff in DT-VEC. First, a DT-VEC ar… ▽ More

    Submitted 27 January, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2209.12265

  49. arXiv:2210.15834  [pdf, other

    cs.SD cs.AI cs.HC eess.AS

    GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

    Authors: Jia-Xin Ye, Xin-Cheng Wen, Xuan-Ze Wang, Yong Xu, Yan Luo, Chang-Li Wu, Li-Yan Chen, Kun-Hong Liu

    Abstract: In human-computer interaction, Speech Emotion Recognition (SER) plays an essential role in understanding the user's intent and improving the interactive experience. While similar sentimental speeches own diverse speaker characteristics but share common antecedents and consequences, an essential challenge for SER is how to produce robust and discriminative representations through causality between… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: The source code is available at: https://github.com/Jiaxin-Ye/GM-TCNet

    Journal ref: speech communication, 145, November 2022, 21-35

  50. arXiv:2210.13271  [pdf, other

    eess.SP cs.LG

    ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks

    Authors: Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao

    Abstract: Electrocardiogram (ECG) artifact contamination often occurs in surface electromyography (sEMG) applications when the measured muscles are in proximity to the heart. Previous studies have developed and proposed various methods, such as high-pass filtering, template subtraction and so forth. However, these methods remain limited by the requirement of reference signals and distortion of original sEMG… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 5 pages, 5 figures