Skip to main content

Showing 1–21 of 21 results for author: Liang, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.16446  [pdf, ps, other

    eess.SP

    A New Solution for MU-MISO Symbol-Level Precoding: Extrapolation and Deep Unfolding

    Authors: Mu Liang, Ang Li, Xiaoyan Hu, Christos Masouros

    Abstract: Constructive interference (CI) precoding, which converts the harmful multi-user interference into beneficial signals, is a promising and efficient interference management scheme in multi-antenna communication systems. However, CI-based symbol-level precoding (SLP) experiences high computational complexity as the number of symbol slots increases within a transmission block, rendering it unaffordabl… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2405.14802  [pdf, other

    eess.IV cs.CV

    Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

    Authors: Hongxu Jiang, Muhammad Imran, Linhai Ma, Teng Zhang, Yuyin Zhou, Muxuan Liang, Kuang Gong, Wei Shao

    Abstract: Denoising diffusion probabilistic models (DDPMs) have achieved unprecedented success in computer vision. However, they remain underutilized in medical imaging, a field crucial for disease diagnosis and treatment planning. This is primarily due to the high computational cost associated with (1) the use of large number of time steps (e.g., 1,000) in diffusion processes and (2) the increased dimensio… ▽ More

    Submitted 23 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2404.09841  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Anatomy of Industrial Scale Multilingual ASR

    Authors: Francis McCann Ramirez, Luka Chkhetiani, Andrew Ehrenberg, Robert McHardy, Rami Botros, Yash Khare, Andrea Vanzo, Taufiquzzaman Peyash, Gabriel Oexle, Michael Liang, Ilya Sklyar, Enver Fakhan, Ahmed Etefy, Daniel McCrystal, Sam Flamini, Domenic Donato, Takuya Yoshioka

    Abstract: This paper describes AssemblyAI's industrial-scale automatic speech recognition (ASR) system, designed to meet the requirements of large-scale, multilingual ASR serving various application needs. Our system leverages a diverse training dataset comprising unsupervised (12.5M hours), supervised (188k hours), and pseudo-labeled (1.6M hours) data across four languages. We provide a detailed descriptio… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.07341  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrap**

    Authors: Kevin Zhang, Luka Chkhetiani, Francis McCann Ramirez, Yash Khare, Andrea Vanzo, Michael Liang, Sergio Ramirez Martin, Gabriel Oexle, Ruben Bousbib, Taufiquzzaman Peyash, Michael Nguyen, Dillon Pulliam, Domenic Donato

    Abstract: This paper presents Conformer-1, an end-to-end Automatic Speech Recognition (ASR) model trained on an extensive dataset of 570k hours of speech audio data, 91% of which was acquired from publicly available sources. To achieve this, we perform Noisy Student Training after generating pseudo-labels for the unlabeled public data using a strong Conformer RNN-T baseline model. The addition of these pseu… ▽ More

    Submitted 12 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  5. arXiv:2402.11954  [pdf, other

    cs.SD cs.MM eess.AS

    Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

    Authors: Xiaohui Zhang, Wenjie Fu, Mangui Liang

    Abstract: Speech Emotion Recognition (SER) is still a complex task for computers with average recall rates usually about 70% on the most realistic datasets. Most SER systems use hand-crafted features extracted from audio signal such as energy, zero crossing rate, spectral information, prosodic, mel frequency cepstral coefficient (MFCC), and so on. More recently, using raw waveform for training neural networ… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  6. arXiv:2402.11931  [pdf, other

    cs.SD eess.AS q-bio.NC

    Soft-Weighted CrossEntropy Loss for Continous Alzheimer's Disease Detection

    Authors: Xiaohui Zhang, Wenjie Fu, Mangui Liang

    Abstract: Alzheimer's disease is a common cognitive disorder in the elderly. Early and accurate diagnosis of Alzheimer's disease (AD) has a major impact on the progress of research on dementia. At present, researchers have used machine learning methods to detect Alzheimer's disease from the speech of participants. However, the recognition accuracy of current methods is unsatisfactory, and most of them focus… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2312.15564  [pdf, ps, other

    eess.SP

    A Belief Propagation Approach for Direct Multipath-Based SLAM

    Authors: Mingchao Liang, Erik Leitinger, Florian Meyer

    Abstract: In this work, we develop a multipath-based simultaneous localization and map** (SLAM) method that can directly be applied to received radio signals. In existing multipath-based SLAM approaches, a channel estimator is used as a preprocessing stage that reduces data flow and computational complexity by extracting features related to multipath components (MPCs). We aim to avoid any preprocessing st… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  8. arXiv:2310.18529  [pdf, other

    physics.optics eess.IV

    FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations

    Authors: Haowen Zhou, Brandon Y. Feng, Haiyun Guo, Siyu Lin, Mingshu Liang, Christopher A. Metzler, Changhuei Yang

    Abstract: Image stacks provide invaluable 3D information in various biological and pathological imaging applications. Fourier ptychographic microscopy (FPM) enables reconstructing high-resolution, wide field-of-view image stacks without z-stack scanning, thus significantly accelerating image acquisition. However, existing FPM methods take tens of minutes to reconstruct and gigabytes of memory to store a hig… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Project Page: https://hwzhou2020.github.io/FPM-INR-Web/

  9. arXiv:2307.08323  [pdf, other

    cs.SD eess.AS

    TST: Time-Sparse Transducer for Automatic Speech Recognition

    Authors: Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao

    Abstract: End-to-end model, especially Recurrent Neural Network Transducer (RNN-T), has achieved great success in speech recognition. However, transducer requires a great memory footprint and computing time when processing a long decoding sequence. To solve this problem, we propose a model named time-sparse transducer, which introduces a time-sparse mechanism into transducer. In this mechanism, we obtain th… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 10 pages

    Journal ref: International Conference on Artificial Intelligence (CICAI 2023)

  10. arXiv:2307.00765  [pdf, ps, other

    eess.SP

    A BP Method for Track-Before-Detect

    Authors: Mingchao Liang, Thomas Kropfreiter, Florian Meyer

    Abstract: Tracking an unknown number of low-observable objects is notoriously challenging. This letter proposes a sequential Bayesian estimation method based on the track-before-detect (TBD) approach. In TBD, raw sensor measurements are directly used by the tracking algorithm without any preprocessing. Our proposed method is based on a new statistical model that introduces a new object hypothesis for each d… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  11. arXiv:2305.19956  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MicroSegNet: A Deep Learning Approach for Prostate Segmentation on Micro-Ultrasound Images

    Authors: Hongxu Jiang, Muhammad Imran, Preethika Muralidharan, Anjali Patel, Jake Pensa, Muxuan Liang, Tarik Benidir, Joseph R. Grajo, Jason P. Joseph, Russell Terry, John Michael DiBianco, Li-Ming Su, Yuyin Zhou, Wayne G. Brisbane, Wei Shao

    Abstract: Micro-ultrasound (micro-US) is a novel 29-MHz ultrasound technique that provides 3-4 times higher resolution than traditional ultrasound, potentially enabling low-cost, accurate diagnosis of prostate cancer. Accurate prostate segmentation is crucial for prostate volume measurement, cancer diagnosis, prostate biopsy, and treatment planning. However, prostate segmentation on micro-US is challenging… ▽ More

    Submitted 25 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Computerized Medical Imaging and Graphics (2024): 102326

  12. arXiv:2305.19939  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Image Registration of In Vivo Micro-Ultrasound and Ex Vivo Pseudo-Whole Mount Histopathology Images of the Prostate: A Proof-of-Concept Study

    Authors: Muhammad Imran, Brianna Nguyen, Jake Pensa, Sara M. Falzarano, Anthony E. Sisk, Muxuan Liang, John Michael DiBianco, Li-Ming Su, Yuyin Zhou, Wayne G. Brisbane, Wei Shao

    Abstract: Early diagnosis of prostate cancer significantly improves a patient's 5-year survival rate. Biopsy of small prostate cancers is improved with image-guided biopsy. MRI-ultrasound fusion-guided biopsy is sensitive to smaller tumors but is underutilized due to the high cost of MRI and fusion equipment. Micro-ultrasound (micro-US), a novel high-resolution ultrasound technology, provides a cost-effecti… ▽ More

    Submitted 16 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  13. arXiv:2303.04432  [pdf, ps, other

    eess.SP

    Deep Learning-Based Channel Extrapolation for Pattern Reconfigurable Massive MIMO

    Authors: Mu Liang, Ang Li

    Abstract: Reconfigurable antennas that can dynamically change their operation state exhibit excellent adaptivity and flexibility over traditional antennas, and MIMO arrays that consist of multifunctional and reconfigurable antennas (MRAs) are foreseen as one promising solution towards future Holographic MIMO. Specifically, in pattern reconfigurable MIMO (PR-MIMO) communication systems, accurate acquisition… ▽ More

    Submitted 6 April, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  14. arXiv:2212.08340  [pdf, ps, other

    cs.CV cs.AI cs.LG eess.SP

    Neural Enhanced Belief Propagation for Multiobject Tracking

    Authors: Mingchao Liang, Florian Meyer

    Abstract: Algorithmic solutions for multi-object tracking (MOT) are a key enabler for applications in autonomous navigation and applied ocean sciences. State-of-the-art MOT methods fully rely on a statistical model and typically use preprocessed sensor data as measurements. In particular, measurements are produced by a detector that extracts potential object locations from the raw sensor data collected for… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  15. arXiv:2212.06414  [pdf, other

    eess.SY math.SG

    Even Order Explicit Symplectic Geometric Algorithms for Quaternion Kinematical Differential Equation in Guidance Navigation and Control via Diagonal Padè Approximation and Cayley Transform

    Authors: Hong-Yan Zhang, Fei Liu, Yu Zhou, Man Liang

    Abstract: The Quaternion kinematical differential equation (QKDE) plays a key role in navigation, control and guidance systems. Although explicit symplectic geometric algorithms (ESGA) for this problem are available, there is a lack of a unified way for constructing high order symplectic difference schemes with configurable order parameter. We present even order explicit symplectic geometric algorithms to s… ▽ More

    Submitted 12 January, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  16. arXiv:2206.09746  [pdf, other

    eess.SP

    Data Fusion for Radio Frequency SLAM with Robust Sampling

    Authors: Erik Leitinger, Bryan Teague, Wenyu Zhang, Mingchao Liang, Florian Meyer

    Abstract: Precise indoor localization remains a challenging problem for a variety of essential applications. A promising approach to address this problem is to exchange radio signals between mobile agents and static physical anchors (PAs) that bounce off flat surfaces in the indoor environment. Radio frequency simultaneous localization and map** (RF-SLAM) methods can be used to jointly estimates the time-… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: published at FUSION 2022

  17. arXiv:2203.09948  [pdf, ps, other

    cs.CV cs.LG eess.SP

    Neural Enhanced Belief Propagation for Data Association in Multiobject Tracking

    Authors: Mingchao Liang, Florian Meyer

    Abstract: Situation-aware technologies enabled by multiobject tracking (MOT) methods will create new services and applications in fields such as autonomous navigation and applied ocean sciences. Belief propagation (BP) is a state-of-the-art method for Bayesian MOT but fully relies on a statistical model and preprocessed sensor measurements. In this paper, we establish a hybrid method for model-based and dat… ▽ More

    Submitted 15 June, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

  18. arXiv:2105.12903  [pdf, ps, other

    cs.LG cs.MA cs.RO eess.SP

    Neural Enhanced Belief Propagation for Cooperative Localization

    Authors: Mingchao Liang, Florian Meyer

    Abstract: Location-aware networks will introduce innovative services and applications for modern convenience, applied ocean sciences, and public safety. In this paper, we establish a hybrid method for model-based and data-driven inference. We consider a cooperative localization (CL) scenario where the mobile agents in a wireless network aim to localize themselves by performing pairwise observations with oth… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  19. arXiv:2102.10260  [pdf, other

    eess.SY

    Wireless sensor network for in situ soil moisture monitoring

    Authors: Jianing Fang, Chuheng Hu, Nour Smaoui, Doug Carlson, Jayant Gupchup, Razvan Musaloiu-E., Chieh-Jan Mike Liang, Marcus Chang, Omprakash Gnawali, Tamas Budavari, Andreas Terzis, Katalin Szlavecz, Alexander S. Szalay

    Abstract: We discuss the history and lessons learned from a series of deployments of environmental sensors measuring soil parameters and CO2 fluxes over the last fifteen years, in an outdoor environment. We present the hardware and software architecture of our current Gen-3 system, and then discuss how we are simplifying the user facing part of the software, to make it easier and friendlier for the environm… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

    Comments: 12 pages, 16 figures, Sensornets 2021 Conference

  20. arXiv:2005.05288  [pdf, other

    physics.optics eess.IV eess.SP

    Non-iterative complex wave-field reconstruction based on Kramers-Kronig relations

    Authors: Cheng Shen, An Pan, Mingshu Liang, Changhuei Yang

    Abstract: A new computational imaging method to reconstruct the complex wave-field is reported. Due to the existence of zero frequency component, the measured signal by amplitude modulation of pupil has a spectrum similar to the one of off-axis hologram. The mathematical analogy between them is established in this paper. Based on this observation and analyticity of band-limited signal under any diffraction-… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

  21. FeederGAN: Synthetic Feeder Generation via Deep Graph Adversarial Nets

    Authors: Ming Liang, Yao Meng, Jiyu Wang, David Lubkeman, Ning Lu

    Abstract: This paper presents a novel, automated, generative adversarial networks (GAN) based synthetic feeder generation mechanism, abbreviated as FeederGAN. FeederGAN digests real feeder models represented by directed graphs via a deep learning framework powered by GAN and graph convolutional networks (GCN). Information of a distribution feeder circuit is extracted from its model input files so that the d… ▽ More

    Submitted 16 September, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: Accepted by IEEE Trans. on Smart Grid