Skip to main content

Showing 1–46 of 46 results for author: Huang, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.09658  [pdf

    eess.IV cs.CV

    Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm

    Authors: Amir Mohammad Naderi, Jennifer G. Casey, Mao-Hsiang Huang, Rachelle Victorio, David Y. Chiang, Calum MacRae, Hung Cao, Vandana A. Gupta

    Abstract: Quantifying cardiovascular parameters like ejection fraction in zebrafish as a host of biological investigations has been extensively studied. Since current manual monitoring techniques are time-consuming and fallible, several image processing frameworks have been proposed to automate the process. Most of these works rely on supervised deep-learning architectures. However, supervised methods tend… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  2. arXiv:2402.01031  [pdf

    eess.IV cs.CV

    MRAnnotator: A Multi-Anatomy Deep Learning Model for MRI Segmentation

    Authors: Alexander Zhou, Zelong Liu, Andrew Tieu, Nikhil Patel, Sean Sun, Anthony Yang, Peter Choi, Valentin Fauveau, George Soultanidis, Mingqian Huang, Amish Doshi, Zahi A. Fayad, Timothy Deyer, Xueyan Mei

    Abstract: Purpose To develop a deep learning model for multi-anatomy and many-class segmentation of diverse anatomic structures on MRI imaging. Materials and Methods In this retrospective study, two datasets were curated and annotated for model development and evaluation. An internal dataset of 1022 MRI sequences from various clinical sites within a health system and an external dataset of 264 MRI sequenc… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  3. arXiv:2302.13172  [pdf

    eess.IV cs.CV

    Deep Learning-based Multi-Organ CT Segmentation with Adversarial Data Augmentation

    Authors: Shaoyan Pan, Shao-Yuan Lo, Min Huang, Chaoqiong Ma, Jacob Wynne, Tonghe Wang, Tian Liu, Xiaofeng Yang

    Abstract: In this work, we propose an adversarial attack-based data augmentation method to improve the deep-learning-based segmentation algorithm for the delineation of Organs-At-Risk (OAR) in abdominal Computed Tomography (CT) to facilitate radiation therapy. We introduce Adversarial Feature Attack for Medical Image (AFA-MI) augmentation, which forces the segmentation network to learn out-of-distribution s… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted at SPIE Medical Imaging 2023

  4. arXiv:2301.02858  [pdf, ps, other

    eess.SP

    Two Efficient Beamforming Methods for Hybrid IRS-aided AF Relay Wireless Networks

    Authors: Xuehui Wang, Feng Shu, Mengxing Huang, Fuhui Zhou, Riqing Chen, Cunhua Pan, Yongpeng Wu, Jiangzhou Wang

    Abstract: Due to the double fading effect caused by conventional passive intelligent reflecting surface (IRS), the signal via the reflection link is weak. To enhance the received signal, active elements with the ability to amplify the reflected signal are introduced to the passive IRS forming hybrid IRS. In this paper, we propose a hybrid IRS-aided amplify-and-forward (AF) relay wireless network, where an o… ▽ More

    Submitted 23 November, 2023; v1 submitted 7 January, 2023; originally announced January 2023.

  5. arXiv:2211.07849  [pdf, other

    cs.MA eess.SY

    Linear Convergent Distributed Nash Equilibrium Seeking with Compression

    Authors: Xiaomeng Chen, Yuchi Wu, Xinlei Yi, Minyi Huang, Ling Shi

    Abstract: Information compression techniques are majorly employed to address the concern of reducing communication cost over peer-to-peer links. In this paper, we investigate distributed Nash equilibrium (NE) seeking problems in a class of non-cooperative games over directed graphs with information compression. To improve communication efficiency, a compressed distributed NE seeking (C-DNES) algorithm is pr… ▽ More

    Submitted 21 September, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  6. arXiv:2211.01607  [pdf, other

    eess.IV cs.LG

    ImageCAS: A Large-Scale Dataset and Benchmark for Coronary Artery Segmentation based on Computed Tomography Angiography Images

    Authors: An Zeng, Chunbiao Wu, Mei** Huang, Jian Zhuang, Shanshan Bi, Dan Pan, Najeeb Ullah, Kaleem Nawaz Khan, Tianchen Wang, Yiyu Shi, Xiaomeng Li, Guisen Lin, Xiaowei Xu

    Abstract: Cardiovascular disease (CVD) accounts for about half of non-communicable diseases. Vessel stenosis in the coronary artery is considered to be the major risk of CVD. Computed tomography angiography (CTA) is one of the widely used noninvasive imaging modalities in coronary artery diagnosis due to its superior image resolution. Clinically, segmentation of coronary arteries is essential for the diagno… ▽ More

    Submitted 17 October, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: 17 pages, 12 figures, 4 tables

    Journal ref: Computerized Medical Imaging and Graphics, 2023

  7. arXiv:2210.16584  [pdf, other

    eess.IV cs.CV cs.LG

    Interpretable CNN-Multilevel Attention Transformer for Rapid Recognition of Pneumonia from Chest X-Ray Images

    Authors: Shengchao Chen, Sufen Ren, Guanjun Wang, Mengxing Huang, Chenyang Xue

    Abstract: Chest imaging plays an essential role in diagnosing and predicting patients with COVID-19 with evidence of worsening respiratory status. Many deep learning-based approaches for pneumonia recognition have been developed to enable computer-aided diagnosis. However, the long training and inference time makes them inflexible, and the lack of interpretability reduces their credibility in clinical medic… ▽ More

    Submitted 13 January, 2024; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: Accepted by the IEEE Journal of Biomedical and Health Informatic, doi: 10.1109/JBHI.2023.3247949

  8. arXiv:2206.08019  [pdf, other

    eess.IV cs.CV

    Multi-View Imputation and Cross-Attention Network Based on Incomplete Longitudinal and Multimodal Data for Conversion Prediction of Mild Cognitive Impairment

    Authors: Tao Wang, Xiumei Chen, Xiaoling Zhang, Shuoling Zhou, Qian** Feng, Meiyan Huang

    Abstract: Predicting whether subjects with mild cognitive impairment (MCI) will convert to Alzheimer's disease is a significant clinical challenge. Longitudinal variations and complementary information inherent in longitudinal and multimodal data are crucial for MCI conversion prediction, but persistent issue of missing data in these data may hinder their effective application. Additionally, conversion pred… ▽ More

    Submitted 25 May, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

  9. arXiv:2204.06230  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis of Wireless Network Aided by Discrete-Phase-Shifter IRS

    Authors: Rongen Dong, Yin Teng, Zhongwen Sun, Jun Zou, Mengxing Huang, Jun Li, Feng Shu, Jiangzhou Wang

    Abstract: Discrete phase shifters of intelligent reflecting surface (IRS) generates phase quantization error (QE) and degrades the receive performance at the receiver. To make an analysis of the performance loss caused by IRS with phase QE, based on the law of large numbers, the closed-form expressions of signal-to-noise ratio (SNR) performance loss (PL), achievable rate (AR), and bit error rate (BER) are s… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

  10. arXiv:2204.03173  [pdf, other

    cs.LG cs.AI eess.SP

    Automated Sleep Staging via Parallel Frequency-Cut Attention

    Authors: Zheng Chen, Ziwei Yang, Lingwei Zhu, Wei Chen, Toshiyo Tamura, Naoaki Ono, MD Altaf-Ul-Amin, Shigehiko Kanaya, Ming Huang

    Abstract: This paper proposes a novel framework for automatically capturing the time-frequency nature of electroencephalogram (EEG) signals of human sleep based on the authoritative sleep medicine guidance. The framework consists of two parts: the first part extracts informative features by partitioning the input EEG spectrograms into a sequence of time-frequency patches. The second part is constituted by a… ▽ More

    Submitted 12 January, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 10 pages, 9 figures

  11. arXiv:2203.12151  [pdf, other

    eess.IV cs.CV cs.LG

    Semi-Supervised Hybrid Spine Network for Segmentation of Spine MR Images

    Authors: Meiyan Huang, Shuoling Zhou, Xiumei Chen, Haoran Lai, Qian** Feng

    Abstract: Automatic segmentation of vertebral bodies (VBs) and intervertebral discs (IVDs) in 3D magnetic resonance (MR) images is vital in diagnosing and treating spinal diseases. However, segmenting the VBs and IVDs simultaneously is not trivial. Moreover, problems exist, including blurry segmentation caused by anisotropy resolution, high computational cost, inter-class similarity and intra-class variabil… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  12. arXiv:2203.04583  [pdf, other

    eess.AS cs.SD

    Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks

    Authors: Yizhou Lu, Mingkun Huang, Xinghua Qu, Pengfei Wei, Zejun Ma

    Abstract: Unsupervised cross-lingual speech representation learning (XLSR) has recently shown promising results in speech recognition by leveraging vast amounts of unlabeled data across multiple languages. However, standard XLSR model suffers from language interference problem due to the lack of language specific modeling ability. In this work, we investigate language adaptive training on XLSR models. More… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: To appear in ICASSP 2022

  13. Robust Dynamic State Estimator of Integrated Energy Systems based on Natural Gas Partial Differential Equations

    Authors: Liang Chen, Yang Li, Manyun Huang, Xinxin Hui, Songlin Gu

    Abstract: The reliability and precision of dynamic database are vital for the optimal operating and global control of integrated energy systems. One of the effective ways to obtain the accurate states is state estimations. A novel robust dynamic state estimation methodology for integrated natural gas and electric power systems is proposed based on Kalman filter. To take full advantage of measurement redunda… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: Accepted by IEEE transactions on Industry Applications. arXiv admin note: text overlap with arXiv:2107.05891

    Journal ref: IEEE Transactions on Industry Applications 58 (2022) 3303-3312

  14. arXiv:2201.04452  [pdf, other

    cs.IT eess.SP

    Machine-learning-aided Massive Hybrid Analog and Digital MIMO DOA Estimation for Future Wireless Networks

    Authors: Feng Shu, Yiwen Chen, Xichao Zhan, Wenlong Cai, Mengxing Huang, Qijuan Jie, Yifang Li, Baihua Shi, Jiangzhou Wang, Xiaohu You

    Abstract: Due to a high spatial angle resolution and low circuit cost of massive hybrid analog and digital (HAD) multiple-input multiple-output (MIMO), it is viewed as a valuable green communication technology for future wireless networks. Combining a massive HAD-MIMO with direction of arrival (DOA) will provide a high-precision even ultra-high-precision DOA measurement performance approaching the fully-dig… ▽ More

    Submitted 5 August, 2023; v1 submitted 12 January, 2022; originally announced January 2022.

  15. arXiv:2110.12274  [pdf, other

    eess.IV cs.CV

    "One-Shot" Reduction of Additive Artifacts in Medical Images

    Authors: Yu-Jen Chen, Yen-Jung Chang, Shao-Cheng Wen, Yiyu Shi, Xiaowei Xu, Tsung-Yi Ho, Mei** Huang, Haiyun Yuan, Jian Zhuang

    Abstract: Medical images may contain various types of artifacts with different patterns and mixtures, which depend on many factors such as scan setting, machine condition, patients' characteristics, surrounding environment, etc. However, existing deep-learning-based artifact reduction methods are restricted by their training set with specific predetermined artifact types and patterns. As such, they have lim… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

  16. arXiv:2109.11115  [pdf

    cs.SD eess.AS

    Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

    Authors: Rui Li, Dong Pu, Minnie Huang, Bill Huang

    Abstract: One-shot voice cloning aims to transform speaker voice and speaking style in speech synthesized from a text-to-speech (TTS) system, where only a shot recording from the target reference speech can be used. Out-of-domain transfer is still a challenging task, and one important aspect that impacts the accuracy and similarity of synthetic speech is the conditional representations carrying speaker or s… ▽ More

    Submitted 24 February, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: 6 pages, 5 figures, Accepted to IEEE ICASSP 2022

  17. arXiv:2109.06909  [pdf, other

    eess.IV cs.CV physics.med-ph

    Hardware-aware Real-time Myocardial Segmentation Quality Control in Contrast Echocardiography

    Authors: Dewen Zeng, Yukun Ding, Haiyun Yuan, Mei** Huang, Xiaowei Xu, Jian Zhuang, **gtong Hu, Yiyu Shi

    Abstract: Automatic myocardial segmentation of contrast echocardiography has shown great potential in the quantification of myocardial perfusion parameters. Segmentation quality control is an important step to ensure the accuracy of segmentation results for quality research as well as its clinical application. Usually, the segmentation quality control happens after the data acquisition. At the data acquisit… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: 4 pages, DAC'21 invited paper

  18. arXiv:2109.00374  [pdf, other

    eess.IV cs.CV

    ImageTBAD: A 3D Computed Tomography Angiography Image Dataset for Automatic Segmentation of Type-B Aortic Dissection

    Authors: Zeyang Yao, Jiawei Zhang, Hailong Qiu, Tianchen Wang, Yiyu Shi, Jian Zhuang, Yuhao Dong, Mei** Huang, Xiaowei Xu

    Abstract: Type-B Aortic Dissection (TBAD) is one of the most serious cardiovascular events characterized by a growing yearly incidence,and the severity of disease prognosis. Currently, computed tomography angiography (CTA) has been widely adopted for the diagnosis and prognosis of TBAD. Accurate segmentation of true lumen (TL), false lumen (FL), and false lumen thrombus (FLT) in CTA are crucial for the prec… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

  19. Dynamic State Estimation for Integrated Natural Gas and Electric Power Systems

    Authors: Liang Chen, Xinxin Hui, Songlin Gu, Manyun Huang, Yang Li

    Abstract: A dynamic state estimation method of integrated natural gas and electric power systems (IGESs) in proposed. Firstly, the coupling model of gas pipeline networks and power systems by gas turbine units (GTUs) is established. Secondly, the Kalman filter based linear DSE model for the IGES is built. The gas density and mass flow rate, as well as the real and imaginary parts of bus voltages are taken a… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: Accepted by 2021 IEEE/IAS Industrial and Commercial Power System Asia (I&CPS Asia)

    Journal ref: 2021 IEEE/IAS Industrial and Commercial Power System Asia (I&CPS Asia)

  20. Beamforming and Transmit Power Design for Intelligent Reconfigurable Surface-aided Secure Spatial Modulation

    Authors: Feng Shu, Xinyi Jiang, Wenlong Cai, Wei** Shi, Mengxing Huang, Jiangzhou Wang, Xiaohu You

    Abstract: Intelligent reflecting surface (IRS) is a promising solution to build a programmable wireless environment for future communication systems, in which the reflector elements steer the incident signal in fully customizable ways by passive beamforming. In this paper, an IRS-aided secure spatial modulation (SM) is proposed, where the IRS perform passive beamforming and information transfer simultaneous… ▽ More

    Submitted 21 October, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  21. arXiv:2105.08267  [pdf, other

    eess.IV cs.CV

    EchoCP: An Echocardiography Dataset in Contrast Transthoracic Echocardiography for Patent Foramen Ovale Diagnosis

    Authors: Tianchen Wang, Zhihe Li, Mei** Huang, Jian Zhuang, Shanshan Bi, Jiawei Zhang, Yiyu Shi, Hongwen Fei, Xiaowei Xu

    Abstract: Patent foramen ovale (PFO) is a potential separation between the septum, primum and septum secundum located in the anterosuperior portion of the atrial septum. PFO is one of the main factors causing cryptogenic stroke which is the fifth leading cause of death in the United States. For PFO diagnosis, contrast transthoracic echocardiography (cTTE) is preferred as being a more robust method compared… ▽ More

    Submitted 15 September, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: MICCAI2021

  22. arXiv:2104.12044  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Cycle-Consistent Adversarial Networks for Edge Denoising of Computed Tomography Images

    Authors: Xiaowe Xu, Jiawei Zhang, **glan Liu, Yukun Ding, Tianchen Wang, Hailong Qiu, Haiyun Yuan, Jian Zhuang, Wen Xie, Yuhao Dong, Qianjun Jia, Mei** Huang, Yiyu Shi

    Abstract: As one of the most commonly ordered imaging tests, computed tomography (CT) scan comes with inevitable radiation exposure that increases the cancer risk to patients. However, CT image quality is directly related to radiation dose, thus it is desirable to obtain high-quality CT images with as little dose as possible. CT image denoising tries to obtain high dose like high-quality CT images (domain X… ▽ More

    Submitted 24 April, 2021; originally announced April 2021.

    Comments: 16 pages, 7 figures, 4 tables, accepted by the ACM Journal on Emerging Technologies in Computing Systems (JETC). arXiv admin note: substantial text overlap with arXiv:2002.12130

  23. Earnings-21: A Practical Benchmark for ASR in the Wild

    Authors: Miguel Del Rio, Natalie Delworth, Ryan Westerman, Michelle Huang, Nishchal Bhandari, Joseph Palakapilly, Quinten McNamara, Joshua Dong, Piotr Zelasko, Miguel Jette

    Abstract: Commonly used speech corpora inadequately challenge academic and commercial ASR systems. In particular, speech corpora lack metadata needed for detailed analysis and WER measurement. In response, we present Earnings-21, a 39-hour corpus of earnings calls containing entity-dense speech from nine different financial sectors. This corpus is intended to benchmark ASR systems in the wild with special a… ▽ More

    Submitted 15 June, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to INTERSPEECH 2021. June 15 2021: Addressing the comments of reviewers and updating the results of our internal ESPNet model. The results do not change our conclusions. April 28th, 2021: We found and resolved an issue in our experimental evaluation that scored the LibriSpeech model at ~20% worse relative WER than the actual WER. The updated results do not affect our conclusions

  24. arXiv:2104.10747  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Accented Speech Recognition: A Survey

    Authors: Arthur Hinsvark, Natalie Delworth, Miguel Del Rio, Quinten McNamara, Joshua Dong, Ryan Westerman, Michelle Huang, Joseph Palakapilly, Jennifer Drexler, Ilya Pirkin, Nishchal Bhandari, Miguel Jette

    Abstract: Automatic Speech Recognition (ASR) systems generalize poorly on accented speech. The phonetic and linguistic variability of accents present hard challenges for ASR systems today in both data collection and modeling strategies. The resulting bias in ASR performance across accents comes at a cost to both users and providers of ASR. We present a survey of current promising approaches to accented sp… ▽ More

    Submitted 2 June, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  25. arXiv:2104.02066  [pdf, other

    cs.CV cs.LG eess.IV

    Dopamine Transporter SPECT Image Classification for Neurodegenerative Parkinsonism via Diffusion Maps and Machine Learning Classifiers

    Authors: Jun-En Ding, Chi-Hsiang Chu, Mong-Na Lo Huang, Chien-Ching Hsu

    Abstract: Neurodegenerative parkinsonism can be assessed by dopamine transporter single photon emission computed tomography (DaT-SPECT). Although generating images is time consuming, these images can show interobserver variability and they have been visually interpreted by nuclear medicine physicians to date. Accordingly, this study aims to provide an automatic and robust method based on Diffusion Maps and… ▽ More

    Submitted 7 May, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Journal ref: 24th Annual Conference, MIUA 2021, Oxford, UK, July 12-14, 2021, Proceedings

  26. arXiv:2103.08259  [pdf, other

    eess.IV cs.CV cs.LG

    The QXS-SAROPT Dataset for Deep Learning in SAR-Optical Data Fusion

    Authors: Meiyu Huang, Yao Xu, Lixin Qian, Weili Shi, Yaqin Zhang, Wei Bao, Nan Wang, Xuejiao Liu, Xueshuang Xiang

    Abstract: Deep learning techniques have made an increasing impact on the field of remote sensing. However, deep neural networks based fusion of multimodal data from different remote sensors with heterogenous characteristics has not been fully explored, due to the lack of availability of big amounts of perfectly aligned multi-sensor image data with diverse scenes of high resolutions, especially for synthetic… ▽ More

    Submitted 25 April, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

  27. arXiv:2102.12173  [pdf

    eess.IV

    Deep learning-based framework for cardiac function assessment in embryonic zebrafish from heart beating videos

    Authors: Amir Mohammad Naderi, Haisong Bu, **gcheng Su, Mao-Hsiang Huang, Khuong Vo, Ramses Seferino Trigo Torres, J. -C. Chiao, Juhyun Lee, Michael P. H. Lau, Xiaolei Xu, Hung Cao

    Abstract: Zebrafish is a powerful and widely-used model system for a host of biological investigations including cardiovascular studies and genetic screening. Zebrafish are readily assessable during developmental stages; however, the current methods for quantification and monitoring of cardiac functions mostly involve tedious manual work and inconsistent estimations. In this paper, we developed and validate… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  28. arXiv:2101.10799  [pdf, other

    eess.IV cs.CV cs.LG

    ImageCHD: A 3D Computed Tomography Image Dataset for Classification of Congenital Heart Disease

    Authors: Xiaowei Xu, Tianchen Wang, Jian Zhuang, Haiyun Yuan, Mei** Huang, Jianzheng Cen, Qianjun Jia, Yuhao Dong, Yiyu Shi

    Abstract: Congenital heart disease (CHD) is the most common type of birth defect, which occurs 1 in every 110 births in the United States. CHD usually comes with severe variations in heart structure and great artery connections that can be classified into many types. Thus highly specialized domain knowledge and the time-consuming human process is needed to analyze the associated medical images. On the other… ▽ More

    Submitted 11 May, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: 11 pages, 6 figures, 2 tables, published at MICCAI 2020. The diagnosis info of the dataset is updated (thanks to the help of Kadirbarut from Bilgiuzayi)

  29. Myocardial Segmentation of Cardiac MRI Sequences with Temporal Consistency for Coronary Artery Disease Diagnosis

    Authors: Yutian Chen, Xiaowei Xu, Dewen Zeng, Yiyu Shi, Haiyun Yuan, Jian Zhuang, Yuhao Dong, Qianjun Jia, Mei** Huang

    Abstract: Coronary artery disease (CAD) is the most common cause of death globally, and its diagnosis is usually based on manual myocardial segmentation of Magnetic Resonance Imaging (MRI) sequences. As the manual segmentation is tedious, time-consuming and with low applicability, automatic myocardial segmentation using machine learning techniques has been widely explored recently. However, almost all the e… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

    Comments: 9 pages, 9 figures

  30. arXiv:2011.10634  [pdf

    eess.SY

    Distributed Robust State Estimation for Hybrid AC/DC Distribution Systems using Multi-Source Data

    Authors: Manyun Huang, Junbo Zhao, Zhinong Wei, Marco Pau, Guoqiang Sun

    Abstract: Hybrid AC/DC distribution systems are becoming a popular means to accommodate the increasing penetration of distributed energy resources and flexible loads. This paper proposes a distributed and robust state estimation (DRSE) method for hybrid AC/DC distribution systems using multiple sources of data. In the proposed distributed implementation framework, a unified robust linear state estimation mo… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: 8 pages, 12 figures

  31. arXiv:2011.02155  [pdf, other

    eess.IV cs.CV cs.LG

    Do Noises Bother Human and Neural Networks In the Same Way? A Medical Image Analysis Perspective

    Authors: Shao-Cheng Wen, Yu-Jen Chen, Zihao Liu, Wujie Wen, Xiaowei Xu, Yiyu Shi, Tsung-Yi Ho, Qianjun Jia, Mei** Huang, Jian Zhuang

    Abstract: Deep learning had already demonstrated its power in medical images, including denoising, classification, segmentation, etc. All these applications are proposed to automatically analyze medical images beforehand, which brings more information to radiologists during clinical assessment for accuracy improvement. Recently, many medical denoising methods had shown their significant artifact reduction r… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

  32. arXiv:2011.01576  [pdf, other

    eess.AS cs.SD

    Improving RNN transducer with normalized jointer network

    Authors: Mingkun Huang, Jun Zhang, Meng Cai, Yang Zhang, Jiali Yao, Yongbin You, Yi He, Zejun Ma

    Abstract: Recurrent neural transducer (RNN-T) is a promising end-to-end (E2E) model in automatic speech recognition (ASR). It has shown superior performance compared to traditional hybrid ASR systems. However, training RNN-T from scratch is still challenging. We observe a huge gradient variance during RNN-T training and suspect it hurts the performance. In this work, we analyze the cause of the huge gradien… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  33. arXiv:2011.01570  [pdf, other

    eess.AS cs.SD

    Dynamic latency speech recognition with asynchronous revision

    Authors: Mingkun Huang, Meng Cai, Jun Zhang, Yang Zhang, Yongbin You, Yi He, Zejun Ma

    Abstract: In this work we propose an inference technique, asynchronous revision, to unify streaming and non-streaming speech recognition models. Specifically, we achieve dynamic latency with only one model by using arbitrary right context during inference. The model is composed of a stack of convolutional layers for audio encoding. In inference stage, the history states of encoder and decoder can be asynchr… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  34. arXiv:2008.07071  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Cardiac Intervention Assistance: Hardware-aware Neural Architecture Exploration for Real-Time 3D Cardiac Cine MRI Segmentation

    Authors: Dewen Zeng, Weiwen Jiang, Tianchen Wang, Xiaowei Xu, Haiyun Yuan, Mei** Huang, Jian Zhuang, **gtong Hu, Yiyu Shi

    Abstract: Real-time cardiac magnetic resonance imaging (MRI) plays an increasingly important role in guiding various cardiac interventions. In order to provide better visual assistance, the cine MRI frames need to be segmented on-the-fly to avoid noticeable visual lag. In addition, considering reliability and patient data privacy, the computation is preferably done on local hardware. State-of-the-art MRI se… ▽ More

    Submitted 13 December, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: 8 pages, conference

  35. arXiv:2008.05067  [pdf, ps, other

    cs.IT eess.SP

    Enhanced Secrecy Rate Maximization for Directional Modulation Networks via IRS

    Authors: Feng Shu, Jiayu Li, Mengxing Huang, Wei** Shi, Yin Teng, Jun Li, Yongpeng Wu, Jiangzhou Wang

    Abstract: Intelligent reflecting surface (IRS) is of low-cost and energy-efficiency and will be a promising technology for the future wireless communications like sixth generation. To address the problem of conventional directional modulation (DM) that Alice only transmits single confidential bit stream (CBS) to Bob with multiple antennas in a line-of-sight channel, IRS is proposed to create friendly multip… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  36. arXiv:2008.00953  [pdf, other

    eess.AS cs.SD

    Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model

    Authors: Qi Liu, Zhehuai Chen, Hao Li, Mingkun Huang, Yizhou Lu, Kai Yu

    Abstract: End-to-end (E2E) systems have played a more and more important role in automatic speech recognition (ASR) and achieved great performance. However, E2E systems recognize output word sequences directly with the input acoustic feature, which can only be trained on limited acoustic data. The extra text data is widely used to improve the results of traditional artificial neural network-hidden Markov mo… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted by IEEE TASLP

  37. arXiv:2007.12072  [pdf, other

    cs.CV cs.LG eess.IV

    TSIT: A Simple and Versatile Framework for Image-to-Image Translation

    Authors: Liming Jiang, Changxu Zhang, Mingyang Huang, Chunxiao Liu, Jian** Shi, Chen Change Loy

    Abstract: We introduce a simple and versatile framework for image-to-image translation. We unearth the importance of normalization layers, and provide a carefully designed two-stream generative model with newly proposed feature transformations in a coarse-to-fine fashion. This allows multi-scale semantic structure information and style representation to be effectively captured and fused by the network, perm… ▽ More

    Submitted 25 July, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: ECCV 2020 (Spotlight). Table 2 is updated. GitHub: https://github.com/EndlessSora/TSIT

  38. arXiv:2007.09455  [pdf, other

    eess.IV cs.CV

    ICA-UNet: ICA Inspired Statistical UNet for Real-time 3D Cardiac Cine MRI Segmentation

    Authors: Tianchen Wang, Xiaowei Xu, **jun Xiong, Qianjun Jia, Haiyun Yuan, Mei** Huang, Jian Zhuang, Yiyu Shi

    Abstract: Real-time cine magnetic resonance imaging (MRI) plays an increasingly important role in various cardiac interventions. In order to enable fast and accurate visual assistance, the temporal frames need to be segmented on-the-fly. However, state-of-the-art MRI segmentation methods are used either offline because of their high computation complexity, or in real-time but with significant accuracy loss… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: MICCAI2020, 12 pages, 3 figures

  39. arXiv:2006.03902  [pdf, ps, other

    eess.SP cs.IT

    I/Q Imbalance Aware Nonlinear Wireless-Powered Relaying of B5G Networks: Security and Reliability Analysis

    Authors: Xingwang Li, Mengyan Huang, Yuanwei Liu, Varun G Menon, Anand Paul, Zhiguo Ding

    Abstract: Physical layer security is known as a promising paradigm to ensure security for the beyond 5G (B5G) networks in the presence of eavesdroppers. In this paper, we elaborate on a tractable analysis framework to evaluate the reliability and security of wireless-powered decode-and-forward (DF) multi-relay networks. The nonlinear energy harvesters, in-phase and quadrature-phase imbalance (IQI) and chann… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

  40. arXiv:2004.00679  [pdf, other

    eess.SY math.OC

    LQG Graphon Mean Field Games: Analysis via Graphon Invariant Subspaces

    Authors: Shuang Gao, Peter E. Caines, Minyi Huang

    Abstract: This paper studies approximate solutions to large-scale linear quadratic stochastic games with homogeneous nodal dynamics parameters and heterogeneous network couplings within the graphon mean field game framework in [2]-[4]. A graphon time-varying dynamical system model is first formulated to study the finite and then limit problems of linear quadratic Gaussian graphon mean field games (LQG-GMFG)… ▽ More

    Submitted 21 October, 2021; v1 submitted 1 April, 2020; originally announced April 2020.

  41. arXiv:2002.12130  [pdf, other

    eess.IV cs.CV

    Multi-Cycle-Consistent Adversarial Networks for CT Image Denoising

    Authors: **glan Liu, Yukun Ding, **jun Xiong, Qianjun Jia, Mei** Huang, Jian Zhuang, Bike Xie, Chun-Chen Liu, Yiyu Shi

    Abstract: CT image denoising can be treated as an image-to-image translation task where the goal is to learn the transform between a source domain $X$ (noisy images) and a target domain $Y$ (clean images). Recently, cycle-consistent adversarial denoising network (CCADN) has achieved state-of-the-art results by enforcing cycle-consistent loss without the need of paired training data. Our detailed analysis of… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: Accepted in ISBI 2020. 5 pages, 4 figures

  42. arXiv:1911.05426   

    eess.SY

    Mean-Field Transmission Power Control in Dense Networks, Part II -- Social Welfare Evaluation

    Authors: Yuchi Wu, Junfeng Wu, Minyi Huang, Ling Shi

    Abstract: We consider uplink power control in wireless communication when massive users compete over the channel resources. In Part I, we have formulated massive transmission power control contest in a mean-field game framework. In this part, our goal is to investigate whether the power-domain non-orthogonal multiple access (NOMA) protocol can regulate the non-cooperative channel access behaviors, i.e., ste… ▽ More

    Submitted 8 May, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: Combined with 1911.05421 as a single manuscript

  43. Mean-Field Transmission Power Control in Dense Networks

    Authors: Yuchi Wu, Junfeng Wu, Minyi Huang, Ling Shi

    Abstract: We consider uplink power control in wireless communication when a large number of users compete over the channel resources. The CDMA protocol, as a supporting technology of 3G networks accommodating signal from different sources over the code domain, represents the orthogonal multiple access (OMA) techniques. With the development of 5G wireless networks, non-orthogonal multiple access (NOMA) is in… ▽ More

    Submitted 28 November, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: 20 pages, 6 figures, in IEEE Transactions on Control of Network Systems

  44. arXiv:1909.06726  [pdf, other

    eess.IV cs.CV

    MSU-Net: Multiscale Statistical U-Net for Real-time 3D Cardiac MRI Video Segmentation

    Authors: Tianchen Wang, **jun Xiong, Xiaowei Xu, Meng Jiang, Yiyu Shi, Haiyun Yuan, Mei** Huang, Jian Zhuang

    Abstract: Cardiac magnetic resonance imaging (MRI) is an essential tool for MRI-guided surgery and real-time intervention. The MRI videos are expected to be segmented on-the-fly in real practice. However, existing segmentation methods would suffer from drastic accuracy loss when modified for speedup. In this work, we propose Multiscale Statistical U-Net (MSU-Net) for real-time 3D MRI video segmentation in c… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: MICCAI19

  45. arXiv:1907.05273  [pdf, other

    eess.IV cs.CV

    Accurate Congenital Heart Disease Model Generation for 3D Printing

    Authors: Xiaowei Xu, Tianchen Wang, Dewen Zeng, Yiyu Shi, Qianjun Jia, Haiyun Yuan, Mei** Huang, Jian Zhuang

    Abstract: 3D printing has been widely adopted for clinical decision making and interventional planning of Congenital heart disease (CHD), while whole heart and great vessel segmentation is the most significant but time-consuming step in the model generation for 3D printing. While various automatic whole heart and great vessel segmentation frameworks have been developed in the literature, they are ineffectiv… ▽ More

    Submitted 11 July, 2019; v1 submitted 6 July, 2019; originally announced July 2019.

    Comments: 6 figures, 2 tables, accepted by the IEEE International Workshop on Signal Processing Systems

  46. arXiv:1807.07963  [pdf, other

    eess.IV stat.ML

    Deep Transfer Learning for Cross-domain Activity Recognition

    Authors: **dong Wang, Vincent W. Zheng, Yiqiang Chen, Meiyu Huang

    Abstract: Human activity recognition plays an important role in people's daily life. However, it is often expensive and time-consuming to acquire sufficient labeled activity data. To solve this problem, transfer learning leverages the labeled samples from the source domain to annotate the target domain which has few or none labels. Unfortunately, when there are several source domains available, it is diffic… ▽ More

    Submitted 19 August, 2018; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: ICCSE 2018 best paper; 8 pages