Skip to main content

Showing 1–46 of 46 results for author: Liang, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, **ming Guo, Xiaolin Chen, **gcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2406.08782  [pdf, other

    eess.IV cs.CV

    Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising

    Authors: Hao Liang, Chengjie, Kun Li, Xin Tian

    Abstract: Hyperspectral image (HSI) denoising is an essential procedure for HSI applications. Unfortunately, the existing Transformer-based methods mainly focus on non-local modeling, neglecting the importance of locality in image denoising. Moreover, deep learning methods employ complex spectral learning mechanisms, thus introducing large computation costs. To address these problems, we propose a hybrid… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.04685  [pdf, other

    eess.SY cs.NI

    Statistical QoS Provisioning Architecture for 6G Satellite-Terrestrial Integrated Networks

    Authors: **gqing Wang, Wenchi Cheng, Wei Zhang, Hui Liang

    Abstract: The emergence of massive ultra-reliable and low latency communications (mURLLC) as a category of time/reliability-sensitive service over 6G networks has received considerable research attention, which has presented unprecedented challenges. As one of the key enablers for 6G, satellite-terrestrial integrated networks (STIN) have been developed to offer more expansive connectivity and comprehensive… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2404.11171  [pdf, other

    cs.LG cs.AI eess.SP

    Personalized Heart Disease Detection via ECG Digital Twin Generation

    Authors: Yaojun Hu, **tai Chen, Lianting Hu, Dantong Li, Jiahuan Yan, Haochao Ying, Huiying Liang, Jian Wu

    Abstract: Heart diseases rank among the leading causes of global mortality, demonstrating a crucial need for early diagnosis and intervention. Most traditional electrocardiogram (ECG) based automated diagnosis methods are trained at population level, neglecting the customization of personalized ECGs to enhance individual healthcare management. A potential solution to address this limitation is to employ dig… ▽ More

    Submitted 11 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  5. arXiv:2402.01380  [pdf, other

    cs.CV eess.IV

    Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization

    Authors: Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song

    Abstract: Volumetric videos, benefiting from immersive 3D realism and interactivity, hold vast potential for various applications, while the tremendous data volume poses significant challenges for compression. Recently, NeRF has demonstrated remarkable potential in volumetric video compression thanks to its simple representation and powerful 3D modeling capabilities, where a notable work is ReRF. However, R… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  6. arXiv:2402.00080  [pdf, ps, other

    eess.SY eess.SP

    Arithmetic Average Density Fusion -- Part IV: Distributed Heterogeneous Fusion of RFS and LRFS Filters via Variational Approximation

    Authors: Tiancheng Li, Haozhe Liang, Guchong Li, Jesús García Herrero, Quan Pan

    Abstract: This paper, the fourth part of a series of papers on the arithmetic average (AA) density fusion approach and its application for target tracking, addresses the intricate challenge of distributed heterogeneous multisensor multitarget tracking, where each inter-connected sensor operates a probability hypothesis density (PHD) filter, a multiple Bernoulli (MB) filter or a labeled MB (LMB) filter and t… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: 13 pages,14 figures

  7. arXiv:2401.00225  [pdf

    eess.AS cs.AI eess.SP

    Enhancing dysarthria speech feature representation with empirical mode decomposition and Walsh-Hadamard transform

    Authors: Ting Zhu, Shufei Duan, Camille Dingam, Huizhi Liang, Wei Zhang

    Abstract: Dysarthria speech contains the pathological characteristics of vocal tract and vocal fold, but so far, they have not yet been included in traditional acoustic feature sets. Moreover, the nonlinearity and non-stationarity of speech have been ignored. In this paper, we propose a feature enhancement algorithm for dysarthria speech called WHFEMD. It combines empirical mode decomposition (EMD) and fast… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  8. arXiv:2312.16057  [pdf, other

    cs.IT eess.SP

    Semantic Importance-Aware Based for Multi-User Communication Over MIMO Fading Channels

    Authors: Haotai Liang, Zhicheng Bao, Wannian An, Chen Dong, Xiaodong Xu

    Abstract: Semantic communication, as a novel communication paradigm, has attracted the interest of many scholars, with multi-user, multi-input multi-output (MIMO) scenarios being one of the critical contexts. This paper presents a semantic importance-aware based communication system (SIA-SC) over MIMO Rayleigh fading channels. Combining the semantic symbols' inequality and the equivalent subchannels of MIMO… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  9. arXiv:2312.10051  [pdf, other

    eess.SP

    Semantic Synchronization for Enhanced Reliability in Communication Systems

    Authors: Xiaoyi Liu, Haotai Liang, Chen Dong, Xiaodong Xu

    Abstract: As a new communication paradigm, semantic communication has received widespread attention in communication fields. However, since the decoding of semantic signals relies on contextual knowledge, misalignment between the starting position of the semantic signal and the AI-based semantic decoder would prevent source signal recovery and reconstruction. To achieve more precise semantic communication,… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  10. arXiv:2312.08998  [pdf

    eess.AS cs.AI cs.SD eess.SP

    Design, construction and evaluation of emotional multimodal pathological speech database

    Authors: Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

    Abstract: The lack of an available emotion pathology database is one of the key obstacles in studying the emotion expression status of patients with dysarthria. The first Chinese multimodal emotional pathological speech database containing multi-perspective information is constructed in this paper. It includes 29 controls and 39 patients with different degrees of motor dysarthria, expressing happy, sad, ang… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  11. arXiv:2309.12849  [pdf, other

    cs.LG eess.SY

    DeepOPF-U: A Unified Deep Neural Network to Solve AC Optimal Power Flow in Multiple Networks

    Authors: Heng Liang, Changhong Zhao

    Abstract: The traditional machine learning models to solve optimal power flow (OPF) are mostly trained for a given power network and lack generalizability to today's power networks with varying topologies and growing plug-and-play distributed energy resources (DERs). In this paper, we propose DeepOPF-U, which uses one unified deep neural network (DNN) to solve alternating-current (AC) OPF problems in differ… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 3 pages, 2 figures

  12. arXiv:2308.16738  [pdf, other

    eess.IV cs.CV cs.LG

    SFUSNet: A Spatial-Frequency domain-based Multi-branch Network for diagnosis of Cervical Lymph Node Lesions in Ultrasound Images

    Authors: Yubiao Yue, Jun Xue, Haihua Liang, Bingchun Luo, Zhenzhang Li

    Abstract: Booming deep learning has substantially improved the diagnosis for diverse lesions in ultrasound images, but a conspicuous research gap concerning cervical lymph node lesions still remains. The objective of this work is to diagnose cervical lymph node lesions in ultrasound images by leveraging a deep learning model. To this end, we first collected 3392 cervical ultrasound images containing normal… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  13. arXiv:2308.14081   

    eess.IV cs.CV

    U-SEANNet: A Simple, Efficient and Applied U-Shaped Network for Diagnosis of Nasal Diseases on Nasal Endoscopic Images

    Authors: Yubiao Yue, Jun Xue, Chao Wang, Haihua Liang, Zhenzhang Li

    Abstract: Numerous studies have affirmed that deep learning models can facilitate early diagnosis of lesions in endoscopic images. However, the lack of available datasets stymies advancements in research on nasal endoscopy, and existing models fail to strike a good trade-off between model diagnosis performance, model complexity and parameters size, rendering them unsuitable for real-world application. To br… ▽ More

    Submitted 11 February, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: There are some descriptive errors in the manuscript

  14. arXiv:2308.04805  [pdf, other

    cs.IR cs.SD eess.AS

    DiVa: An Iterative Framework to Harvest More Diverse and Valid Labels from User Comments for Music

    Authors: Hongru Liang, **gyao Liu, Yuanxin Xiang, Jiachen Du, Lanjun Zhou, Shushen Pan, Wenqiang Lei

    Abstract: Towards sufficient music searching, it is vital to form a complete set of labels for each song. However, current solutions fail to resolve it as they cannot produce diverse enough map**s to make up for the information missed by the gold labels. Based on the observation that such missing information may already be presented in user comments, we propose to study the automated music labeling in an… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 11 pages, 5 figures, published to ACM MM 2023

  15. arXiv:2306.10772  [pdf, other

    cs.SD eess.AS

    Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming

    Authors: Hao Liang, Guanxing Zhou, Xiaotong Tu, Andreas Jakobsson, Xinghao Ding, Yue Huang

    Abstract: Recently, many forms of audio industrial applications, such as sound monitoring and source localization, have begun exploiting smart multi-modal devices equipped with a microphone array. Regrettably, model-based methods are often difficult to employ for such devices due to their high computational complexity, as well as the difficulty of appropriately selecting the user-determined parameters. As a… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures

  16. arXiv:2305.00149  [pdf, other

    eess.IV cs.CV

    X-ray Recognition: Patient identification from X-rays using a contrastive objective

    Authors: Hao Liang, Kevin Ni, Guha Balakrishnan

    Abstract: Recent research demonstrates that deep learning models are capable of precisely extracting bio-information (e.g. race, gender and age) from patients' Chest X-Rays (CXRs). In this paper, we further show that deep learning models are also surprisingly accurate at recognition, i.e., distinguishing CXRs belonging to the same patient from those belonging to different patients. These findings suggest po… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  17. arXiv:2305.00147  [pdf, other

    eess.IV cs.CV

    Visualizing chest X-ray dataset biases using GANs

    Authors: Hao Liang, Kevin Ni, Guha Balakrishnan

    Abstract: Recent work demonstrates that images from various chest X-ray datasets contain visual features that are strongly correlated with protected demographic attributes like race and gender. This finding raises issues of fairness, since some of these factors may be used by downstream algorithms for clinical predictions. In this work, we propose a framework, using generative adversarial networks (GANs), t… ▽ More

    Submitted 5 September, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: Medical Imaging with Deep Learning(MIDL) 2023

  18. arXiv:2303.15206  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views

    Authors: Hanxue Liang, Tianhao Wu, Param Hanji, Francesco Banterle, Hongyun Gao, Rafal Mantiuk, Cengiz Oztireli

    Abstract: Neural view synthesis (NVS) is one of the most successful techniques for synthesizing free viewpoint videos, capable of achieving high fidelity from only a sparse set of captured images. This success has led to many variants of the techniques, each evaluated on a set of test views typically using image quality metrics such as PSNR, SSIM, or LPIPS. There has been a lack of research on how NVS metho… ▽ More

    Submitted 24 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  19. arXiv:2303.11692  [pdf, other

    cs.SD cs.IR eess.AS

    ByteCover3: Accurate Cover Song Identification on Short Queries

    Authors: Xingjian Du, Zijie Wang, Xia Liang, Huidong Liang, Bilei Zhu, Zejun Ma

    Abstract: Deep learning based methods have become a paradigm for cover song identification (CSI) in recent years, where the ByteCover systems have achieved state-of-the-art results on all the mainstream datasets of CSI. However, with the burgeon of short videos, many real-world applications require matching short music excerpts to full-length music tracks in the database, which is still under-explored and w… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepeted by ICASSP 2023

  20. Non-Orthogonal Multiple Access Enhanced Multi-User Semantic Communication

    Authors: Weizhi Li, Haotai Liang, Chen Dong, Xiaodong Xu, ** Zhang, Kaijun Liu

    Abstract: Semantic communication serves as a novel paradigm and attracts the broad interest of researchers. One critical aspect of it is the multi-user semantic communication theory, which can further promote its application to the practical network environment. While most existing works focused on the design of end-to-end single-user semantic transmission, a novel non-orthogonal multiple access (NOMA)-base… ▽ More

    Submitted 20 November, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: accepted by IEEE Transactions on Cognitive Communications and Networking

  21. arXiv:2303.01175  [pdf, ps, other

    math.AC cs.IT eess.SP

    A Field-Theoretic Approach to Unlabeled Sensing

    Authors: Hao Liang, **gyu Lu, Manolis C. Tsakiris, Lihong Zhi

    Abstract: We study the recent problem of unlabeled sensing from the information sciences in a field-theoretic framework. Our main result asserts that, for sufficiently generic data, the unique solution can be obtained by solving n + 1 polynomial equations in n unknowns.

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 17 pages, 2 tables

  22. arXiv:2301.03331  [pdf, other

    cs.CV cs.AI eess.IV

    A Specific Task-oriented Semantic Image Communication System for substation patrol inspection

    Authors: Senran Fan, Haotai Liang, Chen Dong, Xiaodong Xu, Geng Liu

    Abstract: Intelligent inspection robots are widely used in substation patrol inspection, which can help check potential safety hazards by patrolling the substation and sending back scene images. However, when patrolling some marginal areas with weak signal, the scene images cannot be sucessfully transmissted to be used for hidden danger elimination, which greatly reduces the quality of robots'daily work. To… ▽ More

    Submitted 13 April, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: 9 pages, 8 figures

    Journal ref: IEEE Transactions on Power Delivery; vol. 39; no. 2; pp. 835-844; April 2024

  23. arXiv:2212.03093  [pdf

    eess.SY

    Cooperative Guidance Strategy for Active Defense Spacecraft with Imperfect Information via Deep Reinforcement Learning

    Authors: Li Zhi, Haizhao Liang, **ze Wu, Jianying Wang, Yu Zheng

    Abstract: In this paper, an adaptive cooperative guidance strategy for the active protection of a target spacecraft trying to evade an interceptor was developed. The target spacecraft performs evasive maneuvers, launching an active defense vehicle to divert the interceptor. Instead of classical strategies, which are based on optimal control or differential game theory, the problem was solved by using the de… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  24. arXiv:2211.02320  [pdf, other

    eess.SY

    Aircraft Ground Taxiing Deduction and Conflict Early Warning Method Based on Control Command Information

    Authors: **gchang Zhuge, Huiyuan Liang, Yiming Zhang, Shichao Li, Xinyu Yang, Jun Wu

    Abstract: Aircraft taxiing conflict is a threat to the safety of airport operations, mainly due to the human error in control command infor-mation. In order to solve the problem, The aircraft taxiing deduction and conflict early warning method based on control order information is proposed. This method does not need additional equipment and operating costs, and is completely based on his-torical data and co… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  25. arXiv:2210.00621  [pdf, other

    cs.LG cs.CV eess.SP math.OC

    Optimization for Robustness Evaluation beyond $\ell_p$ Metrics

    Authors: Hengyue Liang, Buyun Liang, Ying Cui, Tim Mitchell, Ju Sun

    Abstract: Empirical evaluation of deep learning models against adversarial attacks entails solving nontrivial constrained optimization problems. Popular algorithms for solving these constrained problems rely on projected gradient descent (PGD) and require careful tuning of multiple hyperparameters. Moreover, PGD can only handle $\ell_1$, $\ell_2$, and $\ell_\infty$ attack models due to the use of analytical… ▽ More

    Submitted 13 November, 2022; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 5 pages, 1 figure, 3 tables, accepted by the 14th International OPT Workshop on Optimization for Machine Learning, and submitted to the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  26. arXiv:2203.16988  [pdf

    cs.SD cs.LG eess.AS

    Acoustic-Net: A Novel Neural Network for Sound Localization and Quantification

    Authors: Guanxing Zhou, Hao Liang, Xinghao Ding, Yue Huang, Xiaotong Tu, Saqlain Abbas

    Abstract: Acoustic source localization has been applied in different fields, such as aeronautics and ocean science, generally using multiple microphones array data to reconstruct the source location. However, the model-based beamforming methods fail to achieve the high-resolution of conventional beamforming maps. Deep neural networks are also appropriate to locate the sound source, but in general, these met… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  27. arXiv:2203.10674  [pdf, other

    cs.LG cs.CR cs.NI eess.SY

    RareGAN: Generating Samples for Rare Classes

    Authors: Zinan Lin, Hao Liang, Giulia Fanti, Vyas Sekar

    Abstract: We study the problem of learning generative adversarial networks (GANs) for a rare class of an unlabeled dataset subject to a labeling budget. This problem is motivated from practical applications in domains including security (e.g., synthesizing packets for DNS amplification attacks), systems and networking (e.g., synthesizing workloads that trigger high resource usage), and machine learning (e.g… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Published in AAAI 2022

  28. arXiv:2203.05087  [pdf, other

    eess.SY

    False Data Injection Attack on Electric Vehicle-Assisted Voltage Regulation

    Authors: Yuan Liu, Omid Ardakanian, Ioanis Nikolaidis, Hao Liang

    Abstract: With the large scale penetration of electric vehicles (EVs) and the advent of bidirectional chargers, EV aggregators will become a major player in the voltage regulation market. This paper proposes a novel false data injection attack (FDIA) against the voltage regulation capacity estimation of EV charging stations, the process that underpins voltage regulation in distribution system. The proposed… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 10 pages

  29. arXiv:2202.09595  [pdf, other

    eess.SP

    Innovative semantic communication system

    Authors: Chen Dong, Haotai Liang, Xiaodong Xu, Shujun Han, Bizhu Wang, ** Zhang

    Abstract: Traditional communication systems focus on the transmission process, and the context-dependent meaning has been ignored. The fact that 5G system has approached Shannon limit and the increasing amount of data will cause communication bottleneck, such as the increased delay problems. Inspired by the ability of artificial intelligence to understand semantics, we propose a new communication paradigm,… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  30. arXiv:2112.06074  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Early Stop** for Deep Image Prior

    Authors: Hengkang Wang, Taihui Li, Zhong Zhuang, Tiancong Chen, Hengyue Liang, Ju Sun

    Abstract: Deep image prior (DIP) and its variants have showed remarkable potential for solving inverse problems in computer vision, without any extra training data. Practical DIP models are often substantially overparameterized. During the fitting process, these models learn mostly the desired visual content first, and then pick up the potential modeling and observational noise, i.e., overfitting. Thus, the… ▽ More

    Submitted 11 December, 2023; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: Published in TMLR (https://openreview.net/forum?id=231ZzrLC8X)

    Journal ref: Transactions on Machine Learning Research (TMLR), 2835-8856 (12/2023)

  31. arXiv:2112.05844  [pdf, other

    eess.SY

    Economic MPC-based planning for marine vehicles: Tuning safety and energy efficiency

    Authors: Haojiao Liang, Hui** Li, Jian Gao, Rongxin Cui, Demin Xu

    Abstract: Energy efficiency and safety are two critical objectives for marine vehicles operating in environments with obstacles, and they generally conflict with each other. In this paper, we propose a novel online motion planning method of marine vehicles which can make trade-offs between the two design objectives based on the framework of economic model predictive control (EMPC). Firstly, the feasible tra… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  32. arXiv:2110.12271  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Self-Validation: Early Stop** for Single-Instance Deep Generative Priors

    Authors: Taihui Li, Zhong Zhuang, Hengyue Liang, Le Peng, Hengkang Wang, Ju Sun

    Abstract: Recent works have shown the surprising effectiveness of deep generative models in solving numerous image reconstruction (IR) tasks, even without training data. We call these models, such as deep image prior and deep decoder, collectively as single-instance deep generative priors (SIDGPs). The successes, however, often hinge on appropriate early stop** (ES), which by far has largely been handled… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: To appear in British Machine Vision Conference (BMVC) 2021

  33. arXiv:2107.07988  [pdf, other

    cs.CV cs.LG cs.SD eess.AS eess.IV

    Controlled AutoEncoders to Generate Faces from Voices

    Authors: Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh

    Abstract: Multiple studies in the past have shown that there is a strong correlation between human vocal characteristics and facial features. However, existing approaches generate faces simply from voice, without exploring the set of features that contribute to these observed correlations. A computational methodology to explore this can be devised by rephrasing the question to: "how much would a target face… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  34. arXiv:2106.12511  [pdf

    eess.IV cs.CV cs.LG

    High-Throughput Precision Phenoty** of Left Ventricular Hypertrophy with Cardiovascular Deep Learning

    Authors: Grant Duffy, Paul P Cheng, Neal Yuan, Bryan He, Alan C. Kwan, Matthew J. Shun-Shin, Kevin M. Alexander, Joseph Ebinger, Matthew P. Lungren, Florian Rader, David H. Liang, Ingela Schnittger, Euan A. Ashley, James Y. Zou, Jignesh Patel, Ronald Witteles, Susan Cheng, David Ouyang

    Abstract: Left ventricular hypertrophy (LVH) results from chronic remodeling caused by a broad range of systemic and cardiovascular disease including hypertension, aortic stenosis, hypertrophic cardiomyopathy, and cardiac amyloidosis. Early detection and characterization of LVH can significantly impact patient care but is limited by under-recognition of hypertrophy, measurement error and variability, and di… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  35. arXiv:2106.05152  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Transfer Learning for Medical Image Classification

    Authors: Le Peng, Hengyue Liang, Gaoxiang Luo, Taihui Li, Ju Sun

    Abstract: Transfer learning (TL) from pretrained deep models is a standard practice in modern medical image classification (MIC). However, what levels of features to be reused are problem-dependent, and uniformly finetuning all layers of pretrained models may be suboptimal. This insight has partly motivated the recent differential TL strategies, such as TransFusion (TF) and layer-wise finetuning (LWFT), whi… ▽ More

    Submitted 26 May, 2024; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted by BMVC2023 (oral)

  36. arXiv:2103.00345  [pdf, other

    cs.RO cs.CR cs.LG eess.SY

    End-to-end Uncertainty-based Mitigation of Adversarial Attacks to Automated Lane Centering

    Authors: Ruochen Jiao, Hengyi Liang, Takami Sato, Junjie Shen, Qi Alfred Chen, Qi Zhu

    Abstract: In the development of advanced driver-assistance systems (ADAS) and autonomous vehicles, machine learning techniques that are based on deep neural networks (DNNs) have been widely used for vehicle perception. These techniques offer significant improvement on average perception accuracy over traditional methods, however, have been shown to be susceptible to adversarial attacks, where small perturba… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: 8 pages for conference

  37. arXiv:2012.09154  [pdf

    eess.IV cs.CV physics.optics

    Exploration of Whether Skylight Polarization Patterns Contain Three-dimensional Attitude Information

    Authors: Huaju Liang, Hongyang Bai, Tong Zhou

    Abstract: Our previous work has demonstrated that Rayleigh model, which is widely used in polarized skylight navigation to describe skylight polarization patterns, does not contain three-dimensional (3D) attitude information [1]. However, it is still necessary to further explore whether the skylight polarization patterns contain 3D attitude information. So, in this paper, a social spider optimization (SSO)… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

  38. arXiv:2010.08091  [pdf, other

    cs.SD cs.MM eess.AS

    PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

    Authors: Hongru Liang, Wenqiang Lei, Paul Yaozhu Chan, Zhenglu Yang, Maosong Sun, Tat-Seng Chua

    Abstract: Definitive embeddings remain a fundamental challenge of computational musicology for symbolic music in deep learning today. Analogous to natural language, music can be modeled as a sequence of tokens. This motivates the majority of existing solutions to explore the utilization of word embedding models to build music embeddings. However, music differs from natural languages in two key aspects: (1)… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: ACM Multimedia 2020 -- best paper

  39. Leveraging Weakly-hard Constraints for Improving System Fault Tolerance with Functional and Timing Guarantees

    Authors: Hengyi Liang, Zhilu Wang, Ruochen Jiao, Qi Zhu

    Abstract: Many safety-critical real-time systems operate under harsh environment and are subject to soft errors caused by transient or intermittent faults. It is critical and yet often very challenging to apply fault tolerance techniques in these systems, due to their resource limitations and stringent constraints on timing and functionality. In this work, we leverage the concept of weakly-hard constraints,… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: ICCAD 2020

  40. arXiv:2007.12578  [pdf, other

    eess.IV cs.CV cs.LG

    Stain Style Transfer of Histopathology Images Via Structure-Preserved Generative Learning

    Authors: Hanwen Liang, Konstantinos N. Plataniotis, Xingyu Li

    Abstract: Computational histopathology image diagnosis becomes increasingly popular and important, where images are segmented or classified for disease diagnosis by computers. While pathologists do not struggle with color variations in slides, computational solutions usually suffer from this critical issue. To address the issue of color variations in histopathology images, this study proposes two stain styl… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  41. arXiv:2005.11842  [pdf, other

    eess.SY

    Cross-Layer Design of Automotive Systems

    Authors: Zhilu Wang, Hengyi Liang, Chao Huang, Qi Zhu

    Abstract: With growing system complexity and closer cyber-physical interaction, there are increasingly stronger dependencies between different function and architecture layers in automotive systems. This paper first introduces several cross-layer approaches we developed in the past for holistically addressing multiple system layers in the design of individual vehicles and of connected vehicle applications;… ▽ More

    Submitted 31 May, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  42. arXiv:2003.10689  [pdf

    eess.IV cs.CV

    Learning regularization and intensity-gradient-based fidelity for single image super resolution

    Authors: Hu Liang, Shengrong Zhao

    Abstract: How to extract more and useful information for single image super resolution is an imperative and difficult problem. Learning-based method is a representative method for such task. However, the results are not so stable as there may exist big difference between the training data and the test data. The regularization-based method can effectively utilize the self-information of observation. However,… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  43. arXiv:2003.00342  [pdf, other

    cs.RO cs.AI cs.LG cs.SD eess.AS

    Robust Robotic Pouring using Audition and Haptics

    Authors: Hongzhuo Liang, Chuangchuang Zhou, Shuang Li, Xiaojian Ma, Norman Hendrich, Timo Gerkmann, Fuchun Sun, Marcus Stoffel, Jianwei Zhang

    Abstract: Robust and accurate estimation of liquid height lies as an essential part of pouring tasks for service robots. However, vision-based methods often fail in occluded conditions while audio-based methods cannot work well in a noisy environment. We instead propose a multimodal pouring network (MP-Net) that is able to robustly predict liquid height by conditioning on both audition and haptics input. MP… ▽ More

    Submitted 14 October, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: accepted by IROS2020

    Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  44. Making Sense of Audio Vibration for Liquid Height Estimation in Robotic Pouring

    Authors: Hongzhuo Liang, Shuang Li, Xiaojian Ma, Norman Hendrich, Timo Gerkmann, Fuchun Sun, Jianwei Zhang

    Abstract: In this paper, we focus on the challenging perception problem in robotic pouring. Most of the existing approaches either leverage visual or haptic information. However, these techniques may suffer from poor generalization performances on opaque containers or concerning measuring precision. To tackle these drawbacks, we propose to make use of audio vibration sensing and design a deep neural network… ▽ More

    Submitted 21 July, 2019; v1 submitted 2 March, 2019; originally announced March 2019.

    Comments: Accepted to IROS 2019

    Journal ref: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  45. arXiv:1806.01989  [pdf

    eess.SP cs.ET physics.app-ph

    Design of Voltage Pulse Control Module for Free Space Measurement-Device-Independent Quantum Key Distribution

    Authors: Sijie Zhang, Nan Zhou, Fanshui Deng, Hao Liang

    Abstract: Measurement-Device-Independent Quantum Key Distribution (MDIQKD) protocol has been proved that it is unaffected by all hacking attacks, and ensures the security of information theory even when the performance of single-photon detectors is not ideal. Fiber channel has been used by the previous MDIQKD experimental device. However, the signal attenuation increases exponentially as the transmission di… ▽ More

    Submitted 20 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  46. arXiv:1806.01490  [pdf

    physics.ins-det eess.SP

    Design of 32-channel TDC Based on Single FPGA for μSR Spectrometer at CSNS

    Authors: Fanshui Deng, Hao Liang, Bangjiao Ye, **gyu Tang

    Abstract: Muon Spin Rotation, Relaxation and Resonance (μSR) technology has an irreplaceable role in studying the microstructure and properties of materials, especially micro-magnetic properties. An experimental muon source is being built in China Spallation Neutron Source (CSNS) now. At the same time, a 128-channel μSR spectrometer as China's first μSR spectrometer is being developed. The time spectrum of… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.