Skip to main content

Showing 1–50 of 51 results for author: Liang, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.15354  [pdf, other

    eess.SP cs.AI cs.LG math.NA

    Elevating Spectral GNNs through Enhanced Band-pass Filter Approximation

    Authors: Guoming Li, Jian Yang, Shangsong Liang, Dongsheng Luo

    Abstract: Spectral Graph Neural Networks (GNNs) have attracted great attention due to their capacity to capture patterns in the frequency domains with essential graph filters. Polynomial-based ones (namely poly-GNNs), which approximately construct graph filters with conventional or rational polynomials, are routinely adopted in practice for their substantial performances on graph learning tasks. However, pr… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Preprint

  2. arXiv:2404.11861  [pdf, other

    eess.SP

    sEMG-based Fine-grained Gesture Recognition via Improved LightGBM Model

    Authors: Xiupeng Qiao, Zekun Chen, Shili Liang

    Abstract: Surface electromyogram (sEMG), as a bioelectrical signal reflecting the activity of human muscles, has a wide range of applications in the control of prosthetics, human-computer interaction and so on. However, the existing recognition methods are all discrete actions, that is, every time an action is executed, it is necessary to restore the resting state before the next action, and it is unable to… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  3. arXiv:2404.11383  [pdf, other

    eess.SP

    Lower Limb Movements Recognition Based on Feature Recursive Elimination and Backpropagation Neural Network

    Authors: Yongkai Ma, Shili Liang, Zekun Chen

    Abstract: Surface electromyographic (sEMG) signal serve as a signal source commonly used for lower limb movement recognition, reflecting the intent of human movement. However, it has been a challenge to improve the movements recognition rate while using fewer features in this area of research area. In this paper, a method for lower limb movements recognition based on recursive feature elimination and backpr… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  4. arXiv:2404.07444  [pdf, other

    cs.NI eess.SP

    Two-Way Aerial Secure Communications via Distributed Collaborative Beamforming under Eavesdropper Collusion

    Authors: Jiahui Li, Geng Sun, Qingqing Wu, Shuang Liang, Pengfei Wang, Dusit Niyato

    Abstract: Unmanned aerial vehicles (UAVs)-enabled aerial communication provides a flexible, reliable, and cost-effective solution for a range of wireless applications. However, due to the high line-of-sight (LoS) probability, aerial communications between UAVs are vulnerable to eavesdrop** attacks, particularly when multiple eavesdroppers collude. In this work, we aim to introduce distributed collaborativ… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by IEEE INFOCOM 2024

  5. arXiv:2404.04597  [pdf, other

    eess.SY

    A Two Time-Scale Joint Optimization Approach for UAV-assisted MEC

    Authors: Zemin Sun, Geng Sun, Long He, Fang Mei, Shuang Liang, Yanheng Liu

    Abstract: Unmanned aerial vehicles (UAV)-assisted mobile edge computing (MEC) is emerging as a promising paradigm to provide aerial-terrestrial computing services close to mobile devices (MDs). However, meeting the demands of computation-intensive and delay-sensitive tasks for MDs poses several challenges, including the demand-supply contradiction between MDs and MEC servers, the demand-supply heterogeneity… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2403.15828

  6. arXiv:2404.04559  [pdf, ps, other

    cs.LG eess.SP math.NA

    Spectral GNN via Two-dimensional (2-D) Graph Convolution

    Authors: Guoming Li, Jian Yang, Shangsong Liang, Dongsheng Luo

    Abstract: Spectral Graph Neural Networks (GNNs) have achieved tremendous success in graph learning. As an essential part of spectral GNNs, spectral graph convolution extracts crucial frequency information in graph data, leading to superior performance of spectral GNNs in downstream tasks. However, in this paper, we show that existing spectral GNNs remain critical drawbacks in performing the spectral graph c… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Preprint

  7. arXiv:2403.15828  [pdf, other

    eess.SY

    TJCCT: A Two-timescale Approach for UAV-assisted Mobile Edge Computing

    Authors: Zemin Sun, Geng Sun, Qingqing Wu, Long He, Shuang Liang, Hongyang Pan, Dusit Niyato, Chau Yuen, Victor C. M. Leung

    Abstract: Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) is emerging as a promising paradigm to provide aerial-terrestrial computing services in close proximity to mobile devices (MDs). However, meeting the demands of computation-intensive and delay-sensitive tasks for MDs poses several challenges, including the demand-supply contradiction between MDs and MEC servers, the demand-supply h… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  8. arXiv:2403.05247  [pdf, other

    cs.CV eess.IV

    Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds

    Authors: Tianrui Lou, Xiaojun Jia, **dong Gu, Li Liu, Siyuan Liang, Bangyan He, Xiaochun Cao

    Abstract: Adversarial attack methods based on point manipulation for 3D point cloud classification have revealed the fragility of 3D models, yet the adversarial examples they produce are easily perceived or defended against. The trade-off between the imperceptibility and adversarial strength leads most point attack methods to inevitably introduce easily detectable outlier points upon a successful attack. An… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  9. arXiv:2402.04097  [pdf, other

    cs.CV eess.IV

    Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction

    Authors: Shijun Liang, Evan Bell, Qing Qu, Rongrong Wang, Saiprasad Ravishankar

    Abstract: The ability of deep image prior (DIP) to recover high-quality images from incomplete or corrupted measurements has made it popular in inverse problems in image restoration and medical imaging including magnetic resonance imaging (MRI). However, conventional DIP suffers from severe overfitting and spectral bias effects. In this work, we first provide an analysis of how DIP recovers information from… ▽ More

    Submitted 7 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  10. arXiv:2312.07784  [pdf, other

    eess.IV cs.AI cs.CV cs.LG eess.SP

    Robust MRI Reconstruction by Smoothed Unrolling (SMUG)

    Authors: Shijun Liang, Van Hoang Minh Nguyen, **ghan Jia, Ismail Alkhouri, Sijia Liu, Saiprasad Ravishankar

    Abstract: As the popularity of deep learning (DL) in the field of magnetic resonance imaging (MRI) continues to rise, recent research has indicated that DL-based MRI reconstruction models might be excessively sensitive to minor input disturbances, including worst-case additive perturbations. This sensitivity often leads to unstable, aliased images. This raises the question of how to devise DL techniques for… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  11. arXiv:2310.00396  [pdf, other

    eess.SY

    Joint Scheduling and Trajectory Optimization of Charging UAV in Wireless Rechargeable Sensor Networks

    Authors: Yanheng Liu, Hongyang Pan, Geng Sun, Aimin Wang, Jiahui Li, Shuang Liang

    Abstract: Wireless rechargeable sensor networks with a charging unmanned aerial vehicle (CUAV) have the broad application prospects in the power supply of the rechargeable sensor nodes (SNs). However, how to schedule a CUAV and design the trajectory to improve the charging efficiency of the entire system is still a vital problem. In this paper, we formulate a joint-CUAV scheduling and trajectory optimizatio… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  12. arXiv:2310.00384  [pdf, ps, other

    eess.SY

    Joint Power and 3D Trajectory Optimization for UAV-enabled Wireless Powered Communication Networks with Obstacles

    Authors: Hongyang Pan, Yanheng Liu, Geng Sun, Junsong Fan, Shuang Liang, Chau Yuen

    Abstract: Unmanned aerial vehicle (UAV)-enabled wireless powered communication networks (WPCNs) are promising technologies in 5G/6G wireless communications, while there are several challenges about UAV power allocation and scheduling to enhance the energy utilization efficiency, considering the existence of obstacles. In this work, we consider a UAV-enabled WPCN scenario that a UAV needs to cover the ground… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  13. arXiv:2310.00288  [pdf

    cs.AR cs.ET eess.SY physics.app-ph

    Parallel in-memory wireless computing

    Authors: Cong Wang, Gong-Jie Ruan, Zai-Zheng Yang, Xing-Jian Yangdong, Yixiang Li, Liang Wu, Yingmeng Ge, Yichen Zhao, Chen Pan, Wei Wei, Li-Bo Wang, Bin Cheng, Zaichen Zhang, Chuan Zhang, Shi-Jun Liang, Feng Miao

    Abstract: Parallel wireless digital communication with ultralow power consumption is critical for emerging edge technologies such as 5G and Internet of Things. However, the physical separation between digital computing units and analogue transmission units in traditional wireless technology leads to high power consumption. Here we report a parallel in-memory wireless computing scheme. The approach combines… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Journal ref: Nat Electron 6, 381-389 (2023)

  14. arXiv:2309.16709  [pdf, other

    eess.SP cs.GT cs.NI

    Joint Task Offloading and Resource Allocation in Aerial-Terrestrial UAV Networks with Edge and Fog Computing for Post-Disaster Rescue

    Authors: Geng Sun, Long He, Zemin Sun, Qingqing Wu, Shuang Liang, Jiahui Li, Dusit Niyato, Victor C. M. Leung

    Abstract: Unmanned aerial vehicles (UAVs) play an increasingly important role in assisting fast-response post-disaster rescue due to their fast deployment, flexible mobility, and low cost. However, UAVs face the challenges of limited battery capacity and computing resources, which could shorten the expected flight endurance of UAVs and increase the rescue response delay during performing mission-critical ta… ▽ More

    Submitted 6 October, 2023; v1 submitted 17 August, 2023; originally announced September 2023.

    Comments: 18 pages, 6 figures

  15. arXiv:2309.15977  [pdf, other

    cs.SD cs.CV eess.AS

    Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields

    Authors: Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

    Abstract: Room impulse response (RIR), which measures the sound propagation within an environment, is critical for synthesizing high-fidelity audio for a given environment. Some prior work has proposed representing RIR as a neural field function of the sound emitter and receiver positions. However, these methods do not sufficiently consider the acoustic properties of an audio scene, leading to unsatisfactor… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  16. arXiv:2309.05794  [pdf, other

    eess.IV

    Robust Physics-based Deep MRI Reconstruction Via Diffusion Purification

    Authors: Ismail Alkhouri, Shijun Liang, Rongrong Wang, Qing Qu, Saiprasad Ravishankar

    Abstract: Deep learning (DL) techniques have been extensively employed in magnetic resonance imaging (MRI) reconstruction, delivering notable performance enhancements over traditional non-DL methods. Nonetheless, recent studies have identified vulnerabilities in these models during testing, namely, their susceptibility to (\textit{i}) worst-case measurement perturbations and to (\textit{ii}) variations in t… ▽ More

    Submitted 24 October, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  17. arXiv:2308.00122  [pdf, other

    cs.CV cs.SD eess.AS

    DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models

    Authors: Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu

    Abstract: We propose DAVIS, a Diffusion model-based Audio-VIusal Separation framework that solves the audio-visual sound source separation task through a generative manner. While existing discriminative methods that perform mask regression have made remarkable progress in this field, they face limitations in capturing the complex data distribution required for high-quality separation of sounds from diverse… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  18. arXiv:2305.13774  [pdf, other

    cs.SD eess.AS

    ADD 2023: the Second Audio Deepfake Detection Challenge

    Authors: Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li

    Abstract: Audio deepfake detection is an emerging topic in the artificial intelligence community. The second Audio Deepfake Detection Challenge (ADD 2023) aims to spur researchers around the world to build new innovative technologies that can further accelerate and foster research on detecting and analyzing deepfake speech utterances. Different from previous challenges (e.g. ADD 2022), ADD 2023 focuses on s… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  19. arXiv:2304.08038  [pdf, other

    cs.IT eess.SP

    Orthogonal AMP for Problems with Multiple Measurement Vectors and/or Multiple Transforms

    Authors: Yiyao Cheng, Lei Liu, Shansuo Liang, Jonathan. H. Manton, Li **

    Abstract: Approximate message passing (AMP) algorithms break a (high-dimensional) statistical problem into parts then repeatedly solve each part in turn, akin to alternating projections. A distinguishing feature is their asymptotic behaviours can be accurately predicted via their associated state evolution equations. Orthogonal AMP (OAMP) was recently developed to avoid the need for computing the so-called… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  20. arXiv:2303.15299  [pdf, other

    eess.SY cs.AI

    Resilient Output Consensus Control of Heterogeneous Multi-agent Systems against Byzantine Attacks: A Twin Layer Approach

    Authors: Xin Gong, Yiwen Liang, Yukang Cui, Shi Liang, Tingwen Huang

    Abstract: This paper studies the problem of cooperative control of heterogeneous multi-agent systems (MASs) against Byzantine attacks. The agent affected by Byzantine attacks sends different wrong values to all neighbors while applying wrong input signals for itself, which is aggressive and difficult to be defended. Inspired by the concept of Digital Twin, a new hierarchical protocol equipped with a virtual… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  21. arXiv:2303.12735  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    SMUG: Towards robust MRI reconstruction by smoothed unrolling

    Authors: Hui Li, **ghan Jia, Shijun Liang, Yuguang Yao, Saiprasad Ravishankar, Sijia Liu

    Abstract: Although deep learning (DL) has gained much popularity for accelerated magnetic resonance imaging (MRI), recent studies have shown that DL-based MRI reconstruction models could be oversensitive to tiny input perturbations (that are called 'adversarial perturbations'), which cause unstable, low-quality reconstructed images. This raises the question of how to design robust DL methods for MRI reconst… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  22. arXiv:2302.02088  [pdf, other

    cs.CV cs.GR cs.SD eess.AS

    AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis

    Authors: Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

    Abstract: Can machines recording an audio-visual scene produce realistic, matching audio-visual experiences at novel positions and novel view directions? We answer it by studying a new task -- real-world audio-visual scene synthesis -- and a first-of-its-kind NeRF-based approach for multimodal learning. Concretely, given a video recording of an audio-visual scene, the task is to synthesize new videos with s… ▽ More

    Submitted 16 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  23. arXiv:2301.09321  [pdf, ps, other

    eess.SY

    Optimal Inter-area Oscillation Dam** Control: A Transfer Deep Reinforcement Learning Approach with Switching Control Strategy

    Authors: Siyuan Liang, Long Huo, Xin Chen, Peiyuan Sun

    Abstract: Wide-area dam** control for inter-area oscillation (IAO) is critical to modern power systems. The recent breakthroughs in deep learning and the broad deployment of phasor measurement units (PMU) promote the development of datadriven IAO dam** controllers. In this paper, the dam** control of IAOs is modeled as a Markov Decision Process (MDP) and solved by the proposed Deep Deterministic Polic… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  24. arXiv:2211.03577  [pdf

    physics.optics eess.SP physics.app-ph

    Regrowth-free AlGaInAs MQW polarization controller integrated with sidewall grating DFB laser

    Authors: Xiao Sun, Song Liang, Weiqing Cheng, Shengwei Ye, Yiming Sun, Yongguang Huang, Ruikang Zhang, Jichuan Xiong, Xuefeng Liu, John H. Marsh, Lian** Hou

    Abstract: We report an AlGaInAs multiple quantum well integrated source of polarization controlled light consisting of a polarization mode converter PMC, differential phase shifter(DPS), and a side wall grating distributed-feedback DFB laser. We demonstrate an asymmetrical stepped-height ridge waveguide PMC to realize TE to TM polarization conversion and a symmetrical straight waveguide DPS to enable polari… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2210.10519

  25. arXiv:2206.14635  [pdf, ps, other

    eess.SY

    Prespecified-time observer-based distributed control of battery energy storage systems

    Authors: Wu Yang, Shu-Ming Liang, Yan-Wu Wang, Zhi-Wei Liu

    Abstract: This paper studies the state-of-charge (SoC) balancing and the total charging/discharging power tracking issues for battery energy storage systems (BESSs) with multiple distributed heterogeneous battery units. Different from the traditional cooperative control strategies based on the asymptotical or finite-time distributed observers, two distributed prespecified-time observers are proposed to esti… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  26. arXiv:2206.11680  [pdf, other

    cs.IT cs.AI cs.LG eess.SP

    Capacity Optimality of OAMP in Coded Large Unitarily Invariant Systems

    Authors: Lei Liu, Shansuo Liang, Li **

    Abstract: This paper investigates a large unitarily invariant system (LUIS) involving a unitarily invariant sensing matrix, an arbitrary fixed signal distribution, and forward error control (FEC) coding. Several area properties are established based on the state evolution of orthogonal approximate message passing (OAMP) in an un-coded LUIS. Under the assumptions that the state evolution for joint OAMP and F… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted by the 2022 IEEE International Symposium on Information Theory (ISIT). arXiv admin note: substantial text overlap with arXiv:2108.08503

  27. arXiv:2206.10861  [pdf, other

    cs.CV cs.SD eess.AS

    UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022

    Authors: Yuanhang Zhang, Susan Liang, Shuang Yang, Shiguang Shan

    Abstract: This report presents a brief description of our winning solution to the AVA Active Speaker Detection (ASD) task at ActivityNet Challenge 2022. Our underlying model UniCon+ continues to build on our previous work, the Unified Context Network (UniCon) and Extended UniCon which are designed for robust scene-level ASD. We augment the architecture with a simple GRU-based module that allows information… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 5 pages, 3 figures; technical report for AVA Challenge (see https://research.google.com/ava/challenge.html) at the International Challenge on Activity Recognition (ActivityNet), CVPR 2022

  28. arXiv:2206.00775  [pdf, other

    eess.IV cs.LG

    Adaptive Local Neighborhood-based Neural Networks for MR Image Reconstruction from Undersampled Data

    Authors: Shijun Liang, Anish Lahiri, Saiprasad Ravishankar

    Abstract: Recent medical image reconstruction techniques focus on generating high-quality medical images suitable for clinical use at the lowest possible cost and with the fewest possible adverse effects on patients. Recent works have shown significant promise for reconstructing MR images from sparsely sampled k-space data using deep learning. In this work, we propose a technique that rapidly estimates deep… ▽ More

    Submitted 23 January, 2024; v1 submitted 1 June, 2022; originally announced June 2022.

  29. arXiv:2205.14942  [pdf

    cs.CV cs.LG eess.SP

    Edge YOLO: Real-Time Intelligent Object Detection System Based on Edge-Cloud Cooperation in Autonomous Vehicles

    Authors: Siyuan Liang, Hao Wu

    Abstract: Driven by the ever-increasing requirements of autonomous vehicles, such as traffic monitoring and driving assistant, deep learning-based object detection (DL-OD) has been increasingly attractive in intelligent transportation systems. However, it is difficult for the existing DL-OD schemes to realize the responsible, cost-saving, and energy-efficient autonomous vehicle systems due to low their inhe… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  30. arXiv:2205.07646  [pdf, other

    cs.CL cs.SD eess.AS

    A Fast Attention Network for Joint Intent Detection and Slot Filling on Edge Devices

    Authors: Liang Huang, Senjie Liang, Feiyang Ye, Nan Gao

    Abstract: Intent detection and slot filling are two main tasks in natural language understanding and play an essential role in task-oriented dialogue systems. The joint learning of both tasks can improve inference accuracy and is popular in recent works. However, most joint models ignore the inference latency and cannot meet the need to deploy dialogue systems at the edge. In this paper, we propose a Fast A… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 9 pages, 4 figures

  31. arXiv:2204.01731  [pdf, ps, other

    cs.LG eess.SP

    Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random Access

    Authors: Shuang Liang, Yinan Zou, Yong Zhou

    Abstract: Joint activity detection and channel estimation (JADCE) for grant-free random access is a critical issue that needs to be addressed to support massive connectivity in IoT networks. However, the existing model-free learning method can only achieve either activity detection or channel estimation, but not both. In this paper, we propose a novel model-free learning method based on generative adversari… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 5 pages, 5 figures IEEE ICASSP2022

  32. arXiv:2203.03836  [pdf, other

    cs.IT eess.SP

    An Efficient Two-Stage SPARC Decoder for Massive MIMO Unsourced Random Access

    Authors: Juntao You, Wenjie Wang, Shansuo Liang, Wei Han, Bo Bai

    Abstract: In this paper, we study a concatenate coding scheme based on sparse regression code (SPARC) and tree code for unsourced random access in massive multiple-input and multiple-output systems. Our focus is concentrated on efficient decoding for the inner SPARC with practical concerns. A two-stage method is proposed to achieve near-optimal performance while maintaining low computational complexity. Spe… ▽ More

    Submitted 11 August, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 30 pages, 4 figures

  33. arXiv:2203.00224  [pdf, other

    cs.IT eess.SP

    On Orthogonal Approximate Message Passing

    Authors: Lei Liu, Yiyao Cheng, Shansuo Liang, Jonathan H. Manton, Li **

    Abstract: Approximate Message Passing (AMP) is an efficient iterative parameter-estimation technique for certain high-dimensional linear systems with non-Gaussian distributions, such as sparse systems. In AMP, a so-called Onsager term is added to keep estimation errors approximately Gaussian. Orthogonal AMP (OAMP) does not require this Onsager term, relying instead on an orthogonalization procedure to keep… ▽ More

    Submitted 13 January, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: 15 pages, 2 figure

  34. arXiv:2202.08433  [pdf, ps, other

    cs.SD cs.LG eess.AS

    ADD 2022: the First Audio Deep Synthesis Detection Challenge

    Authors: Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu

    Abstract: Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021. However, the recent shared tasks have not covered many real-life and challenging scenarios. The first Audio Deep synthesis Detection challenge (ADD) was motivated to fill in the gap. The ADD 2022 includes three tracks: low-quality fake audio detection (LF), partially fake audio detection (PF) and audio fake gam… ▽ More

    Submitted 26 February, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by ICASSP 2022

  35. arXiv:2201.09245  [pdf, other

    eess.SY cs.LG

    Fast Transient Stability Prediction Using Grid-informed Temporal and Topological Embedding Deep Neural Network

    Authors: Peiyuan Sun, Long Huo, Siyuan Liang, Xin Chen

    Abstract: Transient stability prediction is critically essential to the fast online assessment and maintaining the stable operation in power systems. The wide deployment of phasor measurement units (PMUs) promotes the development of data-driven approaches for transient stability assessment. This paper proposes the temporal and topological embedding deep neural network (TTEDNN) model to forecast transient st… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  36. arXiv:2112.14839  [pdf, ps, other

    eess.SY cs.AI physics.data-an

    An overview of the quantitative causality analysis and causal graph reconstruction based on a rigorous formalism of information flow

    Authors: X. San Liang

    Abstract: Inference of causal relations from data now has become an important field in artificial intelligence. During the past 16 years, causality analysis (in a quantitative sense) has been developed independently in physics from first principles. This short note is a brief summary of this line of work, including part of the theory and several representative applications.

    Submitted 31 December, 2021; originally announced December 2021.

    Comments: 7 pages, 1 figure. Presented at the First International AIxIA Workshop on Causality, Causal-ITALY, Italian Conference on Artificial Intelligence, November 30, 2021

  37. arXiv:2112.02629  [pdf, other

    eess.SP math.OC

    A Tensor-BTD-based Modulation for Massive Unsourced Random Access

    Authors: Zhenting Luan, Yuchi Wu, Shansuo Liang, Li** Zhang, Wei Han, Bo Bai

    Abstract: In this letter, we propose a novel tensor-based modulation scheme for massive unsourced random access. The proposed modulation can be deemed as a summation of third-order tensors, of which the factors are representatives of subspaces. A constellation design based on high-dimensional Grassmann manifold is presented for information encoding. The uniqueness of tensor decomposition provides theoretica… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

  38. arXiv:2111.10006  [pdf

    eess.IV physics.med-ph physics.optics

    Image enhancement in acoustic-resolution photoacoustic microscopy enabled by a novel directional algorithm

    Authors: Fei Feng, Siqi Liang, Sung-Liang Chen

    Abstract: Acoustic-resolution photoacoustic microscopy (AR-PAM) is a promising tool for microvascular imaging. In the focal region, resolution of AR-PAM is determined by the ultrasound transducer and ultimately limited by acoustic diffraction. In the out-of-focus region, resolution deteriorates with increasing distance from the focal plane, which restricts depth of focus (DOF). Besides, a trade-off exists b… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: 34 pages (including 16 pages of supplementary materials)

  39. arXiv:2108.09116  [pdf, other

    eess.SP

    Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming

    Authors: Shuang Liang, Yuanming Shi, Yong Zhou

    Abstract: Massive connectivity is a critical challenge of Internet of Things (IoT) networks. In this paper, we consider the grant-free uplink transmission of an IoT network with a multi-antenna base station (BS) and a large number of single-antenna IoT devices. Due to the sporadic nature of IoT devices, we formulate the joint activity detection and channel estimation (JADCE) problem as a group-sparse matrix… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: IEEE/CIC ICCC 2021

  40. arXiv:2108.08503  [pdf, other

    cs.IT eess.SP

    On Capacity Optimality of OAMP: Beyond IID Sensing Matrices and Gaussian Signaling

    Authors: Lei Liu, Shansuo Liang, Li **

    Abstract: This paper investigates a large unitarily invariant system (LUIS) involving a unitarily invariant sensing matrix, an arbitrarily fixed signal distribution, and forward error control (FEC) coding. A universal Gram-Schmidt orthogonalization is considered for constructing orthogonal approximate message passing (OAMP), enabling its applicability to a wide range of prototypes without the constraint of… ▽ More

    Submitted 9 November, 2023; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Double columns, 17 pages, 9 figures

  41. arXiv:2108.02607  [pdf, other

    cs.CV cs.MM cs.SD eess.AS eess.IV

    UniCon: Unified Context Network for Robust Active Speaker Detection

    Authors: Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan, Xilin Chen

    Abstract: We introduce a new efficient framework, the Unified Context Network (UniCon), for robust active speaker detection (ASD). Traditional methods for ASD usually operate on each candidate's pre-cropped face track separately and do not sufficiently consider the relationships among the candidates. This potentially limits performance, especially in challenging scenarios with low-resolution faces, multiple… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 10 pages, 6 figures; to appear at ACM Multimedia 2021

  42. arXiv:2107.03904  [pdf, other

    eess.IV cs.CV

    A hybrid deep learning framework for Covid-19 detection via 3D Chest CT Images

    Authors: Shuang Liang

    Abstract: In this paper, we present a hybrid deep learning framework named CTNet which combines convolutional neural network and transformer together for the detection of COVID-19 via 3D chest CT images. It consists of a CNN feature extractor module with SE attention to extract sufficient features from CT scans, together with a transformer model to model the discriminative features of the 3D CT scans. Compa… ▽ More

    Submitted 9 July, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: 5 pages, 1 figure, 2 tables

  43. arXiv:2105.12718  [pdf

    physics.app-ph eess.SY

    Magnetic Particle Spectroscopy (MPS) with One-stage Lock-in Implementation for Magnetic Bioassays with Improved Sensitivities

    Authors: Vinit Kumar Chugh, Kai Wu, Venkatramana D. Krishna, Arturo di Girolamo, Robert P. Bloom, Yongqiang Andrew Wang, Renata Saha, Shuang Liang, Maxim C-J Cheeran, Jian-** Wang

    Abstract: In recent years, magnetic particle spectroscopy (MPS) has become a highly sensitive and versatile sensing technique for quantitative bioassays. It relies on the dynamic magnetic responses of magnetic nanoparticles (MNPs) for the detection of target analytes in liquid phase. There are many research studies reporting the application of MPS for detecting a variety of analytes including viruses, toxin… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: 26 Pages, 11 Figures

  44. arXiv:2104.10832  [pdf, other

    eess.AS cs.SD

    Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss

    Authors: Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li

    Abstract: Building cross-lingual voice conversion (VC) systems for multiple speakers and multiple languages has been a challenging task for a long time. This paper describes a parallel non-autoregressive network to achieve bilingual and code-switched voice conversion for multiple speakers when there are only mono-lingual corpora for each language. We achieve cross-lingual VC between Mandarin speech with mul… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Submitted to Interspeech 2021

  45. arXiv:2012.03500  [pdf, other

    eess.AS cs.SD

    EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture

    Authors: Chenfeng Miao, Shuang Liang, Zhencheng Liu, Minchuan Chen, Jun Ma, Shaojun Wang, **g Xiao

    Abstract: In this work, we address the Text-to-Speech (TTS) task by proposing a non-autoregressive architecture called EfficientTTS. Unlike the dominant non-autoregressive TTS models, which are trained with the need of external aligners, EfficientTTS optimizes all its parameters with a stable, end-to-end training procedure, while allowing for synthesizing high quality speech in a fast and efficient manner.… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 15 pages, 9 figures

  46. arXiv:2010.14035  [pdf, other

    eess.SP

    Two-Parametric Nyquist Pulses with Better Performance Based on Inverse Hyperbolic Functions

    Authors: Songbing Liang, Stylianos D. Assimonis

    Abstract: In this article, three new inter-symbol interference (ISI)-free pulses with enhanced performance compared to the state-of-the-art are proposed and studied in terms of frequency and time domain characteristics. They are based on inverse hyperbolic functions and on the concept of inner and outer functions, which was first introduced by the authors. New pulses are two-parametric, i.e., their design d… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures, submitted to the IEEE Communication Letters

  47. arXiv:2009.08605  [pdf, other

    eess.SP cs.AR

    Hardware Accelerator for Multi-Head Attention and Position-Wise Feed-Forward in the Transformer

    Authors: Siyuan Lu, Meiqi Wang, Shuang Liang, Jun Lin, Zhongfeng Wang

    Abstract: Designing hardware accelerators for deep neural networks (DNNs) has been much desired. Nonetheless, most of these existing accelerators are built for either convolutional neural networks (CNNs) or recurrent neural networks (RNNs). Recently, the Transformer model is replacing the RNN in the natural language processing (NLP) area. However, because of intensive matrix computations and complicated dat… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: 6 pages, 8 figures. This work has been accepted by IEEE SOCC (System-on-chip Conference) 2020, and peresnted by Siyuan Lu in SOCC2020. It also received the Best Paper Award in the Methdology Track in this conference

  48. arXiv:2008.00217  [pdf, other

    cs.CV cs.LG eess.IV

    Efficient Adversarial Attacks for Visual Object Tracking

    Authors: Siyuan Liang, Xingxing Wei, Siyuan Yao, Xiaochun Cao

    Abstract: Visual object tracking is an important task that requires the tracker to find the objects quickly and accurately. The existing state-ofthe-art object trackers, i.e., Siamese based trackers, use DNNs to attain high accuracy. However, the robustness of visual tracking models is seldom explored. In this paper, we analyze the weakness of object trackers based on the Siamese network and then extend adv… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

    Journal ref: eccv 2020

  49. arXiv:1909.07820  [pdf, other

    cs.OS cs.LG eess.SY

    Data Centers Job Scheduling with Deep Reinforcement Learning

    Authors: Sisheng Liang, Zhou Yang, Fang **, Yong Chen

    Abstract: Efficient job scheduling on data centers under heterogeneous complexity is crucial but challenging since it involves the allocation of multi-dimensional resources over time and space. To adapt the complex computing environment in data centers, we proposed an innovative Advantage Actor-Critic (A2C) deep reinforcement learning based approach called A2cScheduler for job scheduling. A2cScheduler consi… ▽ More

    Submitted 1 March, 2020; v1 submitted 15 September, 2019; originally announced September 2019.

    Comments: 13 pages

  50. arXiv:1811.00883  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Deep Segment Attentive Embedding for Duration Robust Speaker Verification

    Authors: Bin Liu, Shuai Nie, Ya** Zhang, Shan Liang, Wenju Liu

    Abstract: LSTM-based speaker verification usually uses a fixed-length local segment randomly truncated from an utterance to learn the utterance-level speaker embedding, while using the average embedding of all segments of a test utterance to verify the speaker, which results in a critical mismatch between testing and training. This mismatch degrades the performance of speaker verification, especially when t… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.