Skip to main content

Showing 1–50 of 70 results for author: Liang, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16942  [pdf, other

    eess.IV cs.AI cs.CV

    Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

    Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

    Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

  2. arXiv:2406.11636  [pdf, other

    eess.IV cs.CV cs.LG

    Feasibility of Federated Learning from Client Databases with Different Brain Diseases and MRI Modalities

    Authors: Felix Wagner, Wentian Xu, Pramit Saha, Ziyun Liang, Daniel Whitehouse, David Menon, Natalie Voets, J. Alison Noble, Konstantinos Kamnitsas

    Abstract: Segmentation models for brain lesions in MRI are commonly developed for a specific disease and trained on data with a predefined set of MRI modalities. Each such model cannot segment the disease using data with a different set of MRI modalities, nor can it segment any other type of disease. Moreover, this training paradigm does not allow a model to benefit from learning from heterogeneous database… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    ACM Class: I.4.9; I.4.6; I.2.11; I.4.0

  3. arXiv:2406.09989  [pdf, other

    q-bio.NC eess.SY

    Suppressing seizure via optimal electrical stimulation to the hub of epileptic brain network

    Authors: Zhichao Liang, Guanyi Zhao, Yinuo Zhang, Weiting Sun, **gzhe Lin, Jialin Wang, Quanying Liu

    Abstract: The electrical stimulation to the seizure onset zone (SOZ) serves as an efficient approach to seizure suppression. Recently, seizure dynamics have gained widespread attendance in its network propagation mechanisms. Compared with the direct stimulation to SOZ, other brain network-level approaches that can effectively suppress epileptic seizures remain under-explored. In this study, we introduce a p… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.08052  [pdf, other

    cs.SD eess.AS

    FakeSound: Deepfake General Audio Detection

    Authors: Zeyu Xie, Baihan Li, Xuenan Xu, Zheng Liang, Kai Yu, Mengyue Wu

    Abstract: With the advancement of audio generation, generative models can produce highly realistic audios. However, the proliferation of deepfake general audio can pose negative consequences. Therefore, we propose a new task, deepfake general audio detection, which aims to identify whether audio content is manipulated and to locate deepfake regions. Leveraging an automated manipulation pipeline, a dataset n… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024

    MSC Class: 68Txx ACM Class: I.2

  5. arXiv:2406.02422  [pdf, other

    eess.IV cs.CV cs.LG

    IterMask2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI

    Authors: Ziyun Liang, Xiaoqing Guo, J. Alison Noble, Konstantinos Kamnitsas

    Abstract: Unsupervised anomaly segmentation approaches to pathology segmentation train a model on images of healthy subjects, that they define as the 'normal' data distribution. At inference, they aim to segment any pathologies in new images as 'anomalies', as they exhibit patterns that deviate from those in 'normal' training data. Prevailing methods follow the 'corrupt-and-reconstruct' paradigm. They inten… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.11163  [pdf, other

    cs.HC eess.SP

    Domain Generalization for Zero-calibration BCIs with Knowledge Distillation-based Phase Invariant Feature Extraction

    Authors: Zilin Liang, Zheng Zheng, Weihai Chen, Xinzhi Ma, Zhongcai Pei, Xiantao Sun

    Abstract: The distribution shift of electroencephalography (EEG) data causes poor generalization of braincomputer interfaces (BCIs) in unseen domains. Some methods try to tackle this challenge by collecting a portion of user data for calibration. However, it is time-consuming, mentally fatiguing, and user-unfriendly. To achieve zerocalibration BCIs, most studies employ domain generalization (DG) techniques… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  7. arXiv:2405.11155  [pdf, other

    eess.SY cs.CC

    Inner-approximate Reachability Computation via Zonotopic Boundary Analysis

    Authors: De** Ren, Zhen Liang, Chenyu Wu, Jianqiang Ding, Taoran Wu, Bai Xue

    Abstract: Inner-approximate reachability analysis involves calculating subsets of reachable sets, known as inner-approximations. This analysis is crucial in the fields of dynamic systems analysis and control theory as it provides a reliable estimation of the set of states that a system can reach from given initial states at a specific time instant. In this paper, we study the inner-approximate reachability… ▽ More

    Submitted 21 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: the extended version of the paper accepted by CAV 2024

  8. arXiv:2405.06971  [pdf, other

    eess.SY

    Controlling network-coupled neural dynamics with nonlinear network control theory

    Authors: Zhongye Xia, Weibin Li, Zhichao Liang, Kexin Lou, Quanying Liu

    Abstract: This paper addresses the problem of controlling the temporal dynamics of complex nonlinear network-coupled dynamical systems, specifically in terms of neurodynamics. Based on the Lyapunov direct method, we derive a control strategy with theoretical guarantees of controllability. To verify the performance of the derived control strategy, we perform numerical experiments on two nonlinear network-cou… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  9. arXiv:2405.03123  [pdf, other

    math.OC eess.SY

    Revealing Decision Conservativeness Through Inverse Distributionally Robust Optimization

    Authors: Qi Li, Zhirui Liang, Andrey Bernstein, Yury Dvorkin

    Abstract: This paper introduces Inverse Distributionally Robust Optimization (I-DRO) as a method to infer the conservativeness level of a decision-maker, represented by the size of a Wasserstein metric-based ambiguity set, from the optimal decisions made using Forward Distributionally Robust Optimization (F-DRO). By leveraging the Karush-Kuhn-Tucker (KKT) conditions of the convex F-DRO model, we formulate I… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  10. arXiv:2405.00734  [pdf, other

    eess.SP cs.AI cs.LG

    EEG-MACS: Manifold Attention and Confidence Stratification for EEG-based Cross-Center Brain Disease Diagnosis under Unreliable Annotations

    Authors: Zhenxi Song, Ruihan Qin, Huixia Ren, Zhen Liang, Yi Guo, Min Zhang, Zhiguo Zhang

    Abstract: Cross-center data heterogeneity and annotation unreliability significantly challenge the intelligent diagnosis of diseases using brain signals. A notable example is the EEG-based diagnosis of neurodegenerative diseases, which features subtler abnormal neural dynamics typically observed in small-group settings. To advance this area, in this work, we introduce a transferable framework employing Mani… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

  11. arXiv:2404.19214  [pdf, other

    cs.SD eess.AS

    EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization

    Authors: Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, **g Xiao

    Abstract: In recent years, Transformer networks have shown remarkable performance in speech recognition tasks. However, their deployment poses challenges due to high computational and storage resource requirements. To address this issue, a lightweight model called EfficientASR is proposed in this paper, aiming to enhance the versatility of Transformer models. EfficientASR employs two primary modules: Shared… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

  12. arXiv:2404.19212  [pdf, other

    cs.SD eess.AS

    EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

    Authors: Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, **g Xiao

    Abstract: Using unsupervised learning to disentangle speech into content, rhythm, pitch, and timbre for voice conversion has become a hot research topic. Existing works generally take into account disentangling speech components through human-crafted bottleneck features which can not achieve sufficient information disentangling, while pitch and rhythm may still be mixed together. There is a risk of informat… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

  13. arXiv:2404.16357  [pdf, other

    q-bio.NC eess.SY

    Reverse engineering the brain input: Network control theory to identify cognitive task-related control nodes

    Authors: Zhichao Liang, Yinuo Zhang, Jushen Wu, Quanying Liu

    Abstract: The human brain receives complex inputs when performing cognitive tasks, which range from external inputs via the senses to internal inputs from other brain regions. However, the explicit inputs to the brain during a cognitive task remain unclear. Here, we present an input identification framework for reverse engineering the control nodes and the corresponding inputs to the brain. The framework is… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  14. arXiv:2403.08164  [pdf, other

    cs.SD cs.LG eess.AS

    EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech

    Authors: Ziqi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu

    Abstract: Recently, deep learning-based Text-to-Speech (TTS) systems have achieved high-quality speech synthesis results. Recurrent neural networks have become a standard modeling technique for sequential data in TTS systems and are widely used. However, training a TTS model which includes RNN components requires powerful GPU performance and takes a long time. In contrast, CNN-based sequence synthesis techn… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by the 27th IEEE International Conference on Computer Supported Cooperative Work in Design (IEEE CSCWD 2024). arXiv admin note: substantial text overlap with arXiv:2211.01948

  15. arXiv:2312.08862  [pdf, other

    cs.IT eess.SP

    Semantics-Division Duplexing: A Novel Full-Duplex Paradigm

    Authors: Kai Niu, Zijian Liang, Chao Dong, **cheng Dai, Zhongwei Si, ** Zhang

    Abstract: In-band full-duplex (IBFD) is a theoretically effective solution to increase the overall throughput for the future wireless communications system by enabling transmission and reception over the same time-frequency resources. However, reliable source reconstruction remains a great challenge in the practical IBFD systems due to the non-ideal elimination of the self-interference and the inherent limi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures, submitted to IEEE Wireless Communications Magazine

  16. arXiv:2312.01727  [pdf

    eess.IV physics.bio-ph

    Deep learning acceleration of iterative model-based light fluence correction for photoacoustic tomography

    Authors: Zhaoyong Liang, Shuangyang Zhang, Zhichao Liang, Zhongxin Mo, Xiaoming Zhang, Yutian Zhong, Wufan Chen, Li Qi

    Abstract: Photoacoustic tomography (PAT) is a promising imaging technique that can visualize the distribution of chromophores within biological tissue. However, the accuracy of PAT imaging is compromised by light fluence (LF), which hinders the quantification of light absorbers. Currently, model-based iterative methods are used for LF correction, but they require significant computational resources due to r… ▽ More

    Submitted 7 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

  17. Multi-Objective Transmission Expansion: An Offshore Wind Power Integration Case Study

    Authors: Saroj Khanal, Christoph Graf, Zhirui Liang, Yury Dvorkin, Burçin Ünel

    Abstract: Despite ambitious offshore wind targets in the U.S. and globally, offshore grid planning guidance remains notably scarce, contrasting with well-established frameworks for onshore grids. This gap, alongside the increasing penetration of offshore wind and other clean-energy resources in onshore grids, highlights the urgent need for a coordinated planning framework. Our paper describes a multi-object… ▽ More

    Submitted 21 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  18. arXiv:2311.05188  [pdf, other

    eess.AS

    Sound field reconstruction using neural processes with dynamic kernels

    Authors: Zining Liang, Wen Zhang, Thushara D. Abhayapala

    Abstract: Accurately representing the sound field with the high spatial resolution is critical for immersive and interactive sound field reproduction technology. To minimize experimental effort, data-driven methods have been proposed to estimate sound fields from a small number of discrete observations. In particular, kernel-based methods using Gaussian Processes (GPs) with a covariance function to model sp… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  19. arXiv:2309.07648  [pdf, other

    eess.AS cs.CL cs.SD

    Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

    Authors: Peng Wang, Yifan Yang, Zheng Liang, Tian Tan, Shiliang Zhang, Xie Chen

    Abstract: Despite advancements of end-to-end (E2E) models in speech recognition, named entity recognition (NER) is still challenging but critical for semantic understanding. Previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along… ▽ More

    Submitted 8 June, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted in INTERSPEECH 2024

  20. arXiv:2308.16150  [pdf, other

    eess.IV cs.CV cs.LG

    Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI

    Authors: Ziyun Liang, Harry Anthony, Felix Wagner, Konstantinos Kamnitsas

    Abstract: Unsupervised anomaly segmentation aims to detect patterns that are distinct from any patterns processed during training, commonly called abnormal or out-of-distribution patterns, without providing any associated manual segmentations. Since anomalies during deployment can lead to model failure, detecting the anomaly can enhance the reliability of models, which is valuable in high-risk domains like… ▽ More

    Submitted 2 November, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted in Multiscale Multimodal Medical Imaging workshop in MICCAI 2023

  21. arXiv:2308.11635  [pdf, other

    eess.SP cs.HC cs.LG

    Semi-Supervised Dual-Stream Self-Attentive Adversarial Graph Contrastive Learning for Cross-Subject EEG-based Emotion Recognition

    Authors: Weishan Ye, Zhiguo Zhang, Min Zhang, Fei Teng, Li Zhang, Linling Li, Gan Huang, Jianhong Wang, Dong Ni, Zhen Liang

    Abstract: Electroencephalography (EEG) is an objective tool for emotion recognition with promising applications. However, the scarcity of labeled data remains a major challenge in this field, limiting the widespread use of EEG-based emotion recognition. In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the ch… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.06496

  22. arXiv:2308.08968  [pdf, other

    eess.SP cs.IT

    On the Performance of Multidimensional Constellation Sha** for Linear and Nonlinear Optical Fiber Channel

    Authors: Bin Chen, Zhiwei Liang, Shen Li, Yi Lei, Gabriele Liga, Alex Alvarado

    Abstract: Multidimensional constellation sha** of up to 32 dimensions with different spectral efficiencies are compared through AWGN and fiber-optic simulations. The results show that no constellation is universal and the balance of required and effective SNRs should be jointly considered for the specific optical transmission scenario.

    Submitted 18 October, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: The paper has been accepted by the ECOC 2023

  23. arXiv:2306.10494  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study

    Authors: Rushuang Zhou, Lei Lu, Zijun Liu, Ting Xiang, Zhen Liang, David A. Clifton, Yining Dong, Yuan-Ting Zhang

    Abstract: Electrocardiography (ECG) is a non-invasive tool for predicting cardiovascular diseases (CVDs). Current ECG-based diagnosis systems show promising performance owing to the rapid development of deep learning techniques. However, the label scarcity problem, the co-occurrence of multiple CVDs and the poor performance on unseen datasets greatly hinder the widespread application of deep learning-based… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  24. arXiv:2306.08588  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

    Authors: Zheng Liang, Zheshu Song, Ziyang Ma, Chenpeng Du, Kai Yu, Xie Chen

    Abstract: Recently, end-to-end (E2E) automatic speech recognition (ASR) models have made great strides and exhibit excellent performance in general speech recognition. However, there remain several challenging scenarios that E2E models are not competent in, such as code-switching and named entity recognition (NER). Data augmentation is a common and effective practice for these two scenarios. However, the cu… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by Interspeech 2023

  25. UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding

    Authors: Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu

    Abstract: The utilization of discrete speech tokens, divided into semantic tokens and acoustic tokens, has been proven superior to traditional acoustic feature mel-spectrograms in terms of naturalness and robustness for text-to-speech (TTS) synthesis. Recent popular models, such as VALL-E and SPEAR-TTS, allow zero-shot speaker adaptation through auto-regressive (AR) continuation of acoustic tokens extracted… ▽ More

    Submitted 28 March, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted to AAAI 2024

  26. arXiv:2305.15193  [pdf, other

    cs.LG eess.SY

    Adaptive Policy Learning to Additional Tasks

    Authors: Wenjian Hao, Zehui Lu, Zihao Liang, Tianyu Zhou, Shaoshuai Mou

    Abstract: This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task. A method named Adaptive Policy Gradient (APG) is proposed in this paper, which combines Bellman's principle of optimality with the policy gradient approach to improve the convergence rate. This paper provides theoretical analysis which guarantees the converg… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  27. arXiv:2305.15188  [pdf, other

    cs.LG eess.SY

    Policy Learning based on Deep Koopman Representation

    Authors: Wenjian Hao, Paulo C. Heredia, Bowen Huang, Zehui Lu, Zihao Liang, Shaoshuai Mou

    Abstract: This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment. The proposed algorithm has two innovations: first, it introduces the so-called deep Koopman representation int… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  28. arXiv:2305.07926  [pdf, other

    physics.flu-dyn eess.SY

    Characteristic time of transient response of solid oxide cells (SOCs) to changes in voltage/current: from theory to applications

    Authors: Zhaojian Liang, **gyi Wang, Liang An, Yang Wang, Meng Ni, Mengying Li

    Abstract: The intermittency of solar and wind power can be addressed by integrating them with Solid Oxide Cells (SOCs). This study delves into the transient characteristics of SOCs and their dependence on dynamic heat and mass transfer processes. Non-dimensional analysis was used to identify influential parameters, followed by a 3-D numerical simulation-based parametric analysis to examine the dynamic gaseo… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 May, 2023; originally announced May 2023.

    Journal ref: Nat Commun 15, 4587 (2024)

  29. arXiv:2304.06496  [pdf, other

    eess.SP cs.HC cs.LG

    EEGMatch: Learning with Incomplete Labels for Semi-Supervised EEG-based Cross-Subject Emotion Recognition

    Authors: Rushuang Zhou, Weishan Ye, Zhiguo Zhang, Yanyang Luo, Li Zhang, Linling Li, Gan Huang, Yining Dong, Yuan-Ting Zhang, Zhen Liang

    Abstract: Electroencephalography (EEG) is an objective tool for emotion recognition and shows promising performance. However, the label scarcity problem is a main challenge in this field, which limits the wide application of EEG-based emotion recognition. In this paper, we propose a novel semi-supervised learning framework (EEGMatch) to leverage both labeled and unlabeled EEG data. First, an EEG-Mixup based… ▽ More

    Submitted 27 March, 2023; originally announced April 2023.

  30. arXiv:2304.00100  [pdf, other

    eess.SY

    A Data-Driven Approach for Inverse Optimal Control

    Authors: Zihao Liang, Wenjian Hao, Shaoshuai Mou

    Abstract: This paper proposes a data-driven, iterative approach for inverse optimal control (IOC), which aims to learn the objective function of a nonlinear optimal control system given its states and inputs. The approach solves the IOC problem in a challenging situation when the system dynamics is unknown. The key idea of the proposed approach comes from the deep Koopman representation of the unknown syste… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  31. arXiv:2304.00062  [pdf, other

    cs.LG eess.SY math.OC

    A Physics-Informed Machine Learning for Electricity Markets: A NYISO Case Study

    Authors: Robert Ferrando, Laurent Pagnier, Robert Mieth, Zhirui Liang, Yury Dvorkin, Daniel Bienstock, Michael Chertkov

    Abstract: This paper addresses the challenge of efficiently solving the optimal power flow problem in real-time electricity markets. The proposed solution, named Physics-Informed Market-Aware Active Set learning OPF (PIMA-AS-OPF), leverages physical constraints and market properties to ensure physical and economic feasibility of market-clearing outcomes. Specifically, PIMA-AS-OPF employs the active set lear… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  32. arXiv:2303.12360  [pdf

    cs.CV eess.IV

    Automatically Predict Material Properties with Microscopic Image Example Polymer Compatibility

    Authors: Zhilong Liang, Zhenzhi Tan, Ruixin Hong, Wanli Ouyang, **ying Yuan, Changshui Zhang

    Abstract: Many material properties are manifested in the morphological appearance and characterized with microscopic image, such as scanning electron microscopy (SEM). Polymer miscibility is a key physical quantity of polymer material and commonly and intuitively judged by SEM images. However, human observation and judgement for the images is time-consuming, labor-intensive and hard to be quantified. Comput… ▽ More

    Submitted 3 August, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  33. arXiv:2302.06831  [pdf, other

    eess.SP cs.IT

    Analytical Model of Nonlinear Fiber Propagation for General Dual-Polarization Four-Dimensional Modulation Format

    Authors: Zhiwei Liang, Bin Chen, Yi Lei, Gabriele Liga, Alex Alvarado

    Abstract: Coherent dual-polarization (DP) optical transmission systems encode information on the four available degrees of freedom of an optical field: the two polarization states, each with two quadrature components. Such systems naturally operate based on a four-dimensional (4D) signal space. Having a general analytical model to accurately estimate nonlinear interference (NLI) is key to analyze such trans… ▽ More

    Submitted 9 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 12 pages,8 figures

  34. arXiv:2302.05498  [pdf, other

    math.OC eess.SY

    Data-Driven Inverse Optimization for Marginal Offer Price Recovery in Electricity Markets

    Authors: Zhirui Liang, Yury Dvorkin

    Abstract: This paper presents a data-driven inverse optimization (IO) approach to recover the marginal offer prices of generators in a wholesale energy market. By leveraging underlying market-clearing processes, we establish a closed-form relationship between the unknown parameters and the publicly available market-clearing results. Based on this relationship, we formulate the data-driven IO problem as a co… ▽ More

    Submitted 16 May, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

  35. arXiv:2212.00661  [pdf, other

    quant-ph eess.SY

    Hybrid Gate-Pulse Model for Variational Quantum Algorithms

    Authors: Zhiding Liang, Zhixin Song, **glei Cheng, Zichang He, Ji Liu, Hanrui Wang, Ruiyang Qin, Yiru Wang, Song Han, Xuehai Qian, Yiyu Shi

    Abstract: Current quantum programs are mostly synthesized and compiled on the gate-level, where quantum circuits are composed of quantum gates. The gate-level workflow, however, introduces significant redundancy when quantum gates are eventually transformed into control signals and applied on quantum devices. For superconducting quantum computers, the control signals are microwave pulses. Therefore, pulse-l… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 8 pages, 6 figures

  36. arXiv:2211.09854  [pdf, other

    eess.SY

    An Iterative Method to Learn a Linear Control Barrier Function

    Authors: Zihao Liang, Jason King Ching Lo

    Abstract: Control barrier function (CBF) has recently started to serve as a basis to develop approaches for enforcing safety requirements in control systems. However, constructing such function for a general system is a non-trivial task. This paper proposes an iterative, optimization-based framework to obtain a CBF from a given user-specified set for a general control affine system. Without losing generalit… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  37. arXiv:2211.09381  [pdf, other

    cs.SD eess.AS

    Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

    Authors: Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu

    Abstract: In multi-talker scenarios such as meetings and conversations, speech processing systems are usually required to segment the audio and then transcribe each segmentation. These two stages are addressed separately by speaker change detection (SCD) and automatic speech recognition (ASR). Most previous SCD systems rely solely on speaker information and ignore the importance of speech content. In this p… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  38. arXiv:2210.05713  [pdf, other

    q-bio.NC cs.NE eess.SP

    Explainable fMRI-based Brain Decoding via Spatial Temporal-pyramid Graph Convolutional Network

    Authors: Ziyuan Ye, Youzhi Qu, Zhichao Liang, Mo Wang, Quanying Liu

    Abstract: Brain decoding, aiming to identify the brain states using neural activity, is important for cognitive neuroscience and neural engineering. However, existing machine learning methods for fMRI-based brain decoding either suffer from low classification performance or poor explainability. Here, we address this issue by proposing a biologically inspired architecture, Spatial Temporal-pyramid Graph Conv… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  39. arXiv:2209.02604  [pdf, other

    cs.MM cs.AI cs.CV cs.SD eess.AS

    Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

    Authors: Yihe Liu, Ziqi Yuan, Huisheng Mao, Zhiyun Liang, Wanqiuyue Yang, Yuanzhe Qiu, Tie Cheng, Xiaoteng Li, Hua Xu, Kai Gao

    Abstract: Multimodal sentiment analysis (MSA), which supposes to improve text-based sentiment analysis with associated acoustic and visual modalities, is an emerging research area due to its potential applications in Human-Computer Interaction (HCI). However, the existing researches observe that the acoustic and visual modalities contribute much less than the textual modality, termed as text-predominant. Un… ▽ More

    Submitted 21 August, 2022; originally announced September 2022.

    Comments: 16pages, 7 figures, accepted by ICMI 2022

  40. arXiv:2209.00707  [pdf, other

    eess.SY

    Weather-Driven Flexibility Reserve Procurement: A NYISO Offshore Wind Power Case Study

    Authors: Zhirui Liang, Robert Mieth, Yury Dvorkin, Miguel A. Ortega-Vazquez

    Abstract: The growing penetration of variable renewable energy sources (VRES) requires additional flexibility reserve to ensure reliable power system operations. Current industry practice typically assumes a certain fraction of the VRES power production forecast as flexibility reserve, thus ignoring other relevant information, such as weather conditions. To address this, probability- and risk-based reserve… ▽ More

    Submitted 10 December, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  41. arXiv:2207.10282  [pdf

    cs.NI cs.AI eess.SY

    An Evolutionary Game based Secure Clustering Protocol with Fuzzy Trust Evaluation and Outlier Detection for Wireless Sensor Networks

    Authors: Liu Yang, Yinzhi Lu, Simon X. Yang, Yuanchang Zhong, Tan Guo, Zhifang Liang

    Abstract: Trustworthy and reliable data delivery is a challenging task in Wireless Sensor Networks (WSNs) due to unique characteristics and constraints. To acquire secured data delivery and address the conflict between security and energy, in this paper we present an evolutionary game based secure clustering protocol with fuzzy trust evaluation and outlier detection for WSNs. Firstly, a fuzzy trust evaluati… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  42. arXiv:2207.09936  [pdf

    cs.NI cs.AI eess.SY

    A Secure Clustering Protocol with Fuzzy Trust Evaluation and Outlier Detection for Industrial Wireless Sensor Networks

    Authors: Liu Yang, Yinzhi Lu, Simon X. Yang, Tan Guo, Zhifang Liang

    Abstract: Security is one of the major concerns in Industrial Wireless Sensor Networks (IWSNs). To assure the security in clustered IWSNs, this paper presents a secure clustering protocol with fuzzy trust evaluation and outlier detection (SCFTO). Firstly, to deal with the transmission uncertainty in an open wireless medium, an interval type-2 fuzzy logic controller is adopted to estimate the trusts. And the… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  43. Geometrically-Shaped Multi-Dimensional Modulation Formats in Coherent Optical Transmission Systems

    Authors: Bin Chen, Yi Lei, Gabriele Liga, Zhiwei Liang, Wei Ling, Xuwei Xue, Alex Alvarado

    Abstract: Sha** modulation formats in multi-dimensional (MD) space is an effective approach to harvest spectral efficiency gains in both the additive white Gaussian noise (AWGN) channel and the optical fiber channel. In the first part of this paper, existing MD geometrically-shaped modulations for fiber optical communications are reviewed. It is shown that large gains can be obtained by exploiting correla… ▽ More

    Submitted 31 August, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: 14 pages, 10 figures, accepted by JLT

  44. arXiv:2206.06214  [pdf, other

    cs.CV eess.IV

    Real-World Light Field Image Super-Resolution via Degradation Modulation

    Authors: Yingqian Wang, Zhengyu Liang, Longguang Wang, Jungang Yang, Wei An, Yulan Guo

    Abstract: Recent years have witnessed the great advances of deep neural networks (DNNs) in light field (LF) image super-resolution (SR). However, existing DNN-based LF image SR methods are developed on a single fixed degradation (e.g., bicubic downsampling), and thus cannot be applied to super-resolve real LF images with diverse degradation. In this paper, we propose a simple yet effective method for real-w… ▽ More

    Submitted 30 November, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 15 pages, 10 figures

  45. arXiv:2206.00866  [pdf, other

    eess.SP cs.IT

    Analytical SNR Prediction in Long-Haul Optical Transmission using General Dual-Polarization 4D Formats

    Authors: Zhiwei Liang, Bin Chen, Yi Lei, Gabriele Liga, Alex Alvarado

    Abstract: Nonlinear interference models for dual-polarization 4D (DP-4D) modulation have only been used so far to predict signal-signal nonlinear interference. We show that including the signal-noise term in the prediction of the effective signal-to-noise ratio in long distance DP-4D transmission improves the accuracy by up to 0.2 dB.

    Submitted 15 July, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: 4 pages

  46. arXiv:2205.14029  [pdf

    eess.IV cs.CV

    Lesion classification by model-based feature extraction: A differential affine invariant model of soft tissue elasticity

    Authors: Weiguo Cao, Marc J. Pomeroy, Zhengrong Liang, Yongfeng Gao, Yongyi Shi, Jiaxing Tan, Fangfang Han, **g Wang, Jianhua Ma, Hongbin Lu, Almas F. Abbasi, Perry J. Pickhardt

    Abstract: The elasticity of soft tissues has been widely considered as a characteristic property to differentiate between healthy and vicious tissues and, therefore, motivated several elasticity imaging modalities, such as Ultrasound Elastography, Magnetic Resonance Elastography, and Optical Coherence Elastography. This paper proposes an alternative approach of modeling the elasticity using Computed Tomogra… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 12 pages, 4 figures, 3 tables

  47. arXiv:2201.12806  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection

    Authors: Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu

    Abstract: Nowadays, most methods in end-to-end contextual speech recognition bias the recognition process towards contextual knowledge. Since all-neural contextual biasing methods rely on phrase-level contextual modeling and attention-based relevance modeling, they may encounter confusion between similar context-specific phrases, which hurts predictions at the token level. In this work, we focus on mitigati… ▽ More

    Submitted 2 March, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Accepted by ICASSP 2022

  48. arXiv:2112.14948  [pdf, other

    eess.SY

    Data-Driven State Estimation for Light-Emitting Diode Underwater Optical Communication

    Authors: Yingquan Li, Zhenwen Liang, Ibrahima N'Doye, Xiangliang Zhang, Mohamed-Slim Alouini, Taous-Meriem Laleg-Kirati

    Abstract: Light-Emitting Diodes (LEDs) based underwater optical wireless communications (UOWCs), a technology with low latency and high data rates, have attracted significant importance for underwater robots. However, maintaining a controlled line of sight link between transmitter and receiver is challenging due to the constant movement of the underlying optical platform caused by the dynamic uncertainties… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

    Comments: 12 pages, 11 figures

  49. arXiv:2111.11112  [pdf, other

    cs.IT eess.SP

    Data Sensing and Offloading in Edge Computing Networks: TDMA or NOMA?

    Authors: Zezu Liang, Hanbiao Chen, Yuan Liu, Fangjiong Chen

    Abstract: With the development of Internet-of-Things (IoT), we witness the explosive growth in the number of devices with sensing, computing, and communication capabilities, along with a large amount of raw data generated at the network edge. Mobile (multi-access) edge computing (MEC), acquiring and processing data at network edge (like base station (BS)) via wireless links, has emerged as a promising techn… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: To appear in IEEE Transactions on Wireless Communications

  50. arXiv:2110.02152  [pdf, other

    eess.SY

    Operation-Adversarial Scenario Generation

    Authors: Zhirui Liang, Robert Mieth, Yury Dvorkin

    Abstract: This paper proposes a modified conditional generative adversarial network (cGAN) model to generate net load scenarios for power systems that are statistically credible, conditioned by given labels (e.g., seasons), and, at the same time, "stressful" to the system operations and dispatch decisions. The measure of stress used in this paper is based on the operating cost increases due to net load chan… ▽ More

    Submitted 11 April, 2022; v1 submitted 5 October, 2021; originally announced October 2021.