Skip to main content

Showing 1–50 of 154 results for author: Ma, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16571  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Differentiable Distributionally Robust Optimization Layers

    Authors: Xutao Ma, Chao Ning, Wenli Du

    Abstract: In recent years, there has been a growing research interest in decision-focused learning, which embeds optimization problems as a layer in learning pipelines and demonstrates a superior performance than the prediction-focused approach. However, for distributionally robust optimization (DRO), a popular paradigm for decision-making under uncertainty, it is still unknown how to embed it as a layer, i… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: In Forty-first International Conference on Machine Learning (2024)

  2. arXiv:2406.09869  [pdf, ps, other

    cs.SD eess.AS

    MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model

    Authors: Jiatong Shi, Xutai Ma, Hirofumi Inaguma, Anna Sun, Shinji Watanabe

    Abstract: Speech discrete representation has proven effective in various downstream applications due to its superior compression rate of the waveform, fast convergence during training, and compatibility with other modalities. Discrete units extracted from self-supervised learning (SSL) models have emerged as a prominent approach for obtaining speech discrete representation. However, while discrete units hav… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech2024

  3. arXiv:2406.08714  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator

    Authors: Mandovi Mukherjee, Xiangyu Mao, Nael Rahman, Coleman DeLude, Joe Driscoll, Sudarshan Sharma, Payman Behnam, Uday Kamal, Jongseok Woo, Daehyun Kim, Sharjeel Khan, Jianming Tong, Jamin Seo, Prachi Sinha, Madhavan Swaminathan, Tushar Krishna, Santosh Pande, Justin Romberg, Saibal Mukhopadhyay

    Abstract: A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2406.08710  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- I: The Direct Path Computational Model

    Authors: Coleman DeLude, Joe Driscoll, Mandovi Mukherjee, Nael Rahman, Uday Kamal, Xiangyu Mao, Sharjeel Khan, Hariharan Sivaraman, Eric Huang, Jeffrey McHarg, Madhavan Swaminathan, Santosh Pande, Saibal Mukhopadhyay, Justin Romberg

    Abstract: In this paper we consider the problem of develo** a computational model for emulating an RF channel. The motivation for this is that an accurate and scalable emulator has the potential to minimize the need for field testing, which is expensive, slow, and difficult to replicate. Traditionally, emulators are built using a tapped delay line model where long filters modeling the physical interaction… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2406.02247  [pdf, other

    physics.ins-det eess.SY

    A Study of the Latest Updates of the Readout System for the Hybird-Pixel Detector at HEPS

    Authors: Hangxu Li, Jie Zhang, Wei Wei, Zhenjie Li, Xiaolu Ji, Yan Zhang, Xuanzheng Yang, Shuihan Zhang, Xueke Ma, Peng Liu, Zheng Wang, Yuanbai Chen

    Abstract: The High Energy Photon Source (HEPS) represents a fourth-generation light source. This facility has made unprecedented advancements in accelerator technology, necessitating the development of new detectors to satisfy physical requirements such as single-photon resolution, large dynamic range, and high frame rates. Since 2016, the Institute of High Energy Physics has introduced the first user-exper… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.16516  [pdf, other

    eess.IV cs.CV

    Memory-efficient High-resolution OCT Volume Synthesis with Cascaded Amortized Latent Diffusion Models

    Authors: Kun Huang, Xiao Ma, Yuhan Zhang, Na Su, Songtao Yuan, Yong Liu, Qiang Chen, Huazhu Fu

    Abstract: Optical coherence tomography (OCT) image analysis plays an important role in the field of ophthalmology. Current successful analysis models rely on available large datasets, which can be challenging to be obtained for certain tasks. The use of deep generative models to create realistic data emerges as a promising approach. However, due to limitations in hardware resources, it is still difficulty t… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Provisionally accepted for medical image computing and computer-assisted intervention (MICCAI) 2024

  7. arXiv:2405.11163  [pdf, other

    cs.HC eess.SP

    Domain Generalization for Zero-calibration BCIs with Knowledge Distillation-based Phase Invariant Feature Extraction

    Authors: Zilin Liang, Zheng Zheng, Weihai Chen, Xinzhi Ma, Zhongcai Pei, Xiantao Sun

    Abstract: The distribution shift of electroencephalography (EEG) data causes poor generalization of braincomputer interfaces (BCIs) in unseen domains. Some methods try to tackle this challenge by collecting a portion of user data for calibration. However, it is time-consuming, mentally fatiguing, and user-unfriendly. To achieve zerocalibration BCIs, most studies employ domain generalization (DG) techniques… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  8. arXiv:2405.09552  [pdf, other

    eess.IV cs.AI cs.CV

    ODFormer: Semantic Fundus Image Segmentation Using Transformer for Optic Nerve Head Detection

    Authors: Jiayi Wang, Yi-An Mao, Xiaoyu Ma, Sicen Guo, Yuting Shao, Xiao Lv, Wenting Han, Mark Christopher, Linda M. Zangwill, Yanlong Bi, Rui Fan

    Abstract: Optic nerve head (ONH) detection has been a crucial area of study in ophthalmology for years. However, the significant discrepancy between fundus image datasets, each generated using a single type of fundus camera, poses challenges to the generalizability of ONH detection approaches developed based on semantic segmentation networks. Despite the numerous recent advancements in general-purpose seman… ▽ More

    Submitted 2 June, 2024; v1 submitted 15 April, 2024; originally announced May 2024.

  9. I$^3$Net: Inter-Intra-slice Interpolation Network for Medical Slice Synthesis

    Authors: Haofei Song, Xintian Mao, **g Yu, Qingli Li, Yan Wang

    Abstract: Medical imaging is limited by acquisition time and scanning equipment. CT and MR volumes, reconstructed with thicker slices, are anisotropic with high in-plane resolution and low through-plane resolution. We reveal an intriguing phenomenon that due to the mentioned nature of data, performing slice-wise interpolation from the axial view can yield greater benefits than performing super-resolution fr… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  10. Non-overshooting sliding mode for UAV control

    Authors: Xinhua Wang, Xuerui Mao

    Abstract: For a class of uncertain systems, a non-overshooting sliding mode control is presented to make them globally exponentially stable and without overshoot. Even when the unknown stochastic disturbance exists, and the time-variant reference trajectory is required, the strict non-overshooting stabilization is still achieved. The control law design is based on a desired second-order sliding mode (2-slid… ▽ More

    Submitted 12 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 57 pages, 29 figures

    Journal ref: The Aeronautical Journal, 2024

  11. arXiv:2404.19242  [pdf, other

    cs.CV eess.IV stat.ME

    A Minimal Set of Parameters Based Depth-Dependent Distortion Model and Its Calibration Method for Stereo Vision Systems

    Authors: Xin Ma, Puchen Zhu, Xiao Li, Xiaoyin Zheng, Jianshu Zhou, Xuchen Wang, Kwok Wai Samuel Au

    Abstract: Depth position highly affects lens distortion, especially in close-range photography, which limits the measurement accuracy of existing stereo vision systems. Moreover, traditional depth-dependent distortion models and their calibration methods have remained complicated. In this work, we propose a minimal set of parameters based depth-dependent distortion model (MDM), which considers the radial an… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for publication in IEEE Transactions on Instrumentation and Measurement

  12. arXiv:2403.16830  [pdf, other

    cs.NI eess.SP

    Exploring Communication Technologies, Standards, and Challenges in Electrified Vehicle Charging

    Authors: Xiang Ma, Yuan Zhou, Hanwen Zhang, Qun Wang, Haijian Sun, Hongjie Wang, Rose Qingyang Hu

    Abstract: As public awareness of environmental protection continues to grow, the trend of integrating more electric vehicles (EVs) into the transportation sector is rising. Unlike conventional internal combustion engine (ICE) vehicles, EVs can minimize carbon emissions and potentially achieve autonomous driving. However, several obstacles hinder the widespread adoption of EVs, such as their constrained driv… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: submitted to IET Communication as a survey paper

  13. arXiv:2403.11078  [pdf, other

    eess.IV cs.CV

    Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution

    Authors: Jialu Sui, ** Ma, Xiaokang Zhang, Man-On Pun

    Abstract: Remote sensing image super-resolution (SR) is a crucial task to restore high-resolution (HR) images from low-resolution (LR) observations. Recently, the Denoising Diffusion Probabilistic Model (DDPM) has shown promising performance in image reconstructions by overcoming problems inherent in generative models, such as over-smoothing and mode collapse. However, the high-frequency details generated b… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  14. arXiv:2403.06798  [pdf, other

    eess.IV cs.CV cs.LG

    Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification

    Authors: Shuai Li, Xiaoguang Ma, Shancheng Jiang, Lu Meng

    Abstract: Remarkable successes were made in Medical Image Classification (MIC) recently, mainly due to wide applications of convolutional neural networks (CNNs). However, adversarial examples (AEs) exhibited imperceptible similarity with raw data, raising serious concerns on network robustness. Although adversarial training (AT), in responding to malevolent AEs, was recognized as an effective approach to im… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, 2 tables

  15. arXiv:2402.15982  [pdf

    eess.SP

    Spherical Geometry Algorithm for Space-borne Synthetic Aperture Radar Imaging

    Authors: Xinhua Mao

    Abstract: Higher spatial resolution and larger imaging scene are always the goals pursued by advanced space-borne SAR system.High resolution and wide swath SAR imaging can provide more information about the illuminated scene of interest on one hand,but also come with some new challenges on the other hand.The induced new challenging problems include curved orbit,Earth rotation,and spherical ground surface,et… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 13 pages, 15 figures

  16. arXiv:2402.08445  [pdf, other

    eess.SP

    $1$-Bit SubTHz RIS with Planar Tightly Coupled Dipoles: Beam Sha** and Prototypes

    Authors: Xianjun Ma, Yonggang Zhou, Qi Luo, Yihan Ma, Kyriakos Stylianopoulos, George C. Alexandropoulos

    Abstract: In this paper, a proof-of-concept study of a $1$-bit wideband reconfigurable intelligent surface (RIS) comprising planar tightly coupled dipoles (PTCD) is presented. The developed RIS operates at subTHz frequencies and a $3$-dB gain bandwidth of $27.4\%$ with the center frequency at $102$ GHz is shown to be obtainable via full-wave electromagnetic simulations. The binary phase shift offered by eac… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 5 pages, 11 figures, 18th European Conference on Antennas and Propagation (EuCAP) - to be presented

  17. arXiv:2401.01693  [pdf, other

    cs.CV eess.IV

    AID-DTI: Accelerating High-fidelity Diffusion Tensor Imaging with Detail-Preserving Model-based Deep Learning

    Authors: Wenxin Fan, Jian Cheng, Cheng Li, Xinrui Ma, **g Yang, Juan Zou, Ruoyou Wu, Qiegen Liu, Shanshan Wang

    Abstract: Deep learning has shown great potential in accelerating diffusion tensor imaging (DTI). Nevertheless, existing methods tend to suffer from Rician noise and detail loss in reconstructing the DTI-derived parametric maps especially when sparsely sampled q-space data are used. This paper proposes a novel method, AID-DTI (Accelerating hIgh fiDelity Diffusion Tensor Imaging), to facilitate fast and accu… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  18. arXiv:2312.15389  [pdf, other

    eess.IV cs.CV

    TJDR: A High-Quality Diabetic Retinopathy Pixel-Level Annotation Dataset

    Authors: **gxin Mao, Xiaoyu Ma, Yanlong Bi, Rongqing Zhang

    Abstract: Diabetic retinopathy (DR), as a debilitating ocular complication, necessitates prompt intervention and treatment. Despite the effectiveness of artificial intelligence in aiding DR grading, the progression of research toward enhancing the interpretability of DR grading through precise lesion segmentation faces a severe hindrance due to the scarcity of pixel-level annotated DR datasets. To mitigate… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  19. arXiv:2312.13319  [pdf, other

    eess.IV cs.CV

    In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging

    Authors: Xin Wang, Lizhi Wang, Xiangtian Ma, Maoqing Zhang, Lin Zhu, Hua Huang

    Abstract: Dual-Camera Compressed Hyperspectral Imaging (DCCHI) offers the capability to reconstruct 3D Hyperspectral Image (HSI) by fusing compressive and Panchromatic (PAN) image, which has shown great potential for snapshot hyperspectral imaging in practice. In this paper, we introduce a novel DCCHI reconstruction network, the Intra-Inter Similarity Exploiting Transformer (In2SET). Our key insight is to m… ▽ More

    Submitted 8 June, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  20. arXiv:2312.12565  [pdf, other

    cs.IT eess.SP

    Precise Coil Alignment for Dynamic Wireless Charging of Electric Vehicles with RFID Sensing

    Authors: Haijian Sun, Xiang Ma, Rose Qingyang Hu, Randy Christensen

    Abstract: Electric vehicle (EV) has emerged as a transformative force for the sustainable and environmentally friendly future. To alleviate range anxiety caused by battery and charging facility, dynamic wireless power transfer (DWPT) is increasingly recognized as a key enabler for widespread EV adoption, yet it faces significant technical challenges, primarily in precise coil alignment. This article begins… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: submitted to IEEE magazine for potential publication. 5 figure

  21. arXiv:2312.07818  [pdf

    eess.SY eess.SP

    Brain Computer Interface Technology for Future Battlefield

    Authors: Guodong Xiong, Xinyan Ma, Wei Li, Jiaqi Cao, Jian Zhong, Yicong Su

    Abstract: With the development of artificial intelligence and unmanned equipment, human-machine hybrid formations will be the main focus in future combat formations. With the development of big data and various situational awareness technologies, while enhancing the breadth and depth of information, decision-making has also become more complex. The operation mode of existing unmanned equipment often require… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 4 pages, 1 figure

  22. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  23. arXiv:2311.06916  [pdf

    eess.SY cs.AI

    TSViT: A Time Series Vision Transformer for Fault Diagnosis

    Authors: Shouhua Zhang, Jiehan Zhou, Xue Ma, Chenglin Wen, Susanna Pirttikangas, Chen Yu, Weishan Zhang, Chunsheng Yang

    Abstract: Traditional fault diagnosis methods using Convolutional Neural Networks (CNNs) face limitations in capturing temporal features (i.e., the variation of vibration signals over time). To address this issue, this paper introduces a novel model, the Time Series Vision Transformer (TSViT), specifically designed for fault diagnosis. On one hand, TSViT model integrates a convolutional layer to segment vib… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  24. arXiv:2311.01812  [pdf, ps, other

    eess.SP

    Carrier Frequency Offset Estimation for OCDM with Null Subchirps

    Authors: Sidong Guo, Yiyin Wang, Xiaoli Ma

    Abstract: In this paper, we investigate the carrier frequency offset (CFO) identifiability problem in orthogonal chirp division multiplexing (OCDM) systems. We propose a transmission scheme by inserting consecutive null subchirps. A CFO estimator is accordingly developed to achieve a full acquisition range. We further demonstrate that the proposed transmission scheme not only help to resolve CFO identifia… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 2 fig

  25. arXiv:2310.19756  [pdf

    eess.SP

    Transmission line condition prediction based on semi-supervised learning

    Authors: Sizhe Li, Xun Ma, Nan Liu, Yi **

    Abstract: Transmission line state assessment and prediction are of great significance for the rational formulation of operation and maintenance strategy and improvement of operation and maintenance level. Aiming at the problem that existing models cannot take into account the robustness and data demand, this paper proposes a state prediction method based on semi-supervised learning. Firstly, for the expande… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  26. arXiv:2310.16732  [pdf, other

    cs.CV eess.IV

    A No-Reference Quality Assessment Method for Digital Human Head

    Authors: Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiongkuo Min, Xianghe Ma, Guangtao Zhai

    Abstract: In recent years, digital humans have been widely applied in augmented/virtual reality (A/VR), where viewers are allowed to freely observe and interact with the volumetric content. However, the digital humans may be degraded with various distortions during the procedure of generation and transmission. Moreover, little effort has been put into the perceptual quality assessment of digital humans. The… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  27. arXiv:2310.14355  [pdf

    cs.LG eess.IV

    A global product of fine-scale urban building height based on spaceborne lidar

    Authors: Xiao Ma, Guang Zheng, Chi Xu, L. Monika Moskal, Peng Gong, Qinghua Guo, Huabing Huang, Xuecao Li, Yong Pang, Cheng Wang, Huan Xie, Bailang Yu, Bo Zhao, Yuyu Zhou

    Abstract: Characterizing urban environments with broad coverages and high precision is more important than ever for achieving the UN's Sustainable Development Goals (SDGs) as half of the world's populations are living in cities. Urban building height as a fundamental 3D urban structural feature has far-reaching applications. However, so far, producing readily available datasets of recent urban building heig… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  28. arXiv:2310.13177  [pdf

    eess.SY

    Enhancing Building Energy Efficiency through Advanced Sizing and Dispatch Methods for Energy Storage

    Authors: Min Gyung Yu, Xu Ma, Bowen Huang, Karthik Devaprasad, Fredericka Brown, Di Wu

    Abstract: Energy storage and electrification of buildings hold great potential for future decarbonized energy systems. However, there are several technical and economic barriers that prevent large-scale adoption and integration of energy storage in buildings. These barriers include integration with building control systems, high capital costs, and the necessity to identify and quantify value streams for dif… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  29. arXiv:2310.10513  [pdf, other

    cs.CV eess.IV

    Unifying Image Processing as Visual Prompting Question Answering

    Authors: Yihao Liu, Xiangyu Chen, Xianzheng Ma, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong

    Abstract: Image processing is a fundamental task in computer vision, which aims at enhancing image quality and extracting essential features for subsequent vision applications. Traditionally, task-specific models are developed for individual tasks and designing such models requires distinct expertise. Building upon the success of large language models (LLMs) in natural language processing (NLP), there is a… ▽ More

    Submitted 20 February, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 16 pages, 12 figures

  30. arXiv:2310.02720  [pdf, other

    cs.SD eess.AS

    Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction

    Authors: Jiatong Shi, Hirofumi Inaguma, Xutai Ma, Ilia Kulikov, Anna Sun

    Abstract: Existing Self-Supervised Learning (SSL) models for speech typically process speech signals at a fixed resolution of 20 milliseconds. This approach overlooks the varying informational content present at different resolutions in speech signals. In contrast, this paper aims to incorporate multi-resolution information into speech self-supervised representation learning. We introduce a SSL model that l… ▽ More

    Submitted 30 January, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR2024 as spotlight

  31. arXiv:2310.00255  [pdf, other

    eess.SP

    Identifying Distribution Network Faults Using Adaptive Transition Probability

    Authors: Xinliang Ma, Weihua Liu, Bingying **

    Abstract: A novel approach is suggested for improving the accuracy of fault detection in distribution networks. This technique combines adaptive probability learning and waveform decomposition to optimize the similarity of features. Its objective is to discover the most appropriate linear map** between simulated and real data to minimize distribution differences. By aligning the data in the same feature s… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  32. arXiv:2309.04990  [pdf, ps, other

    eess.SP

    On the Impact of Mutual Coupling on RIS-Assisted Channel Estimation

    Authors: Pinjun Zheng, Xiuxiu Ma, Tareq Y. Al-Naffouri

    Abstract: Amid the demand for densely integrated elements in techniques such as holographic reconfigurable intelligent surfaces (RISs), the mutual coupling effect has gained prominence. By performing a misspecified Cramer-Rao bound analysis within an electromagnetics-compliant communication model, this letter offers a quantitative evaluation of the impact of mutual coupling on RIS-assisted channel estimatio… ▽ More

    Submitted 20 February, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

  33. arXiv:2308.15493  [pdf, other

    eess.SY

    Unidentifiability of System Dynamics: Conditions and Controller Design

    Authors: Xiangyu Mao, Jian** He

    Abstract: How to make a dynamic system unidentifiable is an important but still open issue. It not only requires that the parameters of the systems but also the equivalent systems cannot be identified by any identification approaches. Thus, it is a much more challenging problem than the existing analysis of parameter identifiability. In this paper, we investigate the problem of dynamic unidentifiability and… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  34. arXiv:2308.12299  [pdf

    eess.IV cs.AR cs.CV cs.LG

    Inverse Lithography Physics-informed Deep Neural Level Set for Mask Optimization

    Authors: Xing-Yu Ma, Shaogang Hao

    Abstract: As the feature size of integrated circuits continues to decrease, optical proximity correction (OPC) has emerged as a crucial resolution enhancement technology for ensuring high printability in the lithography process. Recently, level set-based inverse lithography technology (ILT) has drawn considerable attention as a promising OPC solution, showcasing its powerful pattern fidelity, especially in… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Journal ref: 2023, Applied Optics

  35. arXiv:2308.03727  [pdf, ps, other

    eess.SY

    Adaptive robust tracking control with active learning for linear systems with ellipsoidal bounded uncertainties

    Authors: Xuehui Ma, Shiliang Zhang, Yushuai Li, Fucai Qian, Tingwen Huang

    Abstract: This paper is concerned with the robust tracking control of linear uncertain systems, whose unknown system parameters and disturbances are bounded within ellipsoidal sets. We propose an adaptive robust control that can actively learn the ellipsoid sets. Particularly, the proposed approach utilizes the recursive set-membership state estimation in learning the ellipsoidal sets, aiming at mitigating… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  36. arXiv:2307.12498  [pdf, other

    cs.SD cs.CL eess.AS

    Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training

    Authors: Gege Qi, Yuefeng Chen, Xiaofeng Mao, Xiaojun Jia, Ranjie Duan, Rong Zhang, Hui Xue

    Abstract: Develo** a practically-robust automatic speech recognition (ASR) is challenging since the model should not only maintain the original performance on clean samples, but also achieve consistent efficacy under small volume perturbations and large domain shifts. To address this problem, we propose a novel WavAugment Guided Phoneme Adversarial Training (wapat). wapat use adversarial examples in phone… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  37. arXiv:2307.04327  [pdf

    cs.RO eess.SY

    Legal Decision-making for Highway Automated Driving

    Authors: Xiaohan Ma, Wenhao Yu, Chengxiang Zhao, Changjun Wang, Wenhui Zhou, Guangming Zhao, Mingyue Ma, Weida Wang, Lin Yang, Rui Mu, Hong Wang, Jun Li

    Abstract: Compliance with traffic laws is a fundamental requirement for human drivers on the road, and autonomous vehicles must adhere to traffic laws as well. However, current autonomous vehicles prioritize safety and collision avoidance primarily in their decision-making and planning, which will lead to misunderstandings and distrust from human drivers and may even result in accidents in mixed traffic flo… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 14 pages, 17 figures

  38. arXiv:2307.02146  [pdf, other

    cs.CL cs.SD eess.AS

    LOAF-M2L: Joint Learning of Wording and Formatting for Singable Melody-to-Lyric Generation

    Authors: Longshen Ou, Xichu Ma, Ye Wang

    Abstract: Despite previous efforts in melody-to-lyric generation research, there is still a significant compatibility gap between generated lyrics and melodies, negatively impacting the singability of the outputs. This paper bridges the singability gap with a novel approach to generating singable lyrics by jointly Learning wOrding And Formatting during Melody-to-Lyric training (LOAF-M2L). After general-doma… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: An extension of our previous work arXiv:2305.16816 [cs.CL]

  39. arXiv:2307.01445  [pdf, ps, other

    eess.SP

    Distributed fusion filter over lossy wireless sensor networks with the presence of non-Gaussian noise

    Authors: Jiacheng He, Bei Peng, Zhenyu Feng, Xuemei Mao, Song Gao, Gang Wang

    Abstract: The information transmission between nodes in a wireless sensor networks (WSNs) often causes packet loss due to denial-of-service (DoS) attack, energy limitations, and environmental factors, and the information that is successfully transmitted can also be contaminated by non-Gaussian noise. The presence of these two factors poses a challenge for distributed state estimation (DSE) over WSNs. In thi… ▽ More

    Submitted 6 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

  40. Precheck Sequence Based False Base Station Detection During Handover: A Physical Layer Security Scheme

    Authors: Xiangyu Li, Kaiwen Zheng, Sidong Guo, Xiaoli Ma

    Abstract: False Base Station (FBS) attack has been a severe security problem for the cellular network since 2G era. During handover, the user equipment (UE) periodically receives state information from surrounding base stations (BSs) and uploads it to the source BS. The source BS compares the uploaded signal power and shifts UE to another BS that can provide the strongest signal. An FBS can transmit signal… ▽ More

    Submitted 3 November, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

  41. arXiv:2306.11476  [pdf, other

    eess.SP

    A Model Fusion Distributed Kalman Filter For Non-Gaussian Observation Noise

    Authors: Xuemei Mao, Gang Wang, Bei Peng, Jiacheng He, Kun Zhang, Song Gao

    Abstract: The distributed Kalman filter (DKF) has attracted extensive research as an information fusion method for wireless sensor systems(WSNs). And the DKF in non-Gaussian environments is still a pressing problem. In this paper, we approximate the non-Gaussian noise as a Gaussian mixture model and estimate the parameters through the expectation-maximization algorithm. A DKF, called model fusion DKF (MFDKF… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  42. arXiv:2305.03101  [pdf, other

    cs.CL cs.SD eess.AS

    Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks

    Authors: Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden D. Tomasello, Juan Pino

    Abstract: Transducer and Attention based Encoder-Decoder (AED) are two widely used frameworks for speech-to-text tasks. They are designed for different purposes and each has its own benefits and drawbacks for speech-to-text tasks. In order to leverage strengths of both modeling methods, we propose a solution by combining Transducer and Attention based Encoder-Decoder (TAED) for speech-to-text tasks. The new… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: ACL 2023 main conference

  43. arXiv:2304.06662  [pdf, other

    eess.IV cs.CV

    Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions

    Authors: Luyang Luo, Xi Wang, Yi Lin, Xiaoqi Ma, Andong Tan, Ronald Chan, Varut Vardhanabhuti, Winnie CW Chu, Kwang-Ting Cheng, Hao Chen

    Abstract: Breast cancer has reached the highest incidence rate worldwide among all malignancies since 2020. Breast imaging plays a significant role in early diagnosis and intervention to improve the outcome of breast cancer patients. In the past decade, deep learning has shown remarkable progress in breast cancer imaging analysis, holding great promise in interpreting the rich information and complex contex… ▽ More

    Submitted 20 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: IEEE RBME 2024

  44. arXiv:2304.03359  [pdf, other

    cs.DC eess.SY

    Approximate Wireless Communication for Federated Learning

    Authors: Xiang Ma, Haijian Sun, Rose Qingyang Hu, Yi Qian

    Abstract: This paper presents an approximate wireless communication scheme for federated learning (FL) model aggregation in the uplink transmission. We consider a realistic channel that reveals bit errors during FL model exchange in wireless networks. Our study demonstrates that random bit errors during model transmission can significantly affect FL performance. To overcome this challenge, we propose an app… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  45. arXiv:2303.15940  [pdf, other

    cs.SD eess.AS

    TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations

    Authors: Qi Gege, Yuefeng Chen, Xiaofeng Mao, Yao Zhu, Binyuan Hui, Xiaodan Li, Rong Zhang, Hui Xue

    Abstract: In a transfer-based attack against Automatic Speech Recognition (ASR) systems, attacks are unable to access the architecture and parameters of the target model. Existing attack methods are mostly investigated in voice assistant scenarios with restricted voice commands, prohibiting their applicability to more general ASR related applications. To tackle this challenge, we propose a novel contextuali… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  46. arXiv:2303.08331  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting

    Authors: Gen Li, Jie Ji, Minghai Qin, Wei Niu, Bin Ren, Fatemeh Afghah, Linke Guo, Xiaolong Ma

    Abstract: As deep convolutional neural networks (DNNs) are widely used in various fields of computer vision, leveraging the overfitting ability of the DNN to achieve video resolution upscaling has become a new trend in the modern video delivery system. By dividing videos into chunks and overfitting each chunk with a super-resolution model, the server encodes videos before transmitting them to the clients, t… ▽ More

    Submitted 18 June, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 Highlight Paper

  47. arXiv:2303.04557  [pdf, other

    cs.CV eess.IV

    Scene Matters: Model-based Deep Video Compression

    Authors: Lv Tang, Xinfeng Zhang, Gai Zhang, Xiaoqi Ma

    Abstract: Video compression has always been a popular research area, where many traditional and deep video compression methods have been proposed. These methods typically rely on signal prediction theory to enhance compression performance by designing high efficient intra and inter prediction strategies and compressing video frames one by one. In this paper, we propose a novel model-based video compression… ▽ More

    Submitted 30 August, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  48. arXiv:2303.04266  [pdf, other

    eess.SY

    Learning to Influence Vehicles' Routing in Mixed-Autonomy Networks by Dynamically Controlling the Headway of Autonomous Cars

    Authors: Xiaoyu Ma, Negar Mehr

    Abstract: It is known that autonomous cars can increase road capacities by maintaining a smaller headway through vehicle platooning. Recent works have shown that these capacity increases can influence vehicles' route choices in unexpected ways similar to the well-known Braess's paradox, such that the network congestion might increase. In this paper, we propose that in mixed-autonomy networks, i.e., networks… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: 7 pages, 7 figures, The 40th IEEE Conference on Robotics and Automation (ICRA 2023)

  49. arXiv:2302.13854  [pdf, other

    eess.SP astro-ph.IM cs.LG cs.SD eess.AS

    A Deep Neural Network Based Reverse Radio Spectrogram Search Algorithm

    Authors: Peter Xiangyuan Ma, Steve Croft, Chris Lintott, Andrew P. V. Siemion

    Abstract: Modern radio astronomy instruments generate vast amounts of data, and the increasingly challenging radio frequency interference (RFI) environment necessitates ever-more sophisticated RFI rejection algorithms. The "needle in a haystack" nature of searches for transients and technosignatures requires us to develop methods that can determine whether a signal of interest has unique properties, or is a… ▽ More

    Submitted 18 January, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 8 pages, 8 figures

    Journal ref: RAS Techniques and Instruments 2023

  50. arXiv:2301.12048  [pdf, other

    cs.CV eess.IV

    Making Reconstruction-based Method Great Again for Video Anomaly Detection

    Authors: Yizhou Wang, Can Qin, Yue Bai, Yi Xu, Xu Ma, Yun Fu

    Abstract: Anomaly detection in videos is a significant yet challenging problem. Previous approaches based on deep neural networks employ either reconstruction-based or prediction-based approaches. Nevertheless, existing reconstruction-based methods 1) rely on old-fashioned convolutional autoencoders and are poor at modeling temporal dependency; 2) are prone to overfit the training samples, leading to indist… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted by ICDM 2022