Skip to main content

Showing 1–50 of 100 results for author: Feng, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16588  [pdf, other

    eess.SY cs.FL

    Switching Controller Synthesis for Hybrid Systems Against STL Formulas

    Authors: Han Su, Shenghua Feng, Sinong Zhan, Naijun Zhan

    Abstract: Switching controllers play a pivotal role in directing hybrid systems (HSs) towards the desired objective, embodying a ``correct-by-construction'' approach to HS design. Identifying these objectives is thus crucial for the synthesis of effective switching controllers. While most of existing works focus on safety and liveness, few of them consider timing constraints. In this paper, we delves into t… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.11568  [pdf, other

    cs.CL cs.SD eess.AS q-bio.NC

    Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

    Authors: Sheng Feng, Heyang Liu, Yu Wang, Yanfeng Wang

    Abstract: In this paper, we introduce a groundbreaking end-to-end (E2E) framework for decoding invasive brain signals, marking a significant advancement in the field of speech neuroprosthesis. Our methodology leverages the comprehensive reasoning abilities of large language models (LLMs) to facilitate direct decoding. By fully integrating LLMs, we achieve results comparable to the state-of-the-art cascade m… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.08306  [pdf, other

    math.OC eess.SY

    Flight Path Optimization with Optimal Control Method

    Authors: Gaofeng Su, Xi Cheng, Siyuan Feng, Ke Liu, Jilin Song, Jianan Chen, Chen Zhu, Hui Lin

    Abstract: This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to d… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  4. arXiv:2405.06230  [pdf

    eess.IV

    Fire in SRRN: Next-Gen 3D Temperature Field Reconstruction Technology

    Authors: Shenxiang Feng, Xiaojian Hao, Xiaodong Huang, Pan Pei, Tong Wei, Chenyang Xu

    Abstract: In aerospace and energy engineering, accurate 3D combustion field temperature measurement is critical. The resolution of traditional methods based on algebraic iteration is limited by the initial voxel division. This study introduces a novel method for reconstructing three-dimensional temperature fields using the Spatial Radiation Representation Network (SRRN). This method utilizes the flame therm… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  5. arXiv:2404.13748  [pdf, other

    eess.SY math.NA

    Application of Kalman Filter in Stochastic Differential Equations

    Authors: Wencheng Bao, Shi Feng, Kaiwen Zhang

    Abstract: In areas such as finance, engineering, and science, we often face situations that change quickly and unpredictably. These situations are tough to handle and require special tools and methods capable of understanding and predicting what might happen next. Stochastic Differential Equations (SDEs) are renowned for modeling and analyzing real-world dynamical systems. However, obtaining the parameters,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 18 pages, 14 figures

  6. arXiv:2404.09500  [pdf

    physics.optics eess.IV

    On-chip Real-time Hyperspectral Imager with Full CMOS Resolution Enabled by Massively Parallel Neural Network

    Authors: Junren Wen, Haiqi Gao, Weiming Shi, Shuaibo Feng, Lingyun Hao, Yujie Liu, Liang Xu, Yuchuan Shao, Yueguang Zhang, Weidong Shen, Chenying Yang

    Abstract: Traditional spectral imaging methods are constrained by the time-consuming scanning process, limiting the application in dynamic scenarios. One-shot spectral imaging based on reconstruction has been a hot research topic recently and the primary challenges still lie in both efficient fabrication techniques suitable for mass production and the high-speed, high-accuracy reconstruction algorithm for r… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2404.07959  [pdf

    eess.SP eess.SY

    Damage identification of offshore jacket platforms in a digital twin framework considering optimal sensor placement

    Authors: Mengmeng Wang, Atilla Incecik, Shizhe Feng, M. K. Gupta, Grzegorz Krlolczyk, Z Li

    Abstract: A new digital twin (DT) framework with optimal sensor placement (OSP) is proposed to accurately calculate the modal responses and identify the damage ratios of the offshore jacket platforms. The proposed damage identification framework consists of two models (namely one OSP model and one damage identification model). The OSP model adopts the multi-objective Lichtenberg algorithm (MOLA) to perform… ▽ More

    Submitted 26 March, 2024; originally announced April 2024.

  8. arXiv:2402.19275  [pdf, other

    eess.SY cs.LG

    Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning

    Authors: **gxuan Yang, Ruoxuan Bai, Haoyuan Ji, Yi Zhang, Jianming Hu, Shuo Feng

    Abstract: The assessment of safety performance plays a pivotal role in the development and deployment of connected and automated vehicles (CAVs). A common approach involves designing testing scenarios based on prior knowledge of CAVs (e.g., surrogate models), conducting tests in these scenarios, and subsequently evaluating CAVs' safety performances. However, substantial differences between CAVs and the prio… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  9. arXiv:2402.01795  [pdf, other

    eess.SY cs.LG cs.RO cs.SE

    Few-Shot Scenario Testing for Autonomous Vehicles Based on Neighborhood Coverage and Similarity

    Authors: Shu Li, **gxuan Yang, Honglin He, Yi Zhang, Jianming Hu, Shuo Feng

    Abstract: Testing and evaluating the safety performance of autonomous vehicles (AVs) is essential before the large-scale deployment. Practically, the number of testing scenarios permissible for a specific AV is severely limited by tight constraints on testing budgets and time. With the restrictions imposed by strictly restricted numbers of tests, existing testing methods often lead to significant uncertaint… ▽ More

    Submitted 22 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  10. arXiv:2312.15416  [pdf, other

    eess.SY

    On Completeness of SDP-Based Barrier Certificate Synthesis over Unbounded Domains

    Authors: Hao Wu, Shenghua Feng, Ting Gan, Jie Wang, Bican Xia, Naijun Zhan

    Abstract: Barrier certificates, serving as differential invariants that witness system safety, play a crucial role in the verification of cyber-physical systems (CPS). Prevailing computational methods for synthesizing barrier certificates are based on semidefinite programming (SDP) by exploiting Putinar Positivstellensatz. Consequently, these approaches are limited by the Archimedean condition, which requir… ▽ More

    Submitted 26 April, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: 18 pages, 1 figure

  11. arXiv:2311.07418  [pdf, other

    cs.CL cs.SD eess.AS

    Speech-based Slot Filling using Large Language Models

    Authors: Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gašić, Philip C. Woodland

    Abstract: Recently, advancements in large language models (LLMs) have shown an unprecedented ability across various language tasks. This paper investigates the potential application of LLMs to slot filling with noisy ASR transcriptions, via both in-context learning and task-specific fine-tuning. Dedicated prompt designs and fine-tuning approaches are proposed to improve the robustness of LLMs for slot filli… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  12. arXiv:2311.00263  [pdf, other

    eess.SY

    The bottleneck and ceiling effects in quantized tracking control of heterogeneous multi-agent systems under DoS attacks

    Authors: Shuai Feng, Maopeng Ran, Baoyong Zhang, Lihua Xie, Shengyuan Xu

    Abstract: In this paper, we investigate tracking control of heterogeneous multi-agent systems under Denial-of-Service (DoS) attacks and state quantization. Dynamic quantized mechanisms are designed for inter-follower communication and leader-follower communication. Zooming-in and out factors, and data rates of both mechanisms for preventing quantizer saturation are provided. Our results show that by tuning… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  13. arXiv:2309.05908  [pdf, other

    eess.SY

    Reset Controller Synthesis by Reach-avoid Analysis for Delay Hybrid Systems

    Authors: Han Su, Jiyu Zhu, Shenghua Feng, Yunjun Bai, Bin Gu, Jiang Liu, Mengfei Yang, Naijun Zhan

    Abstract: A reset controller plays a crucial role in designing hybrid systems. It restricts the initial set and redefines the reset map associated with discrete transitions, in order to guarantee the system to achieve its objective. Reset controller synthesis, together with feedback controller synthesis and switching logic controller synthesis, provides a correct-by-construction approach to designing hybrid… ▽ More

    Submitted 27 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 15 pages, 10 figures

  14. arXiv:2308.12617  [pdf, ps, other

    eess.SY cs.MA

    Quantized distributed Nash equilibrium seeking under DoS attacks: A quantized consensus based approach

    Authors: Shuai Feng, Maojiao Ye, Lihua Xie, Shengyuan Xu

    Abstract: This paper studies distributed Nash equilibrium (NE) seeking under Denial-of-Service (DoS) attacks and quantization. The players can only exchange information with their own direct neighbors. The transmitted information is subject to quantization and packet losses induced by malicious DoS attacks. We propose a quantized distributed NE seeking strategy based on the approach of dynamic quantized con… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  15. arXiv:2307.15388  [pdf, other

    cs.LG eess.SP physics.geo-ph

    An Empirical Study of Large-Scale Data-Driven Full Waveform Inversion

    Authors: Peng **, Yinan Feng, Shihang Feng, Hanchen Wang, Yinpeng Chen, Benjamin Consolvo, Zicheng Liu, Youzuo Lin

    Abstract: This paper investigates the impact of big data on deep learning models to help solve the full waveform inversion (FWI) problem. While it is well known that big data can boost the performance of deep learning models in many tasks, its effectiveness has not been validated for FWI. To address this gap, we present an empirical study that investigates how deep learning models in FWI behave when trained… ▽ More

    Submitted 24 April, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

  16. arXiv:2306.02982  [pdf, other

    cs.CL eess.AS

    PolyVoice: Language Models for Speech to Speech Translation

    Authors: Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yu** Wang, Mingxuan Wang, Yuxuan Wang

    Abstract: We propose PolyVoice, a language model-based framework for speech-to-speech translation (S2ST) system. Our framework consists of two language models: a translation language model and a speech synthesis language model. We use discretized speech units, which are generated in a fully unsupervised way, and thus our framework can be used for unwritten languages. For the speech synthesis part, we adopt… ▽ More

    Submitted 13 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  17. Dynamic quantized consensus under DoS attacks: Towards a tight zooming-out factor

    Authors: Shuai Feng, Maopeng Ran, Hideaki Ishii, Shengyuan Xu

    Abstract: This paper deals with dynamic quantized consensus of dynamical agents in a general form under packet losses induced by Denial-of-Service (DoS) attacks. The communication channel has limited bandwidth and hence the transmitted signals over the network are subject to quantization. To deal with agent's output, an observer is implemented at each node. The state of the observer is quantized by a finite… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  18. arXiv:2305.15719  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Efficient Neural Music Generation

    Authors: Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yu** Wang, Yuxuan Wang

    Abstract: Recent progress in music generation has been remarkably advanced by the state-of-the-art MusicLM, which comprises a hierarchy of three LMs, respectively, for semantic, coarse acoustic, and fine acoustic modelings. Yet, sampling with the MusicLM requires processing through these LMs one by one to obtain the fine-grained acoustic tokens, making it computationally expensive and prohibitive for a real… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  19. arXiv:2305.13314  [pdf, other

    physics.geo-ph cs.LG eess.SP

    Auto-Linear Phenomenon in Subsurface Imaging

    Authors: Yinan Feng, Yinpeng Chen, Peng **, Shihang Feng, Zicheng Liu, Youzuo Lin

    Abstract: Subsurface imaging involves solving full waveform inversion (FWI) to predict geophysical properties from measurements. This problem can be reframed as an image-to-image translation, with the usual approach being to train an encoder-decoder network using paired data from two domains: geophysical property and measurement. A recent seminal work (InvLINT) demonstrates there is only a linear map** be… ▽ More

    Submitted 21 May, 2024; v1 submitted 27 April, 2023; originally announced May 2023.

  20. arXiv:2305.11576  [pdf, other

    eess.AS cs.CL cs.SD

    Language-universal phonetic encoder for low-resource speech recognition

    Authors: Siyuan Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang

    Abstract: Multilingual training is effective in improving low-resource ASR, which may partially be explained by phonetic representation sharing between languages. In end-to-end (E2E) ASR systems, graphemes are often used as basic modeling units, however graphemes may not be ideal for multilingual phonetic sharing. In this paper, we leverage International Phonetic Alphabet (IPA) based language-universal phon… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted for publication in INTERSPEECH 2023

  21. arXiv:2305.11569  [pdf, ps, other

    eess.AS cs.CL cs.SD

    Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition

    Authors: Siyuan Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang

    Abstract: We improve low-resource ASR by integrating the ideas of multilingual training and self-supervised learning. Concretely, we leverage an International Phonetic Alphabet (IPA) multilingual model to create frame-level pseudo labels for unlabeled speech, and use these pseudo labels to guide hidden-unit BERT (HuBERT) based speech pretraining in a phonetically-informed manner. The experiments on the Mult… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted for publication in INTERSPEECH 2023

  22. arXiv:2303.14278  [pdf, other

    cs.RO eess.SY

    Safe Hierarchical Navigation in Crowded Dynamic Uncertain Environments

    Authors: Hongyi Chen, Shiyu Feng, Ye Zhao, Changliu Liu, Patricio A. Vela

    Abstract: This paper describes a hierarchical solution consisting of a multi-phase planner and a low-level safe controller to jointly solve the safe navigation problem in crowded, dynamic, and uncertain environments. The planner employs dynamic gap analysis and trajectory optimization to achieve collision avoidance with respect to the predicted trajectories of dynamic agents within the sensing and planning… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  23. arXiv:2303.08243  [pdf, other

    cs.RO eess.SY

    Safer Gap: A Gap-based Local Planner for Safe Navigation with Nonholonomic Mobile Robots

    Authors: Shiyu Feng, Ahmad Abuaish, Patricio A. Vela

    Abstract: This paper extends the gap-based navigation technique in Potential Gap by guaranteeing safety for nonholonomic robots for all tiers of the local planner hierarchy, so called Safer Gap. The first tier generates a Bezier-based collision-free path through gaps. A subset of navigable free-space from the robot through a gap, called the keyhole, is defined to be the union of the largest collision-free d… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Submitted to IROS 2023

  24. arXiv:2302.08010  [pdf, other

    cs.IT eess.SP

    Achieving Covert Communication in Large-Scale SWIPT-Enabled D2D Networks

    Authors: Shaohan Feng, Xiao Lu, Dusit Niyato, Ekram Hossain, Sumei Sun

    Abstract: We aim to secure a large-scale device-to-device (D2D) network against adversaries. The D2D network underlays a downlink cellular network to reuse the cellular spectrum and is enabled for simultaneous wireless information and power transfer (SWIPT). In the D2D network, the transmitters communicate with the receivers, and the receivers extract information and energy from their received radio-frequen… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  25. arXiv:2302.01745  [pdf, other

    cs.CR cs.NI eess.SP

    Covert D2D Communication Underlaying Cellular Network: A System-Level Security Perspective

    Authors: Shaohan Feng, Xiao Lu, Kun Zhu, Dusit Niyato, ** Wang

    Abstract: In this paper, we aim to secure the D2D communication of the D2D-underlaid cellular network by leveraging covert communication to hide its presence from the vigilant adversary. In particular, there are adversaries aiming to detect D2D communications based on their received signal powers. To avoid being detected, the legitimate entity, i.e., D2D-underlaid cellular network, performs power control so… ▽ More

    Submitted 27 January, 2023; originally announced February 2023.

  26. arXiv:2212.08391  [pdf, ps, other

    eess.SP

    Enhanced-rate Iterative Beamformers for Active IRS-assisted Wireless Communications

    Authors: Yeqing Lin, Feng Shu, Rongen Dong, Riqing Chen, Siling Feng, Wei** Shi, **g Liu, Jiangzhou Wang

    Abstract: Compared to passive intelligent reflecting surface (IRS), active IRS is viewed as a more efficient promising technique to combat the double-fading impact in IRS-aided wireless network. In this paper, in order to boost the achievable rate of user in such a wireless network, three enhanced-rate iterative beamforming methods are proposed by designing the amplifying factors and the corresponding phase… ▽ More

    Submitted 14 May, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  27. Adaptive Safety Evaluation for Connected and Automated Vehicles with Sparse Control Variates

    Authors: **gxuan Yang, Haowei Sun, Honglin He, Yi Zhang, Shuo Feng, Henry X. Liu

    Abstract: Safety performance evaluation is critical for develo** and deploying connected and automated vehicles (CAVs). One prevailing way is to design testing scenarios using prior knowledge of CAVs, test CAVs in these scenarios, and then evaluate their safety performances. However, significant differences between CAVs and prior knowledge could severely reduce the evaluation efficiency. Towards addressin… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  28. arXiv:2210.10506  [pdf, other

    cs.SD eess.AS

    Audio Tampering Detection Based on Shallow and Deep Feature Representation Learning

    Authors: Zhifeng Wang, Yao Yang, Chunyan Zeng, Shuai Kong, Shixiong Feng, Nan Zhao

    Abstract: Digital audio tampering detection can be used to verify the authenticity of digital audio. However, most current methods use standard electronic network frequency (ENF) databases for visual comparison analysis of ENF continuity of digital audio or perform feature extraction for classification by machine learning methods. ENF databases are usually tricky to obtain, visual methods have weak feature… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Audio tampering detection, 21 pages, 4 figures

  29. arXiv:2209.15170  [pdf, other

    cs.CR eess.SP

    Securing Large-Scale D2D Networks Using Covert Communication and Friendly Jamming

    Authors: Shaohan Feng, Xiao Lu, Sumei Sun, Dusit Niyato, Ekram Hossain

    Abstract: We exploit both covert communication and friendly jamming to propose a friendly jamming-assisted covert communication and use it to doubly secure a large-scale device-to-device (D2D) network against eavesdroppers (i.e., wardens). The D2D transmitters defend against the wardens by: 1) hiding their transmissions with enhanced covert communication, and 2) leveraging friendly jamming to ensure informa… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  30. arXiv:2209.00196  [pdf, other

    eess.IV physics.optics

    Group frame neural network of moving object ghost imaging combined with frame merging algorithm

    Authors: Da Chen, Shan-Guo Feng, Hua-Hua Wang, Jia-Ning Cao, Zhi-Wei Zhang, Zhi-Xin Yang, Ao Yan, Lu Gao, Ze Zhang

    Abstract: The nature of multiple samples to extract correlation information limits the applications of ghost imaging of moving objects. A novel multi-to-one neural network is proposed and the concept of "batch frame" is introduced to improve the serial imaging method. The neural network extracts more correlation information from a small number of samples, thus reducing the sampling ratio of the ghost imagin… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 12 pages, 7 figures

  31. arXiv:2208.12753  [pdf, other

    cs.SD cs.AI eess.AS

    Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings

    Authors: Chunyan Zeng, Shixiong Feng, Zhifeng Wang, Xiangkui Wan, Yunfan Chen, Nan Zhao

    Abstract: The existing source cell-phone recognition method lacks the long-term feature characterization of the source device, resulting in inaccurate representation of the source cell-phone related features which leads to insufficient recognition accuracy. In this paper, we propose a source cell-phone recognition method based on spatio-temporal representation learning, which includes two main parts: extrac… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: 29 pages, 4 figures

  32. arXiv:2207.09259  [pdf, other

    eess.SY

    Adaptive Testing for Connected and Automated Vehicles with Sparse Control Variates in Overtaking Scenarios

    Authors: **gxuan Yang, Honglin He, Yi Zhang, Shuo Feng, Henry X. Liu

    Abstract: Testing and evaluation is a critical step in the development and deployment of connected and automated vehicles (CAVs). Due to the black-box property and various types of CAVs, how to test and evaluate CAVs adaptively remains a major challenge. Many approaches have been proposed to adaptively generate testing scenarios during the testing process. However, most existing approaches cannot be applied… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  33. arXiv:2207.08332  [pdf, other

    eess.SY eess.SP

    Quantized Consensus under Data-Rate Constraints and DoS Attacks: A Zooming-In and Holding Approach

    Authors: Maopeng Ran, Shuai Feng, Juncheng Li, Lihua Xie

    Abstract: This paper is concerned with the quantized consensus problem for uncertain nonlinear multi-agent systems under data-rate constraints and Denial-of-Service (DoS) attacks. The agents are modeled in strict-feedback form with unknown nonlinear dynamics and external disturbance. Extended state observers (ESOs) are leveraged to estimate agents' total uncertainties along with their states. To mitigate th… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 16 pages, 8 figures

  34. arXiv:2207.01430  [pdf, ps, other

    eess.SY math.OC

    Krasovskii and Shifted Passivity Based Output Consensus

    Authors: Yu Kawano, Michele Cucuzzella, Shuai Feng, Jacquelien M. A. Scherpen

    Abstract: Motivated by current sharing in power networks, we consider a class of output consensus (also called agreement) problems for nonlinear systems, where the consensus value is determined by external disturbances, e.g., power demand. This output consensus problem is solved by a simple distributed output feedback controller if a system is either Krasovskii or shifted passive, which is the only essentia… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  35. arXiv:2206.00105  [pdf, other

    eess.IV cs.CV cs.LG

    Deep learning pipeline for image classification on mobile phones

    Authors: Muhammad Muneeb, Samuel F. Feng, Andreas Henschel

    Abstract: This article proposes and documents a machine-learning framework and tutorial for classifying images using mobile phones. Compared to computers, the performance of deep learning model performance degrades when deployed on a mobile phone and requires a systematic approach to find a model that performs optimally on both computers and mobile phones. By following the proposed pipeline, which consists… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: 20 pages

    Journal ref: 9th International Conference on Artificial Intelligence and Applications (AIAPP 2022)

  36. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, ** Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  37. arXiv:2204.13731  [pdf, other

    cs.LG eess.SP physics.geo-ph

    An Intriguing Property of Geophysics Inversion

    Authors: Yinan Feng, Yinpeng Chen, Shihang Feng, Peng **, Zicheng Liu, Youzuo Lin

    Abstract: Inversion techniques are widely used to reconstruct subsurface physical properties (e.g., velocity, conductivity) from surface-based geophysical measurements (e.g., seismic, electric/magnetic (EM) data). The problems are governed by partial differential equations (PDEs) like the wave or Maxwell's equations. Solving geophysical inversion problems is challenging due to the ill-posedness and high com… ▽ More

    Submitted 16 June, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

  38. arXiv:2204.03178  [pdf, other

    cs.SD cs.CL eess.AS

    3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

    Authors: Zhao You, Shulin Feng, Dan Su, Dong Yu

    Abstract: Recently, Conformer based CTC/AED model has become a mainstream architecture for ASR. In this paper, based on our prior work, we identify and integrate several approaches to achieve further improvements for ASR tasks, which we denote as multi-loss, multi-path and multi-level, summarized as "3M" model. Specifically, multi-loss refers to the joint CTC/AED loss and multi-path denotes the Mixture-of-E… ▽ More

    Submitted 14 April, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 5 pages, 1 figure. Submitted to INTERSPEECH 2022

  39. arXiv:2203.03969  [pdf, other

    cs.GT cs.NI eess.SY

    A Dynamic Hierarchical Framework for IoT-assisted Metaverse Synchronization

    Authors: Yue Han, Dusit Niyato, Cyril Leung, Dong In Kim, Kun Zhu, Shaohan Feng, Sherman Xuemin Shen, Chunyan Miao

    Abstract: Metaverse has recently attracted much attention from both academia and industry. Virtual services, ranging from virtual driver training to online route optimization for smart goods delivery, are emerging in the Metaverse. To make the human experience of virtual life more real, digital twins (DTs), namely digital replicas of physical objects, are key enablers. However, DT status may not always accu… ▽ More

    Submitted 14 March, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

  40. arXiv:2202.11880  [pdf, other

    cs.GT eess.SY

    On Nash-Stackelberg-Nash Games under Decision-Dependent Uncertainties: Model and Equilibrium

    Authors: Yunfan Zhang, Feng Liu, Zhaojian Wang, Yue Chen, Shuanglei Feng, Qiuwei Wu, Yunhe Hou

    Abstract: In this paper, we discuss a class of two-stage hierarchical games with multiple leaders and followers, which is called Nash-Stackelberg-Nash (N-S-N) games. Particularly, we consider N-S-N games under decision-dependent uncertainties (DDUs). DDUs refer to the uncertainties that are affected by the strategies of decision-makers and have been rarely addressed in game equilibrium analysis. In this pap… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  41. arXiv:2202.04542  [pdf, other

    eess.SP cs.HC

    Spectrally Adaptive Common Spatial Patterns

    Authors: Mahta Mousavi, Eric Lybrand, Shuangquan Feng, Shuai Tang, Rayan Saab, Virginia de Sa

    Abstract: The method of Common Spatial Patterns (CSP) is widely used for feature extraction of electroencephalography (EEG) data, such as in motor imagery brain-computer interface (BCI) systems. It is a data-driven method estimating a set of spatial filters so that the power of the filtered EEG signal is maximized for one motor imagery class and minimized for the other. This method, however, is prone to ove… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  42. arXiv:2201.11207  [pdf, other

    cs.SD cs.CL eess.AS

    Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

    Authors: Piotr Żelasko, Siyuan Feng, Laureano Moro Velazquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak

    Abstract: The high cost of data acquisition makes Automatic Speech Recognition (ASR) model training problematic for most existing languages, including languages that do not even have a written script, or for which the phone inventories remain unknown. Past works explored multilingual training, transfer learning, as well as zero-shot learning in order to build ASR systems for these low-resource languages. Wh… ▽ More

    Submitted 27 January, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in Computer Speech and Language

  43. arXiv:2201.04908  [pdf, ps, other

    cs.SD cs.AI eess.AS

    The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition

    Authors: Luke Prananta, Bence Mark Halpern, Siyuan Feng, Odette Scharenborg

    Abstract: In this paper, we investigate several existing and a new state-of-the-art generative adversarial network-based (GAN) voice conversion method for enhancing dysarthric speech for improved dysarthric speech recognition. We compare key components of existing methods as part of a rigorous ablation study to find the most effective solution to improve dysarthric speech recognition. We find that straightf… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: Extended version of paper to be submitted to Interspeech 2022. 6 pages, 2 tables

  44. arXiv:2111.11831  [pdf, other

    eess.AS cs.CL cs.SD

    SpeechMoE2: Mixture-of-Experts Model with Improved Routing

    Authors: Zhao You, Shulin Feng, Dan Su, Dong Yu

    Abstract: Mixture-of-experts based acoustic models with dynamic routing mechanisms have proved promising results for speech recognition. The design principle of router architecture is important for the large model capacity and high computational efficiency. Our previous work SpeechMoE only uses local grapheme embedding to help routers to make route decisions. To further improve speech recognition performanc… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 5 pages, 1 figure. Submitted to ICASSP 2022

  45. arXiv:2111.02926  [pdf, other

    cs.LG eess.SP

    OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic Full Waveform Inversion

    Authors: Chengyuan Deng, Shihang Feng, Hanchen Wang, Xitong Zhang, Peng **, Yinan Feng, Qili Zeng, Yinpeng Chen, Youzuo Lin

    Abstract: Full waveform inversion (FWI) is widely used in geophysics to reconstruct high-resolution velocity maps from seismic data. The recent success of data-driven FWI methods results in a rapidly increasing demand for open datasets to serve the geophysics community. We present OpenFWI, a collection of large-scale multi-structural benchmark datasets, to facilitate diversified, rigorous, and reproducible… ▽ More

    Submitted 23 June, 2023; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: This manuscript has been accepted by NeurIPS 2022 dataset and benchmark track

  46. arXiv:2110.14879  [pdf, ps, other

    cs.IT eess.SP

    Pilot Optimization and Channel Estimation for Two-way Relaying Network Aided by IRS with Finite Discrete Phase Shifters

    Authors: Zhongwen Sun, Xuehui Wang, Siling Feng, Xinrong Guan, Feng Shu, Jiangzhou Wang

    Abstract: In this paper, we investigate the problem of pilot optimization and channel estimation of two-way relaying network (TWRN) aided by an intelligent reflecting surface (IRS) with finite discrete phase shifters. In a TWRN, there exists a challenging problem that the two cascading channels from source-to-IRS-to-Relay and destination-to-IRS-to-relay interfere with each other. Via designing the initial p… ▽ More

    Submitted 15 February, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: 5 pages, 5 figures

  47. arXiv:2109.00154  [pdf, other

    cs.IT eess.SP

    DOA Estimation Using Massive Receive MIMO: Basic Principle and Key Techniques

    Authors: Jiangzhou Wang, Baihua Shi, Feng Shu, Qi Zhang, Di Wu, Qijuan Jie, Zhihong Zhuang, Siling Feng, Yi** Zhang

    Abstract: As massive multiple-input multiple-output (MIMO) becomes popular, direction of arrival (DOA) measurement has been made a real renaissance due to the high-resolution achieved. Thus, there is no doubt about DOA estimation using massive MIMO. The purpose of this paper is to describe its basic principles and key techniques, to present the performance analysis, and to appreciate its engineering applica… ▽ More

    Submitted 15 July, 2023; v1 submitted 31 August, 2021; originally announced September 2021.

  48. arXiv:2105.03643  [pdf, ps, other

    eess.AS cs.SD

    Latency-Controlled Neural Architecture Search for Streaming Speech Recognition

    Authors: Liqiang He, Shulin Feng, Dan Su, Dong Yu

    Abstract: Neural architecture search (NAS) has attracted much attention and has been explored for automatic speech recognition (ASR). In this work, we focus on streaming ASR scenarios and propose the latency-controlled NAS for acoustic modeling. First, based on the vanilla neural architecture, normal cells are altered to causal cells to control the total latency of the architecture. Second, a revised operat… ▽ More

    Submitted 13 September, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted to ASRU 2021

  49. arXiv:2105.03036  [pdf, other

    cs.SD cs.CL eess.AS

    SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts

    Authors: Zhao You, Shulin Feng, Dan Su, Dong Yu

    Abstract: Recently, Mixture of Experts (MoE) based Transformer has shown promising results in many domains. This is largely due to the following advantages of this architecture: firstly, MoE based Transformer can increase model capacity without computational cost increasing both at training and inference time. Besides, MoE based Transformer is a dynamic network which can adapt to the varying complexity of i… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 5 pages, 2 figures. Submitted to Interspeech 2021

  50. arXiv:2104.13721  [pdf, other

    eess.SY

    Optimal Cooperative Driving at Signal-Free Intersections with Polynomial-Time Complexity

    Authors: Huaxin Pei, Yuxiao Zhang, Yi Zhang, Shuo Feng

    Abstract: Cooperative driving at signal-free intersections, which aims to improve driving safety and efficiency for connected and automated vehicles, has attracted increasing interest in recent years. However, existing cooperative driving strategies either suffer from computational complexity or cannot guarantee global optimality. To fill this research gap, this paper proposes an optimal and computationally… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.