Skip to main content

Showing 1–50 of 121 results for author: Yang, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00933  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and Analysis

    Authors: Xueyao Zhang, Bo Yang, Zhiwen Yu, Xuelin Cao, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: This paper investigates autonomous driving safety improvement via task offloading from cellular vehicles (CVs) to a multi-access edge computing (MEC) server using vehicle-to-infrastructure (V2I) links. Considering that the latter links can be reused by vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of the V2I link may suffer from severe interference that can… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.19959  [pdf, other

    cs.SD eess.AS

    RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

    Authors: Bing Yang, Changsheng Quan, Yabo Wang, Pengyu Wang, Yujie Yang, Ying Fang, Nian Shao, Hui Bu, Xin Xu, Xiaofei Li

    Abstract: The training of deep learning-based multichannel speech enhancement and source localization systems relies heavily on the simulation of room impulse response and multichannel diffuse noise, due to the lack of large-scale real-recorded datasets. However, the acoustic mismatch between simulated and real-world data could degrade the model performance when applying in real-world scenarios. To bridge t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2406.18055  [pdf, other

    cs.IT eess.SP

    Filtering Reconfigurable Intelligent Computational Surface for RF Spectrum Purification

    Authors: Kaining Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Mérouane Debbah, Chau Yuen

    Abstract: The increasing demand for communication is degrading the electromagnetic (EM) transmission environment due to severe EM interference, significantly reducing the efficiency of the radio frequency (RF) spectrum. Metasurfaces, a promising technology for controlling desired EM waves, have recently received significant attention from both academia and industry. However, the potential impact of out-of-b… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  4. arXiv:2406.13335  [pdf, other

    cs.NI eess.SP

    AI-Empowered Multiple Access for 6G: A Survey of Spectrum Sensing, Protocol Designs, and Optimizations

    Authors: Xuelin Cao, Bo Yang, Kaining Wang, Xinghua Li, Zhiwen Yu, Chau Yuen, Yan Zhang, Zhu Han

    Abstract: With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimiz… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.12447  [pdf, other

    eess.AS

    Text-aware Speech Separation for Multi-talker Keyword Spotting

    Authors: Haoyu Li, Baochen Yang, Yu Xi, Linfeng Yu, Tian Tan, Hao Li, Kai Yu

    Abstract: For noisy environments, ensuring the robustness of keyword spotting (KWS) systems is essential. While much research has focused on noisy KWS, less attention has been paid to multi-talker mixed speech scenarios. Unlike the usual cocktail party problem where multi-talker speech is separated using speaker clues, the key challenge here is to extract the target speech for KWS based on text clues. To ad… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH2024

  6. arXiv:2406.11546  [pdf, other

    eess.AS cs.CL cs.SD

    GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

    Authors: Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, **peng Li, Bo Yang, Yexing Du, Ziyang Ma, Xunying Liu, Ziyuan Wang, Ke Li, Shuai Fan, Kai Yu, Wei-Qiang Zhang, Guoguo Chen, Xie Chen

    Abstract: The evolution of speech technology has been spurred by the rapid increase in dataset sizes. Traditional speech models generally depend on a large amount of labeled training data, which is scarce for low-resource languages. This paper presents GigaSpeech 2, a large-scale, multi-domain, multilingual speech recognition corpus. It is designed for low-resource languages and does not rely on paired spee… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  7. arXiv:2406.03391  [pdf, other

    eess.SP

    Joint Association, Beamforming, and Resource Allocation for Multi-IRS Enabled MU-MISO Systems With RSMA

    Authors: Chunjie Wang, Xuhui Zhang, Huijun Xing, Liang Xue, Shuqiang Wang, Yanyan Shen, Bo Yang, ** Guan

    Abstract: Intelligent reflecting surface (IRS) and rate-splitting multiple access (RSMA) technologies are at the forefront of enhancing spectrum and energy efficiency in the next generation multi-antenna communication systems. This paper explores a RSMA system with multiple IRSs, and proposes two purpose-driven scheduling schemes, i.e., the exhaustive IRS-aided (EIA) and opportunistic IRS-aided (OIA) scheme… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  8. arXiv:2405.07021  [pdf, other

    eess.AS cs.SD

    IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization

    Authors: Yabo Wang, Bing Yang, Xiaofei Li

    Abstract: Extracting direct-path spatial feature is crucial for sound source localization in adverse acoustic environments. This paper proposes the IPDnet, a neural network that estimates direct-path inter-channel phase difference (DP-IPD) of sound sources from microphone array signals. The estimated DP-IPD can be easily translated to source location based on the known microphone array geometry. First, a fu… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  9. arXiv:2404.13786  [pdf, other

    eess.SY cs.AI cs.DC cs.LG

    Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving

    Authors: Shuyao Shi, Neiwen Ling, Zhehao Jiang, Xuan Huang, Yuze He, Xiaoguang Zhao, Bufang Yang, Chen Bian, **gfei Xia, Zhenyu Yan, Raymond Yeung, Guoliang Xing

    Abstract: Recently,smart roadside infrastructure (SRI) has demonstrated the potential of achieving fully autonomous driving systems. To explore the potential of infrastructure-assisted autonomous driving, this paper presents the design and deployment of Soar, the first end-to-end SRI system specifically designed to support autonomous driving systems. Soar consists of both software and hardware components ca… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  10. arXiv:2404.07215  [pdf, other

    cs.NI cs.AI eess.SP

    Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method

    Authors: Siyu Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Yan Zhang, Chau Yuen

    Abstract: In this paper, we investigate a multi-user offloading problem in the overlap** domain of a multi-server mobile edge computing system. We divide the original problem into two stages: the offloading decision making stage and the request scheduling stage. To prevent the terminal from going out of service area during offloading, we consider the mobility parameter of the terminal according to the hum… ▽ More

    Submitted 20 February, 2024; originally announced April 2024.

  11. arXiv:2403.13332  [pdf, other

    eess.AS cs.SD

    TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer

    Authors: Yu Xi, Hao Li, Baochen Yang, Haoyu Li, Hainan Xu, Kai Yu

    Abstract: Designing an efficient keyword spotting (KWS) system that delivers exceptional performance on resource-constrained edge devices has long been a subject of significant attention. Existing KWS search algorithms typically follow a frame-synchronous approach, where search decisions are made repeatedly at each frame despite the fact that most frames are keyword-irrelevant. In this paper, we propose TDT… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted by ICASSP2024

  12. arXiv:2402.15932  [pdf, other

    cs.LG eess.SY

    Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach

    Authors: Alaa Selim, Yanzhu Ye, Junbo Zhao, Bo Yang

    Abstract: In the rapidly evolving domain of electrical power systems, the Volt-VAR optimization (VVO) is increasingly critical, especially with the burgeoning integration of renewable energy sources. Traditional approaches to learning-based VVO in expansive and dynamically changing power systems are often hindered by computational complexities. To address this challenge, our research presents a novel framew… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  13. arXiv:2402.04584  [pdf, other

    eess.IV cs.CV

    Troublemaker Learning for Low-Light Image Enhancement

    Authors: Yinghao Song, Zhiyuan Cao, Wanhong Xiang, Sifan Long, Bo Yang, Hongwei Ge, Yanchun Liang, Chunguo Wu

    Abstract: Low-light image enhancement (LLIE) restores the color and brightness of underexposed images. Supervised methods suffer from high costs in collecting low/normal-light image pairs. Unsupervised methods invest substantial effort in crafting complex loss functions. We address these two challenges through the proposed TroubleMaker Learning (TML) strategy, which employs normal-light images as inputs for… ▽ More

    Submitted 2 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  14. arXiv:2402.00398  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks

    Authors: Bo Yang, Xueyao Zhang, Zhiwen Yu, Xuelin Cao, Chongwen Huang, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: In this paper, we focus on improving autonomous driving safety via task offloading from cellular vehicles (CVs), using vehicle-to-infrastructure (V2I) links, to an multi-access edge computing (MEC) server. Considering that the frequencies used for V2I links can be reused for vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of each V2I link may suffer from sever… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  15. arXiv:2401.12546  [pdf, other

    cs.LG eess.SY math.OC

    On Building Myopic MPC Policies using Supervised Learning

    Authors: Christopher A. Orrico, Bokan Yang, Dinesh Krishnamoorthy

    Abstract: The application of supervised learning techniques in combination with model predictive control (MPC) has recently generated significant interest, particularly in the area of approximate explicit MPC, where function approximators like deep neural networks are used to learn the MPC policy via optimal state-action pairs generated offline. While the aim of approximate explicit MPC is to closely replic… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  16. arXiv:2401.06485  [pdf, other

    eess.AS cs.SD

    Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech

    Authors: Yu Xi, Baochen Yang, Hao Li, Jiaqi Guo, Kai Yu

    Abstract: Customizable keyword spotting (KWS) in continuous speech has attracted increasing attention due to its real-world application potential. While contrastive learning (CL) has been widely used to extract keyword representations, previous CL approaches all operate on pre-segmented isolated words and employ only audio-text representations matching strategy. However, for KWS in continuous speech, co-art… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP2024

  17. arXiv:2312.14473  [pdf, other

    math.OC eess.SY

    Coordinated Active-Reactive Power Management of ReP2H Systems with Multiple Electrolyzers

    Authors: Yangjun Zeng, Buxiang Zhou, Jie Zhu, Jiarong Li, Bosen Yang, ** Lin, Yiwei Qiu

    Abstract: Utility-scale renewable power-to-hydrogen (ReP2H) production typically uses thyristor rectifiers (TRs) to supply power to multiple electrolyzers (ELZs). They exhibit a nonlinear and non-decouplable relation between active and reactive power. The on-off scheduling and load allocation of multiple ELZs simultaneously impact energy conversion efficiency and AC-side active and reactive power flow. Impr… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  18. Joint Trading and Scheduling among Coupled Carbon-Electricity-Heat-Gas Industrial Clusters

    Authors: Dafeng Zhu, Bo Yang, Yu Wu, Haoran Deng, Zhaoyang Dong, Kai Ma, ** Guan

    Abstract: This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Smart Grid, 2023

  19. arXiv:2312.12789  [pdf, other

    eess.IV cs.CV cs.LG

    SLP-Net:An efficient lightweight network for segmentation of skin lesions

    Authors: Bo Yang, Hong Peng, Chenggang Guo, Xiaohui Luo, Jun Wang, Xianzhong Long

    Abstract: Prompt treatment for melanoma is crucial. To assist physicians in identifying lesion areas precisely in a quick manner, we propose a novel skin lesion segmentation technique namely SLP-Net, an ultra-lightweight segmentation network based on the spiking neural P(SNP) systems type mechanism. Most existing convolutional neural networks achieve high segmentation accuracy while neglecting the high hard… ▽ More

    Submitted 4 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  20. arXiv:2312.07631  [pdf, other

    physics.med-ph cs.AI eess.IV physics.bio-ph physics.optics

    AI-driven projection tomography with multicore fibre-optic cell rotation

    Authors: Jiawei Sun, Bin Yang, Nektarios Koukourakis, Jochen Guck, Juergen W. Czarske

    Abstract: Optical tomography has emerged as a non-invasive imaging method, providing three-dimensional insights into subcellular structures and thereby enabling a deeper understanding of cellular functions, interactions, and processes. Conventional optical tomography methods are constrained by a limited illumination scanning range, leading to anisotropic resolution and incomplete imaging of cellular structu… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 15 pages, 6 figures

  21. arXiv:2312.00476  [pdf, other

    cs.SD eess.AS

    Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer

    Authors: Bing Yang, Xiaofei Li

    Abstract: Supervised learning methods have shown effectiveness in estimating spatial acoustic parameters such as time difference of arrival, direct-to-reverberant ratio and reverberation time. However, they still suffer from the simulation-to-reality generalization problem due to the mismatch between simulated and real-world acoustic characteristics and the deficiency of annotated real-world data. To this e… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  22. arXiv:2311.18520  [pdf, other

    cs.HC cs.AI cs.LG eess.SP

    Calibration-free online test-time adaptation for electroencephalography motor imagery decoding

    Authors: Martin Wimpff, Mario Döbler, Bin Yang

    Abstract: Providing a promising pathway to link the human brain with external devices, Brain-Computer Interfaces (BCIs) have seen notable advancements in decoding capabilities, primarily driven by increasingly sophisticated techniques, especially deep learning. However, achieving high accuracy in real-world scenarios remains a challenge due to the distribution shift between sessions and subjects. In this pa… ▽ More

    Submitted 8 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures, 12th International Winter Conference on Brain-Computer Interface 2024

  23. arXiv:2310.04231  [pdf, other

    cs.RO eess.SY

    Indoor Positioning based on Active Radar Sensing and Passive Reflectors: Concepts & Initial Results

    Authors: Pascal Schlachter, Zhibin Yu, Naveed Iqbal, Xiaofeng Wu, Sven Hinderer, Bin Yang

    Abstract: To navigate reliably in indoor environments, an industrial autonomous vehicle must know its position. However, current indoor vehicle positioning technologies either lack accuracy, usability or are too expensive. Thus, we propose a novel concept called local reference point assisted active radar positioning, which is able to overcome these drawbacks. It is based on distributing passive retroreflec… ▽ More

    Submitted 31 January, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted as a work-in-progress paper at the 13th International Conference on Indoor Positioning and Indoor Navigation (IPIN 2023)

    Journal ref: Proceedings of the Work-in-Progress Papers at the 13th International Conference on Indoor Positioning and Indoor Navigation (IPIN-WiP 2023), September 25 - 28, 2023, Nuremberg, Germany (https://ceur-ws.org/Vol-3581/)

  24. arXiv:2309.14274  [pdf

    eess.SY

    Analysis and Experimental Validation of the WPT Efficiency of the Both-Sides Retrodirective System

    Authors: Charleston Dale M. Ambatali, Shinichi Nakasuka, Bo Yang, Naoki Shinohara

    Abstract: The retrodirective antenna array is considered as a mechanism to enable target tracking of a power receiver for long range wireless power transfer (WPT) due to its simplicity in implementation using only analog circuits. By installing the retrodirective capability on both the generator and rectenna arrays, a feedback loop that produces a high efficiency WPT channel is created. In this paper, we ch… ▽ More

    Submitted 27 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: This current version has been submitted to the Space Solar Power and Wireless Transmission on February 19, 2024 for possible publication. Compared to the previous version, this version is a major revision discussing existing works more thoroughly to the proposed idea and also adding more detail to the experiment setup so it can be reproducible

  25. arXiv:2309.13515  [pdf, other

    cs.RO eess.SY

    Learning-based Inverse Perception Contracts and Applications

    Authors: Dawei Sun, Benjamin C. Yang, Sayan Mitra

    Abstract: Perception modules are integral in many modern autonomous systems, but their accuracy can be subject to the vagaries of the environment. In this paper, we propose a learning-based approach that can automatically characterize the error of a perception module from data and use this for safe control. The proposed approach constructs an inverse perception contract (IPC) which generates a set that cont… ▽ More

    Submitted 3 March, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

  26. arXiv:2309.05964  [pdf, other

    cs.NI eess.SP

    Massive Access of Static and Mobile Users via Reconfigurable Intelligent Surfaces: Protocol Design and Performance Analysis

    Authors: Xuelin Cao, Bo Yang, Chongwen Huang, George C. Alexandropoulos, Chau Yuen, Zhu Han, H. Vincent Poor, Lajos Hanzo

    Abstract: The envisioned wireless networks of the future entail the provisioning of massive numbers of connections, heterogeneous data traffic, ultra-high spectral efficiency, and low latency services. This vision is spurring research activities focused on defining a next generation multiple access (NGMA) protocol that can accommodate massive numbers of users in different resource blocks, thereby, achieving… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  27. arXiv:2309.01168  [pdf, other

    physics.flu-dyn eess.SY

    Noise Measurement of a Wind Turbine using Thick Blades with Blunt Trailing Edge

    Authors: Weicheng Xue, Bing Yang

    Abstract: The noise generated by wind turbines can potentially cause significant harm to the ecological environment and the living conditions of residents. Therefore, a proper assessment of wind turbine noise is crucial. The IEC 61400-11 standard provides standardized guidelines for measuring turbine noise, facilitating the comparison of noise characteristics among different wind turbine models. This work a… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  28. arXiv:2309.00907  [pdf, other

    eess.SP cs.LG

    A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical Computation Offloading

    Authors: Ruihuai Liang, Bo Yang, Zhiwen Yu, Xuelin Cao, Derrick Wing Kwan Ng, Chau Yuen

    Abstract: Computation offloading has become a popular solution to support computationally intensive and latency-sensitive applications by transferring computing tasks to mobile edge servers (MESs) for execution, which is known as mobile/multi-access edge computing (MEC). To improve the MEC performance, it is required to design an optimal offloading strategy that includes offloading decision (i.e., whether o… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  29. arXiv:2308.06958  [pdf, other

    eess.SY

    Hydrogen Supply Infrastructure Network Planning Approach towards Chicken-egg Conundrum

    Authors: Haoran Deng, Bo Yang, Mo-Yuen Chow, Gang Yao, Cailian Chen, ** Guan

    Abstract: In the early commercialization stage of hydrogen fuel cell vehicles (HFCVs), reasonable hydrogen supply infrastructure (HSI) planning decisions is a premise for promoting the popularization of HFCVs. However, there is a strong causality between HFCVs and hydrogen refueling stations (HRSs): the planning decisions of HRSs could affect the hydrogen refueling demand of HFCVs, and the growth of demand… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  30. arXiv:2306.14276  [pdf, other

    cs.SD eess.AS

    Aeroacoustic Source Localization

    Authors: Weicheng Xue, Bing Yang, Shaohong Jia

    Abstract: The deconvolutional DAMAS algorithm can effectively eliminate the misconceptions in the usually-used beamforming localization algorithm, allowing for more accurate calculation of the source location as well as the intensity. When solving a linear system of equations, the DAMAS algorithm takes into account the mutual influence of different locations, reducing or even eliminating sidelobes and produ… ▽ More

    Submitted 5 July, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

  31. arXiv:2306.12604  [pdf, other

    cs.RO eess.SY

    Consecutive Inertia Drift of Autonomous RC Car via Primitive-based Planning and Data-driven Control

    Authors: Yiwen Lu, Bo Yang, Jiayun Li, Yihan Zhou, Hongshuai Chen, Yilin Mo

    Abstract: Inertia drift is an aggressive transitional driving maneuver, which is challenging due to the high nonlinearity of the system and the stringent requirement on control and planning performance. This paper presents a solution for the consecutive inertia drift of an autonomous RC car based on primitive-based planning and data-driven control. The planner generates complex paths via the concatenation o… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 9 pages, 10 figures, to appear to IROS 2023

  32. arXiv:2305.19610  [pdf, other

    eess.AS

    FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization

    Authors: Yabo Wang, Bing Yang, Xiaofei Li

    Abstract: Extracting direct-path spatial features is critical for sound source localization in adverse acoustic environments. This paper proposes a full-band and narrow-band fusion network for estimating direct-path inter-channel phase difference (DP-IPD) from microphone signals. The alternating full-band and narrow-band layers are responsible for learning the full-band correlation and narrow-band extractio… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  33. arXiv:2305.17937  [pdf, other

    eess.IV cs.CV

    Attention Mechanisms in Medical Image Segmentation: A Survey

    Authors: Yutong Xie, Bing Yang, Qingbiao Guan, Jianpeng Zhang, Qi Wu, Yong Xia

    Abstract: Medical image segmentation plays an important role in computer-aided diagnosis. Attention mechanisms that distinguish important parts from irrelevant parts have been widely used in medical image segmentation tasks. This paper systematically reviews the basic principles of attention mechanisms and their applications in medical image segmentation. First, we review the basic concepts of attention mec… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Submitted to Medical Image Analysis, survey paper, 34 pages, over 300 references

  34. arXiv:2305.09647  [pdf, other

    cs.CV eess.IV

    Wavelet-based Unsupervised Label-to-Image Translation

    Authors: George Eskandar, Mohamed Abdelsamad, Karim Armanious, Shuai Zhang, Bin Yang

    Abstract: Semantic Image Synthesis (SIS) is a subclass of image-to-image translation where a semantic layout is used to generate a photorealistic image. State-of-the-art conditional Generative Adversarial Networks (GANs) need a huge amount of paired data to accomplish this task while generic unpaired image-to-image translation frameworks underperform in comparison, because they color-code semantic layouts a… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2109.14715

  35. arXiv:2305.00170  [pdf, other

    cs.SD eess.AS

    Enhancing multilingual speech recognition in air traffic control by sentence-level language identification

    Authors: Peng Fan, Dongyue Guo, JianWei Zhang, Bo Yang, Yi Lin

    Abstract: Automatic speech recognition (ASR) technique is becoming increasingly popular to improve the efficiency and safety of air traffic control (ATC) operations. However, the conversation between ATC controllers and pilots using multilingual speech brings a great challenge to building high-accuracy ASR systems. In this work, we present a two-stage multilingual ASR framework. The first stage is to train… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  36. arXiv:2303.11420  [pdf, other

    eess.SP cs.AI cs.CV

    ADCNet: Learning from Raw Radar Data via Distillation

    Authors: Bo Yang, Ishan Khatri, Michael Happold, Chulong Chen

    Abstract: As autonomous vehicles and advanced driving assistance systems have entered wider deployment, there is an increased interest in building robust perception systems using radars. Radar-based systems are lower cost and more robust to adverse weather conditions than their LiDAR-based counterparts; however the point clouds produced are typically noisy and sparse by comparison. In order to combat these… ▽ More

    Submitted 13 December, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Update 12/13/2023: upgrade organization and presentation of the paper, adding appendix

  37. arXiv:2301.00656  [pdf, other

    eess.AS cs.CL cs.LG

    TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR

    Authors: Lixin Cao, Jun Wang, Ben Yang, Dan Su, Dong Yu

    Abstract: Self-supervised learning (SSL) models confront challenges of abrupt informational collapse or slow dimensional collapse. We propose TriNet, which introduces a novel triple-branch architecture for preventing collapse and stabilizing the pre-training. TriNet learns the SSL latent embedding space and incorporates it to a higher level space for predicting pseudo target vectors generated by a frozen te… ▽ More

    Submitted 14 March, 2023; v1 submitted 12 December, 2022; originally announced January 2023.

    Comments: Accepted by ICASSP 2023

  38. How to Share: Balancing Layer and Chain Sharing in Industrial Microservice Deployment

    Authors: Yuxiang Liu, Bo Yang, Yu Wu, Cailian Chen, ** Guan

    Abstract: With the rapid development of smart manufacturing, edge computing-oriented microservice platforms are emerging as an important part of production control. In the containerized deployment of microservices, layer sharing can reduce the huge bandwidth consumption caused by image pulling, and chain sharing can reduce communication overhead caused by communication between microservices. The two sharing… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  39. Distributionally Robust Day-ahead Scheduling for Power-traffic Network under a Potential Game Framework

    Authors: Haoran Deng, Bo Yang, Chao Ning, Cailian Chen, ** Guan

    Abstract: Widespread utilization of electric vehicles (EVs) incurs more uncertainties and impacts on the scheduling of the power-transportation coupled network. This paper investigates optimal power scheduling for a power-transportation coupled network in the day-ahead energy market considering multiple uncertainties related to photovoltaic (PV) generation and the traffic demand of vehicles. The crux of thi… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.14209

    Journal ref: International Journal of Electrical Power and Energy Systems 2023

  40. arXiv:2211.15752  [pdf, other

    cs.RO eess.SY

    Hierarchical Control Strategy for Moving A Robot Manipulator Between Small Containers

    Authors: Paolo Torrado, Boling Yang, Joshua Smith

    Abstract: In this paper, we study the implementation of a model predictive controller (MPC) for the task of object manipulation in a highly uncertain environment (e.g., picking objects from a semi-flexible array of densely packed bins). As a real-time perception-driven feedback controller, MPC is robust to the uncertainties in this environment. However, our experiment shows MPC cannot control a robot to com… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  41. arXiv:2209.11112  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

    Authors: Sherif Abdulatif, Ruizhe Cao, Bin Yang

    Abstract: In this work, we further develop the conformer-based metric generative adversarial network (CMGAN) model for speech enhancement (SE) in the time-frequency (TF) domain. This paper builds on our previous work but takes a more in-depth look by conducting extensive ablation studies on model inputs and architectural design choices. We rigorously tested the generalization ability of the model to unseen… ▽ More

    Submitted 3 May, 2024; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 17 pages, 11 figures, and 6 tables. arXiv admin note: text overlap with arXiv:2203.15149

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 2477-2493, 2024

  42. arXiv:2208.08894  [pdf

    eess.SP

    EEG Machine Learning for Analysis of Mild Traumatic Brain Injury: A survey

    Authors: Weiqing Gu, Ryan Chang, Bohan Yang

    Abstract: Mild Traumatic Brain Injury (mTBI) is a common brain injury and affects a diverse group of people: soldiers, constructors, athletes, drivers, children, elders, and nearly everyone. Thus, having a well-established, fast, cheap, and accurate classification method is crucial for the well-being of people around the globe. Luckily, using Machine Learning (ML) on electroencephalography (EEG) data shows… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 27 pages

  43. arXiv:2208.04509  [pdf, other

    eess.SP cs.NI

    Reconfigurable Intelligent Computational Surfaces: When Wave Propagation Control Meets Computing

    Authors: Bo Yang, Xuelin Cao, **dan Xu, Chongwen Huang, George C. Alexandropoulos, Linglong Dai, M'erouane Debbah, H. Vincent Poor, Chau Yuen

    Abstract: The envisioned sixth-generation (6G) of wireless networks will involve an intelligent integration of communications and computing, thereby meeting the urgent demands of diverse applications. To realize the concept of the smart radio environment, reconfigurable intelligent surfaces (RISs) are a promising technology for offering programmable propagation of im**ing electromagnetic signals via exter… ▽ More

    Submitted 3 October, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

  44. arXiv:2207.00615  [pdf, other

    eess.SY

    Synthesis of General Decoupling Networks Using Transmission Lines

    Authors: Binbin Yang

    Abstract: In this paper, we introduce a synthesis technique for transmission line based decoupling networks, which find application in coupled systems such as multiple-antenna systems and antenna arrays. Employing the generalized $π$-network and the transmission line analysis technique, we reduce the decoupling network design into simple matrix calculations. The synthesized decoupling network is essentially… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: 4 pages

  45. arXiv:2206.00208  [pdf, other

    cs.SD eess.AS

    AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation

    Authors: Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su

    Abstract: Speaker adaptation in text-to-speech synthesis (TTS) is to finetune a pre-trained TTS model to adapt to new target speakers with limited data. While much effort has been conducted towards this task, seldom work has been performed for low computational resource scenarios due to the challenges raised by the requirement of the lightweight model and less computational complexity. In this paper, a tiny… ▽ More

    Submitted 2 November, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted by ISCSLP 2022

  46. arXiv:2204.14059  [pdf

    cs.CE eess.IV

    Improving the estimation of directional area scattering factor (DASF) from canopy reflectance: theoretical basis and validation

    Authors: Yi Lin, Siyuan Liu, Lei Yan, Kai Yan, Yelu Zeng, Bin Yang

    Abstract: Directional area scattering factor (DASF) is a critical canopy structural parameter for vegetation monitoring. It provides an efficient tool for decoupling of canopy structure and leaf optics from canopy reflectance. Current standard approach to estimate DASF from canopy bidirectional reflectance factor (BRF) is based on the assumption that in the weakly absorbing 710 to 790 nm spectral interval,… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  47. arXiv:2204.04088  [pdf, other

    eess.SY

    Stochastic Gradient-based Fast Distributed Multi-Energy Management for an Industrial Park with Temporally-Coupled Constraints

    Authors: Dafeng Zhu, Bo Yang, Chengbin Ma, Zhaojian Wang, Shanying Zhu, Kai Ma, ** Guan

    Abstract: Contemporary industrial parks are challenged by the growing concerns about high cost and low efficiency of energy supply. Moreover, in the case of uncertain supply/demand, how to mobilize delay-tolerant elastic loads and compensate real-time inelastic loads to match multi-energy generation/storage and minimize energy cost is a key issue. Since energy management is hardly to be implemented offline… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted by Applied Energy

  48. arXiv:2204.01645  [pdf, other

    eess.IV cs.CV

    Three-dimensional Microstructural Image Synthesis from 2D Backscattered Electron Image of Cement Paste

    Authors: Xin Zhao, Xu Wu, Lin Wang, Pengkun Hou, Qinfei Li, Yuxuan Zhang, Bo Yang

    Abstract: The microstructure is significant for exploring the physical properties of hardened cement paste. In general, the microstructures of hardened cement paste are obtained by microscopy. As a popular method, scanning electron microscopy (SEM) can acquire high-quality 2D images but fails to obtain 3D microstructures.Although several methods, such as microtomography (Micro-CT) and Focused Ion Beam Scann… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 25 pages, 9 figures

  49. arXiv:2203.15149  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    CMGAN: Conformer-based Metric GAN for Speech Enhancement

    Authors: Ruizhe Cao, Sherif Abdulatif, Bin Yang

    Abstract: Recently, convolution-augmented transformer (Conformer) has achieved promising performance in automatic speech recognition (ASR) and time-domain speech enhancement (SE), as it can capture both local and global dependencies in the speech signal. In this paper, we propose a conformer-based metric generative adversarial network (CMGAN) for SE in the time-frequency (TF) domain. In the generator, we ut… ▽ More

    Submitted 3 March, 2024; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 5 pages, 1 figure, 2 tables, published in INTERSPEECH 2022

    Journal ref: Proceedings of INTERSPEECH, 2022, pp. 936--940

  50. arXiv:2203.00270  [pdf, other

    cs.GT eess.SP

    Bidirectional Pricing and Demand Response for Nanogrids with HVAC Systems

    Authors: Jiaxin Cao, Bo Yang, Shanying Zhu, Kai Ma, ** Guan

    Abstract: Owing to the fluctuant renewable generation and power demand, the energy surplus or deficit in each nanogrid is embodied differently across time. To stimulate local renewable energy consumption and minimize the long-term energy cost, some issues still remain to be explored: when and how the energy demand and bidirectional trading prices are scheduled considering personal comfort preferences and en… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.