Skip to main content

Showing 1–39 of 39 results for author: Tang, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.09663  [pdf

    eess.SP

    Design and Implementation of mmWave Surface Wave Enabled Fluid Antennas and Experimental Results for Fluid Antenna Multiple Access

    Authors: Yuanjun Shen, Boyi Tang, Shuai Gao, Kin-Fai Tong, Hang Wong, Kai-Kit Wong, Yangyang Zhang

    Abstract: While multiple-input multiple-output (MIMO) technologies continue to advance, concerns arise as to how MIMO can remain scalable if more users are to be accommodated with an increasing number of antennas at the base station (BS) in the upcoming sixth generation (6G). Recently, the concept of fluid antenna system (FAS) has emerged, which promotes position flexibility to enable transmitter channel st… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Transactions on Antennas and Propagation

  2. arXiv:2404.09192  [pdf, other

    cs.SD cs.AI eess.AS

    Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling

    Authors: Quanxiu Wang, Hui Huang, Mingjie Wang, Yong Dai, **zuomu Zhong, Benlai Tang

    Abstract: Over the past decade, a series of unflagging efforts have been dedicated to develo** highly expressive and controllable text-to-speech (TTS) systems. In general, the holistic TTS comprises two interconnected components: the frontend module and the backend module. The frontend excels in capturing linguistic representations from the raw text input, while the backend module converts linguistic cues… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  3. arXiv:2403.05834  [pdf, other

    cs.MM cs.SD eess.AS

    Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information

    Authors: Qiaochu Huang, Xu He, Boshi Tang, Haolin Zhuang, Liyang Chen, Shuochen Gao, Zhiyong Wu, Haozhi Huang, Helen Meng

    Abstract: Dance generation, as a branch of human motion generation, has attracted increasing attention. Recently, a few works attempt to enhance dance expressiveness, which includes genre matching, beat alignment, and dance dynamics, from certain aspects. However, the enhancement is quite limited as they lack comprehensive consideration of the aforementioned three factors. In this paper, we propose Expressi… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2402.05569  [pdf, other

    cs.LG cs.AI eess.SP stat.ML

    Simplifying Hypergraph Neural Networks

    Authors: Bohan Tang, Zexi Liu, Keyue Jiang, Siheng Chen, Xiaowen Dong

    Abstract: Hypergraphs are crucial for modeling higher-order interactions in real-world data. Hypergraph neural networks (HNNs) effectively utilise these structures by message passing to generate informative node features for various downstream tasks like node classification. However, the message passing block in existing HNNs typically requires a computationally intensive training process, which limits thei… ▽ More

    Submitted 22 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  5. arXiv:2402.00565  [pdf, other

    eess.SY

    A Review of Carsickness Mitigation: Navigating Challenges and Exploiting Opportunities in the Era of Intelligent Vehicles

    Authors: Daofei Li, Tingzhe Yu, Binbin Tang

    Abstract: Motion sickness (MS) has long been a common complaint in road transportation. However, in the era of driving automation, MS has become an increasingly significant issue. The future intelligent vehicle is envisioned as a mobile space for work or entertainment, but unfortunately passengers' engagement in non-driving tasks may exacerbate MS. Finding effective MS countermeasures is crucial to ensure a… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 19 pages, 5 figures, 5 tables

  6. arXiv:2401.07532  [pdf, other

    cs.SD cs.AI eess.AS

    Multi-view MidiVAE: Fusing Track- and Bar-view Representations for Long Multi-track Symbolic Music Generation

    Authors: Zhiwei Lin, Jun Chen, Boshi Tang, Binzhu Sha, **g Yang, Yaolong Ju, Fan Fan, Shiyin Kang, Zhiyong Wu, Helen Meng

    Abstract: Variational Autoencoders (VAEs) constitute a crucial component of neural symbolic music generation, among which some works have yielded outstanding results and attracted considerable attention. Nevertheless, previous VAEs still encounter issues with overly long feature sequences and generated results lack contextual coherence, thus the challenge of modeling long multi-track symbolic music still re… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  7. arXiv:2312.09778  [pdf, other

    cs.LG eess.SP

    Hypergraph-MLP: Learning on Hypergraphs without Message Passing

    Authors: Bohan Tang, Siheng Chen, Xiaowen Dong

    Abstract: Hypergraphs are vital in modelling data with higher-order relations containing more than two entities, gaining prominence in machine learning and signal processing. Many hypergraph neural networks leverage message passing over hypergraph structures to enhance node representation learning, yielding impressive performances in tasks like hypergraph node classification. However, these message-passing-… ▽ More

    Submitted 2 June, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  8. arXiv:2311.13787  [pdf, other

    eess.SP

    A Fast Power Spectrum Sensing Solution for Generalized Coprime Sampling

    Authors: Kaili Jiang, Dechang Wang, Kailun Tian, Hancong Feng, Yuxin Zhao, Junyu Yuan, Bin Tang

    Abstract: The growing scarcity of spectrum resources, wideband spectrum sensing is required to process a prohibitive volume of data at a high sampling rate. For some applications, spectrum estimation only requires second-order statistics. In this case, a fast power spectrum sensing solution is proposed based on the generalized coprime sampling. By exploring the sensing vector inherent structure, the autocor… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  9. arXiv:2310.12733  [pdf, other

    eess.IV cs.CV

    Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression

    Authors: Yiming Wang, Qian Huang, Bin Tang, Huashan Sun, Xing Li

    Abstract: Recently, learned video compression has achieved exciting performance. Following the traditional hybrid prediction coding framework, most learned methods generally adopt the motion estimation motion compensation (MEMC) method to remove inter-frame redundancy. However, inaccurate motion vector (MV) usually lead to the distortion of reconstructed frame. In addition, most approaches ignore the spatia… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 12pages,12 figures

  10. arXiv:2309.05423  [pdf, other

    eess.AS cs.AI cs.CL cs.SD

    Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP

    Authors: **zuomu Zhong, Yang Li, Hui Huang, Korin Richmond, Jie Liu, Zhiba Su, **g Guo, Benlai Tang, Fengjie Zhu

    Abstract: In expressive and controllable Text-to-Speech (TTS), explicit prosodic features significantly improve the naturalness and controllability of synthesised speech. However, manual prosody annotation is labor-intensive and inconsistent. To address this issue, a two-stage automatic annotation pipeline is novelly proposed in this paper. In the first stage, we use contrastive pretraining of Speech-Silenc… ▽ More

    Submitted 11 June, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  11. arXiv:2308.14172  [pdf, other

    cs.LG cs.AI cs.SI eess.SP stat.ML

    Hypergraph Structure Inference From Data Under Smoothness Prior

    Authors: Bohan Tang, Siheng Chen, Xiaowen Dong

    Abstract: Hypergraphs are important for processing data with higher-order relationships involving more than two entities. In scenarios where explicit hypergraphs are not readily available, it is desirable to infer a meaningful hypergraph structure from the node features to capture the intrinsic relations within the data. However, existing methods either adopt simple pre-defined rules that fail to precisely… ▽ More

    Submitted 31 August, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

  12. arXiv:2308.07079  [pdf

    eess.SP

    Wideband Spectrum Acquisition for UAV Swarm Using the Sparse Coding Fourier Transform

    Authors: Kaili Jiang, Kailun Tian, Hancong Feng, Junyu Yuan, Bin Tang

    Abstract: As the trend towards small, safe, smart, speedy and swarm development grows, unmanned aerial vehicles (UAVs) are becoming increasingly popular for a wide range of applications. In this letter, the challenge of wideband spectrum acquisition for the UAV swarms is studied by proposing a processing method that features lower power consumption, higher compression rates, and a lower signal-to-noise rati… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  13. arXiv:2308.07077  [pdf

    eess.SP

    Distributed UAV Swarm Augmented Wideband Spectrum Sensing Using Nyquist Folding Receiver

    Authors: Kaili Jiang, Kailun Tian, Hancong Feng, Yuxin Zhao, Dechang Wang, Sen Cao, Jian Gao, Xuying Zhang, Yanfei Li, Junyu Yuan, Ying Xiong, Bin Tang

    Abstract: Distributed unmanned aerial vehicle (UAV) swarms are formed by multiple UAVs with increased portability, higher levels of sensing capabilities, and more powerful autonomy. These features make them attractive for many recent applica-tions, potentially increasing the shortage of spectrum resources. In this paper, wideband spectrum sensing augmented technology is discussed for distributed UAV swarms… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  14. arXiv:2308.07075  [pdf, other

    eess.SP

    Wideband Power Spectrum Sensing: a Fast Practical Solution for Nyquist Folding Receiver

    Authors: Kaili Jiang, Dechang Wang, Kailun Tian, Hancong Feng, Yuxin Zhao, Sen Cao, Jian Gao, Xuying Zhang, Yanfei Li, Junyu Yuan, Ying Xiong, Bin Tang

    Abstract: The limited availability of spectrum resources has been growing into a critical problem in wireless communications, remote sensing, and electronic surveillance, etc. To address the high-speed sampling bottleneck of wideband spectrum sensing, a fast and practical solution of power spectrum estimation for Nyquist folding receiver (NYFR) is proposed in this paper. The NYFR architectures is can theore… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  15. arXiv:2307.05386  [pdf, other

    eess.SP physics.optics

    Exploring the Potential of Integrated Optical Sensing and Communication (IOSAC) Systems with Si Waveguides for Future Networks

    Authors: Xiangpeng Ou, Ying Qiu, Ming Luo, Fujun Sun, Peng Zhang, Gang Yang, Junjie Li, Jianfeng Gao, Xiaobin He, Anyan Du, Bo Tang, Bin Li, Zichen Liu, Zhihua Li, Ling Xie, Xi Xiao, Jun Luo, Wenwu Wang, ** Tao, Yan Yang

    Abstract: Advanced silicon photonic technologies enable integrated optical sensing and communication (IOSAC) in real time for the emerging application requirements of simultaneous sensing and communication for next-generation networks. Here, we propose and demonstrate the IOSAC system on the silicon nitride (SiN) photonics platform. The IOSAC devices based on microring resonators are capable of monitoring t… ▽ More

    Submitted 27 June, 2023; originally announced July 2023.

    Comments: 11pages, 5 figutres

  16. arXiv:2306.15212  [pdf, other

    cs.SD cs.LG eess.AS

    TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection

    Authors: Jie Liu, Zhiba Su, Hui Huang, Caiyan Wan, Quanxiu Wang, Jiangli Hong, Benlai Tang, Fengjie Zhu

    Abstract: Thanks to recent advancements in end-to-end speech modeling technology, it has become increasingly feasible to imitate and clone a user`s voice. This leads to a significant challenge in differentiating between authentic and fabricated audio segments. To address the issue of user voice abuse and misuse, the second Audio Deepfake Detection Challenge (ADD 2023) aims to detect and analyze deepfake spe… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  17. Multi-Spectrally Constrained Low-PAPR Waveform Optimization for MIMO Radar Space-Time Adaptive Processing

    Authors: Da Li, Bo Tang, Lei Xue

    Abstract: This paper focuses on the joint design of transmit waveforms and receive filters for airborne multiple-input-multiple-output (MIMO) radar systems in spectrally crowded environments. The purpose is to maximize the output signal-to-interference-plus-noise-ratio (SINR) in the presence of signal-dependent clutter. To improve the practicability of the radar waveforms, both a multi-spectral constraint a… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Journal ref: 2023 IEEE Transactions on Aerospace and Electronic Systems

  18. arXiv:2304.02409  [pdf, other

    cs.IT eess.SP

    Relative Entropy-Based Waveform Optimization for Rician Target Detection with Dual-Function Radar Communication Systems

    Authors: Xuyang Wang, Bo Tang, Wenjun Wu, Da Li

    Abstract: In this paper, we consider waveform design for dualfunction radar-communication systems based on multiple-inputmultiple-out arrays. To achieve better Rician target detection performance, we use the relative entropy associated with the formulated detection problem as the design metric. We also impose a multiuser interference energy constraint on the waveforms to ensure the achievable sum-rate of th… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  19. arXiv:2304.01822   

    eess.SP eess.SY

    Co-Design for Spectral Coexistence between RIS-aided MIMO Radar and MIMO Communication Systems

    Authors: Da Li, Bo Tang, Xuyang Wang, Wenjun Wu, Lei Xue

    Abstract: Reconfigurable intelligent surface (RIS) refers to a signal reflection surface containing a large number of low-cost passive reflecting elements. RIS can improve the performance of radar and communication systems by dynamically modulating the wireless channels. In this paper, we consider the co-design for improving the co-existence between multiple-input-multiple-output (MIMO) radar and MIMO commu… ▽ More

    Submitted 14 June, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: The paper has undergone significant rewriting and is currently being revised

  20. Constant-Modulus Waveform Design for Dual-Function Radar-Communication Systems in the Presence of Clutter

    Authors: Wenjun Wu, Bo Tang, Xuyang Wang

    Abstract: We investigate the constant-modulus (CM) waveform design for dual-function radar communication systems in the presence of clutter.To minimize the interference power and enhance the target acquisition performance, we use the signal-to-interference-plus-noise-ratio as the design metric.In addition, to ensure the quality of the service for each communication user, we enforce a constraint on the synth… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  21. arXiv:2211.03979  [pdf, other

    eess.SY

    AI Testing Framework for Next-G O-RAN Networks: Requirements, Design, and Research Opportunities

    Authors: Bo Tang, Vijay K. Shah, Vuk Marojevic, Jeffrey H. Reed

    Abstract: Openness and intelligence are two enabling features to be introduced in next generation wireless networks, e.g. Beyond 5G and 6G, to support service heterogeneity, open hardware, optimal resource utilization, and on-demand service deployment. The open radio access network (O-RAN) is a promising RAN architecture to achieve both openness and intelligence through virtualized network elements and well… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: To be published in IEEE Wireless Communications Magazine

  22. arXiv:2211.01717  [pdf, other

    cs.LG cs.SI eess.SP stat.ML

    Learning Hypergraphs From Signals With Dual Smoothness Prior

    Authors: Bohan Tang, Siheng Chen, Xiaowen Dong

    Abstract: Hypergraph structure learning, which aims to learn the hypergraph structures from the observed signals to capture the intrinsic high-order relationships among the entities, becomes crucial when a hypergraph topology is not readily available in the datasets. There are two challenges that lie at the heart of this problem: 1) how to handle the huge search space of potential hyperedges, and 2) how to… ▽ More

    Submitted 14 March, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  23. arXiv:2210.06973  [pdf, other

    eess.SP

    Contrastive Psudo-supervised Classification for Intra-Pulse Modulation of Radar Emitter Signals Using data augmentation

    Authors: HanCong Feng, XinHai Yan, KaiLi Jiang, XinYu Zhao, Bin Tang

    Abstract: The automatic classification of radar waveform is a fundamental technique in electronic countermeasures (ECM).Recent supervised deep learning-based methods have achieved great success in a such classification task.However, those methods require enough labeled samples to work properly and in many circumstances, it is not available.To tackle this problem, in this paper, we propose a three-stages dee… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  24. MIMO Multifunction RF Systems: Detection Performance and Waveform Design

    Authors: Bo Tang, Petre Stoica

    Abstract: This paper studies the detection performance of a multiple-input-multiple-output (MIMO) multifunction radio frequency (MFRF) system, which simultaneously supports radar, communication, and jamming. We show that the detection performance of the MIMO MFRF system improves as the transmit signal-to-interference-plus-noise-ratio (SINR) increases. To analyze the achievable SINR of the system, we formula… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  25. arXiv:2208.04398  [pdf, other

    eess.SP

    Waveform Design for Mutual Interference Mitigation in Automotive Radar

    Authors: Arindam Bose, Bo Tang, Wenjie Huang, Mojtaba Soltanalian, Jian Li

    Abstract: The mutual interference between similar radar systems can result in reduced radar sensitivity and increased false alarm rates. To address the synchronous and asynchronous interference mitigation problems in similar radar systems, we first propose herein two slow-time coding schemes to modulate the pulses within a coherent processing interval (CPI) for a single-input-single-output (SISO) scenario.… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  26. arXiv:2204.04408  [pdf, other

    eess.SP

    A Probabilistic Model-Based Robust Waveform Design for MIMO Radar Detection

    Authors: Xuyang Wang, Bo Tang, Ming Zhang

    Abstract: This paper addresses robust waveform design for multiple-input-multiple-output (MIMO) radar detection. A probabilistic model is proposed to describe the target uncertainty. Considering that waveform design based on maximizing the probability of detection is intractable, the relative entropy between the distributions of the observations under two hypotheses (viz., the target is present/absent) is e… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

  27. arXiv:2204.04332  [pdf, other

    cs.IT eess.SP

    Fundamental Limits on Detection With a Dual-function Radar Communication System

    Authors: Bo Tang, Zhongrui Huang, Lilong Qin, Hai Wang

    Abstract: This paper investigates the fundamental limits on the target detection performance with a dual-function multiple-input-multiple-output (MIMO) radar communication (RadCom) systems. By assuming the presence of a point-like target and a communication receiver, closed-form expressions for the maximum detection probability and the transmit waveforms achieving the optimal performance are derived. Result… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  28. arXiv:2110.04754  [pdf, other

    cs.SD cs.CL eess.AS

    Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding

    Authors: Chao Wang, Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Yibiao Yu, Zejun Ma

    Abstract: Recently, phonetic posteriorgrams (PPGs) based methods have been quite popular in non-parallel singing voice conversion systems. However, due to the lack of acoustic information in PPGs, style and naturalness of the converted singing voices are still limited. To solve these problems, in this paper, we utilize an acoustic reference encoder to implicitly model singing characteristics. We experiment… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  29. Constrained Radar Waveform Design for Range Profiling

    Authors: Bo Tang, Jun Liu, Hai Wang, Yihua Hu

    Abstract: Range profiling refers to the measurement of target response along the radar slant range. It plays an important role in automatic target recognition. In this paper, we consider the design of transmit waveform to improve the range profiling performance of radar systems. Two design metrics are adopted for the waveform optimization problem: one is to maximize the mutual information between the receiv… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  30. arXiv:2101.07770  [pdf

    cond-mat.mtrl-sci eess.IV

    Develo** and Evaluating Deep Neural Network-based Denoising for Nanoparticle TEM Images with Ultra-low Signal-to-Noise

    Authors: Joshua L. Vincent, Ramon Manzorro, Sreyas Mohan, Binh Tang, Dev Y. Sheth, Eero P. Simoncelli, David S. Matteson, Carlos Fernandez-Granda, Peter A. Crozier

    Abstract: A deep convolutional neural network has been developed to denoise atomic-resolution TEM image datasets of nanoparticles acquired using direct electron counting detectors, for applications where the image signal is severely limited by shot noise. The network was applied to a model system of CeO2-supported Pt nanoparticles. We leverage multislice image simulations to generate a large and flexible da… ▽ More

    Submitted 17 March, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

  31. arXiv:2010.14804  [pdf, other

    cs.SD cs.CL eess.AS

    PPG-based singing voice conversion with adversarial representation learning

    Authors: Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Ling Xu, Chen Shen, Zejun Ma

    Abstract: Singing voice conversion (SVC) aims to convert the voice of one singer to that of other singers while kee** the singing content and melody. On top of recent voice conversion works, we propose a novel model to steadily convert songs while kee** their naturalness and intonation. We build an end-to-end architecture, taking phonetic posteriorgrams (PPGs) as inputs and generating mel spectrograms.… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  32. arXiv:2010.12970  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Denoising For Scientific Discovery: A Case Study In Electron Microscopy

    Authors: Sreyas Mohan, Ramon Manzorro, Joshua L. Vincent, Binh Tang, Dev Yashpal Sheth, Eero P. Simoncelli, David S. Matteson, Peter A. Crozier, Carlos Fernandez-Granda

    Abstract: Denoising is a fundamental challenge in scientific imaging. Deep convolutional neural networks (CNNs) provide the current state of the art in denoising natural images, where they produce impressive results. However, their potential has barely been explored in the context of scientific imaging. Denoising CNNs are typically trained on real natural images artificially corrupted with simulated noise.… ▽ More

    Submitted 13 July, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: The dataset and the code used to train and evaluate and our models are available at https://sreyas-mohan.github.io/electron-microscopy-denoising/

  33. arXiv:2009.03289  [pdf

    eess.SP cs.AI

    Data-Driven Transferred Energy Management Strategy for Hybrid Electric Vehicles via Deep Reinforcement Learning

    Authors: Hao Chen, Gang Guo, Bangbei Tang, Guo Hu, Xiaolin Tang, Teng Liu

    Abstract: Real-time applications of energy management strategies (EMSs) in hybrid electric vehicles (HEVs) are the harshest requirements for researchers and engineers. Inspired by the excellent problem-solving capabilities of deep reinforcement learning (DRL), this paper proposes a real-time EMS via incorporating the DRL method and transfer learning (TL). The related EMSs are derived from and evaluated on t… ▽ More

    Submitted 12 December, 2022; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: 28 pages, 14 figures

  34. arXiv:2007.08690  [pdf

    eess.SP cs.AI cs.LG

    Transfer Deep Reinforcement Learning-enabled Energy Management Strategy for Hybrid Tracked Vehicle

    Authors: Xiaowei Guo, Teng Liu, Bangbei Tang, Xiaolin Tang, **wei Zhang, Wenhao Tan, Shufeng **

    Abstract: This paper proposes an adaptive energy management strategy for hybrid electric vehicles by combining deep reinforcement learning (DRL) and transfer learning (TL). This work aims to address the defect of DRL in tedious training time. First, an optimization control modeling of a hybrid tracked vehicle is built, wherein the elaborate powertrain components are introduced. Then, a bi-level control fram… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: 11 pages, 11 figures

  35. arXiv:2005.09271  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech

    Authors: Wenjie Li, Benlai Tang, Xiang Yin, Yushi Zhao, Wei Li, Kang Wang, Hao Huang, Yuxuan Wang, Zejun Ma

    Abstract: Accent conversion (AC) transforms a non-native speaker's accent into a native accent while maintaining the speaker's voice timbre. In this paper, we propose approaches to improving accent conversion applicability, as well as quality. First of all, we assume no reference speech is available at the conversion stage, and hence we employ an end-to-end text-to-speech system that is trained on native sp… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  36. arXiv:2004.11012  [pdf, other

    eess.AS cs.SD

    ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders

    Authors: Yu Gu, Xiang Yin, Yonghui Rao, Yuan Wan, Benlai Tang, Yang Zhang, Jitong Chen, Yuxuan Wang, Zejun Ma

    Abstract: This paper presents ByteSing, a Chinese singing voice synthesis (SVS) system based on duration allocated Tacotron-like acoustic models and WaveRNN neural vocoders. Different from the conventional SVS models, the proposed ByteSing employs Tacotron-like encoder-decoder structures as the acoustic models, in which the CBHG models and recurrent neural networks (RNNs) are explored as encoders and decode… ▽ More

    Submitted 24 January, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: Accepted by ISCSLP2021

  37. Polyphase Waveform Design for MIMO Radar Space Time Adaptive Processing

    Authors: Bo Tang, Jonathan Tuck, Peter Stoica

    Abstract: We consider the design of polyphase waveforms for ground moving target detection with airborne multiple-input-multiple-output (MIMO) radar. Due to the constant-modulus and finite-alphabet constraint on the waveforms, the associated design problem is non-convex and in general NP-hard. To tackle this problem, we develop an efficient algorithm based on relaxation and cyclic optimization. Moreover, we… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

  38. arXiv:1809.01046  [pdf, other

    stat.CO eess.SP stat.ML

    Group-Representative Functional Network Estimation from Multi-Subject fMRI Data via MRF-based Image Segmentation

    Authors: Aditi Iyer, Bing**g Tang, Vinayak Rao, Nan Kong

    Abstract: We propose a novel two-phase approach to functional network estimation of multi-subject functional Magnetic Resonance Imaging (fMRI) data, which applies model-based image segmentation to determine a group-representative connectivity map. In our approach, we first improve clustering-based Independent Component Analysis (ICA) to generate maps of components occurring consistently across subjects, and… ▽ More

    Submitted 29 August, 2018; originally announced September 2018.

  39. arXiv:1607.06015  [pdf, other

    eess.SY

    Detection of False Data Injection Attacks in Smart Grid under Colored Gaussian Noise

    Authors: Bo Tang, Jun Yan, Steven Kay, Haibo He

    Abstract: In this paper, we consider the problems of state estimation and false data injection detection in smart grid when the measurements are corrupted by colored Gaussian noise. By modeling the noise with the autoregressive process, we estimate the state of the power transmission networks and develop a generalized likelihood ratio test (GLRT) detector for the detection of false data injection attacks. W… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: 8 pages, 4 figures in IEEE Conference on Communications and Network Security (CNS) 2016