Skip to main content

Showing 1–24 of 24 results for author: Xiong, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2401.15619  [pdf, ps, other

    eess.SP

    A semidefinite programming approach for robust elliptic localization

    Authors: Wenxin Xiong, Jiajun He, Zhang-Lei Shi, Keyuan Hu, Hing Cheung So, Chi-Sing Leung

    Abstract: This short communication addresses the problem of elliptic localization with outlier measurements, whose occurrences are prevalent in various location-enabled applications and can significantly compromise the positioning performance if not adequately handled. In contrast to the reliance on $M$-estimation adopted in the majority of existing solutions, we take a different path, specifically explorin… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  2. arXiv:2401.02831  [pdf, other

    cs.CV eess.IV

    Two-stage Progressive Residual Dense Attention Network for Image Denoising

    Authors: Wencong Wu, An Ge, Guannan Lv, Yuelong Xia, Yungang Zhang, Wen Xiong

    Abstract: Deep convolutional neural networks (CNNs) for image denoising can effectively exploit rich hierarchical features and have achieved great success. However, many deep CNN-based denoising models equally utilize the hierarchical features of noisy images without paying attention to the more important and useful features, leading to relatively low performance. To address the issue, we design a new Two-s… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  3. arXiv:2312.00553  [pdf

    cs.HC eess.SP

    A Spatio-Temporal Graph Convolutional Network for Gesture Recognition from High-Density Electromyography

    Authors: Wenjuan Zhong, Yuyang Zhang, Peiwen Fu, Wenxuan Xiong, Mingming Zhang

    Abstract: Accurate hand gesture prediction is crucial for effective upper-limb prosthetic limbs control. As the high flexibility and multiple degrees of freedom exhibited by human hands, there has been a growing interest in integrating deep networks with high-density surface electromyography (HD-sEMG) grids to enhance gesture recognition capabilities. However, many existing methods fall short in fully explo… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  4. arXiv:2311.14316  [pdf, other

    eess.SP cs.AI

    Windformer:Bi-Directional Long-Distance Spatio-Temporal Network For Wind Speed Prediction

    Authors: Xuewei Li, Zewen Shang, Zhiqiang Liu, Jian Yu, Wei Xiong, Mei Yu

    Abstract: Wind speed prediction is critical to the management of wind power generation. Due to the large range of wind speed fluctuations and wake effect, there may also be strong correlations between long-distance wind turbines. This difficult-to-extract feature has become a bottleneck for improving accuracy. History and future time information includes the trend of airflow changes, whether this dynamic in… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  5. arXiv:2307.11795  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Prompting Large Language Models with Speech Recognition Abilities

    Authors: Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, **xi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

    Abstract: Large language models have proven themselves highly flexible, able to solve a wide range of generative tasks, such as abstractive summarization and open-ended question answering. In this paper we extend the capabilities of LLMs by directly attaching a small audio encoder allowing it to perform speech recognition. By directly prepending a sequence of audial embeddings to the text token embeddings,… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  6. Robust time-of-arrival localization via ADMM

    Authors: Wenxin Xiong, Christian Schindelhauer, Hing Cheung So

    Abstract: This article considers the problem of source localization (SL) using possibly unreliable time-of-arrival (TOA) based range measurements. Adopting the strategy of statistical robustification, we formulate TOA SL as minimization of a versatile loss that possesses resistance against the occurrence of outliers. We then present an alternating direction method of multipliers (ADMM) to tackle the nonconv… ▽ More

    Submitted 17 January, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Should you have any questions regarding this contribution, please don't hesitate to reach out to me via email at [email protected]

  7. arXiv:2305.12498  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Multi-Head State Space Model for Speech Recognition

    Authors: Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales

    Abstract: State space models (SSMs) have recently shown promising results on small-scale sequence and language modelling tasks, rivalling and outperforming many attention-based approaches. In this paper, we propose a multi-head state space (MH-SSM) architecture equipped with special gating mechanisms, where parallel heads are taught to learn local and global temporal dynamics on sequence data. As a drop-in… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Interspeech 2023

  8. arXiv:2305.02441  [pdf, other

    stat.ML cs.IT cs.LG eess.SP

    Reward Teaching for Federated Multi-armed Bandits

    Authors: Chengshuai Shi, Wei Xiong, Cong Shen, **g Yang

    Abstract: Most of the existing federated multi-armed bandits (FMAB) designs are based on the presumption that clients will implement the specified design to collaborate with the server. In reality, however, it may not be possible to modify the clients' existing protocols. To address this challenge, this work focuses on clients who always maximize their individual cumulative rewards, and introduces a novel i… ▽ More

    Submitted 20 November, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Signal Processing

  9. Globally Optimized TDOA High Frequency Source Localization Based on Quasi-Parabolic Ionosphere Modeling and Collaborative Gradient Projection

    Authors: Wenxin Xiong, Christian Schindelhauer, Hing Cheung So

    Abstract: We investigate the problem of high frequency (HF) source localization using the time-difference-of-arrival (TDOA) observations of ionosphere-refracted radio rays based on quasi-parabolic (QP) modeling. An unresolved but pertinent issue in such a field is that the existing gradient-type scheme can easily get trapped in local optima for practical use. This will lead to the difficulty in initializing… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: This is the accepted version. The final version of this paper has been published in the IEEE Transactions on Aerospace and Electronic Systems. The copyright is with IEEE. This version prevails, as there are unfortunately uncorrected editing mistakes in the final one

    Journal ref: in IEEE TAES, vol. 59, no. 1, pp. 580-590, Feb. 2023

  10. arXiv:2212.02084  [pdf, other

    cs.SD eess.AS

    End-to-end Recording Device Identification Based on Deep Representation Learning

    Authors: Chunyan Zeng, Dongliang Zhu, Zhifeng Wang, Minghu Wu, Wei Xiong, Nan Zhao

    Abstract: Deep learning techniques have achieved specific results in recording device source identification. The recording device source features include spatial information and certain temporal information. However, most recording device source identification methods based on deep learning only use spatial representation learning from recording device source features, which cannot make full use of recordin… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: 20 pages, 5 figures, recording device identification

  11. arXiv:2205.05675  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

  12. arXiv:2203.10645  [pdf, other

    eess.IV cs.CV

    Breast Cancer Induced Bone Osteolysis Prediction Using Temporal Variational Auto-Encoders

    Authors: Wei Xiong, Neil Yeung, Shubo Wang, Haofu Liao, Liyun Wang, Jiebo Luo

    Abstract: Objective and Impact Statement. We adopt a deep learning model for bone osteolysis prediction on computed tomography (CT) images of murine breast cancer bone metastases. Given the bone CT scans at previous time steps, the model incorporates the bone-cancer interactions learned from the sequential images and generates future CT images. Its ability of predicting the development of bone lesions in ca… ▽ More

    Submitted 28 March, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: 18 pages

  13. Deep Instance Segmentation with Automotive Radar Detection Points

    Authors: Jianan Liu, Weiyi Xiong, Li** Bai, Yuxuan Xia, Tao Huang, Wanli Ouyang, Bing Zhu

    Abstract: Automotive radar provides reliable environmental perception in all-weather conditions with affordable cost, but it hardly supplies semantic and geometry information due to the sparsity of radar detection points. With the development of automotive radar technologies in recent years, instance segmentation becomes possible by using automotive radar. Its data contain contexts such as radar cross secti… ▽ More

    Submitted 5 February, 2023; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 11 pages, 9 figures, 3 tables, accepted by IEEE Transactions on Intelligent Vehicles

  14. arXiv:2107.10071  [pdf, other

    eess.SP

    Two Efficient and Easy-to-Use NLOS Mitigation Solutions to Indoor 3-D AOA-Based Localization

    Authors: Wenxin Xiong, Joan Bordoy, Andrea Gabbrielli, Georg Fischer, Dominik Jan Schott, Fabian Hoeflinger, Johannes Wendeberg, Christian Schindelhauer, Stefan Johann Rupitsch

    Abstract: This paper proposes two efficient and easy-to-use error mitigation solutions to the problem of three-dimensional (3-D) angle-of-arrival (AOA) source localization in the mixed line-of-sight (LOS) and non-line-of-sight (NLOS) indoor environments. A weighted linear least squares estimator is derived first for the LOS AOA components in terms of the direction vectors of arrival, albeit in a sub-optimal… ▽ More

    Submitted 13 August, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: This paper has been accepted for oral presentation at 2021 International Conference on Indoor Positioning and Indoor Navigation (IPIN), 29 Nov. -- 2 Dec. 2021, Lloret de Mar, Spain

  15. arXiv:2009.06281  [pdf, other

    eess.SP

    Neurodynamic TDOA localization with NLOS mitigation via maximum correntropy criterion

    Authors: Wenxin Xiong, Christian Schindelhauer, Hing Cheung So, Junli Liang, Zhi Wang

    Abstract: In this paper, we exploit the maximum correntropy criterion (MCC) to robustify the traditional time-difference-of-arrival (TDOA) location estimator in the presence of non-line-of-sight (NLOS) propagation conditions. For the sake of statistical efficiency, the correntropy-based robust loss is imposed on the underlying time-of-arrival composition via joint estimation of the source position and onset… ▽ More

    Submitted 9 November, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: Submitted to DSP

  16. Maximum correntropy criterion for robust TOA-based localization in NLOS environments

    Authors: Wenxin Xiong, Christian Schindelhauer, Hing Cheung So, Zhi Wang

    Abstract: We investigate the problem of time-of-arrival (TOA) based localization under possible non-line-of-sight (NLOS) propagation conditions. To robustify the squared-range-based location estimator, we follow the maximum correntropy criterion, essentially the Welsch $M$-estimator with a redescending influence function which behaves like $\ell_0$-minimization towards the grossly biased measurements, to de… ▽ More

    Submitted 10 September, 2021; v1 submitted 13 September, 2020; originally announced September 2020.

    Comments: Published in CSSP

  17. arXiv:2005.04149  [pdf, other

    eess.SP cs.LG cs.NI

    LinksIQ: Robust and Efficient Modulation Recognition with Imperfect Spectrum Scans

    Authors: Wei Xiong, Karyn Doke, Petko Bogdanov, Mariya Zheleva

    Abstract: While critical for the practical progress of spectrum sharing, modulation recognition has so far been investigated under unrealistic assumptions: (i) a transmitter's bandwidth must be scanned alone and in full, (ii) prior knowledge of the technology must be available and (iii) a transmitter must be trustworthy. In reality these assumptions cannot be readily met, as a transmitter's bandwidth may on… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  18. arXiv:2005.02818  [pdf, other

    eess.IV cs.CV

    Unsupervised Low-light Image Enhancement with Decoupled Networks

    Authors: Wei Xiong, Ding Liu, Xiaohui Shen, Chen Fang, Jiebo Luo

    Abstract: In this paper, we tackle the problem of enhancing real-world low-light images with significant noise in an unsupervised fashion. Conventional unsupervised learning-based approaches usually tackle the low-light image enhancement problem using an image-to-image translation model. They focus primarily on illumination or contrast enhancement but fail to suppress the noise that ubiquitously exists in i… ▽ More

    Submitted 28 March, 2022; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 12 pages

  19. TDOA-based localization with NLOS mitigation via robust model transformation and neurodynamic optimization

    Authors: Wenxin Xiong, Christian Schindelhauer, Hing Cheung So, Joan Bordoy, Andrea Gabbrielli, Junli Liang

    Abstract: This paper revisits the problem of locating a signal-emitting source from time-difference-of-arrival (TDOA) measurements under non-line-of-sight (NLOS) propagation. Many currently fashionable methods for NLOS mitigation in TDOA-based localization tend to solve their optimization problems by means of convex relaxation and, thus, are computationally inefficient. Besides, previous studies show that m… ▽ More

    Submitted 20 August, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: This paper has been accepted for publication by Signal Processing

    Journal ref: Signal Process. Vol. 178, 107774, Jan 2021

  20. arXiv:1912.04979  [pdf, other

    eess.AS cs.CL cs.CV cs.SD eess.IV

    Advances in Online Audio-Visual Meeting Transcription

    Authors: Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao , et al. (1 additional authors not shown)

    Abstract: This paper describes a system that generates speaker-annotated transcripts of meetings by using a microphone array and a 360-degree camera. The hallmark of the system is its ability to handle overlapped speech, which has been an unsolved problem in realistic settings for over a decade. We show that this problem can be addressed by using a continuous speech separation approach. In addition, we desc… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: To appear in Proc. IEEE ASRU Workshop 2019

  21. arXiv:1812.04967   

    eess.SP cs.CR

    Security and Privacy Issues for Connected Vehicles

    Authors: Wenjun Xiong, Robert Lagerström

    Abstract: Modern vehicles contain more than a hundred Electronic Control Units (ECUs) that communicate over different in-vehicle networks, and they are often connected to the Internet, which makes them vulnerable to various cyber-attacks. Besides, data collected by the connected vehicles is directly connected to the vehicular network. Thus, big vehicular data are collected, which are valuable and generate i… ▽ More

    Submitted 18 December, 2018; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: There is a crucial mistake with the code, and the model is far from completion. With the agreement of all of the author, we decide to withdraw this paper

  22. An adaptive software defined radio design based on a standard space telecommunication radio system API

    Authors: Wenhao Xiong, Xin Tian, Genshe Chen, Khanh Pham, Erik Blasch

    Abstract: Software defined radio (SDR) has become a popular tool for the implementation and testing for communications performance. The advantage of the SDR approach includes: a re-configurable design, adaptive response to changing conditions, efficient development, and highly versatile implementation. In order to understand the benefits of SDR, the space telecommunication radio system (STRS) was proposed b… ▽ More

    Submitted 25 November, 2017; originally announced November 2017.

  23. arXiv:1610.05256  [pdf, other

    cs.CL eess.AS

    Achieving Human Parity in Conversational Speech Recognition

    Authors: W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer, A. Stolcke, D. Yu, G. Zweig

    Abstract: Conversational speech recognition has served as a flagship speech recognition task since the release of the Switchboard corpus in the 1990s. In this paper, we measure the human error rate on the widely used NIST 2000 test set, and find that our latest automated system has reached human parity. The error rate of professional transcribers is 5.9% for the Switchboard portion of the data, in which new… ▽ More

    Submitted 17 February, 2017; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: Revised for publication, updated results

    Report number: MSR-TR-2016-71, revised Feb. 2017

  24. The Microsoft 2016 Conversational Speech Recognition System

    Authors: W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer, A. Stolcke, D. Yu, G. Zweig

    Abstract: We describe Microsoft's conversational speech recognition system, in which we combine recent developments in neural-network-based acoustic and language modeling to advance the state of the art on the Switchboard recognition task. Inspired by machine learning ensemble techniques, the system uses a range of convolutional and recurrent neural networks. I-vector modeling and lattice-free MMI training… ▽ More

    Submitted 25 January, 2017; v1 submitted 12 September, 2016; originally announced September 2016.

    Journal ref: Proc. IEEE ICASSP, March 2017, pp. 5255-5259