Skip to main content

Showing 1–50 of 75 results for author: Jiang, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.17286  [pdf

    cs.RO eess.SY

    Prioritized experience replay-based DDQN for Unmanned Vehicle Path Planning

    Authors: Liu Lipeng, Letian Xu, Jiabei Liu, Haopeng Zhao, Tongzhou Jiang, Tianyao Zheng

    Abstract: Path planning module is a key module for autonomous vehicle navigation, which directly affects its operating efficiency and safety. In complex environments with many obstacles, traditional planning algorithms often cannot meet the needs of intelligence, which may lead to problems such as dead zones in unmanned vehicles. This paper proposes a path planning algorithm based on DDQN and combines it wi… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 4 pages, 6 figures, 2024 5th International Conference on Information Science, Parallel and Distributed Systems

  2. arXiv:2406.00690  [pdf, other

    eess.SP

    Electromagnetic Wave Property Inspired Radio Environment Knowledge Construction and AI-based Verification for 6G Digital Twin Channel

    Authors: Jialin Wang, Jianhua Zhang, Yutong Sun, Yuxiang Zhang, Tao Jiang, Liang Xia

    Abstract: As the underlying foundation of a digital twin network (DTN), a digital twin channel (DTC) can accurately depict the process of radio propagation in the air interface to support the DTN-based 6G wireless network. Since radio propagation is affected by the environment, constructing the relationship between the environment and radio wave propagation is the key to improving the accuracy of DTC, and t… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  3. arXiv:2405.03129  [pdf, other

    eess.SP cs.IT

    Active Sensing for Multiuser Beam Tracking with Reconfigurable Intelligent Surface

    Authors: Han Han, Tao Jiang, Wei Yu

    Abstract: This paper studies a beam tracking problem in which an access point (AP), in collaboration with a reconfigurable intelligent surface (RIS), dynamically adjusts its downlink beamformers and the reflection pattern at the RIS in order to maintain reliable communications with multiple mobile user equipments (UEs). Specifically, the mobile UEs send uplink pilots to the AP periodically during the channe… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  4. arXiv:2404.13277  [pdf, other

    eess.IV cs.CV

    Beyond Score Changes: Adversarial Attack on No-Reference Image Quality Assessment from Two Perspectives

    Authors: Chenxi Yang, Yujia Liu, Dingquan Li, Yan Zhong, Tingting Jiang

    Abstract: Deep neural networks have demonstrated impressive success in No-Reference Image Quality Assessment (NR-IQA). However, recent researches highlight the vulnerability of NR-IQA models to subtle adversarial perturbations, leading to inconsistencies between model predictions and subjective ratings. Current adversarial attacks, however, focus on perturbing predicted scores of individual images, neglecti… ▽ More

    Submitted 24 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: Submitted to a conference

  5. arXiv:2403.11397  [pdf, other

    cs.CV eess.IV

    Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

    Authors: Yujia Liu, Chenxi Yang, Dingquan Li, Jianhao Ding, Tingting Jiang

    Abstract: The task of No-Reference Image Quality Assessment (NR-IQA) is to estimate the quality score of an input image without additional information. NR-IQA models play a crucial role in the media industry, aiding in performance evaluation and optimization guidance. However, these models are found to be vulnerable to adversarial attacks, which introduce imperceptible perturbations to input images, resulti… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: accepted by CVPR 2024

  6. arXiv:2403.09004  [pdf, ps, other

    cs.IT eess.SP

    Meta-Learning-Based Fronthaul Compression for Cloud Radio Access Networks

    Authors: Ruihua Qiao, Tao Jiang, Wei Yu

    Abstract: This paper investigates the fronthaul compression problem in a user-centric cloud radio access network, in which single-antenna users are served by a central processor (CP) cooperatively via a cluster of remote radio heads (RRHs). To satisfy the fronthaul capacity constraint, this paper proposes a transform-compress-forward scheme, which consists of well-designed transformation matrices and unifor… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 15 Pages, 13 Figures; accepted in IEEE Transactions on Wireless Communications

  7. arXiv:2403.00134  [pdf, other

    cs.IT eess.SP

    Active Sensing for Reciprocal MIMO Channels

    Authors: Tao Jiang, Wei Yu

    Abstract: This paper addresses the design of transmit precoder and receive combiner matrices to support $N_{\rm s}$ independent data streams over a time-division duplex (TDD) point-to-point massive multiple-input multiple-output (MIMO) channel with either a fully digital or a hybrid structure. The optimal precoder and combiner design amounts to finding the top-$N_{\rm s}$ singular vectors of the channel mat… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: This paper is accepted in IEEE Transactions on Signal Processing

  8. arXiv:2402.16153  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Authors: Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, **gcheng Wu, Chenghua Lin, Qifeng Liu , et al. (10 additional authors not shown)

    Abstract: While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce ChatMusician, an open-source LLM that integrates intrinsic musical abilities. It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: GitHub: https://shanghaicannon.github.io/ChatMusician/

  9. arXiv:2402.11164  [pdf

    eess.IV

    TinyLIC-High efficiency lossy image compression method

    Authors: Gaocheng Ma, Yinfeng Chai, Tianhao Jiang, Ming Lu, Tong Chen

    Abstract: Image compression has been the subject of extensive research for several decades, resulting in the development of well-known standards such as JPEG, JPEG2000, and H.264/AVC. However, recent advancements in deep learning have led to the emergence of learned image compression methods that offer significant improvements in coding efficiency compared to traditional codecs. These learned compression te… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  10. Localization of Dummy Data Injection Attacks in Power Systems Considering Incomplete Topological Information: A Spatio-Temporal Graph Wavelet Convolutional Neural Network Approach

    Authors: Zhaoyang Qu, Yunchang Dong, Yang Li, Siqi Song, Tao Jiang, Min Li, Qiming Wang, Lei Wang, Xiaoyong Bo, Jiye Zang, Qi Xu

    Abstract: The emergence of novel the dummy data injection attack (DDIA) poses a severe threat to the secure and stable operation of power systems. These attacks are particularly perilous due to the minimal Euclidean spatial separation between the injected malicious data and legitimate data, rendering their precise detection challenging using conventional distance-based methods. Furthermore, existing researc… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted by Applied Energy

    Journal ref: Applied Energy 360 (2024) 122736

  11. arXiv:2401.13276  [pdf, other

    eess.AS

    SCNet: Sparse Compression Network for Music Source Separation

    Authors: Weinan Tong, Jiaxu Zhu, Jun Chen, Shiyin Kang, Tao Jiang, Yang Li, Zhiyong Wu, Helen Meng

    Abstract: Deep learning-based methods have made significant achievements in music source separation. However, obtaining good results while maintaining a low model complexity remains challenging in super wide-band music source separation. Previous works either overlook the differences in subbands or inadequately address the problem of information loss when generating subband features. In this paper, we propo… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  12. arXiv:2401.05217  [pdf, other

    cs.CV eess.IV

    Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method

    Authors: Chenxi Yang, Yujia Liu, Dingquan Li, Tingting Jiang

    Abstract: No-Reference Image Quality Assessment (NR-IQA) aims to predict image quality scores consistent with human perception without relying on pristine reference images, serving as a crucial component in various visual tasks. Ensuring the robustness of NR-IQA methods is vital for reliable comparisons of different image processing techniques and consistent user experiences in recommendations. The attack m… ▽ More

    Submitted 25 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  13. arXiv:2312.09002  [pdf, other

    cs.IT cs.LG eess.SP

    Localization with Reconfigurable Intelligent Surface: An Active Sensing Approach

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: This paper addresses an uplink localization problem in which a base station (BS) aims to locate a remote user with the help of reconfigurable intelligent surfaces (RISs). We propose a strategy in which the user transmits pilots sequentially and the BS adaptively adjusts the sensing vectors, including the BS beamforming vector and multiple RIS reflection coefficients based on the observations alrea… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted in IEEE Transactions on Wireless Communications. This is an extended version of the previous arXiv paper arXiv:2310.13160

  14. arXiv:2311.12273  [pdf, other

    cs.NI eess.SY

    How AI-driven Digital Twins Can Empower Mobile Networks

    Authors: Tong Li, Fenyu Jiang, Qiaohong Yu, Wenzhen Huang, Tao Jiang, Depeng **

    Abstract: The growing complexity of next-generation networks exacerbates the modeling and algorithmic flaws of conventional network optimization methodology. In this paper, we propose a mobile network digital twin (MNDT) architecture for 6G networks. To address the modeling and algorithmic shortcomings, the MNDT uses a simulation-optimization structure. The feedback from the network simulation engine, which… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  15. arXiv:2310.16765  [pdf, other

    eess.SP

    How to Extend 3D GBSM to Integrated Sensing and Communication Channel with Sharing Feature?

    Authors: Yameng Liu, Jianhua Zhang, Yuxiang Zhang, Huiwen Gong, Tao Jiang, Guangyi Liu

    Abstract: Integrated Sensing and Communication (ISAC) is a promising technology in 6G systems. The existing 3D Geometry-Based Stochastic Model (GBSM), as standardized for 5G systems, addresses solely communication channels and lacks consideration of the integration with sensing channel. Therefore, this letter extends 3D GBSM to support ISAC research, with a particular focus on capturing the sharing feature… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  16. arXiv:2310.13160  [pdf, other

    eess.SP cs.IT

    Active Sensing for Localization with Reconfigurable Intelligent Surface

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: This paper addresses an uplink localization problem in which the base station (BS) aims to locate a remote user with the aid of reconfigurable intelligent surface (RIS). This paper proposes a strategy in which the user transmits pilots over multiple time frames, and the BS adaptively adjusts the RIS reflection coefficients based on the observations already received so far in order to produce an ac… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted in IEEE International Conference on Communications (ICC) 2023

  17. arXiv:2310.11044  [pdf, ps, other

    cs.IT eess.SP

    A Tutorial on Near-Field XL-MIMO Communications Towards 6G

    Authors: Haiquan Lu, Yong Zeng, Changsheng You, Yu Han, Jiayi Zhang, Zhe Wang, Zhenjun Dong, Shi **, Cheng-Xiang Wang, Tao Jiang, Xiaohu You, Rui Zhang

    Abstract: Extremely large-scale multiple-input multiple-output (XL-MIMO) is a promising technology for the sixth-generation (6G) mobile communication networks. By significantly boosting the antenna number or size to at least an order of magnitude beyond current massive MIMO systems, XL-MIMO is expected to unprecedentedly enhance the spectral efficiency and spatial resolution for wireless communication. The… ▽ More

    Submitted 3 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 42 pages

  18. arXiv:2309.11977  [pdf, other

    cs.SD eess.AS

    Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

    Authors: Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu, Xixin Wu, Shiyin Kang, Tao Jiang, Yahui Zhou, Yuxing Han, Helen Meng

    Abstract: Zero-shot text-to-speech (TTS) synthesis aims to clone any unseen speaker's voice without adaptation parameters. By quantizing speech waveform into discrete acoustic tokens and modeling these tokens with the language model, recent language model-based TTS models show zero-shot speaker adaptation capabilities with only a 3-second acoustic prompt of an unseen speaker. However, they are limited by th… ▽ More

    Submitted 9 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted bt ICASSP 2024

  19. arXiv:2308.13575  [pdf

    eess.SP cs.AI physics.optics

    FrFT based estimation of linear and nonlinear impairments using Vision Transformer

    Authors: Ting Jiang, Zheng Gao, Yizhao Chen, Zihe Hu, Ming Tang

    Abstract: To comprehensively assess optical fiber communication system conditions, it is essential to implement joint estimation of the following four critical impairments: nonlinear signal-to-noise ratio (SNRNL), optical signal-to-noise ratio (OSNR), chromatic dispersion (CD) and differential group delay (DGD). However, current studies only achieve identifying a limited number of impairments within a narro… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 15 pages, 10 figures

  20. arXiv:2307.04455  [pdf, other

    cs.CV eess.IV

    SAM-IQA: Can Segment Anything Boost Image Quality Assessment?

    Authors: Xinpeng Li, Ting Jiang, Haoqiang Fan, Shuaicheng Liu

    Abstract: Image Quality Assessment (IQA) is a challenging task that requires training on massive datasets to achieve accurate predictions. However, due to the lack of IQA data, deep learning-based IQA methods typically rely on pre-trained networks trained on massive datasets as feature extractors to enhance their generalization ability, such as the ResNet network trained on ImageNet. In this paper, we utili… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  21. arXiv:2306.08337  [pdf, other

    eess.SY cs.NI

    Carbon emissions and sustainability of launching 5G mobile networks in China

    Authors: Tong Li, Li Yu, Yibo Ma, Tong Duan, Wenzhen Huang, Yan Zhou, Depeng **, Yong Li, Tao Jiang

    Abstract: Since 2021, China has deployed more than 2.1 million 5G base stations to increase the network capacity and provide ubiquitous digital connectivity for mobile terminals. However, the launch of 5G networks also exacerbates the misalignment between cellular traffic and energy consumption, which reduces carbon efficiency - the amount of network traffic that can be delivered for each unit of carbon emi… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  22. arXiv:2305.14022  [pdf, other

    cs.CV eess.IV

    Realistic Noise Synthesis with Diffusion Models

    Authors: Qi Wu, Mingyan Han, Ting Jiang, Haoqiang Fan, Bing Zeng, Shuaicheng Liu

    Abstract: Deep image denoising models often rely on large amount of training data for the high quality performance. However, it is challenging to obtain sufficient amount of data under real-world scenarios for the supervised training. As such, synthesizing realistic noise becomes an important solution. However, existing techniques have limitations in modeling complex noise distributions, resulting in residu… ▽ More

    Submitted 3 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  23. arXiv:2305.07130  [pdf, other

    cs.IT eess.SP

    Active Sensing for Two-Sided Beam Alignment and Reflection Design Using **-Pong Pilots

    Authors: Tao Jiang, Foad Sohrabi, Wei Yu

    Abstract: Beam alignment is an important task for millimeter-wave (mmWave) communication, because constructing aligned narrow beams both at the transmitter (Tx) and the receiver (Rx) is crucial in terms of compensating the significant path loss in very high-frequency bands. However, beam alignment is also a highly nontrivial task because large antenna arrays typically have a limited number of radio-frequenc… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: This paper is accepted in IEEE Journal on Selected Areas in Information Theory

  24. arXiv:2305.05899  [pdf, other

    cs.CV cs.MM eess.IV

    Mobile Image Restoration via Prior Quantization

    Authors: Shiqi Chen, **wen Zhou, Menghao Li, Yueting Chen, Tingting Jiang

    Abstract: In digital images, the performance of optical aberration is a multivariate degradation, where the spectral of the scene, the lens imperfections, and the field of view together contribute to the results. Besides eliminating it at the hardware level, the post-processing system, which utilizes various prior information, is significant for correction. However, due to the content differences among prio… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: Submitted to Elsevier PRL. 5 pages, 5figures

  25. arXiv:2304.07018  [pdf, other

    cs.CV cs.LG eess.IV

    DIPNet: Efficiency Distillation and Iterative Pruning for Image Super-Resolution

    Authors: Lei Yu, Xinpeng Li, Youwei Li, Ting Jiang, Qi Wu, Haoqiang Fan, Shuaicheng Liu

    Abstract: Efficient deep learning-based approaches have achieved remarkable performance in single image super-resolution. However, recent studies on efficient super-resolution have mainly focused on reducing the number of parameters and floating-point operations through various network designs. Although these methods can decrease the number of parameters and floating-point operations, they may not necessari… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  26. arXiv:2303.17959  [pdf, other

    cs.CV eess.IV

    Diffusion Action Segmentation

    Authors: Daochang Liu, Qiyue Li, AnhDung Dinh, Tingting Jiang, Mubarak Shah, Chang Xu

    Abstract: Temporal action segmentation is crucial for understanding long-form videos. Previous works on this task commonly adopt an iterative refinement paradigm by using multi-stage models. We propose a novel framework via denoising diffusion models, which nonetheless shares the same inherent spirit of such iterative refinement. In this framework, action predictions are iteratively generated from random no… ▽ More

    Submitted 11 August, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  27. arXiv:2211.07201  [pdf, other

    eess.AS cs.SD

    Towards A Unified Conformer Structure: from ASR to ASV Task

    Authors: Dexin Liao, Tao Jiang, Feng Wang, Lin Li, Qingyang Hong

    Abstract: Transformer has achieved extraordinary performance in Natural Language Processing and Computer Vision tasks thanks to its powerful self-attention mechanism, and its variant Conformer has become a state-of-the-art architecture in the field of Automatic Speech Recognition (ASR). However, the main-stream architecture for Automatic Speaker Verification (ASV) is convolutional Neural Networks, and there… ▽ More

    Submitted 15 January, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  28. arXiv:2210.02596  [pdf, other

    cs.IT eess.SP

    Role of Deep Learning in Wireless Communications

    Authors: Wei Yu, Foad Sohrabi, Tao Jiang

    Abstract: Traditional communication system design has always been based on the paradigm of first establishing a mathematical model of the communication channel, then designing and optimizing the system according to the model. The advent of modern machine learning techniques, specifically deep neural networks, has opened up opportunities for data-driven system design and optimization. This article draws exam… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: 13 pages, 12 figures, To appear in IEEE BITS the Information Theory Magazine

  29. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, ** Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  30. Learning Based User Scheduling in Reconfigurable Intelligent Surface Assisted Multiuser Downlink

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: Reconfigurable intelligent surface (RIS) is capable of intelligently manipulating the phases of the incident electromagnetic wave to improve the wireless propagation environment between the base-station (BS) and the users. This paper addresses the joint user scheduling, RIS configuration, and BS beamforming problem in an RIS-assisted downlink network with limited pilot overhead. We show that graph… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted in IEEE Journal of Selected Topics in Signal Processing

  31. arXiv:2202.09020  [pdf, other

    cs.CV eess.IV q-bio.QM

    A Comprehensive Survey with Quantitative Comparison of Image Analysis Methods for Microorganism Biovolume Measurements

    Authors: Jiawei Zhang, Chen Li, Md Mamunur Rahaman, Yudong Yao, **li Ma, **ghua Zhang, Xin Zhao, Tao Jiang, Marcin Grzegorzek

    Abstract: With the acceleration of urbanization and living standards, microorganisms play increasingly important roles in industrial production, bio-technique, and food safety testing. Microorganism biovolume measurements are one of the essential parts of microbial analysis. However, traditional manual measurement methods are time-consuming and challenging to measure the characteristics precisely. With the… ▽ More

    Submitted 2 May, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  32. arXiv:2202.07820  [pdf, other

    eess.IV cs.CV

    A Survey of Semen Quality Evaluation in Microscopic Videos Using Computer Assisted Sperm Analysis

    Authors: Wenwei Zhao, **li Ma, Chen Li, Xiaoning Bu, Shuojia Zou, Tao Jiang, Marcin Grzegorzek

    Abstract: The Computer Assisted Sperm Analysis (CASA) plays a crucial role in male reproductive health diagnosis and Infertility treatment. With the development of the computer industry in recent years, a great of accurate algorithms are proposed. With the assistance of those novel algorithms, it is possible for CASA to achieve a faster and higher quality result. Since image processing is the technical basi… ▽ More

    Submitted 17 February, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  33. arXiv:2202.06948  [pdf

    cs.NE cs.HC cs.LG eess.SP

    Towards Best Practice of Interpreting Deep Learning Models for EEG-based Brain Computer Interfaces

    Authors: Jian Cui, Liqiang Yuan, Zhaoxiang Wang, Ruilin Li, Tianzi Jiang

    Abstract: As deep learning has achieved state-of-the-art performance for many tasks of EEG-based BCI, many efforts have been made in recent years trying to understand what have been learned by the models. This is commonly done by generating a heatmap indicating to which extent each pixel of the input contributes to the final classification for a trained model. Despite the wide use, it is not yet understood… ▽ More

    Submitted 17 April, 2023; v1 submitted 12 February, 2022; originally announced February 2022.

  34. arXiv:2202.06465  [pdf, other

    eess.IV cs.CV

    A State-of-the-art Survey of U-Net in Microscopic Image Analysis: from Simple Usage to Structure Mortification

    Authors: Jian Wu, Wanli Liu, Chen Li, Tao Jiang, Islam Mohammad Shariful, Hongzan Sun, Xiaoqi Li, Xintong Li, Xinyu Huang, Marcin Grzegorzek

    Abstract: Image analysis technology is used to solve the inadvertences of artificial traditional methods in disease, wastewater treatment, environmental change monitoring analysis and convolutional neural networks (CNN) play an important role in microscopic image analysis. An important step in detection, tracking, monitoring, feature extraction, modeling and analysis is image segmentation, in which U-Net ha… ▽ More

    Submitted 23 April, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  35. arXiv:2112.13261  [pdf, other

    cs.IT eess.SP

    Interference Nulling Using Reconfigurable Intelligent Surface

    Authors: Tao Jiang, Wei Yu

    Abstract: This paper investigates the interference nulling capability of reconfigurable intelligent surface (RIS) in a multiuser environment where multiple single-antenna transceivers communicate simultaneously in a shared spectrum. From a theoretical perspective, we show that when the channels between the RIS and the transceivers have line-of-sight and the direct paths are blocked, it is possible to adjust… ▽ More

    Submitted 27 January, 2022; v1 submitted 25 December, 2021; originally announced December 2021.

    Comments: This paper is accepted in IEEE Journal on Selected Areas in Communications

  36. arXiv:2110.09121  [pdf, ps, other

    cs.SD eess.AS

    KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke

    Authors: Xiaobin Zhuang, Huiran Yu, Weifeng Zhao, Tao Jiang, Peng Hu

    Abstract: An automatic pitch correction system typically includes several stages, such as pitch extraction, deviation estimation, pitch shift processing, and cross-fade smoothing. However, designing these components with strategies often requires domain expertise and they are likely to fail on corner cases. In this paper, we present KaraTuner, an end-to-end neural architecture that predicts pitch curve and… ▽ More

    Submitted 26 June, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: To be published in Proc. Interspeech 2022, Incheon, South Korea

  37. arXiv:2107.11617  [pdf, other

    cs.CV eess.IV

    LAConv: Local Adaptive Convolution for Image Fusion

    Authors: Zi-Rong **, Liang-Jian Deng, Tai-Xiang Jiang, Tian-**g Zhang

    Abstract: The convolution operation is a powerful tool for feature extraction and plays a prominent role in the field of computer vision. However, when targeting the pixel-wise tasks like image fusion, it would not fully perceive the particularity of each pixel in the image if the uniform convolution kernel is used on different patches. In this paper, we propose a local adaptive convolution (LAConv), which… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

  38. arXiv:2106.12470  [pdf, ps, other

    eess.SY

    Bilateral Control of Teleoperators with Closed Architecture and Time-Varying Delay

    Authors: Hanlei Wang, Yipeng Li, Tiantian Jiang

    Abstract: This paper investigates bilateral control of teleoperators with closed architecture and subjected to arbitrary bounded time-varying delay. A prominent challenge for bilateral control of such teleoperators lies in the closed architecture, especially in the context not involving interaction force/torque measurement. This yields the long-standing situation that most bilateral control rigorously devel… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: This version is prepared with the consideration of the reviewers' and AE's comments

  39. Self-Supervised Nonlinear Transform-Based Tensor Nuclear Norm for Multi-Dimensional Image Recovery

    Authors: Yi-Si Luo, Xi-Le Zhao, Tai-Xiang Jiang, Yi Chang, Michael K. Ng, Chao Li

    Abstract: In this paper, we study multi-dimensional image recovery. Recently, transform-based tensor nuclear norm minimization methods are considered to capture low-rank tensor structures to recover third-order tensors in multi-dimensional image processing applications. The main characteristic of such methods is to perform the linear transform along the third mode of third-order tensors, and then compute te… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  40. arXiv:2104.06243  [pdf, other

    eess.IV cs.CV cs.LG

    A State-of-the-art Survey of Artificial Neural Networks for Whole-slide Image Analysis:from Popular Convolutional Neural Networks to Potential Visual Transformers

    Authors: Xintong Li, Weiming Hu, Chen Li, Tao Jiang, Hongzan Sun, Xiaoyan Li, Xinyu Huang, Marcin Grzegorzek

    Abstract: To increase the objectivity and accuracy of pathologists' work, artificial neural network(ANN) methods have been generally needed in the segmentation, classification, and detection of histopathological WSI. In this paper, WSI analysis methods based on ANN are reviewed. Firstly, the development status of WSI and ANN methods is introduced. Secondly, we summarize the common ANN methods. Next, we disc… ▽ More

    Submitted 26 February, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: 22 pages, 38 figures. arXiv admin note: substantial text overlap with arXiv:2102.10553

  41. arXiv:2103.13625  [pdf, other

    eess.IV q-bio.QM

    A Comprehensive Review of Image Analysis Methods for Microorganism Counting: From Classical Image Processing to Deep Learning Approaches

    Authors: Jiawei Zhang, Chen Li, Md Mamunur Rahaman, Yudong Yao, **li Ma, **ghua Zhang, Xin Zhao, Tao Jiang, Marcin Grzegorzek

    Abstract: Microorganisms such as bacteria and fungi play essential roles in many application fields, like biotechnique, medical technique and industrial domain. Microorganism counting techniques are crucial in microorganism analysis, hel** biologists and related researchers quantitatively analyze the microorganisms and calculate their characteristics, such as biomass concentration and biological activity.… ▽ More

    Submitted 29 September, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  42. arXiv:2012.05720  [pdf, other

    eess.SP cs.HC

    Peer-to-Peer Localization for Single-Antenna Devices

    Authors: Xianan Zhang, Wei Wang, Xuedou Xiao, Hang Yang, Xinyu Zhang, Tao Jiang

    Abstract: Some important indoor localization applications, such as localizing a lost kid in a shop** mall, call for a new peer-to-peer localization technique that can localize an individual's smartphone or wearables by directly using another's on-body devices in unknown indoor environments. However, current localization solutions either require pre-deployed infrastructures or multiple antennas in both tra… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  43. arXiv:2011.04263  [pdf, other

    cs.CV cs.MM eess.IV

    Unified Quality Assessment of In-the-Wild Videos with Mixed Datasets Training

    Authors: Dingquan Li, Tingting Jiang, Ming Jiang

    Abstract: Video quality assessment (VQA) is an important problem in computer vision. The videos in computer vision applications are usually captured in the wild. We focus on automatically assessing the quality of in-the-wild videos, which is a challenging problem due to the absence of reference videos, the complexity of distortions, and the diversity of video contents. Moreover, the video contents and disto… ▽ More

    Submitted 15 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: 20 pages, 12 figures, 7 tables, accepted by IJCV. This is the version provided to IJCV office

  44. arXiv:2009.14404  [pdf, other

    eess.SP cs.IT

    Learning to Reflect and to Beamform for Intelligent Reflecting Surface with Implicit Channel Estimation

    Authors: Tao Jiang, Hei Victor Cheng, Wei Yu

    Abstract: Intelligent reflecting surface (IRS), which consists of a large number of tunable reflective elements, is capable of enhancing the wireless propagation environment in a cellular network by intelligently reflecting the electromagnetic waves from the base-station (BS) toward the users. The optimal tuning of the phase shifters at the IRS is, however, a challenging problem, because due to the passive… ▽ More

    Submitted 8 June, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: To appear in IEEE Journal of Selected Areas in Communications

  45. A Review of Deep Reinforcement Learning for Smart Building Energy Management

    Authors: Liang Yu, Shuqi Qin, Meng Zhang, Chao Shen, Tao Jiang, Xiaohong Guan

    Abstract: Global buildings account for about 30% of the total energy consumption and carbon emission, raising severe energy and environmental concerns. Therefore, it is significant and urgent to develop novel smart building energy management (SBEM) technologies for the advance of energy-efficient and green buildings. However, it is a nontrivial task due to the following challenges. Firstly, it is generally… ▽ More

    Submitted 22 September, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: 21 pages, 12 figures

    Journal ref: IEEE Internet of Things Journal, vol. 8, no. 15, pp. 12046-12063, 2021

  46. arXiv:2008.03889  [pdf, other

    eess.IV cs.CV cs.MM

    Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

    Authors: Dingquan Li, Tingting Jiang, Ming Jiang

    Abstract: Currently, most image quality assessment (IQA) models are supervised by the MAE or MSE loss with empirically slow convergence. It is well-known that normalization can facilitate fast convergence. Therefore, we explore normalization in the design of loss functions for IQA. Specifically, we first normalize the predicted quality scores and the corresponding subjective quality scores. Then, the loss i… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Accepted by ACM MM 2020, + supplemental materials

  47. arXiv:2007.14137  [pdf, other

    cs.CV eess.IV math.NA

    Nonnegative Low Rank Tensor Approximation and its Application to Multi-dimensional Images

    Authors: Tai-Xiang Jiang, Michael K. Ng, Junjun Pan, Guang**g Song

    Abstract: The main aim of this paper is to develop a new algorithm for computing nonnegative low rank tensor approximation for nonnegative tensors that arise in many multi-dimensional imaging applications. Nonnegativity is one of the important property as each pixel value refers to nonzero light intensity in image data acquisition. Our approach is different from classical nonnegative tensor factorization (N… ▽ More

    Submitted 26 September, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

  48. arXiv:2006.14156  [pdf, ps, other

    eess.SY cs.LG

    Multi-Agent Deep Reinforcement Learning for HVAC Control in Commercial Buildings

    Authors: Liang Yu, Yi Sun, Zhanbo Xu, Chao Shen, Dong Yue, Tao Jiang, Xiaohong Guan

    Abstract: In commercial buildings, about 40%-50% of the total electricity consumption is attributed to Heating, Ventilation, and Air Conditioning (HVAC) systems, which places an economic burden on building operators. In this paper, we intend to minimize the energy cost of an HVAC system in a multi-zone commercial building under dynamic pricing with the consideration of random zone occupancy, thermal comfort… ▽ More

    Submitted 22 July, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 14 pages, 21 figures, accepted by IEEE Transactions on Smart Grid

  49. arXiv:2005.14400  [pdf, other

    eess.IV cs.CV

    Hyperspectral Image Super-resolution via Deep Spatio-spectral Convolutional Neural Networks

    Authors: **-Fan Hu, Ting-Zhu Huang, Liang-Jian Deng, Tai-Xiang Jiang, Gemine Vivone, Jocelyn Chanussot

    Abstract: Hyperspectral images are of crucial importance in order to better understand features of different materials. To reach this goal, they leverage on a high number of spectral bands. However, this interesting characteristic is often paid by a reduced spatial resolution compared with traditional multispectral image systems. In order to alleviate this issue, in this work, we propose a simple and effici… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

  50. arXiv:2005.13534  [pdf, ps, other

    eess.SP cs.RO

    Robot-assisted Backscatter Localization for IoT Applications

    Authors: Shengkai Zhang, Wei Wang, Sheyang Tang, Shi **, Tao Jiang

    Abstract: Recent years have witnessed the rapid proliferation of backscatter technologies that realize the ubiquitous and long-term connectivity to empower smart cities and smart homes. Localizing such backscatter tags is crucial for IoT-based smart applications. However, current backscatter localization systems require prior knowledge of the site, either a map or landmarks with known positions, which is la… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: To appear in IEEE Transactions on Wireless Communications. arXiv admin note: substantial text overlap with arXiv:1908.03297