Skip to main content

Showing 1–50 of 58 results for author: Shen, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18993  [pdf, ps, other

    eess.SP

    Interference Cancellation Based Neural Receiver for Superimposed Pilot in Multi-Layer Transmission

    Authors: Han Xiao, Wenqiang Tian, Shi **, Wendong Liu, Jia Shen, Zhihua Shi, Zhi Zhang

    Abstract: In this paper, an interference cancellation based neural receiver for superimposed pilot (SIP) in multi-layer transmission is proposed, where the data and pilot are non-orthogonally superimposed in the same time-frequency resource. Specifically, to deal with the intra-layer and inter-layer interference of SIP under multi-layer transmission, the interference cancellation with superimposed symbol ai… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.14411  [pdf, other

    cs.AI eess.SY

    Large Language Models for Explainable Decisions in Dynamic Digital Twins

    Authors: Nan Zhang, Christian Vergara-Marcillo, Georgios Diamantopoulos, **gran Shen, Nikos Tziritas, Rami Bahsoon, Georgios Theodoropoulos

    Abstract: Dynamic data-driven Digital Twins (DDTs) can enable informed decision-making and provide an optimisation platform for the underlying system. By leveraging principles of Dynamic Data-Driven Applications Systems (DDDAS), DDTs can formulate computational modalities for feedback loops, model updates and decision-making, including autonomous ones. However, understanding autonomous decision-making often… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 8 pages, 3 figures, under review

  3. arXiv:2404.17554  [pdf

    cs.HC eess.SP eess.SY stat.AP

    A Novel Context driven Critical Integrative Levels (CIL) Approach: Advancing Human-Centric and Integrative Lighting Asset Management in Public Libraries with Practical Thresholds

    Authors: **g Lin, Nina Mylly, Per Olof Hedekvist, **gchun Shen

    Abstract: This paper proposes the context driven Critical Integrative Levels (CIL), a novel approach to lighting asset management in public libraries that aligns with the transformative vision of human-centric and integrative lighting. This approach encompasses not only the visual aspects of lighting performance but also prioritizes the physiological and psychological well-being of library users. Incorporat… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  4. arXiv:2404.15312  [pdf, other

    eess.SP cs.CV

    Realtime Person Identification via Gait Analysis

    Authors: Shanmuga Venkatachalam, Harideep Nair, Prabhu Vellaisamy, Yongqi Zhou, Ziad Youssfi, John Paul Shen

    Abstract: Each person has a unique gait, i.e., walking style, that can be used as a biometric for personal identification. Recent works have demonstrated effective gait recognition using deep neural networks, however most of these works predominantly focus on classification accuracy rather than model efficiency. In order to perform gait recognition using wearable devices on the edge, it is imperative to dev… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2403.08948  [pdf, ps, other

    eess.SY cs.GT

    Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning

    Authors: Jiajun Shen, Fengjun Li, Morteza Hashemi, Huazhen Fang

    Abstract: In the swift evolution of Cyber-Physical Systems (CPSs) within intelligent environments, especially in the industrial domain shaped by Industry 4.0, the surge in development brings forth unprecedented security challenges. This paper explores the intricate security issues of Industrial CPSs (ICPSs), with a specific focus on the unique threats presented by intelligent attackers capable of directly c… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 8 pages

  6. arXiv:2402.02889  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

    Authors: Yasar Abbas Ur Rehman, Kin Wai Lau, Yuyang Xie, Lan Ma, Jiajun Shen

    Abstract: The integration of Federated Learning (FL) and Self-supervised Learning (SSL) offers a unique and synergetic combination to exploit the audio data for general-purpose audio understanding, without compromising user data privacy. However, rare efforts have been made to investigate the SSL models in the FL regime for general-purpose audio understanding, especially when the training data is generated… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2402.02724  [pdf, other

    eess.IV cs.CV cs.LG

    FDNet: Frequency Domain Denoising Network For Cell Segmentation in Astrocytes Derived From Induced Pluripotent Stem Cells

    Authors: Haoran Li, Jiahua Shi, Huaming Chen, Bo Du, Simon Maksour, Gabrielle Phillips, Mirella Dottori, Jun Shen

    Abstract: Artificially generated induced pluripotent stem cells (iPSCs) from somatic cells play an important role for disease modeling and drug screening of neurodegenerative diseases. Astrocytes differentiated from iPSCs are important targets to investigate neuronal metabolism. The astrocyte differentiation progress can be monitored through the variations of morphology observed from microscopy images at di… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by The IEEE International Symposium on Biomedical Imaging (ISBI) 2024

  8. arXiv:2310.15548  [pdf, ps, other

    eess.SP

    Knowledge-driven Meta-learning for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Jiajia Guo, Zhi Zhang, Shi **, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output systems. Recently, deep learning (DL) has been introduced for CSI feedback enhancement through massive collected training data and lengthy training time, which is quite costly and impractical for realistic deployment. In this article, a knowledge-driven meta-learning a… ▽ More

    Submitted 25 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.13475

  9. arXiv:2309.09423  [pdf, other

    cs.RO eess.SY

    Two Degree of Freedom Adaptive Control for Hysteresis Compensation of Pneumatic Continuum Bending Actuator

    Authors: Junyi Shen, Tetsuro Miyazaki, Shingo Ohno, Maina Sogabe, Kenji Kawashima

    Abstract: Soft robotics, with their inherent flexibility and infinite degrees of freedom (DoF), offer promising advancements in human-machine interfaces. Particularly, pneumatic artificial muscles (PAMs) and pneumatic bending actuators have been fundamental in driving this evolution, capitalizing on their mimetic nature to natural muscle movements. However, with the versatility of these actuators comes the… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE Conference on Robotics and Automation (ICRA 2024), Under Review

  10. arXiv:2308.13849  [pdf, other

    cs.LG cs.AI eess.SY

    Effectively Heterogeneous Federated Learning: A Pairing and Split Learning Based Approach

    Authors: **glong Shen, Xiucheng Wang, Nan Cheng, Longfei Ma, Conghao Zhou, Yuan Zhang

    Abstract: As a promising paradigm federated Learning (FL) is widely used in privacy-preserving machine learning, which allows distributed devices to collaboratively train a model while avoiding data transmission among clients. Despite its immense potential, the FL suffers from bottlenecks in training speed due to client heterogeneity, leading to escalated training latency and straggling server aggregation.… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  11. Trajectory Tracking Control of Dual-PAM Soft Actuator with Hysteresis Compensator

    Authors: Junyi Shen, Tetsuro Miyazaki, Shingo Ohno, Maina Sogabe, Kenji Kawashima

    Abstract: Soft robotics is a swiftly evolving field. Pneumatic actuators are suitable for driving soft robots because of their superior performance. However, their control is challenging due to the hysteresis characteristics. In response to this challenge, we propose an adaptive control method to compensate for the hysteresis of soft actuators. Employing a novel dual pneumatic artificial muscle (PAM) bendin… ▽ More

    Submitted 18 November, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: This paper has been published in the IEEE Robotics and Automation Letters ,DOI 10.1109/LRA.2023.3334098, copyright has been transfferd to the IEEE. Final version is available at IEEE Xplore

  12. arXiv:2308.04605  [pdf, other

    eess.IV cs.CV cs.GR cs.LG

    PSRFlow: Probabilistic Super Resolution with Flow-Based Models for Scientific Data

    Authors: **gyi Shen, Han-Wei Shen

    Abstract: Although many deep-learning-based super-resolution approaches have been proposed in recent years, because no ground truth is available in the inference stage, few can quantify the errors and uncertainties of the super-resolved results. For scientific visualization applications, however, conveying uncertainties of the results to scientists is crucial to avoid generating misleading or incorrect info… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: To be published in Proc. IEEE VIS 2023

  13. arXiv:2307.02002  [pdf, other

    eess.SY

    Interpretable and Secure Trajectory Optimization for UAV-Assisted Communication

    Authors: Yunhao Quan, Nan Cheng, Xiucheng Wang, **glong Shen, Longfei Ma, Zhisheng Yin

    Abstract: Unmanned aerial vehicles (UAVs) have gained popularity due to their flexible mobility, on-demand deployment, and the ability to establish high probability line-of-sight wireless communication. As a result, UAVs have been extensively used as aerial base stations (ABSs) to supplement ground-based cellular networks for various applications. However, existing UAV-assisted communication schemes mainly… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  14. arXiv:2306.04970  [pdf, other

    cs.RO eess.SY

    Motion Planning for Aerial Pick-and-Place based on Geometric Feasibility Constraints

    Authors: Huazi Cao, Jiahao Shen, Cunjia Liu, Bo Zhu, Shiyu Zhao

    Abstract: This paper studies the motion planning problem of the pick-and-place of an aerial manipulator that consists of a quadcopter flying base and a Delta arm. We propose a novel partially decoupled motion planning framework to solve this problem. Compared to the state-of-the-art approaches, the proposed one has two novel features. First, it does not suffer from increased computation in high-dimensional… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  15. arXiv:2305.10009  [pdf, other

    eess.SP

    A Modular and High-Resolution Time-Frequency Post-Processing Technique

    Authors: **shun Shen, Deyun Wei

    Abstract: In this letter, based on the variational model, we propose a novel time-frequency post-processing technique to approximate the ideal time-frequency representation. Our method has the advantage of modularity, enabling "plug and play", independent of the performance of specific time-frequency analysis tool. Therefore, it can be easily generalized to the fractional Fourier domain and the linear canon… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  16. arXiv:2303.15161  [pdf, other

    cs.SD eess.AS

    Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator

    Authors: Yunhao Chen, Yunjie Zhu, Zihui Yan, Jianlu Shen, Zhen Ren, Yifan Huang

    Abstract: Despite consistent advancement in powerful deep learning techniques in recent years, large amounts of training data are still necessary for the models to avoid overfitting. Synthetic datasets using generative adversarial networks (GAN) have recently been generated to overcome this problem. Nevertheless, despite advancements, GAN-based methods are usually hard to train or fail to generate high-qual… ▽ More

    Submitted 4 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  17. arXiv:2303.12693  [pdf, other

    eess.SY cs.AI

    Resilient Output Containment Control of Heterogeneous Multiagent Systems Against Composite Attacks: A Digital Twin Approach

    Authors: Yukang Cui, Lingbo Cao, Michael V. Basin, Jun Shen, Tingwen Huang, Xin Gong

    Abstract: This paper studies the distributed resilient output containment control of heterogeneous multiagent systems against composite attacks, including denial-of-services (DoS) attacks, false-data injection (FDI) attacks, camouflage attacks, and actuation attacks. Inspired by digital twins, a twin layer (TL) with higher security and privacy is used to decouple the above problem into two tasks: defense pr… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  18. arXiv:2303.08856  [pdf, other

    cs.LG eess.SY

    On the Benefits of Leveraging Structural Information in Planning Over the Learned Model

    Authors: Jiajun Shen, Kananart Kuwaranancharoen, Raid Ayoub, Pietro Mercati, Shreyas Sundaram

    Abstract: Model-based Reinforcement Learning (RL) integrates learning and planning and has received increasing attention in recent years. However, learning the model can incur a significant cost (in terms of sample complexity), due to the need to obtain a sufficient number of samples for each state-action pair. In this paper, we investigate the benefits of leveraging structural information about the system… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 9 pages, 5 figures

  19. arXiv:2301.13475  [pdf, ps, other

    eess.SP

    A Knowledge-Driven Meta-Learning Method for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Zhi Zhang, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output (MIMO) systems. Recently, deep learning (DL) has been introduced to enhance CSI feedback in massive MIMO application, where the massive collected training data and lengthy training time are costly and impractical for realistic deployment. In this paper, a knowledge-dri… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  20. arXiv:2301.02243  [pdf, other

    cs.LG eess.SP stat.AP

    Machine Fault Classification using Hamiltonian Neural Networks

    Authors: Jeremy Shen, Jawad Chowdhury, Sourav Banerjee, Gabriel Terejanu

    Abstract: A new approach is introduced to classify faults in rotating machinery based on the total energy signature estimated from sensor measurements. The overall goal is to go beyond using black-box models and incorporate additional physical constraints that govern the behavior of mechanical systems. Observational data is used to train Hamiltonian neural networks that describe the conserved energy of the… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: ICPRAM 2023

  21. arXiv:2211.02940  [pdf, other

    cs.SD cs.AI eess.AS

    Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block

    Authors: Yunhao Chen, Yunjie Zhu, Zihui Yan, Yifan Huang, Zhen Ren, Jianlu Shen, Lifang Chen

    Abstract: Recently, massive architectures based on Convolutional Neural Network (CNN) and self-attention mechanisms have become necessary for audio classification. While these techniques are state-of-the-art, these works' effectiveness can only be guaranteed with huge computational costs and parameters, large amounts of data augmentation, transfer from large datasets and some other tricks. By utilizing the… ▽ More

    Submitted 30 May, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

  22. arXiv:2210.03402  [pdf

    eess.SY cs.LG nlin.AO

    Research on Self-adaptive Online Vehicle Velocity Prediction Strategy Considering Traffic Information Fusion

    Authors: Ziyan Zhang, Junhao Shen, Dongwei Yao, Feng Wu

    Abstract: In order to increase the prediction accuracy of the online vehicle velocity prediction (VVP) strategy, a self-adaptive velocity prediction algorithm fused with traffic information was presented for the multiple scenarios. Initially, traffic scenarios were established inside the co-simulation environment. In addition, the algorithm of a general regressive neural network (GRNN) paired with datasets… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures

  23. arXiv:2209.05482  [pdf, ps, other

    eess.SY

    Improved Fuzzy $H_{\infty}$ Filter Design Method for Nonlinear Systems with Time-Varing Delay

    Authors: Qianqian Ma, Li Li, Junhui Shen, Haowei Guan, Guangcheng Ma, Hongwei Xia

    Abstract: This paper investigates the fuzzy $H_{\infty}$ filter design issue for nonlinear systems with time-varying delay. In order to obtain less conservative fuzzy $H_{\infty}$ filter design method, a novel integral inequality is employed to replace the conventional Lebniz-Newton formula to analyze the stability conditions of the filtering error system. Besides, the information of the membership function… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: This paper was published in 2017 IEEE SMC. arXiv admin note: text overlap with arXiv:2209.04989. text overlap with arXiv:2209.04989

  24. arXiv:2208.13183  [pdf, other

    cs.SD eess.AS

    Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks

    Authors: Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark

    Abstract: Transfer tasks in text-to-speech (TTS) synthesis - where one or more aspects of the speech of one set of speakers is transferred to another set of speakers that do not feature these aspects originally - remains a challenging task. One of the challenges is that models that have high-quality transfer capabilities can have issues in stability, making them impractical for user-facing critical tasks. T… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: To be published in Interspeech 2022

  25. arXiv:2206.07949  [pdf, other

    eess.SP

    AI Enlightens Wireless Communication: A Transformer Backbone for CSI Feedback

    Authors: Han Xiao, Zhiqin Wang, Dexin Li, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi **, Jia Shen, Zhi Zhang, Ning Yang

    Abstract: This paper is based on the background of the 2nd Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AIWork Group, where the framework of the eigenvector-based channel state information (CSI) feedback problem is firstly provided. Then a basic Transformer backbone for CSI feedback referred to EVCsiNet-T is proposed. Moreover, a s… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  26. arXiv:2205.08391  [pdf, other

    cs.ET eess.SY

    A High-Voltage Characterisation Platform For Emerging Resistive Switching Technologies

    Authors: Jiawei Shen, Andrea Mifsud, Lijie Xie, Abdulaziz Alshaya, Christos Papavassiliou

    Abstract: Emerging memristor-based array architectures have been effectively employed in non-volatile memories and neuromorphic computing systems due to their density, scalability and capability of storing information. Nonetheless, to demonstrate a practical on-chip memristor-based system, it is essential to have the ability to apply large programming voltage ranges during the characterisation procedures fo… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 5 pages. To be published in ISCAS 2022 and made available on IEEEXplore

  27. arXiv:2205.08381  [pdf, other

    cs.ET eess.SY

    A Wide Dynamic Range Read-out System For Resistive Switching Technology

    Authors: Lijie Xie, Jiawei Shen, Andrea Mifsud, Chaohan Wang, Abdulaziz Alshaya, Christos Papavassiliou

    Abstract: The memristor, because of its controllability over a wide dynamic range of resistance, has emerged as a promising device for data storage and analog computation. A major challenge is the accurate measurement of memristance over a wide dynamic range. In this paper, a novel read-out circuit with feedback adjustment is proposed to measure and digitise input current in the range between 20nA and 2mA.… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 5 pages, To be published in ISCAS 2022 and made available on IEEE Xplore

  28. arXiv:2205.08379  [pdf, other

    cs.ET eess.SY

    A CMOS-based Characterisation Platform for Emerging RRAM Technologies

    Authors: Andrea Mifsud, Jiawei Shen, Peilong Feng, Lijie Xie, Chaohan Wang, Yihan Pan, Sachin Maheshwari, Shady Agwa, Spyros Stathopoulos, Shiwei Wang, Alexander Serb, Christos Papavassiliou, Themis Prodromakis, Timothy G. Constandinou

    Abstract: Mass characterisation of emerging memory devices is an essential step in modelling their behaviour for integration within a standard design flow for existing integrated circuit designers. This work develops a novel characterisation platform for emerging resistive devices with a capacity of up to 1 million devices on-chip. Split into four independent sub-arrays, it contains on-chip column-parallel… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 5 pages. To be published in ISCAS 2022 and made available on IEEE Xplore

  29. arXiv:2203.04042  [pdf, other

    eess.IV cs.CV

    Abandoning the Bayer-Filter to See in the Dark

    Authors: Xingbo Dong, Wanyan Xu, Zhihui Miao, Lan Ma, Chao Zhang, Jiewen Yang, Zhe **, Andrew Beng ** Teoh, Jiajun Shen

    Abstract: Low-light image enhancement - a pervasive but challenging problem, plays a central role in enhancing the visibility of an image captured in a poor illumination environment. Due to the fact that not all photons can pass the Bayer-Filter on the sensor of the color camera, in this work, we first present a De-Bayer-Filter simulator based on deep neural networks to generate a monochrome raw image from… ▽ More

    Submitted 22 March, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

  30. arXiv:2201.01449  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia

    Authors: Jon Braatz, Pranav Rajpurkar, Stephanie Zhang, Andrew Y. Ng, Jeanne Shen

    Abstract: In recent years, deep learning has successfully been applied to automate a wide variety of tasks in diagnostic histopathology. However, fast and reliable localization of small-scale regions-of-interest (ROI) has remained a key challenge, as discriminative morphologic features often occupy only a small fraction of a gigapixel-scale whole-slide image (WSI). In this paper, we propose a sparse WSI ana… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  31. arXiv:2112.10107  [pdf, other

    cs.AI cs.LG eess.SP

    Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control

    Authors: Liang Zhang, Qiang Wu, Jun Shen, Linyuan Lü, Bo Du, Jianqing Wu

    Abstract: Many studies confirmed that a proper traffic state representation is more important than complex algorithms for the classical traffic signal control (TSC) problem. In this paper, we (1) present a novel, flexible and efficient method, namely advanced max pressure (Advanced-MP), taking both running and queuing vehicles into consideration to decide whether to change current signal phase; (2) inventiv… ▽ More

    Submitted 9 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: 10 pages, 5 figures

    ACM Class: J.4; J.6

  32. arXiv:2107.04174  [pdf, other

    cs.SD cs.CV cs.LG eess.AS eess.SP

    EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments

    Authors: Jacob Donley, Vladimir Tourbabin, Jung-Suk Lee, Mark Broyles, Hao Jiang, Jie Shen, Maja Pantic, Vamsi Krishna Ithapu, Ravish Mehra

    Abstract: Augmented Reality (AR) as a platform has the potential to facilitate the reduction of the cocktail party effect. Future AR headsets could potentially leverage information from an array of sensors spanning many different modalities. Training and testing signal processing and machine learning algorithms on tasks such as beam-forming and speech enhancement require high quality representative data. To… ▽ More

    Submitted 18 October, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: Dataset is available at: https://github.com/facebookresearch/EasyComDataset

  33. arXiv:2106.06759  [pdf, ps, other

    eess.SP

    AI Enlightens Wireless Communication: Analyses, Solutions and Opportunities on CSI Feedback

    Authors: Han Xiao, Zhiqin Wang, Wenqiang Tian, Xiaofeng Liu, Wendong Liu, Shi **, Jia Shen, Zhi Zhang, Ning Yang

    Abstract: In this paper, we give a systematic description of the 1st Wireless Communication Artificial Intelligence (AI) Competition (WAIC) which is hosted by IMT-2020(5G) Promotion Group 5G+AI Work Group. Firstly, the framework of full channel state information (F-CSI) feedback problem and its corresponding channel dataset are provided. Then the enhancing schemes for DL-based F-CSI feedback including i) ch… ▽ More

    Submitted 14 June, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

  34. arXiv:2105.07146  [pdf, other

    eess.IV cs.CV

    GCN-MIF: Graph Convolutional Network with Multi-Information Fusion for Low-dose CT Denoising

    Authors: Kecheng Chen, Jiayu Sun, Jiang Shen, Jixiang Luo, Xinyu Zhang, Xuelin Pan, Dongsheng Wu, Yue Zhao, Miguel Bento, Yazhou Ren, Xiaorong Pu

    Abstract: Being low-level radiation exposure and less harmful to health, low-dose computed tomography (LDCT) has been widely adopted in the early screening of lung cancer and COVID-19. LDCT images inevitably suffer from the degradation problem caused by complex noises. It was reported that deep learning (DL)-based LDCT denoising methods using convolutional neural network (CNN) achieved impressive denoising… ▽ More

    Submitted 16 April, 2022; v1 submitted 15 May, 2021; originally announced May 2021.

    Comments: Submitted to TMI with under review

  35. arXiv:2103.15060  [pdf, other

    cs.CL cs.SD eess.AS

    PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS

    Authors: Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang, Yonghui Wu

    Abstract: This paper introduces PnG BERT, a new encoder model for neural TTS. This model is augmented from the original BERT model, by taking both phoneme and grapheme representations of text as input, as well as the word-level alignment between them. It can be pre-trained on a large text corpus in a self-supervised manner, and fine-tuned in a TTS task. Experimental results show that a neural TTS model usin… ▽ More

    Submitted 7 June, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

    Comments: Accepted to Interspeech 2021

  36. arXiv:2103.14574  [pdf, other

    cs.SD eess.AS

    Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

    Authors: Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, RJ Skerry-Ryan, Yonghui Wu

    Abstract: This paper introduces Parallel Tacotron 2, a non-autoregressive neural text-to-speech model with a fully differentiable duration model which does not require supervised duration signals. The duration model is based on a novel attention mechanism and an iterative reconstruction loss based on Soft Dynamic Time War**, this model can learn token-frame alignments as well as token durations automatica… ▽ More

    Submitted 29 August, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Submitted to INTERSPEECH 2021

  37. arXiv:2103.00345  [pdf, other

    cs.RO cs.CR cs.LG eess.SY

    End-to-end Uncertainty-based Mitigation of Adversarial Attacks to Automated Lane Centering

    Authors: Ruochen Jiao, Hengyi Liang, Takami Sato, Junjie Shen, Qi Alfred Chen, Qi Zhu

    Abstract: In the development of advanced driver-assistance systems (ADAS) and autonomous vehicles, machine learning techniques that are based on deep neural networks (DNNs) have been widely used for vehicle perception. These techniques offer significant improvement on average perception accuracy over traditional methods, however, have been shown to be susceptible to adversarial attacks, where small perturba… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: 8 pages for conference

  38. arXiv:2102.01678  [pdf, other

    eess.IV cs.CV cs.LG

    Learning domain-agnostic visual representation for computational pathology using medically-irrelevant style transfer augmentation

    Authors: Rikiya Yamashita, ** Long, Snikitha Banda, Jeanne Shen, Daniel L. Rubin

    Abstract: Suboptimal generalization of machine learning models on unseen data is a key challenge which hampers the clinical applicability of such models to medical imaging. Although various methods such as domain adaptation and domain generalization have evolved to combat this challenge, learning robust and generalizable representations is core to medical image understanding, and continues to be a problem.… ▽ More

    Submitted 3 June, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

  39. arXiv:2012.02776  [pdf, other

    cs.CV cs.LG eess.IV

    Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

    Authors: Wencheng Han, ** Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

    Abstract: Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, ha… ▽ More

    Submitted 30 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: Accepted by CVPR2021

  40. arXiv:2010.11439  [pdf, other

    cs.SD eess.AS

    Parallel Tacotron: Non-Autoregressive and Controllable TTS

    Authors: Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron Weiss, Yonghui Wu

    Abstract: Although neural end-to-end text-to-speech models can synthesize highly natural speech, there is still room for improvements to its efficiency and naturalness. This paper proposes a non-autoregressive neural text-to-speech model augmented with a variational autoencoder-based residual encoder. This model, called \emph{Parallel Tacotron}, is highly parallelizable during both training and inference, a… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  41. arXiv:2010.02414  [pdf, other

    eess.IV cs.CV

    ASDN: A Deep Convolutional Network for Arbitrary Scale Image Super-Resolution

    Authors: Jialiang Shen, Yucheng Wang, Jian Zhang

    Abstract: Deep convolutional neural networks have significantly improved the peak signal-to-noise ratio of SuperResolution (SR). However, image viewer applications commonly allow users to zoom the images to arbitrary magnification scales, thus far imposing a large number of required training scales at a tremendous computational cost. To obtain a more computationally efficient model for arbitrary scale SR, t… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  42. arXiv:2008.08243  [pdf, other

    cs.RO eess.SP

    Enabling Remote Whole-Body Control with 5G Edge Computing

    Authors: Huaijiang Zhu, Manali Sharma, Kai Pfeiffer, Marco Mezzavilla, Jia Shen, Sundeep Rangan, Ludovic Righetti

    Abstract: Real-world applications require light-weight, energy-efficient, fully autonomous robots. Yet, increasing autonomy is oftentimes synonymous with escalating computational requirements. It might thus be desirable to offload intensive computation--not only sensing and planning, but also low-level whole-body control--to remote servers in order to reduce on-board computational needs. Fifth Generation (5… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  43. arXiv:2006.11392  [pdf, other

    eess.IV cs.CV

    PraNet: Parallel Reverse Attention Network for Polyp Segmentation

    Authors: Deng-** Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Colonoscopy is an effective technique for detecting colorectal polyps, which are highly related to colorectal cancer. In clinical practice, segmenting polyps from colonoscopy images is of great importance since it provides valuable information for diagnosis and surgery. However, accurate polyp segmentation is a challenging task, for two major reasons: (i) the same type of polyps has a diversity of… ▽ More

    Submitted 3 July, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Accepted to MICCAI 2020

  44. arXiv:2006.10135  [pdf, other

    eess.IV cs.CV cs.LG

    M2Net: Multi-modal Multi-channel Network for Overall Survival Time Prediction of Brain Tumor Patients

    Authors: Tao Zhou, Huazhu Fu, Yu Zhang, Changqing Zhang, Xiankai Lu, Jianbing Shen, Ling Shao

    Abstract: Early and accurate prediction of overall survival (OS) time can help to obtain better treatment planning for brain tumor patients. Although many OS time prediction methods have been developed and obtain promising results, there are still several issues. First, conventional prediction methods rely on radiomic features at the local lesion area of a magnetic resonance (MR) volume, which may not repre… ▽ More

    Submitted 14 July, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted by MICCAI'20

  45. arXiv:2005.05594  [pdf, other

    eess.IV cs.CV

    Modeling and Enhancing Low-quality Retinal Fundus Images

    Authors: Ziyi Shen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Retinal fundus images are widely used for the clinical screening and diagnosis of eye diseases. However, fundus images captured by operators with various levels of experience have a large variation in quality. Low-quality fundus images increase uncertainty in clinical observation and lead to the risk of misdiagnosis. However, due to the special optical beam of fundus imaging and structure of the r… ▽ More

    Submitted 9 December, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

  46. arXiv:2004.14133  [pdf, other

    eess.IV cs.CV cs.LG

    Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images

    Authors: Deng-** Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Coronavirus Disease 2019 (COVID-19) spread globally in early 2020, causing the world to face an existential health crisis. Automated detection of lung infections from computed tomography (CT) images offers a great potential to augment the traditional healthcare strategy for tackling COVID-19. However, segmenting infected regions from CT slices faces several challenges, including high variation in… ▽ More

    Submitted 21 May, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: To appear in IEEE TMI. The code is released in: https://github.com/Deng**Fan/Inf-Net

  47. arXiv:2004.10987  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-19 Chest CT Image Segmentation -- A Deep Convolutional Neural Network Solution

    Authors: Qingsen Yan, Bo Wang, Dong Gong, Chuan Luo, Wei Zhao, Jianhu Shen, Qinfeng Shi, Shuo **, Liang Zhang, Zheng You

    Abstract: A novel coronavirus disease 2019 (COVID-19) was detected and has spread rapidly across various countries around the world since the end of the year 2019, Computed Tomography (CT) images have been used as a crucial alternative to the time-consuming RT-PCR test. However, pure manual segmentation of CT images faces a serious challenge with the increase of suspected cases, resulting in urgent requirem… ▽ More

    Submitted 25 April, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

  48. arXiv:2003.04262  [pdf, other

    cs.CV cs.LG eess.IV

    Cascaded Human-Object Interaction Recognition

    Authors: Tianfei Zhou, Wenguan Wang, Siyuan Qi, Haibin Ling, Jianbing Shen

    Abstract: Rapid progress has been witnessed for human-object interaction (HOI) recognition, but most existing models are confined to single-stage reasoning pipelines. Considering the intrinsic complexity of the task, we introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding. At each stage, an instance localization network progressively refines HOI proposals and feeds them into… ▽ More

    Submitted 11 March, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2020. Winner of the ICCV-2019 PIC Challenge on both HOIW and PIC tracks. Code: https://github.com/tfzhou/C-HOI

  49. arXiv:2002.09175  [pdf

    eess.SP

    Depression Detection using Resting State Three-channel EEG Signal

    Authors: Qiuxia Shi, Ang Liu, Rongyan Chen, Jian Shen, Qinglin Zhao, Bin Hu

    Abstract: In universal environment, a patient-friendly inexpensive method is needed to realize the early diagnosis of depression, which is believed to be an effective way to reduce the mortality of depression. The purpose of this study is only to collect EEG signal from three electrodes Fp1, Fpz and Fp2, then the linear and nonlinear features of EEG used to classify depression patients and healthy controls.… ▽ More

    Submitted 26 February, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: 12 pages, 2figures, 1 table

  50. arXiv:2002.05000  [pdf, other

    cs.CV eess.IV

    Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis

    Authors: Tao Zhou, Huazhu Fu, Geng Chen, Jianbing Shen, Ling Shao

    Abstract: Magnetic resonance imaging (MRI) is a widely used neuroimaging technique that can provide images of different contrasts (i.e., modalities). Fusing this multi-modal data has proven particularly effective for boosting model performance in many tasks. However, due to poor data quality and frequent patient dropout, collecting all modalities for every patient remains a challenge. Medical image synthesi… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: has been accepted by IEEE TMI