Skip to main content

Showing 1–50 of 54 results for author: Xu, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.06253  [pdf, other

    eess.SY cs.PL

    PretVM: Predictable, Efficient Virtual Machine for Real-Time Concurrency

    Authors: Shaokai Lin, Erling Jellum, Mirco Theile, Tassilo Tanneberger, Binqi Sun, Chadlia Jerad, Ruomu Xu, Guangyu Feng, Christian Menard, Marten Lohstroh, Jeronimo Castrillon, Sanjit Seshia, Edward Lee

    Abstract: This paper introduces the Precision-Timed Virtual Machine (PretVM), an intermediate platform facilitating the execution of quasi-static schedules compiled from a subset of programs written in the Lingua Franca (LF) coordination language. The subset consists of those programs that in principle should have statically verifiable and predictable timing behavior. The PretVM provides a schedule with wel… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2404.15533  [pdf, other

    eess.SY

    Designing, simulating, and performing the 100-AV field test for the CIRCLES consortium: Methodology and Implementation of the Largest mobile traffic control experiment to date

    Authors: Mostafa Ameli, Sean Mcquade, Jonathan W. Lee, Matthew Bunting, Matthew Nice, Han Wang, William Barbour, Ryan Weightman, Chris Denaro, Ryan Delorenzo, Sharon Hornstein, Jon F. Davis, Dan Timsit, Riley Wagner, Rita Xu, Malaika Mahmood, Mikail Mahmood, Maria Laura Delle Monache, Benjamin Seibold, Daniel B. Work, Jonathan Sprinkle, Benedetto Piccoli, Alexandre M. Bayen

    Abstract: Previous controlled experiments on single-lane ring roads have shown that a single partially autonomous vehicle (AV) can effectively mitigate traffic waves. This naturally prompts the question of how these findings can be generalized to field operational, high-density traffic conditions. To address this question, the Congestion Impacts Reduction via CAV-in-the-loop Lagrangian Energy Smoothing (CIR… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  3. arXiv:2404.12705  [pdf, other

    eess.SP

    Integrated Sensing and Communication enabled Multiple Base Stations Cooperative UAV Detection

    Authors: Xi Lu, Zhiqing Wei, Ruizhong Xu, Lin Wang, Bohao Lu, **ghui Piao

    Abstract: Integrated sensing and communication (ISAC) exhibits notable potential for sensing the unmanned aerial vehicles (UAVs), facilitating real-time monitoring of UAVs for security insurance. Due to the low sensing accuracy of single base stations (BSs), a cooperative UAV sensing method by multi-BS is proposed in this paper to achieve high-accuracy sensing. Specifically, a multiple signal classification… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  4. arXiv:2404.00628  [pdf, other

    cs.IT eess.SP

    Fluid Antenna Relay Assisted Communication Systems Through Antenna Location Optimization

    Authors: Ruopeng Xu, Yixuan Chen, Jiawen Kang, Minrui Xu, Zhaohui Yang, Chongwen Huang, Dusit Niyato

    Abstract: In this paper, we investigate the problem of resource allocation for fluid antenna relay (FAR) system with antenna location optimization. In the considered model, each user transmits information to a base station (BS) with help of FAR. The antenna location of the FAR is flexible and can be adapted to dynamic location distribution of the users. We formulate a sum rate maximization problem through j… ▽ More

    Submitted 27 June, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  5. arXiv:2404.00612  [pdf, other

    cs.IT eess.SP

    Resource Allocation for Green Probabilistic Semantic Communication with Rate Splitting

    Authors: Ruopeng Xu, Zhaohui Yang, Zhouxiang Zhao, Qianqian Yang, Zhaoyang Zhang

    Abstract: In this paper, the energy efficient design for probabilistic semantic communication (PSC) system with rate splitting multiple access (RSMA) is investigated. Basic principles are first reviewed to show how the PSC system works to extract, compress and transmit the semantic information in a task-oriented transmission. Subsequently, the process of how multiuser semantic information can be represented… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  6. arXiv:2402.17043  [pdf, other

    eess.SY

    Traffic Control via Connected and Automated Vehicles: An Open-Road Field Experiment with 100 CAVs

    Authors: Jonathan W. Lee, Han Wang, Kathy Jang, Amaury Hayat, Matthew Bunting, Arwa Alanqary, William Barbour, Zhe Fu, Xiaoqian Gong, George Gunter, Sharon Hornstein, Abdul Rahman Kreidieh, Nathan Lichtlé, Matthew W. Nice, William A. Richardson, Adit Shah, Eugene Vinitsky, Fangyu Wu, Shengquan Xiang, Sulaiman Almatrudi, Fahd Althukair, Rahul Bhadani, Joy Carpio, Raphael Chekroun, Eric Cheng , et al. (39 additional authors not shown)

    Abstract: The CIRCLES project aims to reduce instabilities in traffic flow, which are naturally occurring phenomena due to human driving behavior. These "phantom jams" or "stop-and-go waves,"are a significant source of wasted energy. Toward this goal, the CIRCLES project designed a control system referred to as the MegaController by the CIRCLES team, that could be deployed in real traffic. Our field experim… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  7. arXiv:2401.10561  [pdf, other

    eess.IV cs.CV

    MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images

    Authors: Rui Xu, Yunke Wang, Bo Du

    Abstract: Unsupervised anomaly detection has gained significant attention in the field of medical imaging due to its capability of relieving the costly pixel-level annotation. To achieve this, modern approaches usually utilize generative models to produce healthy references of the diseased images and then identify the abnormalities by comparing the healthy references and the original diseased images. Recent… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  8. arXiv:2312.04418  [pdf, other

    cs.NI eess.SY

    MIST: An Efficient Approach for Software-Defined Multicast in Wireless Mesh Networks

    Authors: Rupei Xu, Yuming Jiang, Jason P. Jue

    Abstract: Multicasting is a vital information dissemination technique in Software-Defined Networking (SDN). With SDN, a multicast service can incorporate network functions implemented at different nodes, which is referred to as software-defined multicast. Emerging ubiquitous wireless networks for 5G and Beyond (B5G) inherently support multicast. However, the broadcast nature of wireless channels, especially… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  9. arXiv:2311.08425  [pdf

    cs.SD eess.AS math.NA physics.ao-ph physics.app-ph

    Research and experimental verification on low-frequency long-range underwater sound propagation dispersion characteristics under dual-channel sound speed profiles in the Chukchi Plateau

    Authors: **bao Weng, Yubo Qi, Yanming Yang, Hongtao Wen, Hongtao Zhou, Ruichao Xue

    Abstract: The dual-channel sound speed profiles of the Chukchi Plateau and the Canadian Basin have become current research hotspots due to their excellent low-frequency sound signal propagation ability. Previous research has mainly focused on using sound propagation theory to explain the changes in sound signal energy. This article is mainly based on the theory of normal modes to study the fine structure of… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 30 pages, 18 figures

  10. arXiv:2310.20427  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology

    Authors: Peixiang Huang, Songtao Zhang, Yulu Gan, Rui Xu, Rongqi Zhu, Wenkang Qin, Limei Guo, Shan Jiang, Lin Luo

    Abstract: Deep learning in digital pathology brings intelligence and automation as substantial enhancements to pathological analysis, the gold standard of clinical diagnosis. However, multiple steps from tissue preparation to slide imaging introduce various image corruptions, making it difficult for deep neural network (DNN) models to achieve stable diagnostic results for clinical use. In order to assess an… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  11. arXiv:2310.13882  [pdf

    eess.SP

    NMR Spectra Denoising with Vandermonde Constraints

    Authors: Di Guo, Runmin Xu, **yu Wu, Mei** Lin, Xiaofeng Du, Xiaobo Qu

    Abstract: Nuclear magnetic resonance (NMR) spectroscopy serves as an important tool to analyze chemicals and proteins in bioengineering. However, NMR signals are easily contaminated by noise during the data acquisition, which can affect subsequent quantitative analysis. Therefore, denoising NMR signals has been a long-time concern. In this work, we propose an optimization model-based iterative denoising met… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 10 pages, 9 figures

  12. arXiv:2310.07180  [pdf, other

    cs.NI eess.SP

    Integrated Sensing and Communication enabled Multiple Base Stations Cooperative Sensing Towards 6G

    Authors: Zhiqing Wei, Wangjun Jiang, Zhiyong Feng, Huici Wu, Ning Zhang, Kaifeng Han, Ruizhong Xu, ** Zhang

    Abstract: Driven by the intelligent applications of sixth-generation (6G) mobile communication systems such as smart city and autonomous driving, which connect the physical and cyber space, the integrated sensing and communication (ISAC) brings a revolutionary change to the base stations (BSs) of 6G by integrating radar sensing and communication in the same hardware and wireless resource. However, with the… ▽ More

    Submitted 24 November, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 11 pages 6 figures

    Journal ref: IEEE NetWork 2023

  13. arXiv:2309.11015   

    eess.IV cs.LG

    3D-U-SAM Network For Few-shot Tooth Segmentation in CBCT Images

    Authors: Yifu Zhang, Zuozhu Liu, Yang Feng, Ren**g Xu

    Abstract: Accurate representation of tooth position is extremely important in treatment. 3D dental image segmentation is a widely used method, however labelled 3D dental datasets are a scarce resource, leading to the problem of small samples that this task faces in many cases. To this end, we address this problem with a pretrained SAM and propose a novel 3D-U-SAM network for 3D dental image segmentation. Sp… ▽ More

    Submitted 27 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: The paper needs to be updated

  14. Symbol-level Integrated Sensing and Communication enabled Multiple Base Stations Cooperative Sensing

    Authors: Zhiqing Wei, Ruizhong Xu, Zhiyong Feng, Huici Wu, Ning Zhang, Wangjun Jiang, Xiaoyu Yang

    Abstract: With the support of integrated sensing and communication (ISAC) technology, mobile communication system will integrate the function of wireless sensing, thereby facilitating new intelligent applications such as smart city and intelligent transportation. Due to the limited sensing accuracy and sensing range of single base station (BS), multi-BS cooperative sensing can be applied to realize high-acc… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 15 pages, 17 figures, 2 tables

  15. arXiv:2307.14588  [pdf

    eess.IV cs.CV cs.LG

    MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation

    Authors: Liang Xu, Mingxiao Chen, Yi Cheng, Pengfei Shao, Shuwei Shen, Peng Yao, Ronald X. Xu

    Abstract: The UNet architecture, based on Convolutional Neural Networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome t… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  16. arXiv:2303.05338  [pdf, other

    cs.SD cs.MM eess.AS

    MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning

    Authors: Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu

    Abstract: Audio-visual learning helps to comprehensively understand the world by fusing practical information from multiple modalities. However, recent studies show that the imbalanced optimization of uni-modal encoders in a joint-learning model is a bottleneck to enhancing the model's performance. We further find that the up-to-date imbalance-mitigating methods fail on some audio-visual fine-grained tasks,… ▽ More

    Submitted 11 March, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  17. arXiv:2303.03625  [pdf, other

    eess.IV cs.CV

    SGDA: Towards 3D Universal Pulmonary Nodule Detection via Slice Grouped Domain Attention

    Authors: Rui Xu, Zhi Liu, Yong Luo, Han Hu, Li Shen, Bo Du, Kaiming Kuang, Jiancheng Yang

    Abstract: Lung cancer is the leading cause of cancer death worldwide. The best solution for lung cancer is to diagnose the pulmonary nodules in the early stage, which is usually accomplished with the aid of thoracic computed tomography (CT). As deep learning thrives, convolutional neural networks (CNNs) have been introduced into pulmonary nodule detection to help doctors in this labor-intensive task and dem… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE/ACM Transactions on Computational Biology and Bioinformatics

  18. arXiv:2303.02939  [pdf, other

    eess.AS cs.SD

    FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model

    Authors: Ruiqing Xue, Yanqing Liu, Lei He, Xu Tan, Linquan Liu, Edward Lin, Sheng Zhao

    Abstract: Neural text-to-speech (TTS) generally consists of cascaded architecture with separately optimized acoustic model and vocoder, or end-to-end architecture with continuous mel-spectrograms or self-extracted speech frames as the intermediate representations to bridge acoustic model and vocoder, which suffers from two limitations: 1) the continuous acoustic frames are hard to predict with phoneme only,… ▽ More

    Submitted 7 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  19. arXiv:2301.07272  [pdf, other

    cs.LG eess.SP

    A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning

    Authors: Hong-Bo Xie, Caoyuan Li, Shuliang Wang, Richard Yi Da Xu, Kerrie Mengersen

    Abstract: Construction of dictionaries using nonnegative matrix factorisation (NMF) has extensive applications in signal processing and machine learning. With the advances in deep learning, training compact and robust dictionaries using deep neural networks, i.e., dictionaries of deep features, has been proposed. In this study, we propose a probabilistic generative model which employs a variational autoenco… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 7 pages, 2 figures

  20. arXiv:2212.09247  [pdf, other

    cs.CV cs.LG eess.IV

    ColoristaNet for Photorealistic Video Style Transfer

    Authors: Xiaowen Qiu, Ruize Xu, Boan He, Yingtao Zhang, Wenqiang Zhang, Weifeng Ge

    Abstract: Photorealistic style transfer aims to transfer the artistic style of an image onto an input image or video while kee** photorealism. In this paper, we think it's the summary statistics matching scheme in existing algorithms that leads to unrealistic stylization. To avoid employing the popular Gram loss, we propose a self-supervised style transfer framework, which contains a style removal part an… ▽ More

    Submitted 21 December, 2022; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: 30 pages, 29 figures

  21. arXiv:2211.00261  [pdf, other

    q-bio.NC cs.LG cs.NE eess.IV

    Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks

    Authors: Yue Yu, Xuan Kan, Hejie Cui, Ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang

    Abstract: Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downs… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Work in progress

  22. arXiv:2208.12573  [pdf, other

    cs.CV eess.IV

    Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention

    Authors: Ruixiang Xue, Jianqiang Wang, Zhan Ma

    Abstract: Although convolutional representation of multiscale sparse tensor demonstrated its superior efficiency to accurately model the occupancy probability for the compression of geometry component of dense object point clouds, its capacity for representing sparse LiDAR point cloud geometry (PCG) was largely limited. This is because 1) fixed receptive field of the convolution cannot characterize extremel… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  23. arXiv:2208.02122  [pdf, other

    eess.IV cs.CV

    LSSANet: A Long Short Slice-Aware Network for Pulmonary Nodule Detection

    Authors: Rui Xu, Yong Luo, Bo Du, Kaiming Kuang, Jiancheng Yang

    Abstract: Convolutional neural networks (CNNs) have been demonstrated to be highly effective in the field of pulmonary nodule detection. However, existing CNN based pulmonary nodule detection methods lack the ability to capture long-range dependencies, which is vital for global information extraction. In computer vision tasks, non-local operations have been widely utilized, but the computational cost could… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: MICCAI 2022

  24. arXiv:2207.13385  [pdf, other

    cs.IT eess.SP

    Symbol Rate and Carries Estimation in OFDM Framework: A high Accuracy Technique under Low SNR

    Authors: Zetian Qin, Yubai Li, Benye Niu, Qingyao Li, Renhao Xue

    Abstract: Under a low Signal-to-Noise Ratio (SNR), the Orthogonal Frequency-Division Multiplexing (OFDM) signal symbol rate is limited. Existing carrier number estimation algorithms lack adequate methods to deal with low SNR. This paper proposes an algorithm with a low error rate under low SNR by correlating the signal and applying a Fast Fourier Transform (FFT) operation. By improving existing algorithms,… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  25. arXiv:2207.13070  [pdf, other

    cs.CR cs.CV cs.LG eess.IV

    DeFakePro: Decentralized DeepFake Attacks Detection using ENF Authentication

    Authors: Deeraj Nagothu, Ronghua Xu, Yu Chen, Erik Blasch, Alexander Aved

    Abstract: Advancements in generative models, like Deepfake allows users to imitate a targeted person and manipulate online interactions. It has been recognized that disinformation may cause disturbance in society and ruin the foundation of trust. This article presents DeFakePro, a decentralized consensus mechanism-based Deepfake detection technique in online video conferencing tools. Leveraging Electrical N… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Journal ref: the IEEE IT Professional, Special Issue on Information Hygiene and the Fight against the Misinformation Info-demic, 2022

  26. arXiv:2207.04646  [pdf, other

    cs.SD eess.AS eess.SP

    DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders

    Authors: Yanqing Liu, Ruiqing Xue, Lei He, Xu Tan, Sheng Zhao

    Abstract: Current text to speech (TTS) systems usually leverage a cascaded acoustic model and vocoder pipeline with mel-spectrograms as the intermediate representations, which suffer from two limitations: 1) the acoustic model and vocoder are separately trained instead of jointly optimized, which incurs cascaded errors; 2) the intermediate speech representations (e.g., mel-spectrogram) are pre-designed and… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: To appear in Interspeech 2022

  27. arXiv:2204.10461  [pdf, other

    cs.CL cs.SD eess.AS

    WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment

    Authors: Lin Yao, Jianfei Song, Ruizhuo Xu, Yingfang Yang, Zijian Chen, Yafeng Deng

    Abstract: Historically lower-level tasks such as automatic speech recognition (ASR) and speaker identification are the main focus in the speech field. Interest has been growing in higher-level spoken language understanding (SLU) tasks recently, like sentiment analysis (SA). However, improving performances on SLU tasks remains a big challenge. Basically, there are two main methods for SLU tasks: (1) Two-stag… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  28. arXiv:2202.02606   

    eess.IV cs.CV cs.LG

    ROMNet: Renovate the Old Memories

    Authors: Runsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, **long Li, Zibo Meng, Jiaqi Ma, Hongkai Yu

    Abstract: Renovating the memories in old photos is an intriguing research topic in computer vision fields. These legacy images often suffer from severe and commingled degradations such as cracks, noise, and color-fading, while lack of large-scale paired old photo datasets makes this restoration task very challenging. In this work, we present a novel reference-based end-to-end learning framework that can joi… ▽ More

    Submitted 27 April, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

    Comments: Paper major revision

  29. Stochastic optimal scheduling of demand response-enabled microgrids with renewable generations: An analytical-heuristic approach

    Authors: Yang Li, Kang Li, Zhen Yang, Yang Yu, Runnan Xu, Miaosen Yang

    Abstract: In the context of transition towards cleaner and sustainable energy production, microgrids have become an effective way for tackling environmental pollution and energy crisis issues. With the increasing penetration of renewables, how to coordinate demand response and renewable generations is a critical and challenging issue in the field of microgrid scheduling. To this end, a bi-level scheduling m… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted by Journal of Cleaner Production

    Journal ref: Journal of Cleaner Production 330 (2022) 129840

  30. arXiv:2109.14754  [pdf, other

    eess.IV cs.CV

    MetaHistoSeg: A Python Framework for Meta Learning in Histopathology Image Segmentation

    Authors: Zheng Yuan, Andre Esteva, Ran Xu

    Abstract: Few-shot learning is a standard practice in most deep learning based histopathology image segmentation, given the relatively low number of digitized slides that are generally available. While many models have been developed for domain specific histopathology image segmentation, cross-domain generalization remains a key challenge for properly validating models. Here, tooling and datasets to benchma… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  31. arXiv:2107.13200  [pdf

    eess.IV cs.CV cs.LG

    An explainable two-dimensional single model deep learning approach for Alzheimer's disease diagnosis and brain atrophy localization

    Authors: Fan Zhang, Bo Pan, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

    Abstract: Early and accurate diagnosis of Alzheimer's disease (AD) and its prodromal period mild cognitive impairment (MCI) is essential for the delayed disease progression and the improved quality of patients'life. The emerging computer-aided diagnostic methods that combine deep learning with structural magnetic resonance imaging (sMRI) have achieved encouraging results, but some of them are limit of issue… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  32. arXiv:2105.11905  [pdf, other

    cs.CL cs.SD eess.AS

    Exploiting Adapters for Cross-lingual Low-resource Speech Recognition

    Authors: Wenxin Hou, Han Zhu, Yidong Wang, **dong Wang, Tao Qin, Renjun Xu, Takahiro Shinozaki

    Abstract: Cross-lingual speech adaptation aims to solve the problem of leveraging multiple rich-resource languages to build models for a low-resource target language. Since the low-resource language has limited training data, speech recognition models can easily overfit. In this paper, we propose to use adapters to investigate the performance of multiple adapters for parameter-efficient cross-lingual speech… ▽ More

    Submitted 17 December, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: Accepted by IEEE Transactions on Audio, Speech, and Language Processing (TASLP) as a full paper; 12 pages; code at https://github.com/**dongwang/transferlearning/tree/master/code/ASR/Adapter

  33. arXiv:2105.09683  [pdf, other

    eess.IV cs.CV

    DPN-SENet:A self-attention mechanism neural network for detection and diagnosis of COVID-19 from chest x-ray images

    Authors: Bo Cheng, Ruhui Xue, Hang Yang, Laili Zhu, Wei Xiang

    Abstract: Background and Objective: The new type of coronavirus is also called COVID-19. It began to spread at the end of 2019 and has now spread across the world. Until October 2020, It has infected around 37 million people and claimed about 1 million lives. We propose a deep learning model that can help radiologists and clinicians use chest X-rays to diagnose COVID-19 cases and show the diagnostic feature… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 11 pages, 7 figures

  34. arXiv:2103.05114  [pdf, other

    eess.IV cs.CV cs.LG

    Learning Invariant Representations across Domains and Tasks

    Authors: **dong Wang, Wenjie Feng, Chang Liu, Chaohui Yu, Mingxuan Du, Renjun Xu, Tao Qin, Tie-Yan Liu

    Abstract: Being expensive and time-consuming to collect massive COVID-19 image samples to train deep classification models, transfer learning is a promising approach by transferring knowledge from the abundant typical pneumonia datasets for COVID-19 image classification. However, negative transfer may deteriorate the performance due to the feature distribution divergence between two datasets and task semant… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Technical report, 12 pages

  35. Deep Learning for Short-Term Voltage Stability Assessment of Power Systems

    Authors: Meng Zhang, Jiazheng Li, Yang Li, Runnan Xu

    Abstract: To fully learn the latent temporal dependencies from post-disturbance system dynamic trajectories, deep learning is utilized for short-term voltage stability (STVS) assessment of power systems in this paper. First of all, a semi-supervised cluster algorithm is performed to obtain class labels of STVS instances due to the unavailability of reliable quantitative criteria. Secondly, a long short-term… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: Accepted by IEEE Access

    Journal ref: IEEE Access 9 (2021) 29711-29718

  36. arXiv:2010.12013  [pdf, other

    cs.SD cs.LG eess.AS

    Listening to Sounds of Silence for Speech Denoising

    Authors: Ruilin Xu, Rundi Wu, Yuko Ishiwaka, Carl Vondrick, Changxi Zheng

    Abstract: We introduce a deep learning model for speech denoising, a long-standing challenge in audio analysis arising in numerous applications. Our approach is based on a key observation about human speech: there is often a short pause between each sentence or word. In a recorded speech signal, those pauses introduce a series of time periods during which only noise is present. We leverage these incidental… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 9 pages, 6 figures, accepted in NeurIPS 2020; Sound examples can be found at http://www.cs.columbia.edu/cg/listen_to_the_silence/

  37. VolumeNet: A Lightweight Parallel Network for Super-Resolution of Medical Volumetric Data

    Authors: Yinhao Li, Yutaro Iwamoto, Lanfen Lin, Rui Xu, Yen-Wei Chen

    Abstract: Deep learning-based super-resolution (SR) techniques have generally achieved excellent performance in the computer vision field. Recently, it has been proven that three-dimensional (3D) SR for medical volumetric data delivers better visual results than conventional two-dimensional (2D) processing. However, deepening and widening 3D networks increases training difficulty significantly due to the la… ▽ More

    Submitted 24 October, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

  38. arXiv:2009.13240  [pdf, other

    cs.CV cs.LG eess.IV

    Texture Memory-Augmented Deep Patch-Based Image Inpainting

    Authors: Rui Xu, Minghao Guo, Jiaqi Wang, Xiaoxiao Li, Bolei Zhou, Chen Change Loy

    Abstract: Patch-based methods and deep networks have been employed to tackle image inpainting problem, with their own strengths and weaknesses. Patch-based methods are capable of restoring a missing region with high-quality texture through searching nearest neighbor patches from the unmasked regions. However, these methods bring problematic contents when recovering large missing regions. Deep networks, on t… ▽ More

    Submitted 4 November, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Published on TIP. Project Page: https://nbei.github.io/tmad.html

  39. arXiv:2007.14556  [pdf, other

    eess.IV cs.CV

    Accurate Lung Nodules Segmentation with Detailed Representation Transfer and Soft Mask Supervision

    Authors: Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

    Abstract: Accurate lung lesion segmentation from Computed Tomography (CT) images is crucial to the analysis and diagnosis of lung diseases such as COVID-19 and lung cancer. However, the smallness and variety of lung nodules and the lack of high-quality labeling make the accurate lung nodule segmentation difficult. To address these issues, we first introduce a novel segmentation mask named Soft Mask which ha… ▽ More

    Submitted 14 April, 2022; v1 submitted 28 July, 2020; originally announced July 2020.

  40. arXiv:2007.08005  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Xiaomingbot: A Multilingual Robot News Reporter

    Authors: Runxin Xu, Jun Cao, Mingxuan Wang, Jiaze Chen, Hao Zhou, Ying Zeng, Yu** Wang, Li Chen, Xiang Yin, Xi** Zhang, Songcheng Jiang, Yuxuan Wang, Lei Li

    Abstract: This paper proposes the building of Xiaomingbot, an intelligent, multilingual and multimodal software robot equipped with four integral capabilities: news generation, news translation, news reading and avatar animation. Its system summarizes Chinese news that it automatically generates from data tables. Next, it translates the summary or the full article into multiple languages, and reads the mult… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to ACL 2020 - system demonstration

  41. arXiv:2004.12776  [pdf, other

    eess.IV cs.CV

    Boosting Connectivity in Retinal Vessel Segmentation via a Recursive Semantics-Guided Network

    Authors: Rui Xu, Tiantian Liu, Xinchen Ye, Yen-Wei Chen

    Abstract: Many deep learning based methods have been proposed for retinal vessel segmentation, however few of them focus on the connectivity of segmented vessels, which is quite important for a practical computer-aided diagnosis system on retinal images. In this paper, we propose an efficient network to address this problem. A U-shape network is enhanced by introducing a semantics-guided module, which integ… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

  42. arXiv:2004.04306  [pdf, other

    eess.IV cs.CV cs.LG

    Physics-enhanced machine learning for virtual fluorescence microscopy

    Authors: Colin L. Cooke, Fanjie Kong, Amey Chaware, Kevin C. Zhou, Kanghyun Kim, Rong Xu, D. Michael Ando, Samuel J. Yang, Pavan Chandra Konda, Roarke Horstmeyer

    Abstract: This paper introduces a new method of data-driven microscope design for virtual fluorescence microscopy. Our results show that by including a model of illumination within the first layers of a deep convolutional neural network, it is possible to learn task-specific LED patterns that substantially improve the ability to infer fluorescence image information from unstained transmission microscopy ima… ▽ More

    Submitted 21 April, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: 12 pages, 13 figures

  43. arXiv:2003.07080  [pdf, other

    cs.CV cs.LG eess.IV

    PS-RCNN: Detecting Secondary Human Instances in a Crowd via Primary Object Suppression

    Authors: Zheng Ge, Zequn Jie, Xin Huang, Rong Xu, Osamu Yoshie

    Abstract: Detecting human bodies in highly crowded scenes is a challenging problem. Two main reasons result in such a problem: 1). weak visual cues of heavily occluded instances can hardly provide sufficient information for accurate detection; 2). heavily occluded instances are easier to be suppressed by Non-Maximum-Suppression (NMS). To address these two issues, we introduce a variant of two-stage detector… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 6pages, accepted by ICME2020

  44. arXiv:2002.05857  [pdf

    eess.SP eess.SY

    Design and Implementation of a High-Accuracy Positioning System Using RTK on Smartphones

    Authors: Geng Shi, Ziqiang Ying, Rongtao Xu, Kan Zheng

    Abstract: In recent years, with the development of the Global Navigation Satellite System (GNSS), the satellite navigation technology has played a crucial role in smartphone navigation. To solve the problem of the low positioning accuracy in the smartphones based on GNSS, this paper proposes to apply real-time dynamic carrier phase difference technique (RTK) in the smartphones, and a real-time positioning s… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  45. arXiv:1911.11502  [pdf, other

    cs.CV cs.LG eess.AS

    Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers

    Authors: Ya Zhao, Rui Xu, Xinchao Wang, Peng Hou, Haihong Tang, Mingli Song

    Abstract: Lip reading has witnessed unparalleled development in recent years thanks to deep learning and the availability of large-scale datasets. Despite the encouraging results achieved, the performance of lip reading, unfortunately, remains inferior to the one of its counterpart speech recognition, due to the ambiguous nature of its actuations that makes it challenging to extract discriminant features fr… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: AAAI 2020

  46. arXiv:1911.10531  [pdf, other

    cs.CV cs.MM eess.IV

    A Proposal-based Approach for Activity Image-to-Video Retrieval

    Authors: Ruicong Xu, Li Niu, Jianfu Zhang, Liqing Zhang

    Abstract: Activity image-to-video retrieval task aims to retrieve videos containing the similar activity as the query image, which is a challenging task because videos generally have many background segments irrelevant to the activity. In this paper, we utilize R-C3D model to represent a video by a bag of activity proposals, which can filter out background segments to some extent. However, there are still n… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

    Comments: The Thirty-Fourth AAAI Conference on Artificial Intelligence

  47. arXiv:1911.01581  [pdf

    eess.SP

    LIFTED: Household Appliance-level Load Dataset and Data Compression with Lossless Coding considering Precision

    Authors: Lei Yan, Jiayu Han, Runnan Xu, Zuyi Li

    Abstract: The issue of estimating the detailed appliance level load consumption has received considerable attention. This paper first presents a Labelled hIgh-Frequency daTaset for Electricity Disaggregation (LIFTED), which can be used for research on nonintrusive load monitoring (NILM). This dataset consists of one-week detailed appliance-level electricity usage information including voltage, current, acti… ▽ More

    Submitted 6 November, 2019; v1 submitted 4 November, 2019; originally announced November 2019.

  48. arXiv:1910.10638  [pdf, other

    cs.DC eess.SP

    Blockchain Methods for Trusted Avionics Systems

    Authors: Erik Blasch, Ronghua Xu, Yu Chen, Genshe Chen, Dan Shen

    Abstract: Blockchain is a popular method to ensure security for trusted systems. The benefits include an auditable method to provide decentralized security without a trusted third party, but the drawback is the large computational resources needed to process and store the ever-expanding chain of security blocks. The promise of blockchain for edge devices (e.g., internet of things) poses a variety of challen… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Accepted and presented at 2019 IEEE NAECON Conference. arXiv admin note: text overlap with arXiv:1902.10567

  49. arXiv:1909.04142  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    DaTscan SPECT Image Classification for Parkinson's Disease

    Authors: Justin Quan, Lin Xu, Rene Xu, Tyrael Tong, Jean Su

    Abstract: Parkinson's Disease (PD) is a neurodegenerative disease that currently does not have a cure. In order to facilitate disease management and reduce the speed of symptom progression, early diagnosis is essential. The current clinical, diagnostic approach is to have radiologists perform human visual analysis of the degeneration of dopaminergic neurons in the substantia nigra region of the brain. Clini… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

  50. arXiv:1909.02068  [pdf, other

    cs.CV eess.IV

    ApproxNet: Content and Contention-Aware Video Analytics System for Embedded Clients

    Authors: Ran Xu, Rakesh Kumar, Pengcheng Wang, Peter Bai, Ganga Meghanath, Somali Chaterji, Subrata Mitra, Saurabh Bagchi

    Abstract: Videos take a lot of time to transport over the network, hence running analytics on the live video on embedded or mobile devices has become an important system driver. Considering that such devices, e.g., surveillance cameras or AR/VR gadgets, are resource constrained, creating lightweight deep neural networks (DNNs) for embedded devices is crucial. None of the current approximation techniques for… ▽ More

    Submitted 14 July, 2021; v1 submitted 28 August, 2019; originally announced September 2019.

    Comments: This paper has been accepted to appear in ACM Transactions on Sensor Networks in 2021