Skip to main content

Showing 1–50 of 59 results for author: Cao, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00951  [pdf, other

    eess.SY

    Effective Management of Airport Security Queues with Passenger Reassignment

    Authors: Shangqing Cao, Aparimit Kasliwal, Masoud Reihanifar, Francesc Robuste, Mark Hansen

    Abstract: Airport security queues often suffer from inefficiencies that result in long wait times and decreased throughput, especially at peak departure time, affecting both passengers and airlines. This work addresses the problem of reassigning passengers to specific time slots for crossing security, aiming to mitigate these inefficiencies. We frame this problem as a Minimum Cost Network Flow (MCNF) proble… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.00947  [pdf, other

    eess.SY

    Fleet Size and Spill for UAM Operation under Uncertain Demand

    Authors: Shangqing Cao, Xuan Jiang, Emin Burak Onat, Bo Zou, Mark Hansen, Raja Sengupta, Anjan Chakrabarty

    Abstract: Variation and imbalance in demand poses significant challenges to Urban Air Mobility (UAM) operations, affecting strategic decisions such as fleet sizing. To study the implications of demand variation on UAM fleet operations, we propose a stochastic passenger arrival time generation model that uses real-world data to infer demand distributions, and two integer programs that compute the zero-spill… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2405.11118  [pdf, other

    eess.SY

    A Simulation-Optimization Framework for Develo** Wind-Resilient AAM Networks

    Authors: Emin Burak Onat, Shangqing Cao, Raiyan Rizwan, Xuan Jiang, Mark Hansen, Raja Sengupta, Anjan Chakrabarty

    Abstract: Environmental factors pose a significant challenge to the operational efficiency and safety of advanced air mobility (AAM) networks. This paper presents a simulation-optimization framework that dynamically integrates wind variability into AAM operations. We employ a nonlinear charging model within a multi-vertiport environment to optimize fleet size and scheduling. Our framework assesses the impac… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted to ICRAT 2024

  4. arXiv:2403.14135  [pdf, other

    eess.IV cs.CV

    Powerful Lossy Compression for Noisy Images

    Authors: Shilv Cai, Xiaoguo Liang, Shuning Cao, Luxin Yan, Sheng Zhong, Liqun Chen, Xu Zou

    Abstract: Image compression and denoising represent fundamental challenges in image processing with many real-world applications. To address practical demands, current solutions can be categorized into two main strategies: 1) sequential method; and 2) joint method. However, sequential methods have the disadvantage of error accumulation as there is information loss between multiple individual models. Recentl… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by ICME 2024

  5. arXiv:2402.18070  [pdf, other

    cs.AR eess.SP

    A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband Processing

    Authors: Limin Jiang, Yi Shi, Haiqin Hu, Qingyu Deng, Siyi Xu, Yintao Liu, Feng Yuan, Si Wang, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang

    Abstract: Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading. Conventional hardware solutions, such as digital signal processors (DSPs) and more recently, graphic processing units (GPUs), provide various degrees of parallelism, yet they both fail to take into account the cyclical and… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 7 figures, conference

  6. arXiv:2312.06969  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation for Movable Antenna Communication Systems: A Framework Based on Compressed Sensing

    Authors: Zhenyu Xiao, Songqi Cao, Lipeng Zhu, Yanming Liu, Xiang-Gen Xia, Rui Zhang

    Abstract: Movable antenna (MA) is a new technology with great potential to improve communication performance by enabling local movement of antennas for pursuing better channel conditions. In particular, the acquisition of complete channel state information (CSI) between the transmitter (Tx) and receiver (Rx) regions is an essential problem for MA systems to reap performance gains. In this paper, we propose… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  7. arXiv:2310.09078  [pdf, other

    cs.NI eess.SP

    DNFS-VNE: Deep Neuro Fuzzy System Driven Virtual Network Embedding

    Authors: Ailing Xiao, Ning Chen, Sheng Wu, Peiying Zhang, Suzhi Cao, Chunxiao Jiang

    Abstract: By decoupling substrate resources, network virtualization (NV) is a promising solution for meeting diverse demands and ensuring differentiated quality of service (QoS). In particular, virtual network embedding (VNE) is a critical enabling technology that enhances the flexibility and scalability of network deployment by addressing the coupling of Internet processes and services. However, in the exi… ▽ More

    Submitted 7 December, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

  8. arXiv:2309.02959  [pdf, other

    eess.IV cs.CV

    A Non-Invasive Interpretable NAFLD Diagnostic Method Combining TCM Tongue Features

    Authors: Shan Cao, Qunsheng Ruan, Qingfeng Wu, Weiqiang Lin

    Abstract: Non-alcoholic fatty liver disease (NAFLD) is a clinicopathological syndrome characterized by hepatic steatosis resulting from the exclusion of alcohol and other identifiable liver-damaging factors. It has emerged as a leading cause of chronic liver disease worldwide. Currently, the conventional methods for NAFLD detection are expensive and not suitable for users to perform daily diagnostics. To ad… ▽ More

    Submitted 5 December, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  9. arXiv:2309.00787  [pdf, other

    cs.RO eess.IV eess.SP eess.SY

    Online Targetless Radar-Camera Extrinsic Calibration Based on the Common Features of Radar and Camera

    Authors: Lei Cheng, Siyang Cao

    Abstract: Sensor fusion is essential for autonomous driving and autonomous robots, and radar-camera fusion systems have gained popularity due to their complementary sensing capabilities. However, accurate calibration between these two sensors is crucial to ensure effective fusion and improve overall system performance. Calibration involves intrinsic and extrinsic calibration, with the latter being particula… ▽ More

    Submitted 24 January, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

  10. arXiv:2308.07077  [pdf

    eess.SP

    Distributed UAV Swarm Augmented Wideband Spectrum Sensing Using Nyquist Folding Receiver

    Authors: Kaili Jiang, Kailun Tian, Hancong Feng, Yuxin Zhao, Dechang Wang, Sen Cao, Jian Gao, Xuying Zhang, Yanfei Li, Junyu Yuan, Ying Xiong, Bin Tang

    Abstract: Distributed unmanned aerial vehicle (UAV) swarms are formed by multiple UAVs with increased portability, higher levels of sensing capabilities, and more powerful autonomy. These features make them attractive for many recent applica-tions, potentially increasing the shortage of spectrum resources. In this paper, wideband spectrum sensing augmented technology is discussed for distributed UAV swarms… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  11. arXiv:2308.07075  [pdf, other

    eess.SP

    Wideband Power Spectrum Sensing: a Fast Practical Solution for Nyquist Folding Receiver

    Authors: Kaili Jiang, Dechang Wang, Kailun Tian, Hancong Feng, Yuxin Zhao, Sen Cao, Jian Gao, Xuying Zhang, Yanfei Li, Junyu Yuan, Ying Xiong, Bin Tang

    Abstract: The limited availability of spectrum resources has been growing into a critical problem in wireless communications, remote sensing, and electronic surveillance, etc. To address the high-speed sampling bottleneck of wideband spectrum sensing, a fast and practical solution of power spectrum estimation for Nyquist folding receiver (NYFR) is proposed in this paper. The NYFR architectures is can theore… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  12. arXiv:2307.15264  [pdf, other

    cs.RO eess.SP eess.SY

    3D Radar and Camera Co-Calibration: A Flexible and Accurate Method for Target-based Extrinsic Calibration

    Authors: Lei Cheng, Arindam Sengupta, Siyang Cao

    Abstract: Advances in autonomous driving are inseparable from sensor fusion. Heterogeneous sensors are widely used for sensor fusion due to their complementary properties, with radar and camera being the most equipped sensors. Intrinsic and extrinsic calibration are essential steps in sensor fusion. The extrinsic calibration, independent of the sensor's own parameters, and performed after the sensors are in… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  13. arXiv:2304.04774  [pdf, other

    cs.CV cs.AI eess.IV

    DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion

    Authors: ZiHan Cao, ShiQi Cao, Xiao Wu, JunMing Hou, Ran Ran, Liang-Jian Deng

    Abstract: Denosing diffusion model, as a generative model, has received a lot of attention in the field of image generation recently, thanks to its powerful generation capability. However, diffusion models have not yet received sufficient research in the field of image fusion. In this article, we introduce diffusion model to the image fusion field, treating the image fusion task as image-to-image translatio… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  14. arXiv:2303.09278  [pdf, other

    eess.AS cs.SD

    DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model

    Authors: Yanzhe Fu, Yueteng Kang, Songjun Cao, Long Ma

    Abstract: Wav2vec 2.0 (W2V2) has shown impressive performance in automatic speech recognition (ASR). However, the large model size and the non-streaming architecture make it hard to be used under low-resource or streaming scenarios. In this work, we propose a two-stage knowledge distillation method to solve these two problems: the first step is to make the big and non-streaming teacher model smaller, and th… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  15. arXiv:2301.02069  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning for Breast MRI Style Transfer with Limited Training Data

    Authors: Shixing Cao, Nicholas Konz, James Duncan, Maciej A. Mazurowski

    Abstract: In this work we introduce a novel medical image style transfer method, StyleMapper, that can transfer medical scans to an unseen style with access to limited training data. This is made possible by training our model on unlimited possibilities of simulated random medical imaging styles on the training set, making our work more computationally efficient when compared with other style transfer metho… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

    Comments: preprint version, accepted in the Journal of Digital Imaging (JDIM). 16 pages (+ author names + references + supplementary), 6 figures

    Journal ref: J Digit Imaging (2022)

  16. Group Testing with Side Information via Generalized Approximate Message Passing

    Authors: Shu-Jie Cao, Ritesh Goenka, Chau-Wai Wong, Ajit Rajwade, Dror Baron

    Abstract: Group testing can help maintain a widespread testing program using fewer resources amid a pandemic. In a group testing setup, we are given n samples, one per individual. Each individual is either infected or uninfected. These samples are arranged into m < n pooled samples, where each pool is obtained by mixing a subset of the n individual samples. Infected individuals are then identified using a g… ▽ More

    Submitted 16 June, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: To appear in IEEE Trans. Signal Processing. arXiv admin note: substantial text overlap with arXiv:2106.02699, arXiv:2011.14186

  17. arXiv:2208.09785  [pdf, ps, other

    eess.SY

    High-Performance Transmission Mechanism Design of Multi-Stream Carrier Aggregation for 5G Non-Standalone Network

    Authors: Jun Yu, Shunqing Zhang, Jiayun Sun, Shugong Xu, Shan Cao

    Abstract: Multi-stream carrier aggregation is a key technology to expand bandwidth and improve the throughput of the fifth-generation wireless communication systems. However, due to the diversified propagation properties of different frequency bands, the traffic migration task is much more challenging, especially in hybrid sub-6 GHz and millimeter wave bands scenario. Existing schemes either neglected to co… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

    Comments: 17 pages, 7 figures

  18. Auto-Encoder-Extreme Learning Machine Model for Boiler NOx Emission Concentration Prediction

    Authors: Zhenhao Tang, Shikui Wang, Xiangying Chai, Shengxian Cao, Tinghui Ouyang, Yang Li

    Abstract: An automatic encoder (AE) extreme learning machine (ELM)-AE-ELM model is proposed to predict the NOx emission concentration based on the combination of mutual information algorithm (MI), AE, and ELM. First, the importance of practical variables is computed by the MI algorithm, and the mechanism is analyzed to determine the variables related to the NOx emission concentration. Then, the time delay c… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted by Energy

    Journal ref: Energy 256 (2022) 124552

  19. arXiv:2206.08189  [pdf, other

    cs.SD eess.AS

    Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training

    Authors: Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki

    Abstract: Recent studies have shown that the benefits provided by self-supervised pre-training and self-training (pseudo-labeling) are complementary. Semi-supervised fine-tuning strategies under the pre-training framework, however, remain insufficiently studied. Besides, modern semi-supervised speech recognition algorithms either treat unlabeled data indiscriminately or filter out noisy samples with a confi… ▽ More

    Submitted 27 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

  20. arXiv:2204.11769  [pdf, ps, other

    eess.IV cs.AI

    Multi-scale reconstruction of undersampled spectral-spatial OCT data for coronary imaging using deep learning

    Authors: Xueshen Li, Shengting Cao, Hongshan Liu, Xinwen Yao, Brigitta C. Brott, Silvio H. Litovsky, Xiaoyu Song, Yuye Ling, Yu Gan

    Abstract: Coronary artery disease (CAD) is a cardiovascular condition with high morbidity and mortality. Intravascular optical coherence tomography (IVOCT) has been considered as an optimal imagining system for the diagnosis and treatment of CAD. Constrained by Nyquist theorem, dense sampling in IVOCT attains high resolving power to delineate cellular structures/ features. There is a trade-off between high… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 11 pages, 8 figures, reviewed by IEEE trans BME

  21. arXiv:2203.04767  [pdf, other

    eess.AS cs.SD

    A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

    Authors: Yike Zhang, Xiaobing Feng, Yi Liu, Songjun Cao, Long Ma

    Abstract: Automatic speech recognition (ASR) systems used on smart phones or vehicles are usually required to process speech queries from very different domains. In such situations, a vanilla ASR system usually fails to perform well on every domain. This paper proposes a multi-domain ASR framework for Tencent Map, a navigation app used on smart phones and in-vehicle infotainment systems. The proposed framew… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 7 pages, 1 figure

  22. Conquering Data Variations in Resolution: A Slice-Aware Multi-Branch Decoder Network

    Authors: Shuxin Wang, Shilei Cao, Zhizhong Chai, Dong Wei, Kai Ma, Liansheng Wang, Yefeng Zheng

    Abstract: Fully convolutional neural networks have made promising progress in joint liver and liver tumor segmentation. Instead of following the debates over 2D versus 3D networks (for example, pursuing the balance between large-scale 2D pretraining and 3D context), in this paper, we novelly identify the wide variation in the ratio between intra- and inter-slice resolutions as a crucial obstacle to the perf… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Published by IEEE TMI

  23. arXiv:2203.03582  [pdf, other

    cs.CL cs.SD eess.AS

    Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

    Authors: Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

    Abstract: Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2.0 models. Due to the conditional independence assumption, CTC-based models are always weaker than attention-based encoder-decoder models and require the assistance of external language models (LMs). To solve this is… ▽ More

    Submitted 22 February, 2022; originally announced March 2022.

    Comments: ICASSP 2022

  24. arXiv:2202.05430  [pdf

    eess.SY eess.SP

    Wind power ramp prediction algorithm based on wavelet deep belief network

    Authors: Zhenhao Tang, Qingyu Meng, Shengxian Cao, Yang Li, Zhongha Mu, Xiaoya Pang

    Abstract: The wind power ramp events threaten the power grid safety significantly. To improve the ramp prediction accuracy, a hybrid wavelet deep belief network algorithm with adaptive feature selection (WDBNAFS) is proposed. First, the wind power characteristic is analyzed. Then, wavelet decomposition is addressed to the time series, and an adaptive feature selection algorithm is proposed to select the inp… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: in Chinese language

    Journal ref: ACTA Energiae Solaris Sinica 40 (2019) 3213-3220

  25. arXiv:2112.07254  [pdf, other

    eess.AS cs.CL cs.SD

    Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

    Authors: Keqi Deng, Songjun Cao, Yike Zhang, Long Ma

    Abstract: Recently, self-supervised pretraining has achieved impressive results in end-to-end (E2E) automatic speech recognition (ASR). However, the dominant sequence-to-sequence (S2S) E2E model is still hard to fully utilize the self-supervised pre-training methods because its decoder is conditioned on acoustic representation thus cannot be pretrained separately. In this paper, we propose a pretrained Tran… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: ASRU2021

  26. arXiv:2109.07349  [pdf, other

    eess.AS cs.SD

    Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning

    Authors: Keqi Deng, Songjun Cao, Long Ma

    Abstract: Recently, self-supervised pre-training has gained success in automatic speech recognition (ASR). However, considering the difference between speech accents in real scenarios, how to identify accents and use accent features to improve ASR is still challenging. In this paper, we employ the self-supervised pre-training method for both accent identification and accented speech recognition tasks. For t… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: INTERSPEECH2021

  27. arXiv:2109.07327  [pdf, ps, other

    eess.AS cs.SD

    Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning

    Authors: Songjun Cao, Yueteng Kang, Yanzhe Fu, Xiaoshuo Xu, Sining Sun, Yike Zhang, Long Ma

    Abstract: Recently self-supervised learning has emerged as an effective approach to improve the performance of automatic speech recognition (ASR). Under such a framework, the neural network is usually pre-trained with massive unlabeled data and then fine-tuned with limited labeled data. However, the non-streaming architecture like bidirectional transformer is usually adopted by the neural network to achieve… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: INTERSPEECH2021

  28. Dynamic Prediction Model for NOx Emission of SCR System Based on Hybrid Data-driven Algorithms

    Authors: Zhenhao Tang, Shikui Wang, Shengxian Cao, Yang Li, Tao Shen

    Abstract: Aiming at the problem that delay time is difficult to determine and prediction accuracy is low in building prediction model of SCR system, a dynamic modeling scheme based on a hybrid of multiple data-driven algorithms was proposed. First, processed abnormal values and normalized the data. To improve the relevance of the input data, used MIC to estimate delay time and reconstructed production data.… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: in Chinese language, Accepted by Proceedings of the CSEE

    Journal ref: Proceedings of the CSEE 42 (2022) 3295-3306

  29. arXiv:2107.10327  [pdf, other

    eess.SP cs.CV

    mmPose-NLP: A Natural Language Processing Approach to Precise Skeletal Pose Estimation using mmWave Radars

    Authors: Arindam Sengupta, Siyang Cao

    Abstract: In this paper we presented mmPose-NLP, a novel Natural Language Processing (NLP) inspired Sequence-to-Sequence (Seq2Seq) skeletal key-point estimator using millimeter-wave (mmWave) radar data. To the best of the author's knowledge, this is the first method to precisely estimate upto 25 skeletal key-points using mmWave radar data alone. Skeletal pose estimation is critical in several applications r… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE Transactions

  30. arXiv:2107.03165  [pdf, other

    eess.AS cs.SD

    Improving Speech Recognition Accuracy of Local POI Using Geographical Models

    Authors: Songjun Cao, Yike Zhang, Xiaobing Feng, Long Ma

    Abstract: Nowadays voice search for points of interest (POI) is becoming increasingly popular. However, speech recognition for local POI has remained to be a challenge due to multi-dialect and massive POI. This paper improves speech recognition accuracy for local POI from two aspects. Firstly, a geographic acoustic model (Geo-AM) is proposed. The Geo-AM deals with multi-dialect problem using dialect-specifi… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted by SLT 2021

  31. arXiv:2010.09586  [pdf, other

    eess.IV cs.CV

    Brain Atlas Guided Attention U-Net for White Matter Hyperintensity Segmentation

    Authors: Zicong Zhang, Kimerly Powell, Changchang Yin, Shilei Cao, Dani Gonzalez, Yousef Hannawi, ** Zhang

    Abstract: White Matter Hyperintensities (WMH) are the most common manifestation of cerebral small vessel disease (cSVD) on the brain MRI. Accurate WMH segmentation algorithms are important to determine cSVD burden and its clinical consequences. Most of existing WMH segmentation algorithms require both fluid attenuated inversion recovery (FLAIR) images and T1-weighted images as inputs. However, T1-weighted i… ▽ More

    Submitted 21 December, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted by AMIA 2021 Virtual Informatics Summit

  32. arXiv:2007.12756  [pdf

    cs.SI eess.SY

    Detecting Dynamic States of Temporal Networks Using Connection Series Tensors

    Authors: Shun Cao, Hiroki Sayama

    Abstract: Many temporal networks exhibit multiple system states, such as weekday and weekend patterns in social contact networks. The detection of such distinct states in temporal network data has recently been explored as it helps reveal underlying dynamical processes. A commonly used method is network aggregation over a time window, which aggregates a subsequence of multiple network snapshots into one sta… ▽ More

    Submitted 19 August, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: 18 pages, 9 figures, 3 tables

  33. arXiv:2007.07241  [pdf, other

    cs.SD eess.AS

    Learning Frame Level Attention for Environmental Sound Classification

    Authors: Zhichao Zhang, Shugong Xu, Shunqing Zhang, Tianhao Qiao, Shan Cao

    Abstract: Environmental sound classification (ESC) is a challenging problem due to the complexity of sounds. The classification performance is heavily dependent on the effectiveness of representative features extracted from the environmental sounds. However, ESC often suffers from the semantically irrelevant frames and silent frames. In order to deal with this, we employ a frame-level attention model to foc… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1907.02230

  34. arXiv:2005.00205  [pdf, other

    cs.CL cs.SD eess.AS

    Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

    Authors: Baiji Liu, Songjun Cao, Sining Sun, Weibin Zhang, Long Ma

    Abstract: The attention mechanism of the Listen, Attend and Spell (LAS) model requires the whole input sequence to calculate the attention context and thus is not suitable for online speech recognition. To deal with this problem, we propose multi-head monotonic chunk-wise attention (MTH-MoChA), an improved version of MoChA. MTH-MoChA splits the input sequence into small chunks and computes multi-head attent… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

  35. arXiv:2004.02872  [pdf, other

    eess.IV cs.CV cs.LG

    Lossless Image Compression through Super-Resolution

    Authors: Sheng Cao, Chao-Yuan Wu, Philipp Krähenbühl

    Abstract: We introduce a simple and efficient lossless image compression algorithm. We store a low resolution version of an image as raw pixels, followed by several iterations of lossless super-resolution. For lossless super-resolution, we predict the probability of a high-resolution image, conditioned on the low-resolution input, and use entropy coding to compress this super-resolution operator. Super-Reso… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: Tech report

  36. arXiv:2003.02386  [pdf, ps, other

    cs.LG eess.SP stat.ML

    mmFall: Fall Detection using 4D MmWave Radar and a Hybrid Variational RNN AutoEncoder

    Authors: Feng **, Arindam Sengupta, Siyang Cao

    Abstract: In this paper we propose mmFall - a novel fall detection system, which comprises of (i) the emerging millimeter-wave (mmWave) radar sensor to collect the human body's point cloud along with the body centroid, and (ii) a variational recurrent autoencoder (VRAE) to compute the anomaly level of the body motion based on the acquired point cloud. A fall is claimed to have occurred when the spike in ano… ▽ More

    Submitted 28 July, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Preprint version

  37. arXiv:2002.01527  [pdf

    eess.SY

    Prediction of Component Shifts in Pick and Place Process of Surface Mount Technology Using Support Vector Regression

    Authors: Shun Cao, Irandokht Parviziomran, Haeyong Yang, Seungbae Park, Daehan Won

    Abstract: In pick and place (P&P) process of surface mount technology (SMT) the placed component can shift from its ideal (or designed) position on the wet solder paste. The solder paste with some fluid properties could slump and the unbalance between different sides of solder paste can lead to other forces on the components as well. Though the shifts are usually considered to be negligible and can be made… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: 8 pages, 8 figures, 5 tables, 25th International Conference on Production Research Manufacturing Innovation: Cyber Physical Manufacturing August 9-14, 2019 | Chicago, Illinois (USA)

  38. arXiv:2002.01255  [pdf, other

    cs.IT cs.NI eess.SP

    Revealing Much While Saying Less: Predictive Wireless for Status Update

    Authors: Zhiyuan Jiang, Zixu Cao, Siyu Fu, Fei Peng, Shan Cao, Shunqing Zhang, Shugong Xu

    Abstract: Wireless communications for status update are becoming increasingly important, especially for machine-type control applications. Existing work has been mainly focused on Age of Information (AoI) optimizations. In this paper, a status-aware predictive wireless interface design, networking and implementation are presented which aim to minimize the status recovery error of a wireless networked system… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: To appear in IEEE INFOCOM 2020

  39. arXiv:2001.09619  [pdf

    eess.SY cs.LG stat.AP stat.ML

    Data-Driven Prediction Model of Components Shift during Reflow Process in Surface Mount Technology

    Authors: Irandokht Parviziomran, Shun Cao, Krishnaswami Srihari, Daehan Won

    Abstract: In surface mount technology (SMT), mounted components on soldered pads are subject to move during reflow process. This capability is known as self-alignment and is the result of fluid dynamic behaviour of molten solder paste. This capability is critical in SMT because inaccurate self-alignment causes defects such as overhanging, tombstoning, etc. while on the other side, it can enable components t… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  40. arXiv:2001.09612  [pdf

    math.OC cs.LG eess.SY stat.ML

    Optimization of Passive Chip Components Placement with Self-Alignment Effect for Advanced Surface Mounting Technology

    Authors: Irandokht Parviziomran, Shun Cao, Haeyong Yang, Seungbae Park, Daehan Won

    Abstract: Surface mount technology (SMT) is an enhanced method in electronic packaging in which electronic components are placed directly on soldered printing circuit board (PCB) and are permanently attached on PCB with the aim of reflow soldering process. During reflow process, once deposited solder pastes start melting, electronic components move in a direction that achieve their highest symmetry. This mo… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  41. arXiv:2001.00068  [pdf, other

    stat.AP eess.IV

    Asymptotic convergence rate of the longest run in an inflating Bernoulli net

    Authors: Kai Ni, Shanshan Cao, Xiaoming Huo

    Abstract: In image detection, one problem is to test whether the set, though mostly consisting of uniformly scattered points, also contains a small fraction of points sampled from some (a priori unknown) curve, for example, a curve with $C^α$-norm bounded by $β$. One approach is to analyze the data by counting membership in multiscale multianisotropic strips, which involves an algorithm that delves into the… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

  42. arXiv:1911.09592  [pdf, other

    eess.SP cs.LG stat.ML

    mm-Pose: Real-Time Human Skeletal Posture Estimation using mmWave Radars and CNNs

    Authors: Arindam Sengupta, Feng **, Renyuan Zhang, Siyang Cao

    Abstract: In this paper, mm-Pose, a novel approach to detect and track human skeletons in real-time using an mmWave radar, is proposed. To the best of the authors' knowledge, this is the first method to detect >15 distinct skeletal joints using mmWave radar reflection signals. The proposed method would find several applications in traffic monitoring systems, autonomous vehicles, patient monitoring systems a… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: Submitted to IEEE Sensors Journal

  43. Automotive Radar Interference Mitigation Using Adaptive Noise Canceller

    Authors: Feng **, Siyang Cao

    Abstract: Interference among frequency modulated continues wave automotive radars can either increase the noise floor, which occurs in the most cases, or generate a ghost target in rare situations. To address the increment of noise floor due to interference, we proposed a low calculation cost method using adaptive noise canceller to increase the signal-to-interference ratio. In a quadrature receiver, the in… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: This paper has been submitted to IEEE Transactions on Vehicular Technology

    Journal ref: in IEEE Transactions on Vehicular Technology, vol. 68, no. 4, pp. 3747-3754, April 2019

  44. arXiv:1911.06364  [pdf, ps, other

    eess.SP cs.LG stat.ML

    MmWave Radar Point Cloud Segmentation using GMM in Multimodal Traffic Monitoring

    Authors: Feng **, Arindam Sengupta, Siyang Cao, Yao-Jan Wu

    Abstract: In multimodal traffic monitoring, we gather traffic statistics for distinct transportation modes, such as pedestrians, cars and bicycles, in order to analyze and improve people's daily mobility in terms of safety and convenience. On account of its robustness to bad light and adverse weather conditions, and inherent speed measurement ability, the radar sensor is a suitable option for this applicati… ▽ More

    Submitted 31 January, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: This paper has been accepted by the IEEE International Radar Conference 2020

  45. arXiv:1911.06363  [pdf, ps, other

    eess.SP cs.LG stat.ML

    Multiple Patients Behavior Detection in Real-time using mmWave Radar and Deep CNNs

    Authors: Feng **, Renyuan Zhang, Arindam Sengupta, Siyang Cao, Salim Hariri, Nimit K. Agarwal, Sumit K. Agarwal

    Abstract: To address potential gaps noted in patient monitoring in the hospital, a novel patient behavior detection system using mmWave radar and deep convolution neural network (CNN), which supports the simultaneous recognition of multiple patients' behaviors in real-time, is proposed. In this study, we use an mmWave radar to track multiple patients and detect the scattering point cloud of each one. For ea… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: This paper has been submitted to IEEE Radar Conference 2019

  46. arXiv:1908.05863  [pdf, other

    cs.SD cs.LG eess.AS

    Sub-Spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion

    Authors: Tianhao Qiao, Shunqing Zhang, Zhichao Zhang, Shan Cao, Shugong Xu

    Abstract: Environmental Sound Classification (ESC) is an important and challenging problem, and feature representation is a critical and even decisive factor in ESC. Feature representation ability directly affects the accuracy of sound classification. Therefore, the ESC performance is heavily dependent on the effectiveness of representative features extracted from the environmental sounds. In this paper, we… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: accepted in the 2019 IEEE International Workshop on Signal Processing Systems (SiPS2019)

  47. arXiv:1908.02334  [pdf

    q-bio.QM cs.LG eess.IV physics.med-ph stat.AP stat.ML

    Predicted disease compositions of human gliomas estimated from multiparametric MRI can predict endothelial proliferation, tumor grade, and overall survival

    Authors: Emily E Diller, Sha Cao, Beth Ey, Robert Lober, Jason G Parker

    Abstract: Background and Purpose: Biopsy is the main determinants of glioma clinical management, but require invasive sampling that fail to detect relevant features because of tumor heterogeneity. The purpose of this study was to evaluate the accuracy of a voxel-wise, multiparametric MRI radiomic method to predict features and develop a minimally invasive method to objectively assess neoplasms. Methods: M… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: 13 pages, 3 figures, 5 tables

  48. arXiv:1907.02230  [pdf, other

    cs.SD cs.LG eess.AS

    Attention based Convolutional Recurrent Neural Network for Environmental Sound Classification

    Authors: Zhichao Zhang, Shugong Xu, Tianhao Qiao, Shunqing Zhang, Shan Cao

    Abstract: Environmental sound classification (ESC) is a challenging problem due to the complexity of sounds. The ESC performance is heavily dependent on the effectiveness of representative features extracted from the environmental sounds. However, ESC often suffers from the semantically irrelevant frames and silent frames. In order to deal with this, we employ a frame-level attention model to focus on the s… ▽ More

    Submitted 4 July, 2019; originally announced July 2019.

    Comments: Accepted to Chinese Conference on Pattern Recognition and Computer Vision (PRCV) 2019

  49. arXiv:1907.00594  [pdf, other

    eess.SP eess.SY

    Fingerprint-based Localization using Commercial LTE Signals: A Field-Trial Study

    Authors: Heng Zhang, Zhichao Zhang, Shunqing Zhang, Shugong Xu, Shan Cao

    Abstract: Wireless localization for mobile device has attracted more and more interests by increasing the demand for location based services. Fingerprint-based localization is promising, especially in non-Line-of-Sight (NLoS) or rich scattering environments, such as urban areas and indoor scenarios. In this paper, we propose a novel fingerprint-based localization technique based on deep learning framework u… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 5 pages, 7 figures, conference

  50. arXiv:1905.05953  [pdf

    eess.IV cs.AI

    Learning-based Single-step Quantitative Susceptibility Map** Reconstruction Without Brain Extraction

    Authors: Hongjiang Wei, Steven Cao, Yuyao Zhang, Xiaojun Guan, Fuhua Yan, Kristen W. Yeom, Chunlei Liu

    Abstract: Quantitative susceptibility map** (QSM) estimates the underlying tissue magnetic susceptibility from MRI gradient-echo phase signal and typically requires several processing steps. These steps involve phase unwrap**, brain volume extraction, background phase removal and solving an ill-posed inverse problem. The resulting susceptibility map is known to suffer from inaccuracy near the edges of t… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: 26 pages