Skip to main content

Showing 1–48 of 48 results for author: Gaofeng

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, **gyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  2. arXiv:2406.06842  [pdf, ps, other

    cs.IT eess.SP

    Aerial Relay to Achieve Covertness and Security

    Authors: Jiacheng Jiang, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: In this work, a delay-tolerant unmanned aerial vehicle (UAV) relayed covert and secure communication framework is investigated. In this framework, a legitimate UAV serves as an aerial relay to realize communication when the direct link between the terrestrial transmitter and receiver is blocked and also acts as a friendly jammer to suppress the malicious nodes presented on the ground. Subsequently… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, submitted to IEEE Journal for review

  3. arXiv:2406.01313  [pdf, ps, other

    cs.IT eess.SP

    3D Trajectory Design for Energy-constrained Aerial CRNs Under Probabilistic LoS Channel

    Authors: Hongjiang Lei, Xiaqiu Wu, Ki-Hong Park, Gaofeng Pan

    Abstract: Unmanned aerial vehicles (UAVs) have been attracting significant attention because there is a high probability of line-of-sight links being obtained between them and terrestrial nodes in high-rise urban areas. In this work, we investigate cognitive radio networks (CRNs) by jointly designing three-dimensional (3D) trajectory, the transmit power of the UAV, and user scheduling. Considering the UAV's… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures,submitted to the IEEE journal for review

  4. arXiv:2405.15717  [pdf, other

    eess.SY

    Integrated Design for Wave Energy Converter Farms: Assessing Plant, Control, Layout, and Site Selection Coupling in the Presence of Irregular Waves

    Authors: Saeed Azad, Suraj Khanal, Daniel R. Herber, Gaofeng Jia

    Abstract: A promising direction towards reducing the levelized cost of energy for wave energy converter (WEC) farms is to improve their performance. WEC design studies generally focus on a single design domain (e.g., geometry, control, or layout) to improve the farm's performance under simplifying assumptions, such as regular waves. This strategy, however, has resulted in design recommendations that are imp… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 12 pages and 7 figures

  5. arXiv:2405.10691  [pdf, other

    eess.IV cs.CV

    LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion

    Authors: Zihao Zhu, Tianli Tao, Yitian Tao, Haowen Deng, Xinyi Cai, Gaofeng Wu, Kaidong Wang, Haifeng Tang, Lixuan Zhu, Zhuoyang Gu, Jiawei Huang, Dinggang Shen, Han Zhang

    Abstract: The infant brain undergoes rapid development in the first few years after birth.Compared to cross-sectional studies, longitudinal studies can depict the trajectories of infants brain development with higher accuracy, statistical power and flexibility.However, the collection of infant longitudinal magnetic resonance (MR) data suffers a notorious dropout problem, resulting in incomplete datasets wit… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  6. arXiv:2405.08306  [pdf, other

    math.OC eess.SY

    Flight Path Optimization with Optimal Control Method

    Authors: Gaofeng Su, Xi Cheng, Siyuan Feng, Ke Liu, Jilin Song, Jianan Chen, Chen Zhu, Hui Lin

    Abstract: This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to d… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2405.06794  [pdf, other

    eess.SY

    Site-dependent Solutions of Wave Energy Converter Farms with Surrogate Models, Control Co-design, and Layout Optimization

    Authors: Saeed Azad, Daniel R. Herber, Suraj Khanal, Gaofeng Jia

    Abstract: Design of wave energy converter farms entails multiple domains that are coupled, and thus, their concurrent representation and consideration in early-stage design optimization has the potential to offer new insights and promising solutions with improved performance. Concurrent optimization of physical attributes (e.g., plant) and the control system design is often known as control co-design or CCD… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 9 pages, 9 figures

  8. arXiv:2403.12467  [pdf, other

    eess.SP

    Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications

    Authors: Heng Wang, Jianhua Zhang, Gaofeng Nie, Li Yu, Zhiqiang Yuan, Tongjie Li, Jialin Wang, Guangyi Liu

    Abstract: Digital twin channel (DTC) is the real-time map** of a wireless channel from the physical world to the digital world, which is expected to provide significant performance enhancements for the sixth-generation (6G) air-interface design. In this work, we first define five evolution levels of channel twins with the progression of wireless communication. The fifth level, autonomous DTC, is elaborate… ▽ More

    Submitted 31 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, 15 references. It is submitted to IEEE journal

  9. arXiv:2402.14099  [pdf, other

    eess.IV cs.CV physics.med-ph

    EXACT-Net:EHR-guided lung tumor auto-segmentation for non-small cell lung cancer radiotherapy

    Authors: Hamed Hooshangnejad, Xue Feng, Gaofeng Huang, Rui Zhang, Quan Chen, Kai Ding

    Abstract: Lung cancer is a devastating disease with the highest mortality rate among cancer types. Over 60% of non-small cell lung cancer (NSCLC) patients, which accounts for 87% of diagnoses, require radiation therapy. Rapid treatment initiation significantly increases the patient's survival rate and reduces the mortality rate. Accurate tumor segmentation is a critical step in the diagnosis and treatment o… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  10. arXiv:2312.10287  [pdf, other

    eess.SP

    Towards 6G Digital Twin Channel Using Radio Environment Knowledge Pool

    Authors: Jialin Wang, Jianhua Zhang, Yuxiang Zhang, Yutong Sun, Gaofeng, Nie, Lianzheng Shi, ** Zhang, Guangyi Liu

    Abstract: The digital twin channel (DTC) is crucial for 6G wireless autonomous networks as it replicates the wireless channel fading states in 6G air interface transmissions. It is well known that the physical environment influences channels. A key task for accurately twinning channels in complex 6G scenarios is establishing precise relationships between the environment and the channels. In this article, th… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  11. arXiv:2311.06825  [pdf, ps, other

    cs.IT eess.SP

    Secure Rate-Splitting Multiple Access Transmissions in LMS Systems

    Authors: Minjue He, Hui Zhao, Xiaqing Miao, Shuai Wang, Gaofeng Pan

    Abstract: This letter investigates the secure delivery performance of the rate-splitting multiple access scheme in land mobile satellite (LMS) systems, considering that the private messages intended by a terminal can be eavesdropped by any others from the broadcast signals. Specifically, the considered system has an N-antenna satellite and numerous single-antenna land users. Maximum ratio transmission (MRT)… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 1 table

  12. arXiv:2310.13932  [pdf, ps, other

    cs.IT eess.SP

    Trajectory and Power Design for Aerial Multi-User Covert Communications

    Authors: Hongjiang Lei, Jiacheng Jiang, Imran Shafique Ansari, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) can provide wireless access to terrestrial users, regardless of geographical constraints, and will be an important part of future communication systems. In this paper, a multi-user downlink dual-UAVs enabled covert communication system was investigated, in which a UAV transmits secure information to ground users in the presence of multiple wardens as well as a frien… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 30 pages, 9 figures, submitted to the IEEE journal for review

  13. arXiv:2310.13931  [pdf, ps, other

    cs.IT eess.SP

    Trajectory and power design for aerial CRNs with colluding eavesdroppers

    Authors: Hongjiang Lei, Jiacheng Jiang, Haosi Yang, Ki-Hong Park, Imran Shafique Ansari, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) can provide wireless access services to terrestrial users without geographical limitations and will become an essential part of the future communication system. However, the openness of wireless channels and the mobility of UAVs make the security of UAV-based communication systems particularly challenging. This work investigates the security of aerial cognitive radi… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 10 pages, 7 figures.submitted to the IEEE journal for review

  14. Convergence Analysis and Latency Minimization for Semi-Federated Learning in Massive IoT Networks

    Authors: Jianyang Ren, Wanli Ni, Hui Tian, Gaofeng Nie

    Abstract: As the number of sensors becomes massive in Internet of Things (IoT) networks, the amount of data is humongous. To process data in real-time while protecting user privacy, federated learning (FL) has been regarded as an enabling technique to push edge intelligence into IoT networks with massive devices. However, FL latency increases dramatically due to the increase of the number of parameters in d… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted by IEEE Transactions on Green Communications and Networking

  15. arXiv:2308.06547  [pdf, other

    eess.AS cs.CL cs.SD

    Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

    Authors: Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan

    Abstract: When labeled data is insufficient, semi-supervised learning with the pseudo-labeling technique can significantly improve the performance of automatic speech recognition. However, pseudo-labels are often noisy, containing numerous incorrect tokens. Taking noisy labels as ground-truth in the loss function results in suboptimal performance. Previous works attempted to mitigate this issue by either fi… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2023

  16. Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture

    Authors: Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

    Abstract: Recently, there has been increasing progress in end-to-end automatic speech recognition (ASR) architecture, which transcribes speech to text without any pre-trained alignments. One popular end-to-end approach is the hybrid Connectionist Temporal Classification (CTC) and attention (CTC/attention) based ASR architecture. However, how to deploy hybrid CTC/attention systems for online speech recogniti… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Volume 28, 2020, Pages 1452 - 1465

  17. arXiv:2302.13222  [pdf, other

    cs.CL cs.SD eess.AS

    Speech Corpora Divergence Based Unsupervised Data Selection for ASR

    Authors: Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

    Abstract: Selecting application scenarios matching data is important for the automatic speech recognition (ASR) training, but it is difficult to measure the matching degree of the training corpus. This study proposes a unsupervised target-aware data selection method based on speech corpora divergence (SCD), which can measure the similarity between two speech corpora. We first use the self-supervised Hubert… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  18. arXiv:2210.06091  [pdf

    cs.CL cs.SD eess.AS

    Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

    Authors: Shuhao Deng, Chengfei Li, **feng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

    Abstract: Code-switching automatic speech recognition becomes one of the most challenging and the most valuable scenarios of automatic speech recognition, due to the code-switching phenomenon between multilingual language and the frequent occurrence of code-switching phenomenon in daily life. The ISCSLP 2022 Chinese-English Code-Switching Automatic Speech Recognition (CSASR) Challenge aims to promote the de… ▽ More

    Submitted 13 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: accepted by ISCSLP 2022

  19. arXiv:2210.05868  [pdf, ps, other

    eess.SP

    On Secure Uplink Transmission in Hybrid RF-FSO Cooperative Satellite-Aerial-Terrestrial Networks

    Authors: Yuanyuan Ma, Tiejun Lv, Gaofeng Pan, Yunfei Chen, Mohamed-Slim Alouini

    Abstract: This work investigates the secrecy outage performance of the uplink transmission of a radio-frequency (RF)-free-space optical (FSO) hybrid cooperative satellite-aerial-terrestrial network (SATN). Specifically, in the considered cooperative SATN, a terrestrial source (S) transmits its information to a satellite receiver (D) via the help of a cache-enabled aerial relay (R) terminal with the most pop… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: 14 pages, 9 figures, accepted by IEEE Transactions on Communications

  20. arXiv:2208.08042  [pdf, other

    cs.CL cs.SD eess.AS

    The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

    Authors: Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan

    Abstract: The conversation scenario is one of the most important and most challenging scenarios for speech processing technologies because people in conversation respond to each other in a casual style. Detecting the speech activities of each person in a conversation is vital to downstream tasks, like natural language processing, machine translation, etc. People refer to the detection technology of "who spe… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:2203.16844

  21. arXiv:2207.02495  [pdf, other

    eess.AS cs.SD

    Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

    Authors: Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan

    Abstract: There is often a trade-off between performance and latency in streaming automatic speech recognition (ASR). Traditional methods such as look-ahead and chunk-based methods, usually require information from future frames to advance recognition accuracy, which incurs inevitable latency even if the computation is fast enough. A causal model that computes without any future frames can avoid this latenc… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted by Interspeech 2022

  22. arXiv:2206.13760  [pdf, other

    eess.AS cs.MM

    Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

    Authors: Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

    Abstract: For online speaker diarization, samples arrive incrementally, and the overall distribution of the samples is invisible. Moreover, in most existing clustering-based methods, the training objective of the embedding extractor is not designed specially for clustering. To improve online speaker diarization performance, we propose a unified online clustering framework, which provides an interactive mann… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Accepted by Interspeech 2022

  23. arXiv:2206.09783  [pdf, other

    eess.AS cs.CL cs.SD

    Boosting Cross-Domain Speech Recognition with Self-Supervision

    Authors: Han Zhu, Gaofeng Cheng, **dong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan

    Abstract: The cross-domain performance of automatic speech recognition (ASR) could be severely hampered due to the mismatch between training and testing distributions. Since the target domain usually lacks labeled data, and domain shifts exist at acoustic and linguistic levels, it is challenging to perform unsupervised domain adaptation (UDA) for ASR. Previous work has shown that self-supervised learning (S… ▽ More

    Submitted 30 July, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2023

  24. arXiv:2206.09102  [pdf, other

    eess.AS cs.CL cs.DC cs.SD

    Decoupled Federated Learning for ASR with Non-IID Data

    Authors: Han Zhu, **dong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

    Abstract: Automatic speech recognition (ASR) with federated learning (FL) makes it possible to leverage data from multiple clients without compromising privacy. The quality of FL-based ASR could be measured by recognition performance, communication and computation costs. When data among different clients are not independently and identically distributed (non-IID), the performance could degrade significantly… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted by Interspeech 2022

  25. Effect of Strong Time-Varying Transmission Distance on LEO Satellite-Terrestrial Deliveries

    Authors: Yuanyuan Ma, Tiejun Lv, Tingting Li, Gaofeng Pan, Yunfei Chen, Mohamed-Slim Alouini

    Abstract: In this paper, we investigate the effect of the strong time-varying transmission distance on the performance of the low-earth orbit (LEO) satellite-terrestrial transmission (STT) system. We propose a new analytical framework using finite-state Markov channel (FSMC) model and time discretization method. Moreover, to demonstrate the applications of the proposed framework, the performances of two ada… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

    Comments: 13 pages, 10 figures, Accepted by IEEE Transactions on Vehicular Technology

  26. arXiv:2203.16844  [pdf, ps, other

    cs.CL eess.AS

    Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset

    Authors: Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui **, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan

    Abstract: This paper introduces a high-quality rich annotated Mandarin conversational (RAMC) speech dataset called MagicData-RAMC. The MagicData-RAMC corpus contains 180 hours of conversational speech data recorded from native speakers of Mandarin Chinese over mobile phones with a sampling rate of 16 kHz. The dialogs in MagicData-RAMC are classified into 15 diversified domains and tagged with topic labels,… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Paper on submission to Interspeech2022

  27. arXiv:2203.09294  [pdf, other

    cs.CV eess.IV

    A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

    Authors: Shi Guo, Xi Yang, Jianqi Ma, Gaofeng Ren, Lei Zhang

    Abstract: Denoising and demosaicking are two essential steps to reconstruct a clean full-color image from the raw data. Recently, joint denoising and demosaicking (JDD) for burst images, namely JDD-B, has attracted much attention by using multiple raw images captured in a short time to reconstruct a single high-quality image. One key challenge of JDD-B lies in the robust alignment of image frames. State-of-… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition 2022

  28. arXiv:2203.03582  [pdf, other

    cs.CL cs.SD eess.AS

    Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

    Authors: Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

    Abstract: Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2.0 models. Due to the conditional independence assumption, CTC-based models are always weaker than attention-based encoder-decoder models and require the assistance of external language models (LMs). To solve this is… ▽ More

    Submitted 22 February, 2022; originally announced March 2022.

    Comments: ICASSP 2022

  29. arXiv:2201.10103  [pdf, other

    eess.AS cs.SD

    Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

    Authors: Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang

    Abstract: While Transformers have achieved promising results in end-to-end (E2E) automatic speech recognition (ASR), their autoregressive (AR) structure becomes a bottleneck for speeding up the decoding process. For real-world deployment, ASR systems are desired to be highly accurate while achieving fast inference. Non-autoregressive (NAR) models have become a popular alternative due to their fast inference… ▽ More

    Submitted 26 January, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted by ICASSP2022

  30. arXiv:2201.03063  [pdf, other

    eess.SP

    Low Earth Orbit Satellite Security and Reliability: Issues, Solutions, and the Road Ahead

    Authors: **yue Yue, Jian** An, Jiankang Zhang, Jia Ye, Gaofeng Pan, Shuai Wang, Pei Xiao, Lajos Hanzo

    Abstract: Low Earth Orbit (LEO) satellites undergo a period of rapid development driven by ever-increasing user demands, reduced costs, and technological progress. Since there is a paucity of literature on the security and reliability issues of LEO Satellite Communication Systems (SCSs), we aim to fill this knowledge gap. Specifically, we critically appraise the inherent characteristics of LEO SCSs and elab… ▽ More

    Submitted 18 July, 2023; v1 submitted 9 January, 2022; originally announced January 2022.

  31. arXiv:2112.12522  [pdf, other

    cs.SD cs.CL eess.AS

    Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition

    Authors: Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang

    Abstract: Automatic speech recognition (ASR) has shown rapid advances in recent years but still degrades significantly in far-field and noisy environments. The recent development of self-supervised learning (SSL) technology can improve the ASR performance by pre-training the model with additional unlabeled speech and the SSL pre-trained model has achieved the state-of-the-art result on several speech benchm… ▽ More

    Submitted 4 May, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: 6 pages, 3 figures

  32. arXiv:2110.04484  [pdf, other

    eess.AS cs.CL cs.SD

    Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR

    Authors: Han Zhu, Li Wang, **dong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

    Abstract: Self-supervised pre-training could effectively improve the performance of low-resource automatic speech recognition (ASR). However, existing self-supervised pre-training are task-agnostic, i.e., could be applied to various downstream tasks. Although it enlarges the scope of its application, the capacity of the pre-trained model is not fully utilized for the ASR task, and the learned representation… ▽ More

    Submitted 17 June, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted by Interspeech 2022

  33. arXiv:2108.03799  [pdf, other

    eess.IV cs.CV

    COVID-view: Diagnosis of COVID-19 using Chest CT

    Authors: Shreeraj Jadhav, Gaofeng Deng, Marlene Zawin, Arie E. Kaufman

    Abstract: Significant work has been done towards deep learning (DL) models for automatic lung and lesion segmentation and classification of COVID-19 on chest CT data. However, comprehensive visualization systems focused on supporting the dual visual+DL diagnosis of COVID-19 are non-existent. We present COVID-view, a visualization application specially tailored for radiologists to diagnose COVID-19 from ches… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 11 pages, 10 figures, accepted to IEEE VIS 2021 conference and IEEE Transactions on Visualization and Computer Graphics

  34. arXiv:2107.07907  [pdf, other

    eess.IV cs.CV cs.MM

    Lightness Modulated Deep Inverse Tone Map**

    Authors: Kanglin Liu, Gaofeng Cao, Jiang Duan, Guo** Qiu

    Abstract: Single-image HDR reconstruction or inverse tone map** (iTM) is a challenging task. In particular, recovering information in over-exposed regions is extremely difficult because details in such regions are almost completely lost. In this paper, we present a deep learning based iTM method that takes advantage of the feature extraction and map** power of deep convolutional neural networks (CNNs) a… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 11 pages, 10 figures

  35. Online Offloading Scheduling for NOMA-Aided MEC Under Partial Device Knowledge

    Authors: Meihui Hua, Hui Tian, Xinchen Lyu, Wanli Ni, Gaofeng Nie

    Abstract: By exploiting the superiority of non-orthogonal multiple access (NOMA), NOMA-aided mobile edge computing (MEC) can provide scalable and low-latency computing services for the Internet of Things. However, given the prevalent stochasticity of wireless networks and sophisticated signal processing of NOMA, it is critical but challenging to design an efficient task offloading algorithm for NOMA-aided M… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures. Accepted for publication in IEEE Internet of Things Journal

  36. arXiv:2104.09177  [pdf, ps, other

    eess.SP

    Research on Resource Allocation for Efficient Federated Learning

    Authors: Jianyang Ren, Wanli Ni, Gaofeng Nie, Hui Tian

    Abstract: As a promising solution to achieve efficient learning among isolated data owners and solve data privacy issues, federated learning is receiving wide attention. Using the edge server as an intermediary can effectively collect sensor data, perform local model training, and upload model parameters for global aggregation. So this paper proposes a new framework for resource allocation in a hierarchical… ▽ More

    Submitted 12 September, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: 14 pages, 13 figures

  37. arXiv:2010.12875  [pdf, ps, other

    cs.IT eess.SP

    Stochastic Analysis of Cooperative Satellite-UAV Communications

    Authors: Yu Tian, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: In this paper, a dual-hop cooperative satellite-unmanned aerial vehicle (UAV) communication system including a satellite (S), a group of cluster headers (CHs), which are respectively with a group of uniformly distributed UAVs, is considered. Specifically, these CHs serve as aerial decode-and-forward relays to forward the information transmitted by S to UAVs. Moreover, free-space optical (FSO) and… ▽ More

    Submitted 18 March, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  38. arXiv:2009.07221  [pdf, ps, other

    eess.SP

    On NOMA-Based mmWave Communications

    Authors: Yu Tian, Gaofeng Pan, Mohamed-Slim

    Abstract: Non-orthogonal multiple access (NOMA) and millimeter-wave (mmWave) communication are two promising techniques to increase the system capacity in the fifth-generation (5G) mobile network. The former can achieve high spectral efficiency by modulating the information in power domain and the latter can provide extremely large spectrum resources. Fluctuating two-ray (FTR) channel model has already been… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  39. arXiv:2006.11854  [pdf, other

    eess.SP

    Performance Analysis and Optimization of Cooperative Satellite-Aerial-Terrestrial Systems

    Authors: Gaofeng Pan, Jia Ye, Yongqiang Zhang, Mohamed-Slim Alouini

    Abstract: Aerial relays have been regarded as an alternative and promising solution to extend and improve satellite-terrestrial communications, as the probability of line-of-sight transmissions increases compared with adopting terrestrial relays. In this paper, a cooperative satellite-aerial-terrestrial system including a satellite transmitter (S), a group of terrestrial receivers (D), and an aerial relay (… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: 15 pages, 17 figures

  40. On the Secrecy of UAV Systems With Linear Trajectory

    Authors: Gaofeng Pan, Hongjiang Lei, Jian** An, Shuo Zhang, Mohamed-Slim Alouini

    Abstract: By observing the fact that moving in a straight line is a common flying behavior of unmanned aerial vehicles (UAVs) in normal applications, e.g., power line inspections, and air patrols along with highway/streets/borders, in this paper we investigate the secrecy outage performance of a UAV system with linear trajectory, where a UAV ($S$) flies in a straight line and transmits its information over… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: 27 pages, 13 figures

  41. arXiv:2006.05782  [pdf, ps, other

    eess.SP cs.CV cs.LG

    Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges

    Authors: Yu Tian, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Deep learning (DL) has seen great success in the computer vision (CV) field, and related techniques have been used in security, healthcare, remote sensing, and many other fields. As a parallel development, visual data has become universal in daily life, easily generated by ubiquitous low-cost cameras. Therefore, exploring DL-based CV may yield useful information about objects, such as their number… ▽ More

    Submitted 2 December, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

  42. arXiv:2005.12561  [pdf, other

    eess.SP

    When Full-Duplex Transmission Meets Intelligent Reflecting Surface: Opportunities and Challenges

    Authors: Gaofeng Pan, Jia Ye, Jian** An, Mohamed-Slim Alouini

    Abstract: Full-duplex (FD) transmission has already been regarded and developed as a promising method to improve the utilization efficiency of the limited spectrum resource, as transmitting and receiving are allowed to simultaneously occur on the same frequency band. Nowadays, benefiting from the recent development of intelligent reflecting surface (IRS), some unique electromagnetic (EM) functionalities, li… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  43. arXiv:2005.00832  [pdf, other

    eess.SY

    Flying Car Transportation System: Advances, Techniques, and Challenges

    Authors: Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Since the development of transport systems, humans have exploited ground-level, below-ground, and high-altitude spaces for transportation purposes. However, with the increasing burden of expanding populations and rapid urbanization in recent decades, public transportation systems and freight traffic are suffering huge pressure, plaguing local governments and straining economies. Engineers and rese… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: 15 pages, 15 figures

  44. arXiv:2001.08290  [pdf, other

    eess.AS cs.LG cs.NE cs.SD stat.ML

    Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture

    Authors: Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan

    Abstract: Recently, Transformer has gained success in automatic speech recognition (ASR) field. However, it is challenging to deploy a Transformer-based end-to-end (E2E) model for online speech recognition. In this paper, we propose the Transformer-based online CTC/attention E2E ASR architecture, which contains the chunk self-attention encoder (chunk-SAE) and the monotonic truncated attention (MTA) based se… ▽ More

    Submitted 11 February, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: Accepted by ICASSP 2020

  45. arXiv:1912.11613  [pdf, other

    cs.SD cs.LG eess.AS

    Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation

    Authors: Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun

    Abstract: Utterance-level permutation invariant training (uPIT) has achieved promising progress on single-channel multi-talker speech separation task. Long short-term memory (LSTM) and bidirectional LSTM (BLSTM) are widely used as the separation networks of uPIT, i.e. uPIT-LSTM and uPIT-BLSTM. uPIT-LSTM has lower latency but worse performance, while uPIT-BLSTM has better performance but higher latency. In t… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

    Comments: Proceedings of APSIPA Annual Summit and Conference 2019, 18-21 November 2019, Lanzhou, China

  46. Secrecy Outage Analysis Over Fluctuating Two-Ray Fading Channels

    Authors: Hui Zhao, Liang Yang, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: In this letter, we analyze the secrecy outage probability (SOP) over fluctuating two-ray fading channels but with a different definition from the one adopted in [5]. Following the new defined SOP, we derive an analytical closed-form expression for our proposed SOP, as well as an asymptotic formula valid in the high signal-to-noise ratio region of the source to destination link. In the numerical re… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 2 Figures, 2 Pages

  47. Secure Analysis Over Generalized-K Channels

    Authors: Luyao Zhang, Hui Zhao, Gaofeng Pan, Liang Yang, Jiawei Chen

    Abstract: In this letter, we adopt the SOP definition in [4] and the simplified model of [8], and derive a closed-form expression for the proposed SOP over GK fading channels. To simplify this expression and obtain additional insights, we also perform an asymptotic analysis of the main link in the high SNR region.

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: 1 figure, 3 pages

  48. Secure mmWave Communications in Cognitive Radio Networks

    Authors: Hui Zhao, Jiayi Zhang, Liang Yang, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: In this letter, the secrecy performance in cognitive radio networks (CRNs) over fluctuating two-ray (FTR) channels, which is used to model the millimetre wave channel, is investigated in terms of the secrecy outage probability (SOP). Specifically, we consider the case where a source (S) transmits confidential messages to a destination (D), and an eavesdropper wants to wiretap the information from… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: 4 pages, 3 figures