Skip to main content

Showing 1–9 of 9 results for author: Shi, E

Searching in archive eess. Search in all archives.
.
  1. Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

    Authors: Yiyang Zhu, Enyu Shi, Ziheng Liu, Jiayi Zhang, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multiple-output (mMIMO) is a promising technique for achieving high spectral efficiency (SE) using multiple distributed access points (APs). However, harsh propagation environments often lead to significant communication performance degradation due to high penetration loss. To overcome this issue, we introduce the reconfigurable intelligent surface (RIS) into… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  2. arXiv:2401.01572  [pdf, other

    cs.CL cs.SD eess.AS

    Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models

    Authors: Rita Frieske, Bertram E. Shi

    Abstract: Hallucinations are a type of output error produced by deep neural networks. While this has been studied in natural language processing, they have not been researched previously in automatic speech recognition. Here, we define hallucinations in ASR as transcriptions generated by a model that are semantically unrelated to the source utterance, yet still fluent and coherent. The similarity of halluci… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  3. arXiv:2310.00263  [pdf, ps, other

    cs.IT eess.SP

    RIS-Aided Cell-Free Massive MIMO Systems for 6G: Fundamentals, System Design, and Applications

    Authors: Enyu Shi, Jiayi Zhang, Hongyang Du, Bo Ai, Chau Yuen, Dusit Niyato, Khaled B. Letaief, Xuemin Shen

    Abstract: An introduction of intelligent interconnectivity for people and things has posed higher demands and more challenges for sixth-generation (6G) networks, such as high spectral efficiency and energy efficiency, ultra-low latency, and ultra-high reliability. Cell-free (CF) massive multiple-input multiple-output (mMIMO) and reconfigurable intelligent surface (RIS), also called intelligent reflecting su… ▽ More

    Submitted 22 May, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: Proceedings of the IEEE, Accept, 2024

  4. arXiv:2306.08278  [pdf, ps, other

    cs.IT eess.SP

    Uplink Performance of RIS-aided Cell-Free Massive MIMO System with Electromagnetic Interference

    Authors: Enyu Shi, Jiayi Zhang, Derrick Wing Kwan Ng, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multiple-output (MIMO) and reconfigurable intelligent surface (RIS) are two promising technologies for realizing future beyond-fifth generation (B5G) networks. In this paper, we consider a practical spatially correlated RIS-aided CF massive MIMO system with multi-antenna access points (APs) over spatially correlated fading channels. Different from previous wor… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: to appear in IEEE Journal on Selected Areas in Communications

  5. arXiv:2209.13845  [pdf, ps, other

    cs.IT eess.SP

    Uplink Performance of RIS-aided Cell-Free Massive MIMO System Over Spatially Correlated Channels

    Authors: Enyu Shi, Jiayi Zhang, Zhe Wang, Derrick Wing Kwan Ng, Bo Ai

    Abstract: We consider a practical spatially correlated reconfigurable intelligent surface (RIS)-aided cell-free (CF) massive multiple-input-multiple-output (mMIMO) system with multi-antenna access points (APs) over spatially correlated Rician fading channels. The minimum mean square error (MMSE) channel estimator is adopted to estimate the aggregated RIS channels. Then, we investigate the uplink spectral ef… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 6 pages, 5 figures

    Journal ref: early access,Globecom 2022

  6. arXiv:2201.09622  [pdf, ps, other

    cs.IT eess.SP

    Uplink Performance of High-Mobility Cell-Free Massive MIMO-OFDM Systems

    Authors: Jiakang Zheng, Jiayi Zhang, Enyu Shi, **g Jiang, Bo Ai

    Abstract: High-speed train (HST) communications with orthogonal frequency division multiplexing (OFDM) techniques have received significant attention in recent years. Besides, cell-free (CF) massive multiple-input multiple-output (MIMO) is considered a promising technology to achieve the ultimate performance limit. In this paper, we focus on the performance of CF massive MIMO-OFDM systems with both matched… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: Accepted in IEEE ICC 2022

  7. arXiv:2201.02419  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

    Authors: Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

    Abstract: Automatic speech recognition (ASR) on low resource languages improves the access of linguistic minorities to technological advantages provided by artificial intelligence (AI). In this paper, we address the problem of data scarcity for the Hong Kong Cantonese language by creating a new Cantonese dataset. Our dataset, Multi-Domain Cantonese Corpus (MDCC), consists of 73.6 hours of clean read speech… ▽ More

    Submitted 17 January, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  8. arXiv:2002.03557  [pdf, other

    cs.CV cs.MM eess.AS

    Multitask Emotion Recognition with Incomplete Labels

    Authors: Didan Deng, Zhaokang Chen, Bertram E. Shi

    Abstract: We train a unified model to perform three tasks: facial action unit detection, expression classification, and valence-arousal estimation. We address two main challenges of learning the three tasks. First, most existing datasets are highly imbalanced. Second, most existing datasets do not contain labels for all three tasks. To tackle the first challenge, we apply data balancing techniques to experi… ▽ More

    Submitted 10 March, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: Accepted by FG2020

  9. arXiv:1805.00625  [pdf, other

    eess.IV cs.CL cs.CV

    Multimodal Utterance-level Affect Analysis using Visual, Audio and Text Features

    Authors: Didan Deng, Yuqian Zhou, Jimin Pi, Bertram E. Shi

    Abstract: The integration of information across multiple modalities and across time is a promising way to enhance the emotion recognition performance of affective systems. Much previous work has focused on instantaneous emotion recognition. The 2018 One-Minute Gradual-Emotion Recognition (OMG-Emotion) challenge, which was held in conjunction with the IEEE World Congress on Computational Intelligence, encour… ▽ More

    Submitted 4 May, 2018; v1 submitted 2 May, 2018; originally announced May 2018.

    Comments: 5 pages, 1 figure, subject to the 2018 IJCNN challenge on One-Minute Gradual-Emotion Recognition