Skip to main content

Showing 1–50 of 66 results for author: Ma, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12323  [pdf, other

    eess.SP

    Hybrid Beamforming Design for Near-Field ISAC with Modular XL-MIMO

    Authors: Chunwei Meng, Dingyou Ma, Zhaolin Wang, Yuanwei Liu, Zhiqing Wei, Zhiyong Feng

    Abstract: A novel modular extremely large-scale multiple-input-multiple-output (XL-MIMO) integrated sensing and communication (ISAC) framework is proposed in this paper. We consider a downlink ISAC scenario and exploit the modular array architecture to enhance the communication spectral efficiency and sensing resolution while reducing the channel modeling complexity by employing the hybrid spherical and pla… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2405.19338  [pdf, other

    eess.SP cs.AI cs.CV

    Accurate Patient Alignment without Unnecessary Imaging Dose via Synthesizing Patient-specific 3D CT Images from 2D kV Images

    Authors: Yuzhen Ding, Jason M. Holmes, Hongying Feng, Baoxin Li, Lisa A. McGee, Jean-Claude M. Rwigema, Sujay A. Vora, Daniel J. Ma, Robert L. Foote, Samir H. Patel, Wei Liu

    Abstract: In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imag… ▽ More

    Submitted 1 April, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures and tables

  3. Multi-Objective Optimization-based Transmit Beamforming for Multi-Target and Multi-User MIMO-ISAC Systems

    Authors: Chunwei Meng, Zhiqing Wei, Dingyou Ma, Wanli Ni, Liyan Su, Zhiyong Feng

    Abstract: Integrated sensing and communication (ISAC) is an enabling technology for the sixth-generation mobile communications, which equips the wireless communication networks with sensing capabilities. In this paper, we investigate transmit beamforming design for multiple-input and multiple-output (MIMO)-ISAC systems in scenarios with multiple radar targets and communication users. A general form of multi… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  4. Cramer-Rao Bounds for Near-Field Sensing: A Generic Modular Architecture

    Authors: Chunwei Meng, Dingyou Ma, Xu Chen, Zhiyong Feng, Yuanwei Liu

    Abstract: A generic modular array architecture is proposed, featuring uniform/non-uniform subarray layouts that allows for flexible deployment. The bistatic near-field sensing system is considered, where the target is located in the near-field of the whole modular array and the far-field of each subarray. Then, the closed-form expressions of Cramer-Rao bounds (CRBs) for range and angle estimations are deriv… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2402.15725  [pdf, other

    eess.AS

    Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks

    Authors: Duo Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li

    Abstract: Human language can be expressed in either written or spoken form, i.e. text or speech. Humans can acquire knowledge from text to improve speaking and listening. However, the quest for speech pre-trained models to leverage unpaired text has just started. In this paper, we investigate a new way to pre-train such a joint speech-text model to learn enhanced speech representations and benefit various s… ▽ More

    Submitted 28 February, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: 5 pages, 1 figures,5 tables, submit to IEEE Signal Processing Letters(SPL)

  6. arXiv:2311.10416  [pdf, other

    eess.SP

    Meta-DSP: A Meta-Learning Approach for Data-Driven Nonlinear Compensation in High-Speed Optical Fiber Systems

    Authors: Xinyu Xiao, Zhennan Zhou, Bin Dong, Dingjiong Ma, Li Zhou, Jie Sun

    Abstract: Non-linear effects in long-haul, high-speed optical fiber systems significantly hinder channel capacity. While the Digital Backward Propagation algorithm (DBP) with adaptive filter (ADF) can mitigate these effects, it suffers from an overwhelming computational complexity. Recent solutions have incorporated deep neural networks in a data-driven strategy to alleviate this complexity in the DBP model… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  7. arXiv:2309.09627  [pdf, other

    cs.SD eess.AS

    Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders

    Authors: Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: We propose a novel framework for electrolaryngeal speech intelligibility enhancement through the use of robust linguistic encoders. Pretraining and fine-tuning approaches have proven to work well in this task, but in most cases, various mismatches, such as the speech type mismatch (electrolaryngeal vs. typical) or a speaker mismatch between the datasets used in each stage, can deteriorate the conv… ▽ More

    Submitted 20 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024. Demo page: lesterphillip.github.io/icassp2024_el_sie

  8. arXiv:2308.08313  [pdf, other

    eess.IV cs.CV

    ECPC-IDS:A benchmark endometrail cancer PET/CT image dataset for evaluation of semantic segmentation and detection of hypermetabolic regions

    Authors: Dechao Tang, Tianming Du, Deguo Ma, Zhiyu Ma, Hongzan Sun, Marcin Grzegorzek, Huiyan Jiang, Chen Li

    Abstract: Endometrial cancer is one of the most common tumors in the female reproductive system and is the third most common gynecological malignancy that causes death after ovarian and cervical cancer. Early diagnosis can significantly improve the 5-year survival rate of patients. With the development of artificial intelligence, computer-assisted diagnosis plays an increasingly important role in improving… ▽ More

    Submitted 11 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 14 pages,6 figures

  9. arXiv:2308.08172  [pdf, other

    eess.IV cs.CV cs.LG

    AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation

    Authors: Zhiyu Ma, Chen Li, Tianming Du, Le Zhang, Dechao Tang, Deguo Ma, Shanchuan Huang, Yan Liu, Yihao Sun, Zhihao Chen, ** Yuan, Qianqing Nie, Marcin Grzegorzek, Hongzan Sun

    Abstract: Methods: In this study, a benchmark \emph{Abdominal Adipose Tissue CT Image Dataset} (AATTCT-IDS) containing 300 subjects is prepared and published. AATTCT-IDS publics 13,732 raw CT slices, and the researchers individually annotate the subcutaneous and visceral adipose tissue regions of 3,213 of those slices that have the same slice distance to validate denoising methods, train semantic segmentati… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 17 pages, 7 figures

  10. SAR Target Image Generation Method Using Azimuth-Controllable Generative Adversarial Network

    Authors: Chenwei Wang, Jifang Pei, Xiaoyu Liu, Yulin Huang, Deqing Mao, Yin Zhang, Jianyu Yang

    Abstract: Sufficient synthetic aperture radar (SAR) target images are very important for the development of researches. However, available SAR target images are often limited in practice, which hinders the progress of SAR application. In this paper, we propose an azimuth-controllable generative adversarial network to generate precise SAR target images with an intermediate azimuth between two given SAR image… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  11. arXiv:2305.15636  [pdf

    eess.SP

    Channelized analog microwave short-time Fourier transform in the optical domain with improved measurement performance

    Authors: Xiaowei Li, Taixia Shi, Dong Ma, Yang Chen

    Abstract: In this article, analog microwave short-time Fourier transform (STFT) with improved measurement performance is implemented in the optical domain by employing stimulated Brillouin scattering (SBS) and channelization. By jointly using three optical frequency combs and filter- and SBS-based frequency-to-time map** (FTTM), the time-frequency information of the signal under test (SUT) in different fr… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 18 pages, 9 figures, 1 table

  12. arXiv:2301.00504  [pdf

    eess.IV cs.AI cs.CV eess.SP

    Spectral Bandwidth Recovery of Optical Coherence Tomography Images using Deep Learning

    Authors: Timothy T. Yu, Da Ma, Jayden Cole, Myeong ** Ju, Mirza F. Beg, Marinko V. Sarunic

    Abstract: Optical coherence tomography (OCT) captures cross-sectional data and is used for the screening, monitoring, and treatment planning of retinal diseases. Technological developments to increase the speed of acquisition often results in systems with a narrower spectral bandwidth, and hence a lower axial resolution. Traditionally, image-processing-based techniques have been utilized to reconstruct subs… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

  13. arXiv:2212.00532  [pdf, other

    eess.IV cs.CV

    EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks

    Authors: Liyu Shi, Xiaoyan Li, Weiming Hu, Haoyuan Chen, **g Chen, Zizhen Fan, Minghe Gao, Yujie **g, Guotao Lu, Deguo Ma, Zhiyu Ma, Qingtao Meng, Dechao Tang, Hongzan Sun, Marcin Grzegorzek, Shouliang Qi, Yueyang Teng, Chen Li

    Abstract: Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when comp… ▽ More

    Submitted 6 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  14. arXiv:2211.01079  [pdf, other

    cs.SD eess.AS

    Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

    Authors: Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

    Abstract: Research on automatic speech recognition (ASR) systems for electrolaryngeal speakers has been relatively unexplored due to small datasets. When training data is lacking in ASR, a large-scale pretraining and fine tuning framework is often sufficient to achieve high recognition rates; however, in electrolaryngeal speech, the domain shift between the pretraining and fine-tuning data is too large to o… ▽ More

    Submitted 30 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted to ICASSP 2023

  15. arXiv:2210.10314  [pdf, other

    cs.SD eess.AS

    Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion

    Authors: Ding Ma, Lester Phillip Violeta, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: Sequence-to-sequence (seq2seq) voice conversion (VC) models have greater potential in converting electrolaryngeal (EL) speech to normal speech (EL2SP) compared to conventional VC models. However, EL2SP based on seq2seq VC requires a sufficiently large amount of parallel data for the model training and it suffers from significant performance degradation when the amount of training data is insuffici… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to SLT 2022

  16. arXiv:2208.14635  [pdf, other

    eess.IV cs.CV cs.LG

    Segmentation-guided Domain Adaptation and Data Harmonization of Multi-device Retinal Optical Coherence Tomography using Cycle-Consistent Generative Adversarial Networks

    Authors: Shuo Chen, Da Ma, Sieun Lee, Timothy T. L. Yu, Gavin Xu, Donghuan Lu, Karteek Popuri, Myeong ** Ju, Marinko V. Sarunic, Mirza Faisal Beg

    Abstract: Optical Coherence Tomography(OCT) is a non-invasive technique capturing cross-sectional area of the retina in micro-meter resolutions. It has been widely used as a auxiliary imaging reference to detect eye-related pathology and predict longitudinal progression of the disease characteristics. Retina layer segmentation is one of the crucial feature extraction techniques, where the variations of reti… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 16 pages, 10 figures

  17. arXiv:2208.09143  [pdf

    physics.optics eess.SP

    Photonics-enabled wavelet-like transform via nonlinear optical frequency swee** and stimulated Brillouin scattering-based frequency-to-time map**

    Authors: Pengcheng Zuo, Dong Ma, Yang Chen

    Abstract: A photonics-enabled wavelet-like transform system, characterized by multi-resolution time-frequency analysis, is proposed based on a typical stimulated Brillouin scattering (SBS) pump-probe setup using an optical nonlinear frequency-sweep signal. In the pump path, a continuous-wave optical signal is injected into an SBS medium to generate an SBS gain. In the probe path, a periodic nonlinear freque… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: 9 pages, 6 figures

  18. arXiv:2208.04871  [pdf

    eess.SP

    Breaking the accuracy and resolution limitation of filter- and frequency-to-time map**-based time and frequency acquisition methods by broadening the filter bandwidth

    Authors: Pengcheng Zuo, Dong Ma, Xiaowei Li, Yang Chen

    Abstract: In this paper, the filter- and frequency-to-time map** (FTTM)-based photonics-assisted time and frequency acquisition methods are comprehensively analyzed and the accuracy and resolution limitation in the fast sweep scenario is broken by broadening the filter bandwidth. It is found that when the sweep speed is very fast, the width of the generated pulse via FTTM is mainly determined by the impul… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 18 pages, 11 figures

  19. arXiv:2207.01175  [pdf

    physics.optics eess.SP

    Photonics-based short-time Fourier transform without high-frequency electronic devices and equipment

    Authors: Pengcheng Zuo, Dong Ma, Yang Chen

    Abstract: A photonics-based short-time Fourier transform (STFT) system is proposed and experimentally demonstrated based on stimulated Brillouin scattering (SBS) without using high-frequency electronic devices and equipment. The wavelength of a distributed feedback laser diode is periodically swept by using a low-speed periodic sawtooth/triangular driving current. The periodic frequency-sweep optical signal… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: 8 pages, 5 figures

  20. arXiv:2204.04579  [pdf, other

    cs.SD eess.AS

    Inferring Pitch from Coarse Spectral Features

    Authors: Danni Ma, Neville Ryant, Mark Liberman

    Abstract: Fundamental frequency (F0) has long been treated as the physical definition of "pitch" in phonetic analysis. But there have been many demonstrations that F0 is at best an approximation to pitch, both in production and in perception: pitch is not F0, and F0 is not pitch. Changes in the pitch involve many articulatory and acoustic covariates; pitch perception often deviates from what F0 analysis pre… ▽ More

    Submitted 26 August, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

  21. arXiv:2203.05707  [pdf

    cs.LG cs.AI eess.IV q-bio.GN

    Machine Learning Based Multimodal Neuroimaging Genomics Dementia Score for Predicting Future Conversion to Alzheimer's Disease

    Authors: Ghazal Mirabnahrazam, Da Ma, Sieun Lee, Karteek Popuri, Hyunwoo Lee, Jiguo Cao, Lei Wang, James E Galvin, Mirza Faisal Beg, the Alzheimer's Disease Neuroimaging Initiative

    Abstract: Background: The increasing availability of databases containing both magnetic resonance imaging (MRI) and genetic data allows researchers to utilize multimodal data to better understand the characteristics of dementia of Alzheimer's type (DAT). Objective: The goal of this study was to develop and analyze novel biomarkers that can help predict the development and progression of DAT. Methods: We use… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Journal ref: J Alzheimers Dis 1 Jan. (2022) 1-21

  22. arXiv:2202.09954  [pdf, other

    eess.SP cs.IT cs.LG

    Theoretical Analysis of Deep Neural Networks in Physical Layer Communication

    Authors: Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

    Abstract: Recently, deep neural network (DNN)-based physical layer communication techniques have attracted considerable interest. Although their potential to enhance communication systems and superb performance have been validated by simulation experiments, little attention has been paid to the theoretical analysis. Specifically, most studies in the physical layer have tended to focus on the application of… ▽ More

    Submitted 26 August, 2022; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: 15 pages, 13 figures, has been accepted for publication in IEEE Transactions on Communications. arXiv admin note: substantial text overlap with arXiv:2106.01124

    Journal ref: IEEE Transactions on Communications, 2022

  23. Time-varying microwave photonic filter for arbitrary waveform signal-to-noise ratio improvement

    Authors: Dong Ma, Yang Chen

    Abstract: A time-varying microwave photonic filter (TV-MPF) based on stimulated Brillouin scattering (SBS) is proposed and utilized to suppress the in-band noise of broadband arbitrary microwave waveforms, thereby improving the signal-to-noise ratio (SNR). The filter-controlling signal is designed according to the signal to be filtered and drives the TV-MPF so that the passband of the filter is always align… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 8 pages, 5 figures

  24. Improving Across-Dataset Brain Tissue Segmentation Using Transformer

    Authors: Vishwanatha M. Rao, Zihan Wan, Soroush Arabshahi, David J. Ma, Pin-Yu Lee, Ye Tian, Xuzhe Zhang, Andrew F. Laine, Jia Guo

    Abstract: Brain tissue segmentation has demonstrated great utility in quantifying MRI data through Voxel-Based Morphometry and highlighting subtle structural changes associated with various conditions within the brain. However, manual segmentation is highly labor-intensive, and automated approaches have struggled due to properties inherent to MRI acquisition, leaving a great need for an effective segmentati… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

    ACM Class: I.4.6

  25. arXiv:2201.07438  [pdf, other

    cs.SD eess.AS

    MHTTS: Fast multi-head text-to-speech for spontaneous speech with imperfect transcription

    Authors: Dabiao Ma, Yitong Zhang, Meng Li, Feng Ye

    Abstract: Neural network based end-to-end Text-to-Speech (TTS) has greatly improved the quality of synthesized speech. While how to use massive spontaneous speech without transcription efficiently still remains an open problem. In this paper, we propose MHTTS, a fast multi-speaker TTS system that is robust to transcription errors and speaking style speech data. Specifically, we introduce a multi-head model… ▽ More

    Submitted 4 February, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

  26. Short-time Fourier transform based on stimulated Brillouin scattering

    Authors: Pengcheng Zuo, Dong Ma, Yang Chen

    Abstract: In this paper, all-optical short-time Fourier transform (STFT) based on stimulated Brillouin scattering (SBS) is proposed and further used for real-time time-frequency analysis of different radio frequency (RF) signals. In the proposed all-optical STFT system, SBS not only provides a band-pass filter for implementing the window function in conjunction with a periodic frequency-sweep optical signal… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 18 pages, 9 figures, 1 table

  27. Physics Assisted Deep Learning for Indoor Imaging using Phaseless Wi-Fi Measurements

    Authors: Samruddhi Deshmukh, Amartansh Dubey, Dingfei Ma, Qifeng Chen, Ross Murch

    Abstract: A physics assisted deep learning framework to perform accurate indoor imaging using phaseless Wi-Fi measurements is proposed. It is able to image objects that are large (compared to wavelength) and have high permittivity values, that existing radio frequency (RF) inverse scattering techniques find very challenging, making it suitable for indoor RF imaging. The technique utilizes a Rytov based inve… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 14 pages, 10 figures. This work has been submitted to IEEE for possible publication

  28. arXiv:2110.12857  [pdf

    physics.app-ph eess.SP physics.optics

    Photonics-assisted microwave pulse detection and frequency measurement based on pulse replication and frequency-to-time map**

    Authors: Pengcheng Zuo, Dong Ma, Qingbo Liu, Lizhong Jiang, Yang Chen

    Abstract: A photonics-assisted microwave pulse detection and frequency measurement scheme is proposed. The unknown microwave pulse is converted to the optical domain and then injected into a fiber loop for pulse replication, which makes it easier to identify the microwave pulse with large pulse repetition interval (PRI), whereas stimulated Brillouin scattering-based frequency-to-time map** (FTTM) is utili… ▽ More

    Submitted 25 September, 2021; originally announced October 2021.

    Comments: 13 pages, 8 figures

  29. arXiv:2109.05627  [pdf, other

    eess.IV cs.CV

    Differential Diagnosis of Frontotemporal Dementia and Alzheimer's Disease using Generative Adversarial Network

    Authors: Da Ma, Donghuan Lu, Karteek Popuri, Mirza Faisal Beg

    Abstract: Frontotemporal dementia and Alzheimer's disease are two common forms of dementia and are easily misdiagnosed as each other due to their similar pattern of clinical symptoms. Differentiating between the two dementia types is crucial for determining disease-specific intervention and treatment. Recent development of Deep-learning-based approaches in the field of medical image computing are delivering… ▽ More

    Submitted 29 September, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

  30. arXiv:2109.03904  [pdf

    eess.SP physics.optics

    Time-frequency analysis of microwave signals based on stimulated Brillouin scattering

    Authors: Dong Ma, Pengcheng Zuo, Yang Chen

    Abstract: A novel photonic approach to the time-frequency analysis of microwave signals is proposed based on the stimulated Brillouin scattering (SBS)-assisted frequency-to-time map** (FTTM). Two types of time-frequency analysis links, namely parallel SBS link and time-division SBS link are proposed. The parallel SBS link can be utilized to perform real-time time-frequency analysis of microwave signal, wh… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: 17 pages, 10 figures, 1 table

  31. arXiv:2107.10701  [pdf, other

    eess.AS cs.SD

    Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech

    Authors: Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng

    Abstract: To realize robust end-to-end Automatic Speech Recognition(E2E ASR) under radio communication condition, we propose a multitask-based method to joint train a Speech Enhancement (SE) module as the front-end and an E2E ASR model as the back-end in this paper. One of the advantage of the proposed method is that the entire system can be trained from scratch. Different from prior works, either component… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 7pages,3figures,Submitted to APSIPA2021

  32. arXiv:2107.02345  [pdf, other

    eess.IV cs.CV cs.LG

    Domain Adaptation via CycleGAN for Retina Segmentation in Optical Coherence Tomography

    Authors: Ricky Chen, Timothy T. Yu, Gavin Xu, Da Ma, Marinko V. Sarunic, Mirza Faisal Beg

    Abstract: With the FDA approval of Artificial Intelligence (AI) for point-of-care clinical diagnoses, model generalizability is of the utmost importance as clinical decision-making must be domain-agnostic. A method of tackling the problem is to increase the dataset to include images from a multitude of domains; while this technique is ideal, the security requirements of medical data is a major limitation. A… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 10 pages, 6 figures, 1 table

    ACM Class: I.4.0

  33. FRaC: FMCW-Based Joint Radar-Communications System via Index Modulation

    Authors: Dingyou Ma, Nir Shlezinger, Tianyao Huang, Yimin Liu, Yonina C. Eldar

    Abstract: Dual function radar communications (DFRC) systems are attractive technologies for autonomous vehicles, which utilize electromagnetic waves to constantly sense the environment while simultaneously communicating with neighbouring devices. An emerging approach to implement DFRC systems is to embed information in radar waveforms via index modulation (IM). Implementation of DFRC schemes in vehicular sy… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 16 pages

  34. arXiv:2106.08147  [pdf, other

    eess.IV cs.CV cs.LG

    Perceptually-inspired super-resolution of compressed videos

    Authors: Di Ma, Mariana Afonso, Fan Zhang, David R. Bull

    Abstract: Spatial resolution adaptation is a technique which has often been employed in video compression to enhance coding efficiency. This approach encodes a lower resolution version of the input video and reconstructs the original resolution during decoding. Instead of using conventional up-sampling filters, recent work has employed advanced super-resolution methods based on convolutional neural networks… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  35. arXiv:2106.01124  [pdf, other

    eess.SP cs.IT cs.LG

    Opening the Black Box of Deep Neural Networks in Physical Layer Communication

    Authors: Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

    Abstract: Deep Neural Network (DNN)-based physical layer techniques are attracting considerable interest due to their potential to enhance communication systems. However, most studies in the physical layer have tended to focus on the application of DNN models to wireless communication problems but not to theoretically understand how does a DNN work in a communication system. In this paper, we aim to quantit… ▽ More

    Submitted 18 February, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: 6 pages, 5 figures, to be presented in the IEEE Wireless Communications and Networking Conference (WCNC) 2022 Workshop on Machine Learning for Communications: Future Large Scale MIMO and AI-Native Air-Interface

  36. arXiv:2105.11594  [pdf

    eess.IV

    A Fast MR Fingerprinting Simulator for Direct Error Estimation and Sequence Optimization

    Authors: Siyuan Hu, Stephen Jordan, Rasim Boyacioglu, Ignacio Rozada, Matthias Troyer, Mark Griswold, Debra McGivney, Dan Ma

    Abstract: MR Fingerprinting is a novel quantitative MR technique that could simultaneously provide multiple tissue property maps. When optimizing MRF scans, modeling undersampling errors and field imperfections in cost functions will make the optimization results more practical and robust. However, this process is computationally expensive and impractical for sequence optimization algorithms when MRF signal… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: 10 pages, 7 figures

  37. arXiv:2103.16051  [pdf, ps, other

    eess.SY

    Reduced Dynamics and Control for an Autonomous Bicycle

    Authors: Jiaming Xiong, Bo Li, Ruihan Yu, Daolin Ma, Wei Wang, Caishan Liu

    Abstract: In this paper, we propose the reduced model for the full dynamics of a bicycle and analyze its nonlinear behavior under a proportional control law for steering. Based on the Gibbs-Appell equations for the Whipple bicycle, we obtain a second-order nonlinear ordinary differential equation (ODE) that governs the bicycle's controlled motion. Two types of equilibrium points for the governing equation a… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Journal ref: ICRA 2021

  38. A Subjective Study on Videos at Various Bit Depths

    Authors: Alex Mackin, Di Ma, Fan Zhang, David Bull

    Abstract: Bit depth adaptation, where the bit depth of a video sequence is reduced before transmission and up-sampled during display, can potentially reduce data rates with limited impact on perceptual quality. In this context, we conducted a subjective study on a UHD video database, BVI-BD, to explore the relationship between bit depth and visual quality. In this work, three bit depth adaptation methods ar… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: 5 pages; 7 figures; 1 table

  39. arXiv:2101.04538  [pdf, ps, other

    eess.IV physics.optics

    Polarized hyperspectral imaging with single fiber bundle via incoherent light transmission matrix approach

    Authors: Yitong Li, Zhengbo Zhu, Ze Li, Donglin Ma

    Abstract: The scattering of multispectral incoherent light is a common and unfavorable signal scrambling in natural scenes. However, the blurred light spot due to scattering still holds lots of information remaining to be explored. Former methods failed to recover the polarized hyperspectral information from scattered incoherent light or relied on additional dispersion elements. Here we put forward the tran… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

  40. arXiv:2011.09190  [pdf, other

    eess.IV cs.CV

    CVEGAN: A Perceptually-inspired GAN for Compressed Video Enhancement

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: We propose a new Generative Adversarial Network for Compressed Video quality Enhancement (CVEGAN). The CVEGAN generator benefits from the use of a novel Mul2Res block (with multiple levels of residual learning branches), an enhanced residual non-local block (ERNB) and an enhanced convolutional block attention module (ECBAM). The ERNB has also been employed in the discriminator to improve the repre… ▽ More

    Submitted 26 November, 2020; v1 submitted 18 November, 2020; originally announced November 2020.

  41. arXiv:2010.13007  [pdf, other

    eess.AS cs.SD

    Probing Acoustic Representations for Phonetic Properties

    Authors: Danni Ma, Neville Ryant, Mark Liberman

    Abstract: Pre-trained acoustic representations such as wav2vec and DeCoAR have attained impressive word error rates (WER) for speech recognition benchmarks, particularly when labeled data is limited. But little is known about what phonetic properties these various representations acquire, and how well they encode transferable features of speech. We compare features from two conventional and four pre-trained… ▽ More

    Submitted 14 February, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  42. Video Compression with CNN-based Post Processing

    Authors: Fan Zhang, Di Ma, Chen Feng, David R. Bull

    Abstract: In recent years, video compression techniques have been significantly challenged by the rapidly increased demands associated with high quality and immersive video content. Among various compression tools, post-processing can be applied on reconstructed video content to mitigate visible compression artefacts and to enhance overall perceptual quality. Inspired by advances in deep learning, we propos… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

  43. arXiv:2009.02752  [pdf, other

    eess.SP cs.HC cs.LG

    Simultaneous Energy Harvesting and Gait Recognition using Piezoelectric Energy Harvester

    Authors: Dong Ma, Guohao Lan, Weitao Xu, Mahbub Hassan, Wen Hu

    Abstract: Piezoelectric energy harvester, which generates electricity from stress or vibrations, is gaining increasing attention as a viable solution to extend battery life in wearables. Recent research further reveals that, besides generating energy, PEH can also serve as a passive sensor to detect human gait power-efficiently because its stress or vibration patterns are significantly influenced by the gai… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 13 pages, 17 figures, and 2 tables

  44. arXiv:2007.14726  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Video compression with low complexity CNN-based spatial resolution adaptation

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: It has recently been demonstrated that spatial resolution adaptation can be integrated within video compression to improve overall coding performance by spatially down-sampling before encoding and super-resolving at the decoder. Significant improvements have been reported when convolutional neural networks (CNNs) were used to perform the resolution up-sampling. However, this approach suffers from… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

  45. Fine Timing and Frequency Synchronization for MIMO-OFDM: An Extreme Learning Approach

    Authors: Jun Liu, Kai Mei, Xiaochen Zhang, Des McLernon, Dongtang Ma, Jibo Wei, Syed Ali Raza Zaidi

    Abstract: Multiple-input multiple-output orthogonal frequency-division multiplexing (MIMO-OFDM) is a key technology component in the evolution towards cognitive radio (CR) in next-generation communication in which the accuracy of timing and frequency synchronization significantly impacts the overall system performance. In this paper, we propose a novel scheme leveraging extreme learning machine (ELM) to ach… ▽ More

    Submitted 1 June, 2022; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: 13 pages, 12 figures, has been accepted for publication in IEEE Transactions on Cognitive Communications and Networking

    Journal ref: IEEE Transactions on Cognitive Communications and Networking, 2021

  46. arXiv:2007.07099  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    MFRNet: A New CNN Architecture for Post-Processing and In-loop Filtering

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: In this paper, we propose a novel convolutional neural network (CNN) architecture, MFRNet, for post-processing (PP) and in-loop filtering (ILF) in the context of video compression. This network consists of four Multi-level Feature review Residual dense Blocks (MFRBs), which are connected using a cascading structure. Each MFRB extracts features from multiple convolutional layers using dense connect… ▽ More

    Submitted 11 December, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

  47. arXiv:2005.06101  [pdf

    eess.SP

    A Cyber Physical System Framework for UAV Communications

    Authors: Haijun Wang, Haitao Zhao, Dongtang Ma, Jibo Wei

    Abstract: Diverse applications have witnessed the prevalence of unmanned aerial vehicles (UAVs) due to their agility and versatility. Compared with computation and control, the communication tends to be the bottleneck of the whole UAV system. Cyber physical system (CPS), which achieves the integration of the cyber and physical domains, can inspire us to deal with the communication problems through a cross-d… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: 7 pages, 5 figures, 1 table, 15 references

  48. arXiv:2004.02270  [pdf, ps, other

    eess.IV cs.CV cs.LG physics.med-ph

    Game of Learning Bloch Equation Simulations for MR Fingerprinting

    Authors: Mingrui Yang, Yun Jiang, Dan Ma, Bhairav B. Mehta, Mark A. Griswold

    Abstract: Purpose: This work proposes a novel approach to efficiently generate MR fingerprints for MR fingerprinting (MRF) problems based on the unsupervised deep learning model generative adversarial networks (GAN). Methods: The GAN model is adopted and modified for better convergence and performance, resulting in an MRF specific model named GAN-MRF. The GAN-MRF model is trained, validated, and tested usin… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

  49. BVI-DVC: A Training Database for Deep Video Compression

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: Deep learning methods are increasingly being applied in the optimisation of video compression algorithms and can achieve significantly enhanced coding gains, compared to conventional approaches. Such approaches often employ Convolutional Neural Networks (CNNs) which are trained on databases with relatively limited content coverage. In this paper, a new extensive and representative video database,… ▽ More

    Submitted 8 October, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

  50. Spatial Modulation for Joint Radar-Communications Systems: Design, Analysis, and Hardware Prototype

    Authors: Dingyou Ma, Nir Shlezinger, Tianyao Huang, Yariv Shavit, Moshe Namer, Yimin Liu, Yonina C. Eldar

    Abstract: Dual-function radar-communications (DFRC) systems implement radar and communication functionalities on a single platform. Jointly designing these subsystems can lead to substantial gains in performance as well as size, cost, and power consumption. In this paper, we propose a DFRC system, which utilizes generalized spatial modulation (GSM) to realize coexisting radar and communications waveforms. O… ▽ More

    Submitted 15 July, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: 14pages

    Journal ref: IEEE Transactions on Vehicular Technology ( Volume: 70, Issue: 3, March 2021)