Skip to main content

Showing 1–21 of 21 results for author: Dai, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.11462  [pdf, other

    cs.MM cs.GR cs.SD eess.AS

    MusicScore: A Dataset for Music Score Modeling and Generation

    Authors: Yuheng Lin, Zheqi Dai, Qiuqiang Kong

    Abstract: Music scores are written representations of music and contain rich information about musical components. The visual information on music scores includes notes, rests, staff lines, clefs, dynamics, and articulations. This visual information in music scores contains more semantic information than audio and symbolic representations of music. Previous music score datasets have limited sizes and are ma… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Dataset paper, dataset link: https://huggingface.co/datasets/ZheqiDAI/MusicScore

  2. arXiv:2406.01922  [pdf, ps, other

    eess.SP cs.IT

    Performance Analysis of Hybrid Cellular and Cell-free MIMO Network

    Authors: Zhuoyin Dai, **gran Xu, Xiaoli Xu, Ruoguang Li, Yong Zeng

    Abstract: Cell-free wireless communication is envisioned as one of the most promising network architectures, which can achieve stable and uniform communication performance while improving the system energy and spectrum efficiency. The deployment of cell-free networks is envisioned to be a longterm evolutionary process, in which cell-free access points (APs) will be gradually introduced into the communicatio… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2405.18251  [pdf, other

    cs.RO eess.SY math.OC

    Sensor-Based Distributionally Robust Control for Safe Robot Navigation in Dynamic Environments

    Authors: Kehan Long, Yinzhuang Yi, Zhirui Dai, Sylvia Herbert, Jorge Cortés, Nikolay Atanasov

    Abstract: We introduce a novel method for safe mobile robot navigation in dynamic, unknown environments, utilizing onboard sensing to impose safety constraints without the need for accurate map reconstruction. Traditional methods typically rely on detailed map information to synthesize safe stabilizing controls for mobile robots, which can be computationally demanding and less effective, particularly in dyn… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Submitted to the International Journal of Robotics Research (IJRR). Project page: https://existentialrobotics.org/DR_Safe_Navigation_Webpage

  4. arXiv:2403.08200  [pdf, ps, other

    eess.SY eess.SP

    Prototy** and Experimental Results for Environment-Aware Millimeter Wave Beam Alignment via Channel Knowledge Map

    Authors: Zhuoyin Dai, Di Wu, Zhenjun Dong, Kun Li, Dingyang Ding, Sihan Wang, Yong Zeng

    Abstract: Channel knowledge map (CKM), which aims to directly reflect the intrinsic channel properties of the local wireless environment, is a novel technique for achieving environmentaware communication. In this paper, to alleviate the large training overhead in millimeter wave (mmWave) beam alignment, an environment-aware and training-free beam alignment prototype is established based on a typical CKM, te… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  5. arXiv:2312.16149  [pdf, other

    cs.SD eess.AS

    SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network

    Authors: Yuhang He, Zhuangzhuang Dai, Long Chen, Niki Trigoni, Andrew Markham

    Abstract: In this paper, we study an underexplored, yet important and challenging problem: counting the number of distinct sounds in raw audio characterized by a high degree of polyphonicity. We do so by systematically proposing a novel end-to-end trainable neural network (which we call DyDecNet, consisting of a dyadic decomposition front-end and backbone network), and quantifying the difficulty level of co… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: AAAI2024 Paper

  6. arXiv:2310.04440  [pdf, other

    eess.SY cs.AI

    Facilitating Battery Swap** Services for Freight Trucks with Spatial-Temporal Demand Prediction

    Authors: Linyu Liu, Zhen Dai, Shiji Song, Xiaocheng Li, Guanting Chen

    Abstract: Electrifying heavy-duty trucks offers a substantial opportunity to curtail carbon emissions, advancing toward a carbon-neutral future. However, the inherent challenges of limited battery energy and the sheer weight of heavy-duty trucks lead to reduced mileage and prolonged charging durations. Consequently, battery-swap** services emerge as an attractive solution for these trucks. This paper empl… ▽ More

    Submitted 23 May, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: 9 pages, 6 figures

    MSC Class: 90B06; 68T07

  7. arXiv:2301.00592  [pdf, other

    cs.CV eess.IV

    Edge Enhanced Image Style Transfer via Transformers

    Authors: Chiyu Zhang, Jun Yang, Zaiyan Dai, Peng Cao

    Abstract: In recent years, arbitrary image style transfer has attracted more and more attention. Given a pair of content and style images, a stylized one is hoped that retains the content from the former while catching style patterns from the latter. However, it is difficult to simultaneously keep well the trade-off between the content details and the style features. To stylize the image with sufficient sty… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  8. arXiv:2206.13203  [pdf, ps, other

    eess.SP

    MIMO Symbiotic Radio with Massive Passive Devices: Asymptotic Analysis and Precoding Optimization

    Authors: **gran Xu, Zhuoyin Dai, Yong Zeng

    Abstract: Symbiotic radio has emerged as a promising technology for spectrum- and energy-efficient wireless communications, where the passive secondary backscatter devices (BDs) reuse not only the spectrum but also the power of the active primary users to transmit their own information. In return, the primary communication links can be enhanced by the additional multipaths created by the BDs. This is known… ▽ More

    Submitted 27 June, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 12 pages, 7 figures. arXiv admin note: text overlap with arXiv:2106.05789

  9. arXiv:2206.07956  [pdf, other

    cs.SD cs.CL eess.AS

    Automatic Prosody Annotation with Pre-Trained Text-Speech Model

    Authors: Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu

    Abstract: Prosodic boundary plays an important role in text-to-speech synthesis (TTS) in terms of naturalness and readability. However, the acquisition of prosodic boundary labels relies on manual annotation, which is costly and time-consuming. In this paper, we propose to automatically extract prosodic boundary labels from text-audio data via a neural text-speech model with pre-trained audio encoders. This… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: accepted by INTERSPEECH2022

  10. arXiv:2205.08220  [pdf, ps, other

    eess.SY eess.SP

    Rate-Region Characterization and Channel Estimation for Cell-Free Symbiotic Radio Communications

    Authors: Zhuoyin Dai, Ruoguang Li, **gran Xu, Yong Zeng, Shi **

    Abstract: Cell-free massive MIMO and symbiotic radio communication have been recently proposed as the promising beyond fifth-generation (B5G) networking architecture and transmission technology, respectively. To reap the benefits of both, this paper studies cell-free symbiotic radio communication systems, where a number of cell-free access points (APs) cooperatively send primary information to a receiver, a… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2106.06148

  11. arXiv:2112.05665  [pdf

    cs.RO eess.SY

    Deep Odometry Systems on Edge with EKF-LoRa Backend for Real-Time Positioning in Adverse Environment

    Authors: Zhuangzhuang Dai, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Andrew Markham, Niki Trigoni

    Abstract: Ubiquitous positioning for pedestrian in adverse environment has served a long standing challenge. Despite dramatic progress made by Deep Learning, multi-sensor deep odometry systems yet pose a high computational cost and suffer from cumulative drifting errors over time. Thanks to the increasing computational power of edge devices, we propose a novel ubiquitous positioning solution by integrating… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  12. arXiv:2112.00695  [pdf, other

    eess.SP cs.LG cs.RO

    DeepAoANet: Learning Angle of Arrival from Software Defined Radios with Deep Neural Networks

    Authors: Zhuangzhuang Dai, Yuhang He, Tran Vu, Niki Trigoni, Andrew Markham

    Abstract: Direction finding and positioning systems based on RF signals are significantly impacted by multipath propagation, particularly in indoor environments. Existing algorithms (e.g MUSIC) perform poorly in resolving Angle of Arrival (AoA) in the presence of multipath or when operating in a weak signal regime. We note that digitally sampled RF frontends allow for the easy analysis of signals, and their… ▽ More

    Submitted 9 December, 2021; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Angle-of-arrival estimation from Software Defined Radios, Benchmark and Baseline

  13. arXiv:2106.06148  [pdf, ps, other

    cs.IT eess.SP

    Cell-Free Symbiotic Radio: Channel Estimation Method and Achievable Rate Analysis

    Authors: Zhuoyin Dai, Ruoguang Li, **gran Xu, Yong Zeng, Shi **

    Abstract: Cell-free massive MIMO and symbiotic radio are promising beyond 5G (B5G) networking architecture and transmission technology, respectively. This paper studies cell-free symbiotic radio systems, where a number of distributed access points (APs) cooperatively send primary information to a receiver, and simultaneously support the backscattering communication of the secondary backscatter device (BD).… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 6 pages, 3 figures, conference

  14. arXiv:2106.05789  [pdf, ps, other

    eess.SP

    Enabling Full Mutualism for Symbiotic Radio with Massive Backscatter Devices

    Authors: **gran Xu, Zhuoyin Dai, Yong Zeng

    Abstract: Symbiotic radio is a promising technology to achieve spectrum- and energy-efficient wireless communications, where the secondary backscatter device (BD) leverages not only the spectrum but also the power of the primary signals for its own information transmission. In return, the primary communication link can be enhanced by the additional multipaths created by the BD. This is known as the mutualis… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  15. arXiv:2101.04979  [pdf, other

    cs.SD eess.AS

    Deep Attention-based Representation Learning for Heart Sound Classification

    Authors: Zhao Ren, Kun Qian, Fengquan Dong, Zhenyu Dai, Yoshiharu Yamamoto, Björn W. Schuller

    Abstract: Cardiovascular diseases are the leading cause of deaths and severely threaten human health in daily life. On the one hand, there have been dramatically increasing demands from both the clinical practice and the smart home application for monitoring the heart status of subjects suffering from chronic cardiovascular diseases. On the other hand, experienced physicians who can perform an efficient aus… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  16. arXiv:2011.09566  [pdf, other

    eess.SY eess.SP

    Line Outage Identification Based on AC Power Flow and Synchronized Measurements

    Authors: Zhen Dai, Joseph Euzebe Tate

    Abstract: This paper proposes a method of identifying single line outages in power systems based on phasor measurement unit (PMU) measurements and ac power flow models. In addition to the main identification algorithm, a rejection filter is introduced so that the preliminary identified results can be further processed and categorized into three types: correctly identified, misidentified and inconclusive (in… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 5 pages, 5 figures, accepted by 2020 IEEE PES General Meeting

  17. arXiv:2010.15233  [pdf

    eess.IV cs.CV cs.LG

    Accurate Prostate Cancer Detection and Segmentation on Biparametric MRI using Non-local Mask R-CNN with Histopathological Ground Truth

    Authors: Zhenzhen Dai, Ivan Jambor, Pekka Taimen, Milan Pantelic, Mohamed Elshaikh, Craig Rogers, Otto Ettala, Peter Boström, Hannu Aronen, Harri Merisaari, Ning Wen

    Abstract: Purpose: We aimed to develop deep machine learning (DL) models to improve the detection and segmentation of intraprostatic lesions (IL) on bp-MRI by using whole amount prostatectomy specimen-based delineations. We also aimed to investigate whether transfer learning and self-training would improve results with small amount labelled data. Methods: 158 patients had suspicious lesions delineated on… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  18. arXiv:2010.08682  [pdf, other

    cs.CV cs.LG eess.IV

    MeshMVS: Multi-View Stereo Guided Mesh Reconstruction

    Authors: Rakesh Shrestha, Zhiwen Fan, Qingkun Su, Zuozhuo Dai, Siyu Zhu, ** Tan

    Abstract: Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. These color image semantics only implicitly encode 3D information, potentially limiting the accuracy of the generated shapes. In this paper we propose a multi-view mesh generation method which incorporates geometry… ▽ More

    Submitted 11 April, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

  19. arXiv:2003.12933  [pdf, other

    eess.SP

    Weak Radio Frequency Signal Detection Based on Piezo-Opto-Electro-Mechanical System: Architecture Design and Sensitivity Prediction

    Authors: Shanchi Wu, Chen Gong, Chengjie Zuo, Shangbin Li, Junyu Zhang, Zhongbin Dai, Kai Yang, Ming Zhao, Rui Ni, Zhengyuan Xu, **kang Zhu

    Abstract: We propose a novel radio-frequency (RF) receiving architecture based on micro-electro-mechanical system (MEMS) and optical coherent detection module. The architecture converts the received electrical signal into mechanical vibration through the piezoelectric effect and adopts an optical detection module to detect the mechanical vibration. We analyze the response function of piezoelectric film to a… ▽ More

    Submitted 8 October, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

    Comments: 15 pages, 16 figures, 6 tables

  20. arXiv:1910.00696  [pdf

    eess.IV cs.LG stat.ML

    Improvement of Multiparametric MR Image Segmentation by Augmenting the Data with Generative Adversarial Networks for Glioma Patients

    Authors: Eric Carver, Zhenzhen Dai, Evan Liang, James Snyder, Ning Wen

    Abstract: Every year thousands of patients are diagnosed with a glioma, a type of malignant brain tumor. Physicians use MR images as a key tool in the diagnosis and treatment of these patients. Neural networks show great potential to aid physicians in the medical image analysis. This study investigates the use of varying amounts of synthetic brain T1-weighted (T1), post-contrast T1-weighted (T1Gd), T2-weigh… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  21. arXiv:1908.06843  [pdf, ps, other

    eess.SP cs.LG stat.ML

    ProSper -- A Python Library for Probabilistic Sparse Coding with Non-Standard Priors and Superpositions

    Authors: Georgios Exarchakis, Jörg Bornschein, Abdul-Saboor Sheikh, Zhenwen Dai, Marc Henniges, Jakob Drefs, Jörg Lücke

    Abstract: ProSper is a python library containing probabilistic algorithms to learn dictionaries. Given a set of data points, the implemented algorithms seek to learn the elementary components that have generated the data. The library widens the scope of dictionary learning approaches beyond implementations of standard approaches such as ICA, NMF or standard L1 sparse coding. The implemented algorithms are e… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.