Skip to main content

Showing 1–22 of 22 results for author: song, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00717  [pdf, other

    cs.LG cs.AI eess.SY

    Learning System Dynamics without Forgetting

    Authors: Xikun Zhang, Dong** Song, Yushan Jiang, Yixin Chen, Dacheng Tao

    Abstract: Predicting the trajectories of systems with unknown dynamics (\textit{i.e.} the governing rules) is crucial in various research fields, including physics and biology. This challenge has gathered significant attention from diverse communities. Most existing works focus on learning fixed system dynamics within one single system. However, real-world applications often involve multiple systems with di… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2405.07260  [pdf

    cs.LG cs.AI eess.SP

    A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition

    Authors: Xiang Li, Jian Song, Zhigang Zhao, Chunxiao Wang, Dawei Song, Bin Hu

    Abstract: This study introduces a novel Supervised Info-enhanced Contrastive Learning framework for EEG based Emotion Recognition (SICLEER). SI-CLEER employs multi-granularity contrastive learning to create robust EEG contextual representations, potentiallyn improving emotion recognition effectiveness. Unlike existing methods solely guided by classification loss, we propose a joint learning model combining… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures, 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  3. arXiv:2312.03900  [pdf, other

    eess.SP

    Community Detection in High-Dimensional Graph Ensembles

    Authors: Robert Malinas, Dogyoon Song, Alfred O. Hero III

    Abstract: Detecting communities in high-dimensional graphs can be achieved by applying random matrix theory where the adjacency matrix of the graph is modeled by a Stochastic Block Model (SBM). However, the SBM makes an unrealistic assumption that the edge probabilities are homogeneous within communities, i.e., the edges occur with the same probabilities. The Degree-Corrected SBM is a generalization of the… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 8 pages, 3 figures

  4. A Message Passing Detection based Affine Frequency Division Multiplexing Communication System

    Authors: Lifan Wu, Shan Luo, Dongxiao Song, Fan Yang, Rong** Lin

    Abstract: The next generation of wireless communication technology is anticipated to address the communication reliability challenges encountered in high-speed mobile communication scenarios. An Orthogonal Time Frequency Space (OTFS) system has been introduced as a solution that effectively mitigates these issues. However, OTFS is associated with relatively high pilot overhead and multiuser multiplexing ove… ▽ More

    Submitted 30 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

    Comments: 8 pages, 7 figures

  5. arXiv:2306.10125  [pdf, other

    cs.LG cs.AI eess.SP stat.AP

    Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects

    Authors: Kexin Zhang, Qingsong Wen, Chaoli Zhang, Rongyao Cai, Ming **, Yong Liu, James Zhang, Yuxuan Liang, Guansong Pang, Dong** Song, Shirui Pan

    Abstract: Self-supervised learning (SSL) has recently achieved impressive performance on various time series tasks. The most prominent advantage of SSL is that it reduces the dependence on labeled data. Based on the pre-training and fine-tuning strategy, even a small amount of labeled data can achieve high performance. Compared with many published self-supervised surveys on computer vision and natural langu… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI); 26 pages, 200+ references; the first work to comprehensively and systematically summarize self-supervised learning for time series analysis (SSL4TS). The GitHub repository is https://github.com/qingsongedu/Awesome-SSL4TS

  6. arXiv:2306.08998  [pdf, other

    cs.SD cs.CV eess.AS

    Team AcieLee: Technical Report for EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023

    Authors: Yuqi Li, Yizhi Luo, Xiaoshuai Hao, Chuanguang Yang, Zhulin An, Dantong Song, Wei Yi

    Abstract: In this report, we describe the technical details of our submission to the EPIC-SOUNDS Audio-Based Interaction Recognition Challenge 2023, by Team "AcieLee" (username: Yuqi\_Li). The task is to classify the audio caused by interactions between objects, or from events of the camera wearer. We conducted exhaustive experiments and found learning rate step decay, backbone frozen, label smoothing and f… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  7. arXiv:2306.01232  [pdf, other

    eess.IV cs.CV

    Deep Reinforcement Learning Framework for Thoracic Diseases Classification via Prior Knowledge Guidance

    Authors: Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, Anan Liu

    Abstract: The chest X-ray is often utilized for diagnosing common thoracic diseases. In recent years, many approaches have been proposed to handle the problem of automatic diagnosis based on chest X-rays. However, the scarcity of labeled data for related diseases still poses a huge challenge to an accurate diagnosis. In this paper, we focus on the thorax disease diagnostic problem and propose a novel deep r… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  8. arXiv:2305.12072  [pdf, other

    eess.IV cs.CV

    Chest X-ray Image Classification: A Causal Perspective

    Authors: Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, Anan Liu

    Abstract: The chest X-ray (CXR) is one of the most common and easy-to-get medical tests used to diagnose common diseases of the chest. Recently, many deep learning-based methods have been proposed that are capable of effectively classifying CXRs. Even though these techniques have worked quite well, it is difficult to establish whether what these algorithms actually learn is the cause-and-effect link between… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  9. arXiv:2305.12070  [pdf, other

    eess.IV cs.CV

    Instrumental Variable Learning for Chest X-ray Classification

    Authors: Weizhi Nie, Chen Zhang, Dan song, Yunpeng Bai, Keliang Xie, Anan Liu

    Abstract: The chest X-ray (CXR) is commonly employed to diagnose thoracic illnesses, but the challenge of achieving accurate automatic diagnosis through this method persists due to the complex relationship between pathology. In recent years, various deep learning-based approaches have been suggested to tackle this problem but confounding factors such as image resolution or noise problems often damage model… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  10. arXiv:2304.05340  [pdf, other

    cs.CV eess.IV

    Unified Multi-Modal Image Synthesis for Missing Modality Imputation

    Authors: Yue Zhang, Chengtao Peng, Qiuli Wang, Dan Song, Kaiyan Li, S. Kevin Zhou

    Abstract: Multi-modal medical images provide complementary soft-tissue characteristics that aid in the screening and diagnosis of diseases. However, limited scanning time, image corruption and various imaging protocols often result in incomplete multi-modal images, thus limiting the usage of multi-modal data for clinical purposes. To address this issue, in this paper, we propose a novel unified multi-modal… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 10 pages, 9 figures

  11. arXiv:2208.01643  [pdf, other

    eess.IV cs.AI cs.CV

    CTooth+: A Large-scale Dental Cone Beam Computed Tomography Dataset and Benchmark for Tooth Volume Segmentation

    Authors: Weiwei Cui, Yaqi Wang, Yilong Li, Dan Song, Xingyong Zuo, Jiaojiao Wang, Yifan Zhang, Huiyu Zhou, Bung san Chong, Liaoyuan Zeng, Qianni Zhang

    Abstract: Accurate tooth volume segmentation is a prerequisite for computer-aided dental analysis. Deep learning-based tooth segmentation methods have achieved satisfying performances but require a large quantity of tooth data with ground truth. The dental data publicly available is limited meaning the existing methods can not be reproduced, evaluated and applied in clinical practice. In this paper, we esta… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  12. arXiv:2203.11279  [pdf, ps, other

    eess.SP cs.AI cs.LG

    EEG based Emotion Recognition: A Tutorial and Review

    Authors: Xiang Li, Yazhou Zhang, Prayag Tiwari, Dawei Song, Bin Hu, Meihong Yang, Zhigang Zhao, Neeraj Kumar, Pekka Marttinen

    Abstract: Emotion recognition technology through analyzing the EEG signal is currently an essential concept in Artificial Intelligence and holds great potential in emotional health care, human-computer interaction, multimedia content recommendation, etc. Though there have been several works devoted to reviewing EEG-based emotion recognition, the content of these reviews needs to be updated. In addition, tho… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  13. arXiv:2202.04878  [pdf, ps, other

    cs.IT eess.SP

    Space-Time Adaptive Processing Using Random Matrix Theory Under Limited Training Samples

    Authors: Di Song, Shengyao Chen, Feng Xi, Zhong Liu

    Abstract: Space-time adaptive processing (STAP) is one of the most effective approaches to suppressing ground clutters in airborne radar systems. It basically takes two forms, i.e., full-dimension STAP (FD-STAP) and reduced-dimension STAP (RD-STAP). When the numbers of clutter training samples are less than two times their respective system degrees-of-freedom (DOF), the performances of both FD-STAP and RD-S… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 24 pages, 5 figures

  14. arXiv:2111.08756  [pdf, other

    cs.IT eess.SP

    Achieving Short-Blocklength RCU bound via CRC List Decoding of TCM with Probabilistic Sha**

    Authors: Linfang Wang, Dan Song, Felipe Areces, Richard D. Wesel

    Abstract: This paper applies probabilistic amplitude sha** (PAS) to a cyclic redundancy check (CRC) aided trellis coded modulation (TCM) to achieve the short-blocklength random coding union (RCU) bound. In the transmitter, the equally likely message bits are first encoded by distribution matcher to generate amplitude symbols with the desired distribution. The binary representations of the distribution mat… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  15. arXiv:2109.12634  [pdf, other

    eess.IV cs.CV

    A Novel Hybrid Convolutional Neural Network for Accurate Organ Segmentation in 3D Head and Neck CT Images

    Authors: Zijie Chen, Cheng Li, Junjun He, ** Ye, Di** Song, Shanshan Wang, Lixu Gu, Yu Qiao

    Abstract: Radiation therapy (RT) is widely employed in the clinic for the treatment of head and neck (HaN) cancers. An essential step of RT planning is the accurate segmentation of various organs-at-risks (OARs) in HaN CT images. Nevertheless, segmenting OARs manually is time-consuming, tedious, and error-prone considering that typical HaN CT images contain tens to hundreds of slices. Automated segmentation… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 10 pages, 2 figures

  16. arXiv:2109.12629  [pdf, ps, other

    eess.IV cs.CV

    Group Shift Pointwise Convolution for Volumetric Medical Image Segmentation

    Authors: Junjun He, ** Ye, Cheng Li, Di** Song, Wanli Chen, Shanshan Wang, Lixu Gu, Yu Qiao

    Abstract: Recent studies have witnessed the effectiveness of 3D convolutions on segmenting volumetric medical images. Compared with the 2D counterparts, 3D convolutions can capture the spatial context in three dimensions. Nevertheless, models employing 3D convolutions introduce more trainable parameters and are more computationally complex, which may lead easily to model overfitting especially for medical a… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 10 pages, 2 figures

  17. arXiv:2103.08357  [pdf, other

    eess.IV cs.CV

    Learning Frequency-aware Dynamic Network for Efficient Super-Resolution

    Authors: Wenbin Xie, Dehua Song, Chang Xu, Chun**g Xu, Hui Zhang, Yunhe Wang

    Abstract: Deep learning based methods, especially convolutional neural networks (CNNs) have been successfully applied in the field of single image super-resolution (SISR). To obtain better fidelity and visual quality, most of existing networks are of heavy design with massive computation. However, the computation resources of modern mobile devices are limited, which cannot easily support the expensive cost.… ▽ More

    Submitted 16 August, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

  18. arXiv:2009.08891  [pdf, other

    eess.IV cs.CV

    AdderSR: Towards Energy Efficient Image Super-Resolution

    Authors: Dehua Song, Yunhe Wang, Hanting Chen, Chang Xu, Chun**g Xu, Dacheng Tao

    Abstract: This paper studies the single image super-resolution problem using adder neural networks (AdderNet). Compared with convolutional neural networks, AdderNet utilizing additions to calculate the output features thus avoid massive energy consumptions of conventional multiplications. However, it is very hard to directly inherit the existing success of AdderNet on large-scale image classification to the… ▽ More

    Submitted 4 May, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

  19. arXiv:2008.04481  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Transformer with Bidirectional Decoder for Speech Recognition

    Authors: Xi Chen, Songyang Zhang, Dandan Song, Peng Ouyang, Shouyi Yin

    Abstract: Attention-based models have made tremendous progress on end-to-end automatic speech recognition(ASR) recently. However, the conventional transformer-based approaches usually generate the sequence results token by token from left to right, leaving the right-to-left contexts unexploited. In this work, we introduce a bidirectional speech transformer to utilize the different directional contexts simul… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Accepted by InterSpeech 2020

  20. arXiv:1912.11585  [pdf, other

    cs.SD cs.CL eess.AS

    THUEE system description for NIST 2019 SRE CTS Challenge

    Authors: Yi Liu, Tianyu Liang, Can Xu, Xianwei Zhang, Xianhong Chen, Wei-Qiang Zhang, Liang He, Dandan song, Ruyun Li, Yangcheng Wu, Peng Ouyang, Shouyi Yin

    Abstract: This paper describes the systems submitted by the department of electronic engineering, institute of microelectronics of Tsinghua university and TsingMicro Co. Ltd. (THUEE) to the NIST 2019 speaker recognition evaluation CTS challenge. Six subsystems, including etdnn/ams, ftdnn/as, eftdnn/ams, resnet, multitask and c-vector are developed in this evaluation.

    Submitted 24 December, 2019; originally announced December 2019.

    Comments: This is the system description of THUEE submitted to NIST SRE 2019

  21. arXiv:1912.05124  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Small-footprint Keyword Spotting with Graph Convolutional Network

    Authors: Xi Chen, Shouyi Yin, Dandan Song, Peng Ouyang, Leibo Liu, Shaojun Wei

    Abstract: Despite the recent successes of deep neural networks, it remains challenging to achieve high precision keyword spotting task (KWS) on resource-constrained devices. In this study, we propose a novel context-aware and compact architecture for keyword spotting task. Based on residual connection and bottleneck structure, we design a compact and efficient network for KWS task. To leverage the long rang… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Accepted by the IEEE Automatic Speech Recognition and Understanding Workshop(ASRU 2019)

  22. arXiv:1809.10875  [pdf, other

    cs.LG cs.AI cs.CR cs.SD eess.AS stat.ML

    Characterizing Audio Adversarial Examples Using Temporal Dependency

    Authors: Zhuolin Yang, Bo Li, Pin-Yu Chen, Dawn Song

    Abstract: Recent studies have highlighted adversarial examples as a ubiquitous threat to different neural network models and many downstream applications. Nonetheless, as unique data properties have inspired distinct and powerful learning principles, this paper aims to explore their potentials towards mitigating adversarial inputs. In particular, our results reveal the importance of using the temporal depen… ▽ More

    Submitted 5 June, 2019; v1 submitted 28 September, 2018; originally announced September 2018.