Skip to main content

Showing 1–26 of 26 results for author: Zeng, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.13789  [pdf, other

    cs.SD cs.AI cs.IR cs.MM eess.AS

    Anchor-aware Deep Metric Learning for Audio-visual Retrieval

    Authors: Donghuo Zeng, Yanan Wang, Kazushi Ikeda, Yi Yu

    Abstract: Metric learning minimizes the gap between similar (positive) pairs of data points and increases the separation of dissimilar (negative) pairs, aiming at capturing the underlying data structure and enhancing the performance of tasks like audio-visual cross-modal retrieval (AV-CMR). Recent works employ sampling methods to select impactful data points from the embedding space during training. However… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures. Accepted by ACM ICMR 2024

  2. arXiv:2403.02307  [pdf, other

    eess.IV cs.CV

    Harnessing Intra-group Variations Via a Population-Level Context for Pathology Detection

    Authors: P. Bilha Githinji, Xi Yuan, Zhenglin Chen, Ijaz Gul, Dingqi Shang, Wen Liang, Jianming Deng, Dan Zeng, Dongmei yu, Chenggang Yan, Peiwu Qin

    Abstract: Realizing sufficient separability between the distributions of healthy and pathological samples is a critical obstacle for pathology detection convolutional models. Moreover, these models exhibit a bias for contrast-based images, with diminished performance on texture-based medical images. This study introduces the notion of a population-level context for pathology detection and employs a graph th… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2402.11274  [pdf, other

    eess.IV cs.CV cs.LG

    TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method

    Authors: Chenyan Zhang, Yifei Chen, Zhenxiong Fan, Yiyu Huang, Wenchao Weng, Ruiquan Ge, Dong Zeng, Changmiao Wang

    Abstract: Recently, diffusion models have gained significant attention as a novel set of deep learning-based generative methods. These models attempt to sample data from a Gaussian distribution that adheres to a target distribution, and have been successfully adapted to the reconstruction of MRI data. However, as an unconditional generative model, the diffusion model typically disrupts image coordination be… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 5 pages, 2 figures, accept ISBI2024

    Journal ref: ISBI 2024

  4. arXiv:2312.13611  [pdf, other

    cs.LG cs.NI eess.SP

    Topology Learning for Heterogeneous Decentralized Federated Learning over Unreliable D2D Networks

    Authors: Zheshun Wu, Zenglin Xu, Dun Zeng, Junfan Li, Jie Liu

    Abstract: With the proliferation of intelligent mobile devices in wireless device-to-device (D2D) networks, decentralized federated learning (DFL) has attracted significant interest. Compared to centralized federated learning (CFL), DFL mitigates the risk of central server failures due to communication bottlenecks. However, DFL faces several challenges, such as the severe heterogeneity of data distributions… ▽ More

    Submitted 10 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in IEEE Transactions on Vehicular Technology

  5. arXiv:2310.13451  [pdf, other

    cs.SD cs.CV cs.IR cs.MM eess.AS

    Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval

    Authors: Donghuo Zeng, Kazushi Ikeda

    Abstract: The cross-modal retrieval model leverages the potential of triple loss optimization to learn robust embedding spaces. However, existing methods often train these models in a singular pass, overlooking the distinction between semi-hard and hard triples in the optimization process. The oversight of not distinguishing between semi-hard and hard triples leads to suboptimal model performance. In this p… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 8 pages, 6 figures

  6. arXiv:2306.02037  [pdf, other

    eess.IV

    A Peer-to-peer Federated Continual Learning Network for Improving CT Imaging from Multiple Institutions

    Authors: Hao Wang, Ruihong He, Xiaoyu Zhang, Zhaoying Bian, Dong Zeng, Jianhua Ma

    Abstract: Deep learning techniques have been widely used in computed tomography (CT) but require large data sets to train networks. Moreover, data sharing among multiple institutions is limited due to data privacy constraints, which hinders the development of high-performance DL-based CT imaging models from multi-institutional collaborations. Federated learning (FL) strategy is an alternative way to train t… ▽ More

    Submitted 11 July, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  7. arXiv:2211.00996  [pdf, other

    cs.SD eess.AS

    Singing Voice Synthesis with Vibrato Modeling and Latent Energy Representation

    Authors: Yingjie Song, Wei Song, Wei Zhang, Zhengchen Zhang, Dan Zeng, Zhi Liu, Yang Yu

    Abstract: This paper proposes an expressive singing voice synthesis system by introducing explicit vibrato modeling and latent energy representation. Vibrato is essential to the naturalness of synthesized sound, due to the inherent characteristics of human singing. Hence, a deep learning-based vibrato model is introduced in this paper to control the vibrato's likeliness, rate, depth and phase in singing, wh… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  8. arXiv:2208.11278  [pdf, other

    cs.LG cs.CR eess.IV

    Federated Self-Supervised Contrastive Learning and Masked Autoencoder for Dermatological Disease Diagnosis

    Authors: Yawen Wu, Dewen Zeng, Zhepeng Wang, Yi Sheng, Lei Yang, Alaina J. James, Yiyu Shi, **gtong Hu

    Abstract: In dermatological disease diagnosis, the private data collected by mobile dermatology assistants exist on distributed mobile devices of patients. Federated learning (FL) can use decentralized data to train models while kee** data local. Existing FL methods assume all the data have labels. However, medical data often comes without full labels due to high labeling costs. Self-supervised learning (… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.07470

  9. arXiv:2208.03808  [pdf, other

    eess.IV cs.CV cs.LG

    Distributed Contrastive Learning for Medical Image Segmentation

    Authors: Yawen Wu, Dewen Zeng, Zhepeng Wang, Yiyu Shi, **gtong Hu

    Abstract: Supervised deep learning needs a large amount of labeled data to achieve high performance. However, in medical imaging analysis, each site may only have a limited amount of data and labels, which makes learning ineffective. Federated learning (FL) can learn a shared model from decentralized data. But traditional FL requires fully-labeled data for training, which is very expensive to obtain. Self-s… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.10983

  10. arXiv:2207.13882  [pdf, other

    eess.IV cs.CV

    SuperVessel: Segmenting High-resolution Vessel from Low-resolution Retinal Image

    Authors: Yan Hu, Zhongxi Qiu, Dan Zeng, Li Jiang, Chen Lin, Jiang Liu

    Abstract: Vascular segmentation extracts blood vessels from images and serves as the basis for diagnosing various diseases, like ophthalmic diseases. Ophthalmologists often require high-resolution segmentation results for analysis, which leads to super-computational load by most existing methods. If based on low-resolution input, they easily ignore tiny vessels or cause discontinuity of segmented vessels. T… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: Accepted by PRCV2022

  11. Federated Contrastive Learning for Volumetric Medical Image Segmentation

    Authors: Yawen Wu, Dewen Zeng, Zhepeng Wang, Yiyu Shi, **gtong Hu

    Abstract: Supervised deep learning needs a large amount of labeled data to achieve high performance. However, in medical imaging analysis, each site may only have a limited amount of data and labels, which makes learning ineffective. Federated learning (FL) can help in this regard by learning a shared model while kee** training data local for privacy. Traditional FL requires fully-labeled data for trainin… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Cham, 2021

  12. arXiv:2203.02110  [pdf, other

    eess.IV cs.CV cs.LG

    FairPrune: Achieving Fairness Through Pruning for Dermatological Disease Diagnosis

    Authors: Yawen Wu, Dewen Zeng, Xiaowei Xu, Yiyu Shi, **gtong Hu

    Abstract: Many works have shown that deep learning-based medical image classification models can exhibit bias toward certain demographic attributes like race, gender, and age. Existing bias mitigation methods primarily focus on learning debiased models, which may not necessarily guarantee all sensitive information can be removed and usually comes with considerable accuracy degradation on both privileged and… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  13. arXiv:2202.13566  [pdf

    cs.AI cs.IR cs.LG eess.SY

    Learning Parameters for a Generalized Vidale-Wolfe Response Model with Flexible Ad Elasticity and Word-of-Mouth

    Authors: Yanwu Yang, Baozhu Feng, Daniel Zeng

    Abstract: In this research, we investigate a generalized form of Vidale-Wolfe (GVW) model. One key element of our modeling work is that the GVW model contains two useful indexes representing advertiser's elasticity and the word-of-mouth (WoM) effect, respectively. Moreover, we discuss some desirable properties of the GVW model, and present a deep neural network (DNN)-based estimation method to learn its par… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: 20 pages, 8 figures, 1 table

    MSC Class: 68Txx ACM Class: I.2.6

    Journal ref: IEEE Intelligent Systems, 36(5), 69-79 (2021)

  14. arXiv:2202.07470  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Federated Contrastive Learning for Dermatological Disease Diagnosis via On-device Learning

    Authors: Yawen Wu, Dewen Zeng, Zhepeng Wang, Yi Sheng, Lei Yang, Alaina J. James, Yiyu Shi, **gtong Hu

    Abstract: Deep learning models have been deployed in an increasing number of edge and mobile devices to provide healthcare. These models rely on training with a tremendous amount of labeled data to achieve high accuracy. However, for medical applications such as dermatological disease diagnosis, the private data collected by mobile dermatology assistants exist on distributed mobile devices of patients, and… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

  15. arXiv:2111.00666  [pdf, other

    eess.IV cs.CV

    Self-Verification in Image Denoising

    Authors: Huangxing Lin, Yihong Zhuang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley

    Abstract: We devise a new regularization, called self-verification, for image denoising. This regularization is formulated using a deep image prior learned by the network, rather than a traditional predefined prior. Specifically, we treat the output of the network as a ``prior'' that we denoise again after ``re-noising''. The comparison between the again denoised image and its prior can be interpreted as a… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  16. arXiv:2110.13720  [pdf

    eess.IV cond-mat.mtrl-sci cs.CV

    Deep DIC: Deep Learning-Based Digital Image Correlation for End-to-End Displacement and Strain Measurement

    Authors: Ru Yang, Yang Li, Danielle Zeng, ** Guo

    Abstract: Digital image correlation (DIC) has become an industry standard to retrieve accurate displacement and strain measurement in tensile testing and other material characterization. Though traditional DIC offers a high precision estimation of deformation for general tensile testing cases, the prediction becomes unstable at large deformation or when the speckle patterns start to tear. In addition, tradi… ▽ More

    Submitted 6 January, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: 39 pages, 19 figures

    Journal ref: Journal of Materials Processing Technology (2021): 117474

  17. arXiv:2109.08909  [pdf, other

    cs.CV eess.IV math.NA

    Measuring the rogue wave pattern triggered from Gaussian perturbations by deep learning

    Authors: Liwen Zou, XinHang Luo, Delu Zeng, Liming Ling, Li-Chen Zhao

    Abstract: Weak Gaussian perturbations on a plane wave background could trigger lots of rogue waves, due to modulational instability. Numerical simulations showed that these rogue waves seemed to have similar unit structure. However, to the best of our knowledge, there is no relative result to prove that these rogue waves have the similar patterns for different perturbations, partly due to that it is hard to… ▽ More

    Submitted 9 October, 2021; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: 8 pages, 6 figures

  18. arXiv:2109.06909  [pdf, other

    eess.IV cs.CV physics.med-ph

    Hardware-aware Real-time Myocardial Segmentation Quality Control in Contrast Echocardiography

    Authors: Dewen Zeng, Yukun Ding, Haiyun Yuan, Mei** Huang, Xiaowei Xu, Jian Zhuang, **gtong Hu, Yiyu Shi

    Abstract: Automatic myocardial segmentation of contrast echocardiography has shown great potential in the quantification of myocardial perfusion parameters. Segmentation quality control is an important step to ensure the accuracy of segmentation results for quality research as well as its clinical application. Usually, the segmentation quality control happens after the data acquisition. At the data acquisit… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: 4 pages, DAC'21 invited paper

  19. arXiv:2109.03233  [pdf, other

    eess.IV

    Contrastive Learning with Temporal Correlated Medical Images: A Case Study using Lung Segmentation in Chest X-Rays

    Authors: Dewen Zeng, John N. Kheir, Peng Zeng, Yiyu Shi

    Abstract: Contrastive learning has been proved to be a promising technique for image-level representation learning from unlabeled data. Many existing works have demonstrated improved results by applying contrastive learning in classification and object detection tasks for either natural images or medical images. However, its application to medical image segmentation tasks has been limited. In this work, we… ▽ More

    Submitted 16 September, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: 7 pages, submitted to ICCAD'21 special session

  20. Myocardial Segmentation of Cardiac MRI Sequences with Temporal Consistency for Coronary Artery Disease Diagnosis

    Authors: Yutian Chen, Xiaowei Xu, Dewen Zeng, Yiyu Shi, Haiyun Yuan, Jian Zhuang, Yuhao Dong, Qianjun Jia, Mei** Huang

    Abstract: Coronary artery disease (CAD) is the most common cause of death globally, and its diagnosis is usually based on manual myocardial segmentation of Magnetic Resonance Imaging (MRI) sequences. As the manual segmentation is tedious, time-consuming and with low applicability, automatic myocardial segmentation using machine learning techniques has been widely explored recently. However, almost all the e… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

    Comments: 9 pages, 9 figures

  21. arXiv:2012.00290  [pdf, other

    cs.SD cs.DB cs.IR cs.MM eess.AS

    MusicTM-Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio

    Authors: Donghuo Zeng, Yi Yu, Keizo Oyama

    Abstract: This work present a music dataset named MusicTM-Dataset, which is utilized in improving the representation learning ability of different types of cross-modal retrieval (CMR). Little large music dataset including three modalities is available for learning representations for CMR. To collect a music dataset, we expand the original musical notation to synthesize audio and generated sheet-music image,… ▽ More

    Submitted 7 May, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 12 pages, 5 figures, 2 tables

    Journal ref: CSMT2020

  22. arXiv:2008.07071  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Cardiac Intervention Assistance: Hardware-aware Neural Architecture Exploration for Real-Time 3D Cardiac Cine MRI Segmentation

    Authors: Dewen Zeng, Weiwen Jiang, Tianchen Wang, Xiaowei Xu, Haiyun Yuan, Mei** Huang, Jian Zhuang, **gtong Hu, Yiyu Shi

    Abstract: Real-time cardiac magnetic resonance imaging (MRI) plays an increasingly important role in guiding various cardiac interventions. In order to provide better visual assistance, the cine MRI frames need to be segmented on-the-fly to avoid noticeable visual lag. In addition, considering reliability and patient data privacy, the computation is preferably done on local hardware. State-of-the-art MRI se… ▽ More

    Submitted 13 December, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: 8 pages, conference

  23. arXiv:2007.14856  [pdf, other

    eess.AS cs.IR cs.MM cs.SD

    Unsupervised Generative Adversarial Alignment Representation for Sheet music, Audio and Lyrics

    Authors: Donghuo Zeng, Yi Yu, Keizo Oyama

    Abstract: Sheet music, audio, and lyrics are three main modalities during writing a song. In this paper, we propose an unsupervised generative adversarial alignment representation (UGAAR) model to learn deep discriminative representations shared across three major musical modalities: sheet music, lyrics, and audio, where a deep neural network based architecture on three branches is jointly trained. In parti… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: 5 pages, 2 figures, 2 tables

  24. arXiv:1910.06154  [pdf

    eess.IV cs.CV physics.med-ph

    Direct Energy-resolving CT Imaging via Energy-integrating CT images using a Unified Generative Adversarial Network

    Authors: Lisha Yao, Sui Li, Manman Zhu, Dong Zeng, Zhaoying Bian, Jianhua Ma

    Abstract: Energy-resolving computed tomography (ErCT) has the ability to acquire energy-dependent measurements simultaneously and quantitative material information with improved contrast-to-noise ratio. Meanwhile, ErCT imaging system is usually equipped with an advanced photon counting detector, which is expensive and technically complex. Therefore, clinical ErCT scanners are not yet commercially available,… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: 5 pages, 3 figures, Accepted by MIC/NSS 2019

  25. arXiv:1907.05273  [pdf, other

    eess.IV cs.CV

    Accurate Congenital Heart Disease Model Generation for 3D Printing

    Authors: Xiaowei Xu, Tianchen Wang, Dewen Zeng, Yiyu Shi, Qianjun Jia, Haiyun Yuan, Mei** Huang, Jian Zhuang

    Abstract: 3D printing has been widely adopted for clinical decision making and interventional planning of Congenital heart disease (CHD), while whole heart and great vessel segmentation is the most significant but time-consuming step in the model generation for 3D printing. While various automatic whole heart and great vessel segmentation frameworks have been developed in the literature, they are ineffectiv… ▽ More

    Submitted 11 July, 2019; v1 submitted 6 July, 2019; originally announced July 2019.

    Comments: 6 figures, 2 tables, accepted by the IEEE International Workshop on Signal Processing Systems

  26. arXiv:1809.00502  [pdf, other

    cs.SD cs.MM eess.AS

    Deep Learning of Human Perception in Audio Event Classification

    Authors: Yi Yu, Samuel Beuret, Donghuo Zeng, Keizo Oyama

    Abstract: In this paper, we introduce our recent studies on human perception in audio event classification by different deep learning models. In particular, the pre-trained model VGGish is used as feature extractor to process audio data, and DenseNet is trained by and used as feature extractor for our electroencephalography (EEG) data. The correlation between audio stimuli and EEG is learned in a shared spa… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.