Skip to main content

Showing 1–12 of 12 results for author: Hung, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.09956  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

    Authors: Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria

    Abstract: Generative multimodal content is increasingly prevalent in much of the content creation arena, as it has the potential to allow artists and media personnel to create pre-production mockups by quickly bringing their ideas to life. The generation of audio from text prompts is an important aspect of such processes in the music and film industry. Many of the recent diffusion-based text-to-audio models… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: https://github.com/declare-lab/tango

  2. arXiv:2401.11095  [pdf, other

    cs.HC cs.SD eess.AS

    SoundShift: Exploring Sound Manipulations for Accessible Mixed-Reality Awareness

    Authors: Ruei-Che Chang, Chia-Sheng Hung, Bing-Yu Chen, Dhruv Jain, Anhong Guo

    Abstract: Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum… ▽ More

    Submitted 26 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: DIS 2024

  3. arXiv:2305.05139   

    cs.SD cs.MM eess.AS

    Temporal Convolution Network Based Onset Detection and Query by Humming System Design

    Authors: Yu Cheng Hung, Jian-Jiun Ding

    Abstract: Onsets are a key factor to split audio into several notes. In this paper, we ensemble multiple temporal convolution network (TCN) based model and utilize a restricted frequency range spectrogram to achieve more robust onset detection. Different from the present onset detection of QBH system which is only available in a clean scenario, our proposal of onset detection and speech enhancement can prev… ▽ More

    Submitted 7 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: This paper has been withdrawn by the author due to a crucial definition of probability threshold and several grammer and vocabulary mistakes

  4. arXiv:2305.03982  [pdf

    cs.SD cs.MM eess.AS

    Pitch Estimation by Denoising Preprocessor and Hybrid Estimation Model

    Authors: Yu Cheng Hung, ** Hung Chen, Jian Jiun Ding

    Abstract: Pitch estimation is to estimate the fundamental frequency and the midi number and plays a critical role in music signal analysis and vocal signal processing. In this work, we proposed a new architecture based on a learning-based enhancement preprocessor and a combination of several traditional and deep learning pitch estimation methods to achieve better pitch estimation performance in both noisy a… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: From ICCE-Taiwan

  5. arXiv:2211.14986  [pdf

    eess.IV cs.CV

    An Unpaired Cross-modality Segmentation Framework Using Data Augmentation and Hybrid Convolutional Networks for Segmenting Vestibular Schwannoma and Cochlea

    Authors: Yuzhou Zhuang, Hong Liu, Enmin Song, Coskun Cetinkaya, Chih-Cheng Hung

    Abstract: The crossMoDA challenge aims to automatically segment the vestibular schwannoma (VS) tumor and cochlea regions of unlabeled high-resolution T2 scans by leveraging labeled contrast-enhanced T1 scans. The 2022 edition extends the segmentation task by including multi-institutional scans. In this work, we proposed an unpaired cross-modality segmentation framework using data augmentation and hybrid con… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted by BrainLes MICCAI proceedings

  6. arXiv:2011.05755  [pdf, other

    q-bio.QM cs.DC eess.IV

    Cryo-RALib -- a modular library for accelerating alignment in cryo-EM

    Authors: Szu-Chi Chung, Cheng-Yu Hung, Huei-Lun Siao, Hung-Yi Wu, Wei-Hau Chang, I-** Tu

    Abstract: Thanks to automated cryo-EM and GPU-accelerated processing, single-particle cryo-EM has become a rapid structure determination method that permits capture of dynamical structures of molecules in solution, which has been recently demonstrated by the determination of COVID-19 spike protein in March, shortly after its breakout in late January 2020. This rapidity is critical for vaccine development in… ▽ More

    Submitted 25 February, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

  7. arXiv:2003.12175  [pdf, other

    cs.LG cs.SD eess.AS

    Incremental Learning Algorithm for Sound Event Detection

    Authors: Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser

    Abstract: This paper presents a new learning strategy for the Sound Event Detection (SED) system to tackle the issues of i) knowledge migration from a pre-trained model to a new target model and ii) learning new sound events without forgetting the previously learned ones without re-training from scratch. In order to migrate the previously learned knowledge from the source model to the target one, a neural a… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Comments: IEEE ICME 2020 Camera Ready Version

    Journal ref: IEEE ICME 2020

  8. arXiv:1905.08413  [pdf

    cs.CV eess.IV

    Dual-branch residual network for lung nodule segmentation

    Authors: Haichao Cao, Hong Liu, Enmin Song, Chih-Cheng Hung, Guangzhi Ma, Xiangyang Xu, Renchao **, Jianguo Lu

    Abstract: An accurate segmentation of lung nodules in computed tomography (CT) images is critical to lung cancer analysis and diagnosis. However, due to the variety of lung nodules and the similarity of visual characteristics between nodules and their surroundings, a robust segmentation of nodules becomes a challenging problem. In this study, we propose the Dual-branch Residual Network (DB-ResNet) which is… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 24 pages, 6 figures

  9. arXiv:1905.03445  [pdf

    cs.CV eess.IV

    Two-Stage Convolutional Neural Network Architecture for Lung Nodule Detection

    Authors: Haichao Cao, Hong Liu, Enmin Song, Guangzhi Ma, Xiangyang Xu, Renchao **, Tengying Liu, Chih-Cheng Hung

    Abstract: Early detection of lung cancer is an effective way to improve the survival rate of patients. It is a critical step to have accurate detection of lung nodules in computed tomography (CT) images for the diagnosis of lung cancer. However, due to the heterogeneity of the lung nodules and the complexity of the surrounding environment, robust nodule detection has been a challenging task. In this study,… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: 29 pages, 10 figures

  10. arXiv:1903.07164  [pdf, ps, other

    eess.SP cs.IR math.OC

    Linearly Constrained Smoothing Group Sparsity Solvers in Off-grid Model

    Authors: Cheng-Yu Hung, Mostafa Kaveh

    Abstract: In compressed sensing, the sensing matrix is assumed perfectly known. However, there exists perturbation in the sensing matrix in reality due to sensor offsets or noise disturbance. Directions-of-arrival (DoA) estimation with off-grid effect satisfies this situation, and can be formulated into a (non)convex optimization problem with linear inequalities constraints, which can be solved by the inter… ▽ More

    Submitted 3 June, 2019; v1 submitted 17 March, 2019; originally announced March 2019.

  11. arXiv:1903.07158  [pdf, ps, other

    eess.SP cs.IR math.OC

    Joint Block Low Rank and Sparse Matrix Recovery in Array Self-Calibration Off-Grid DoA Estimation

    Authors: Cheng-Yu Hung, Mostafa Kaveh

    Abstract: This letter addresses the estimation of directions-of-arrival (DoA) by a sensor array using a sparse model in the presence of array calibration errors and off-grid directions. The received signal utilizes previously used models for unknown errors in calibration and structured linear representation of the off-grid effect. A convex optimization problem is formulated with an objective function to pro… ▽ More

    Submitted 3 June, 2019; v1 submitted 17 March, 2019; originally announced March 2019.

  12. arXiv:1712.05890  [pdf, ps, other

    eess.SP cs.IT

    Low Rank Matrix Recovery for Joint Array Self-Calibration and Sparse Model DoA Estimation

    Authors: Cheng-Yu Hung, Mostafa Kaveh

    Abstract: In this work, combined calibration and DoA estimation is approached as an extension of the formulation for the Single Measurement Vector (SMV) model of self-calibration to the Multiple Measurement Model (MMV) case. By taking advantage of multiple snapshots, a modified nuclear norm minimization problem is proposed to recover a low-rank larger dimension matrix. We also give the definition of a linea… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.