Skip to main content

Showing 1–18 of 18 results for author: Choi, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12688  [pdf, other

    eess.AS eess.SP

    Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation

    Authors: Miseul Kim, Soo-Whan Chung, Youna Ji, Hong-Goo Kang, Min-Seok Choi

    Abstract: This paper introduces a novel task in generative speech processing, Acoustic Scene Transfer (AST), which aims to transfer acoustic scenes of speech signals to diverse environments. AST promises an immersive experience in speech perception by adapting the acoustic scene behind speech signals to desired environments. We propose AST-LDM for the AST task, which generates speech signals accompanied by… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. arXiv:2406.06534  [pdf, other

    cs.CV eess.IV physics.optics

    Compressed Meta-Optical Encoder for Image Classification

    Authors: Anna Wirth-Singh, **lin Xiang, Minho Choi, Johannes E. Fröch, Luocheng Huang, Shane Colburn, Eli Shlizerman, Arka Majumdar

    Abstract: Optical and hybrid convolutional neural networks (CNNs) recently have become of increasing interest to achieve low-latency, low-power image classification and computer vision tasks. However, implementing optical nonlinearity is challenging, and omitting the nonlinear layers in a standard CNN comes at a significant reduction in accuracy. In this work, we use knowledge distillation to compress modif… ▽ More

    Submitted 14 June, 2024; v1 submitted 22 April, 2024; originally announced June 2024.

  3. arXiv:2312.05548  [pdf, other

    eess.IV cs.CV cs.LG

    A Unified Multi-Phase CT Synthesis and Classification Framework for Kidney Cancer Diagnosis with Incomplete Data

    Authors: Kwang-Hyun Uhm, Seung-Won Jung, Moon Hyung Choi, Sung-Hoo Hong, Sung-Jea Ko

    Abstract: Multi-phase CT is widely adopted for the diagnosis of kidney cancer due to the complementary information among phases. However, the complete set of multi-phase CT is often not available in practical clinical applications. In recent years, there have been some studies to generate the missing modality image from the available data. Nevertheless, the generated images are not guaranteed to be effectiv… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: This article has been accepted for publication in IEEE Journal of Biomedical and Health Informatics

    Journal ref: JBHI, 2022

  4. arXiv:2312.05334  [pdf, other

    eess.IV cs.CV

    ProsDectNet: Bridging the Gap in Prostate Cancer Detection via Transrectal B-mode Ultrasound Imaging

    Authors: Sulaiman Vesal, Indrani Bhattacharya, Hassan Jahanandish, Xinran Li, Zachary Kornberg, Steve Ran Zhou, Elijah Richard Sommer, Moon Hyung Choi, Richard E. Fan, Geoffrey A. Sonn, Mirabela Rusu

    Abstract: Interpreting traditional B-mode ultrasound images can be challenging due to image artifacts (e.g., shadowing, speckle), leading to low sensitivity and limited diagnostic accuracy. While Magnetic Resonance Imaging (MRI) has been proposed as a solution, it is expensive and not widely available. Furthermore, most biopsies are guided by Transrectal Ultrasound (TRUS) alone and can miss up to 52% cancer… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023 (Medical Imaging meets NeurIPS Workshop)

  5. arXiv:2311.18505  [pdf, other

    cs.SD eess.AS eess.SP

    String Sound Synthesizer on GPU-accelerated Finite Difference Scheme

    Authors: ** Woo Lee, Min Jun Choi, Kyogu Lee

    Abstract: This paper introduces a nonlinear string sound synthesizer, based on a finite difference simulation of the dynamic behavior of strings under various excitations. The presented synthesizer features a versatile string simulation engine capable of stochastic parameterization, encompassing fundamental frequency modulation, stiffness, tension, frequency-dependent loss, and excitation control. This open… ▽ More

    Submitted 8 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: To be appeared in ICASSP 2024

  6. An empirical study on speech restoration guided by self supervised speech representation

    Authors: Jaeuk Byun, Youna Ji, Soo Whan Chung, Soyeon Choe, Min Seok Choi

    Abstract: Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a range of degradation factors. In addition to additive noise, reverberation, clip**, and speech attenuation can all adversely affect speech quality. Speech restoration aims to recover speech components from these distortions. This paper focuses on exploring the impact of self-supervised speech represen… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To be presented at ICASSP 2023

  7. Cross-domain Denoising for Low-dose Multi-frame Spiral Computed Tomography

    Authors: Yucheng Lu, Zhixin Xu, Moon Hyung Choi, Jimin Kim, Seung-Won Jung

    Abstract: Computed tomography (CT) has been used worldwide as a non-invasive test to assist in diagnosis. However, the ionizing nature of X-ray exposure raises concerns about potential health risks such as cancer. The desire for lower radiation doses has driven researchers to improve reconstruction quality. Although previous studies on low-dose computed tomography (LDCT) denoising have demonstrated the effe… ▽ More

    Submitted 28 June, 2024; v1 submitted 21 April, 2023; originally announced April 2023.

    Journal ref: IEEE Transactions on Medical Imaging (2024)

  8. arXiv:2301.07853  [pdf

    cs.RO cs.HC eess.SY

    DECISIVE Benchmarking Data Report: sUAS Performance Results from Phase I

    Authors: Adam Norton, Reza Ahmadzadeh, Kshitij Jerath, Paul Robinette, Jay Weitzen, Thanuka Wickramarathne, Holly Yanco, Minseop Choi, Ryan Donald, Brendan Donoghue, Christian Dumas, Peter Gavriel, Alden Giedraitis, Brendan Hertel, Jack Houle, Nathan Letteri, Edwin Meriaux, Zahra Rezaei Khavas, Rakshith Singh, Gregg Willcox, Naye Yoni

    Abstract: This report reviews all results derived from performance benchmarking conducted during Phase I of the Development and Execution of Comprehensive and Integrated Subterranean Intelligent Vehicle Evaluations (DECISIVE) project by the University of Massachusetts Lowell, using the test methods specified in the DECISIVE Test Methods Handbook v1.1 for evaluating small unmanned aerial systems (sUAS) perfo… ▽ More

    Submitted 20 January, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Approved for public release: PAO #PR2023_74172; arXiv admin note: substantial text overlap with arXiv:2211.01801

  9. arXiv:2211.01801  [pdf

    cs.RO cs.HC eess.SY

    DECISIVE Test Methods Handbook: Test Methods for Evaluating sUAS in Subterranean and Constrained Indoor Environments, Version 1.1

    Authors: Adam Norton, Reza Ahmadzadeh, Kshitij Jerath, Paul Robinette, Jay Weitzen, Thanuka Wickramarathne, Holly Yanco, Minseop Choi, Ryan Donald, Brendan Donoghue, Christian Dumas, Peter Gavriel, Alden Giedraitis, Brendan Hertel, Jack Houle, Nathan Letteri, Edwin Meriaux, Zahra Rezaei Khavas, Rakshith Singh, Gregg Willcox, Naye Yoni

    Abstract: This handbook outlines all test methods developed under the Development and Execution of Comprehensive and Integrated Subterranean Intelligent Vehicle Evaluations (DECISIVE) project by the University of Massachusetts Lowell for evaluating small unmanned aerial systems (sUAS) performance in subterranean and constrained indoor environments, spanning communications, field readiness, interface, obstac… ▽ More

    Submitted 20 January, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Approved for public release: PAO #PR2022_47058

  10. arXiv:2210.17327  [pdf, other

    eess.AS cs.LG cs.SD

    Diffusion-based Generative Speech Source Separation

    Authors: Robin Scheibler, Youna Ji, Soo-Whan Chung, Jaeuk Byun, Soyeon Choe, Min-Seok Choi

    Abstract: We propose DiffSep, a new single channel source separation method based on score-matching of a stochastic differential equation (SDE). We craft a tailored continuous time diffusion-mixing process starting from the separated sources and converging to a Gaussian distribution centered on their mixture. This formulation lets us apply the machinery of score-based generative modelling. First, we train a… ▽ More

    Submitted 2 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 2 tables. Submitted to ICASSP 2023

  11. arXiv:2203.09769  [pdf, other

    cs.IT eess.SP

    SWIPT-enabled NOMA in Distributed Antenna System with Imperfect Channel State Information for Max-Sum-Rate and Max-Min Fairness

    Authors: Dongjae Kim, Minseok Choi, Dong-Wook Seo

    Abstract: Motivated by the fact that the data rate of non-orthogonal multiple access (NOMA) can be greatly increased with the help of the distributed antenna system (DAS), we presents a framework in which the DAS contributes not only to the data rate but also the energy harvesting of simultaneous wireless information and power transfer (SWIPT) enabled NOMA. This study considers the sum-rate maximization pro… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: 5pages, 5figures

  12. arXiv:2111.09425  [pdf, other

    cs.NI eess.SY

    Quality-Aware Deep Reinforcement Learning for Streaming in Infrastructure-Assisted Connected Vehicles

    Authors: Won Joon Yun, Dohyun Kwon, Minseok Choi, Joongheon Kim, Guiseppe Caire, Andreas F. Molisch

    Abstract: This paper proposes a deep reinforcement learning-based video streaming scheme for mobility-aware vehicular networks, e.g., vehicles on the highway. We consider infrastructure-assisted and mmWave-based scenarios in which the macro base station (MBS) cannot directly provide the streaming service to vehicles due to the short range of mmWave beams so that small mmWave base stations (mBSs) along the r… ▽ More

    Submitted 12 October, 2021; originally announced November 2021.

    Comments: 15 pages, 8 figures, Submitted to IEEE Transactions on Vehicular Technology

  13. arXiv:2106.14203  [pdf, other

    eess.SY

    Joint Mobile Charging and Coverage-Time Extension for Unmanned Aerial Vehicles

    Authors: Soohyun Park, Won-Yong Shin, Minseok Choi, Joongheon Kim

    Abstract: In modern networks, the use of drones as mobile base stations (MBSs) has been discussed for coverage flexibility. However, the realization of drone-based networks raises several issues. One of the critical issues is drones are extremely power-hungry. To overcome this, we need to characterize a new type of drones, so-called charging drones, which can deliver energy to MBS drones. Motivated by the f… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

  14. arXiv:2101.09566  [pdf, other

    physics.app-ph eess.SY

    Isogeometric Configuration Design Optimization of Three-dimensional Curved Beam Structures for Maximal Fundamental Frequency

    Authors: Myung-** Choi, Jae-Hyun Kim, Bonyong Koo, Seonho Cho

    Abstract: This paper presents a configuration design optimization method for three-dimensional curved beam built-up structures having maximized fundamental eigenfrequency. We develop the method of computation of design velocity field and optimal design of beam structures constrained on a curved surface, where both designs of the embedded beams and the curved surface are simultaneously varied during the opti… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: This document is the personal version of an article whose final publication is available at https://doi.org/10.1007/s00158-020-02803-0

    Journal ref: Structural and Multidisciplinary Optimization, 2021

  15. arXiv:2008.10267  [pdf, other

    eess.AS cs.IR

    A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment

    Authors: Taejun Kim, Minsuk Choi, Evan Sacks, Yi-Hsuan Yang, Juhan Nam

    Abstract: A DJ mix is a sequence of music tracks concatenated seamlessly, typically rendered for audiences in a live setting by a DJ on stage. As a DJ mix is produced in a studio or the live version is recorded for music streaming services, computational methods to analyze DJ mixes, for example, extracting track information or understanding DJ techniques, have drawn research interests. Many of previous work… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at 21st International Society for Music Information Retrieval Conference (ISMIR 2020)

  16. arXiv:1911.13010  [pdf, other

    eess.SY cs.NI

    Joint Distributed Link Scheduling and Power Allocation for Content Delivery in Wireless Caching Networks

    Authors: Minseok Choi, Andreas F. Molisch, Joongheon Kim

    Abstract: In wireless caching networks, the design of the content delivery method must consider random user requests, caching states, network topology, and interference management. In this paper, we establish a general framework for content delivery in wireless caching networks without stringent assumptions that restrict the network structure, delivery link, and interference model. Based on the framework, w… ▽ More

    Submitted 29 November, 2019; originally announced November 2019.

    Comments: 30 pages, 13 figures

  17. arXiv:1907.09184  [pdf, other

    physics.data-an eess.IV physics.plasm-ph

    Spectral data analysis methods for the two-dimensional imaging diagnostics

    Authors: Minjun J. Choi

    Abstract: Some spectral data analysis methods that are useful for the two-dimensional imaging diagnostics data are introduced. It is shown that the frequency spectrum, the local dispersion relation, the flow shear, and the nonlinear energy transfer rates can be estimated using the proper analysis methods.

    Submitted 27 August, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

  18. Dynamic Power Allocation and User Scheduling for Power-Efficient and Low-Latency Communications

    Authors: Minseok Choi, Joongheon Kim, Jaekyun Moon

    Abstract: In this paper, we propose a joint dynamic power control and user pairing algorithm for power-efficient and low-latency hybrid multiple access systems. In a hybrid multiple access system, user pairing determines whether the transmitter should serve a certain user by orthogonal multiple access (OMA) or non-orthogonal multiple access (NOMA). The proposed optimization framework minimizes the long-term… ▽ More

    Submitted 28 June, 2018; originally announced July 2018.

    Comments: 30 pages, 10 figures, Submission to IEEE Journal on Selected Areas in Communication

    Journal ref: IEEE Transactions on Wireless Communications, 26 July, 2019