Skip to main content

Showing 1–30 of 30 results for author: Ahn, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18508  [pdf

    eess.IV

    Assessment of Clonal Hematopoiesis of Indeterminate Potential from Cardiac Magnetic Resonance Imaging using Deep Learning in a Cardio-oncology Population

    Authors: Sangeon Ryu, Shawn Ahn, Jeacy Espinoza, Alokkumar Jha, Stephanie Halene, James S. Duncan, Jennifer M Kwan, Nicha C. Dvornek

    Abstract: Background: We propose a novel method to identify who may likely have clonal hematopoiesis of indeterminate potential (CHIP), a condition characterized by the presence of somatic mutations in hematopoietic stem cells without detectable hematologic malignancy, using deep learning techniques. Methods: We developed a convolutional neural network (CNN) to predict CHIP status using 4 different views fr… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2405.04752  [pdf, other

    eess.AS cs.SD

    HILCodec: High Fidelity and Lightweight Neural Audio Codec

    Authors: Sunghwan Ahn, Beom Jun Woo, Min Hyun Han, Chanyeong Moon, Nam Soo Kim

    Abstract: The recent advancement of end-to-end neural audio codecs enables compressing audio at very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such improvements often come at the cost of increased model complexity. In this paper, we identify and address the problems of existing neural audio codecs. We show that the performance of Wave-U-Net does not increase consist… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  3. arXiv:2404.05832  [pdf, other

    cs.HC eess.SY

    Human-Machine Interaction in Automated Vehicles: Reducing Voluntary Driver Intervention

    Authors: Xinzhi Zhong, Yang Zhou, Varshini Kamaraj, Zhenhao Zhou, Wissam Kontar, Dan Negrut, John D. Lee, Soyoung Ahn

    Abstract: This paper develops a novel car-following control method to reduce voluntary driver interventions and improve traffic stability in Automated Vehicles (AVs). Through a combination of experimental and empirical analysis, we show how voluntary driver interventions can instigate substantial traffic disturbances that are amplified along the traffic upstream. Motivated by these findings, we present a fr… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  4. arXiv:2402.12412  [pdf, other

    cs.HC cs.AI cs.MM eess.SP

    Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same

    Authors: Sungjun Ahn, Hyun-Jeong Yim, Youngwan Lee, Sung-Ik Park

    Abstract: This paper introduces a media service model that exploits artificial intelligence (AI) video generators at the receive end. This proposal deviates from the traditional multimedia ecosystem, completely relying on in-house production, by shifting part of the content creation onto the receiver. We bring a semantic process into the framework, allowing the distribution network to provide service elemen… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 13 pages, 7 figures

  5. arXiv:2402.05965  [pdf, other

    cs.LG eess.SP

    Hybrid Neural Representations for Spherical Data

    Authors: Hyomin Kim, Yunhui Jang, Jaeho Lee, Sungsoo Ahn

    Abstract: In this paper, we study hybrid neural representations for spherical data, a domain of increasing relevance in scientific research. In particular, our work focuses on weather and climate data as well as comic microwave background (CMB) data. Although previous studies have delved into coordinate-based neural representations for spherical signals, they often fail to capture the intricate details of h… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 13 pages, 8 figures

  6. arXiv:2312.00836  [pdf, other

    eess.IV cs.CV

    Heteroscedastic Uncertainty Estimation for Probabilistic Unsupervised Registration of Noisy Medical Images

    Authors: Xiaoran Zhang, Daniel H. Pak, Shawn S. Ahn, Xiaoxiao Li, Chenyu You, Lawrence Staib, Albert J. Sinusas, Alex Wong, James S. Duncan

    Abstract: This paper proposes a heteroscedastic uncertainty estimation framework for unsupervised medical image registration. Existing methods rely on objectives (e.g. mean-squared error) that assume a uniform noise level across the image, disregarding the heteroscedastic and input-dependent characteristics of noise distribution in real-world medical images. This further introduces noisy gradients due to un… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  7. arXiv:2308.16870  [pdf, other

    cs.RO cs.AI eess.SY

    Learning Driver Models for Automated Vehicles via Knowledge Sharing and Personalization

    Authors: Wissam Kontar, Xinzhi Zhong, Soyoung Ahn

    Abstract: This paper describes a framework for learning Automated Vehicles (AVs) driver models via knowledge sharing between vehicles and personalization. The innate variability in the transportation system makes it exceptionally challenging to expose AVs to all possible driving scenarios during empirical experimentation or testing. Consequently, AVs could be blind to certain encounters that are deemed detr… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 10 pages, 8 figures

  8. arXiv:2307.02784  [pdf, other

    cs.IT cs.NI eess.SP

    On the Spatial-Wideband Effects in Millimeter-Wave Cell-Free Massive MIMO

    Authors: Seyoung Ahn, Soohyeong Kim, Yongseok Kwon, Joohan Park, Jiseung Youn, Sunghyun Cho

    Abstract: In this paper, we investigate the spatial-wideband effects in cell-free massive MIMO (CF-mMIMO) systems in mmWave bands. The utilization of mmWave frequencies brings challenges such as signal attenuation and the need for denser networks like ultra-dense networks (UDN) to maintain communication performance. CF-mMIMO is introduced as a solution, where distributed access points (APs) transmit signals… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  9. arXiv:2306.10058  [pdf, other

    cs.LG cs.CL eess.AS

    EM-Network: Oracle Guided Self-distillation for Sequence Learning

    Authors: Ji Won Yoon, Sunghwan Ahn, Hyeonseung Lee, Minchan Kim, Seok Min Kim, Nam Soo Kim

    Abstract: We introduce EM-Network, a novel self-distillation approach that effectively leverages target information for supervised sequence-to-sequence (seq2seq) learning. In contrast to conventional methods, it is trained with oracle guidance, which is derived from the target sequence. Since the oracle guidance compactly represents the target-side context that can assist the sequence model in solving the t… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  10. arXiv:2304.14496  [pdf, ps, other

    physics.ins-det cs.LG eess.SP nucl-ex

    Restoring Original Signal From Pile-up Signal using Deep Learning

    Authors: C. H. Kim, S. Ahn, K. Y. Chae, J. Hooker, G. V. Rogachev

    Abstract: Pile-up signals are frequently produced in experimental physics. They create inaccurate physics data with high uncertainty and cause various problems. Therefore, the correction to pile-up signals is crucially required. In this study, we implemented a deep learning method to restore the original signals from the pile-up signals. We showed that a deep learning model could accurately reconstruct the… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  11. Multimodal Speech Recognition for Language-Guided Embodied Agents

    Authors: Allen Chang, Xiaoyuan Zhu, Aarav Monga, Seoho Ahn, Tejas Srinivasan, Jesse Thomason

    Abstract: Benchmarks for language-guided embodied agents typically assume text-based instructions, but deployed agents will encounter spoken instructions. While Automatic Speech Recognition (ASR) models can bridge the input gap, erroneous ASR transcripts can hurt the agents' ability to complete tasks. In this work, we propose training a multimodal ASR model to reduce errors in transcribing spoken instructio… ▽ More

    Submitted 9 October, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 5 pages, 5 figures

    Journal ref: Proceedings of Interspeech 2023, 1608-1612

  12. arXiv:2211.15075  [pdf, other

    eess.AS cs.SD

    Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition

    Authors: Ji Won Yoon, Beom Jun Woo, Sunghwan Ahn, Hyeonseung Lee, Nam Soo Kim

    Abstract: Recently, the advance in deep learning has brought a considerable improvement in the end-to-end speech recognition field, simplifying the traditional pipeline while producing promising results. Among the end-to-end models, the connectionist temporal classification (CTC)-based model has attracted research interest due to its non-autoregressive nature. However, such CTC models require a heavy comput… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted by 2022 SLT Workshop

  13. arXiv:2210.13683  [pdf, other

    eess.SY cs.LG cs.RO

    Bayesian Methods in Automated Vehicle's Car-following Uncertainties: Enabling Strategic Decision Making

    Authors: Wissam Kontar, Soyoung Ahn

    Abstract: This paper proposes a methodology to estimate uncertainty in automated vehicle (AV) dynamics in real time via Bayesian inference. Based on the estimated uncertainty, the method aims to continuously monitor the car-following (CF) performance of the AV to support strategic actions to maintain a desired performance. Our methodology consists of three sequential components: (i) the Stochastic Gradient… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  14. arXiv:2209.00726  [pdf, other

    eess.IV cs.CV

    Learning correspondences of cardiac motion from images using biomechanics-informed modeling

    Authors: Xiaoran Zhang, Chenyu You, Shawn Ahn, Juntang Zhuang, Lawrence Staib, James Duncan

    Abstract: Learning spatial-temporal correspondences in cardiac motion from images is important for understanding the underlying dynamics of cardiac anatomical structures. Many methods explicitly impose smoothness constraints such as the $\mathcal{L}_2$ norm on the displacement vector field (DVF), while usually ignoring biomechanical feasibility in the transformation. Other geometric constraints either regul… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: Accepted by MICCAI-STACOM 2022 as an oral presentation

  15. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  16. Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus

    Authors: Minchan Kim, Myeonghun Jeong, Byoung ** Choi, Sunghwan Ahn, Joun Yeop Lee, Nam Soo Kim

    Abstract: Training a text-to-speech (TTS) model requires a large scale text labeled speech corpus, which is troublesome to collect. In this paper, we propose a transfer learning framework for TTS that utilizes a large amount of unlabeled speech dataset for pre-training. By leveraging wav2vec2.0 representation, unlabeled speech can highly improve performance, especially in the lack of labeled speech. We also… ▽ More

    Submitted 6 October, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted by Interspeech2022

  17. arXiv:2203.04674  [pdf

    eess.IV cs.LG physics.med-ph

    Deep learning-based reconstruction of highly accelerated 3D MRI

    Authors: Sangtae Ahn, Uri Wollner, Graeme McKinnon, Isabelle Heukensfeldt Jansen, Rafi Brada, Dan Rettmann, Ty A. Cashen, John Huston, J. Kevin DeMarco, Robert Y. Shih, Joshua D. Trzasko, Christopher J. Hardy, Thomas K. F. Foo

    Abstract: Purpose: To accelerate brain 3D MRI scans by using a deep learning method for reconstructing images from highly-undersampled multi-coil k-space data Methods: DL-Speed, an unrolled optimization architecture with dense skip-layer connections, was trained on 3D T1-weighted brain scan data to reconstruct complex-valued images from highly-undersampled k-space data. The trained model was evaluated on… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 8 pages, 8 figures

    ACM Class: I.2.6; J.2

  18. arXiv:2201.10747  [pdf, other

    eess.IV cs.CV cs.LG

    Learning Multiple Probabilistic Degradation Generators for Unsupervised Real World Image Super Resolution

    Authors: Sangyun Lee, Sewoong Ahn, Kwang** Yoon

    Abstract: Unsupervised real world super resolution (USR) aims to restore high-resolution (HR) images given low-resolution (LR) inputs, and its difficulty stems from the absence of paired dataset. One of the most common approaches is synthesizing noisy LR images using GANs (i.e., degradation generators) and utilizing a synthetic dataset to train the model in a supervised manner. Although the goal of training… ▽ More

    Submitted 21 August, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted to ECCVW 2022

  19. arXiv:2111.09051  [pdf

    eess.SY

    Implementation of Noise-Shaped Signaling System through Software-Defined Radio

    Authors: Junsung Choi, Dongryul Park, Suil Kim, Seungyoung Ahn

    Abstract: As developments of electromagnetic weapons, Electronic Warfare (EW) has been rising as the future form of war. Especially in wireless communications, the high security defense systems, such as Low Probability of Detection (LPD), Low Probability of Interception (LPI), or Low Prob-ability of Exploitation (LPE) communication algorithms, are studied to prevent the military force loss. One of the LPD,… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  20. arXiv:2111.03664  [pdf, other

    cs.LG eess.AS eess.IV

    Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models

    Authors: Ji Won Yoon, Hyung Yong Kim, Hyeonseung Lee, Sunghwan Ahn, Nam Soo Kim

    Abstract: Knowledge distillation (KD), best known as an effective method for model compression, aims at transferring the knowledge of a bigger network (teacher) to a much smaller network (student). Conventional KD methods usually employ the teacher model trained in a supervised manner, where output labels are treated only as targets. Extending this supervised scheme further, we introduce a new type of teach… ▽ More

    Submitted 11 August, 2023; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing

  21. arXiv:2108.10397  [pdf, other

    cs.LG eess.SY

    Predicting Vehicles' Longitudinal Trajectories and Lane Changes on Highway On-Ramps

    Authors: Nachuan Li, Riley Fischer, Wissam Kontar, Soyoung Ahn

    Abstract: Vehicles on highway on-ramps are one of the leading contributors to congestion. In this paper, we propose a prediction framework that predicts the longitudinal trajectories and lane changes (LCs) of vehicles on highway on-ramps and tapers. Specifically, our framework adopts a combination of prediction models that inputs a 4 seconds duration of a trajectory to output a forecast of the longitudinal… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  22. arXiv:2105.03072  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Perceptual Image Quality Assessment

    Authors: **** Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte, Manri Cheon, Sungjun Yoon, Byungyeon Kang, Junwoo Lee, Qing Zhang, Haiyang Guo, Yi Bin, Yuqing Hou, Hengliang Luo, **gyu Guo, Zirui Wang, Hai Wang, Wenming Yang, Qingyan Bai, Shuwei Shi, Weihao Xia, Mingdeng Cao, Jiahao Wang , et al. (25 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021. As a new type of image processing technology, perceptual image processing algorithms based on Generative Adversarial Networks (GAN) have produced images with more realistic textures. These o… ▽ More

    Submitted 28 June, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

  23. arXiv:2104.01889  [pdf, other

    eess.IV cs.CV

    Adaptive Gradient Balancing for Undersampled MRI Reconstruction and Image-to-Image Translation

    Authors: Itzik Malkiel, Sangtae Ahn, Valentina Taviani, Anne Menini, Lior Wolf, Christopher J. Hardy

    Abstract: Recent accelerated MRI reconstruction models have used Deep Neural Networks (DNNs) to reconstruct relatively high-quality images from highly undersampled k-space data, enabling much faster MRI scanning. However, these techniques sometimes struggle to reconstruct sharp images that preserve fine detail while maintaining a natural appearance. In this work, we enhance the image quality by using a Cond… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:1905.00985

  24. arXiv:2104.01727  [pdf

    eess.SY

    DSRC-Enabled Train Safety Communication System at Unmanned Crossings

    Authors: Junsung Choi, Vuk Marojevic, Carl B. Dietrich, Seungyoung Ahn

    Abstract: Although wireless technology is available for safety-critical applications, few applications have been used to improve train crossing safety. To prevent potential collisions between trains and vehicles, we present a Dedicated Short-Range Communication (DSRC)-enabled train safety communication system targeting to implement at unmanned crossings. Since our application's purpose is preventing collisi… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  25. arXiv:2102.00375  [pdf, other

    eess.SY

    Real-time Monitoring of Autonomous Vehicle's Time Gap Variations: A Bayesian Framework

    Authors: Wissam Kontar, Soyoung Ahn

    Abstract: This paper proposes a novel monitoring methodology for car-following control of automated vehicles that uses real-time measurements of spacing and velocity obtained through vehicle sensors. This study focuses on monitoring the time gap, a key parameter that dictates the desired following spacing of the controlled vehicle. The goal is to monitor deviations in actual time gap from a desired setting… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: This paper was accepted to the 99th Annual Meeting of the Transportation Research Board, Washington, D.C., United States, 2020

  26. Survey of Spectrum Regulation for Intelligent Transportation Systems

    Authors: Junsung Choi, Vuk Marojevic, Carl B. Dietrich, Jeffrey H. Reed, Seungyoung Ahn

    Abstract: As 5G communication technology develops, vehicular communications that require high reliability, low latency, and massive connectivity are drawing increasing interest from those in academia and industry. Due to these develo** technologies, vehicular communication is not limited to vehicle components in the forms of Vehicle-to-Vehicle (V2V) or Vehicle-to-Infrastructure (V2I) networks, but has als… ▽ More

    Submitted 3 August, 2020; v1 submitted 26 June, 2020; originally announced August 2020.

  27. arXiv:2006.13804  [pdf, other

    eess.IV cs.CV

    A Novel Approach for Correcting Multiple Discrete Rigid In-Plane Motions Artefacts in MRI Scans

    Authors: Michael Rotman, Rafi Brada, Israel Beniaminy, Sangtae Ahn, Christopher J. Hardy, Lior Wolf

    Abstract: Motion artefacts created by patient motion during an MRI scan occur frequently in practice, often rendering the scans clinically unusable and requiring a re-scan. While many methods have been employed to ameliorate the effects of patient motion, these often fall short in practice. In this paper we propose a novel method for removing motion artefacts using a deep neural network with two input branc… ▽ More

    Submitted 29 June, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

  28. arXiv:2001.03414  [pdf

    physics.med-ph eess.IV

    Attenuation Coefficient Estimation for PET/MRI With Bayesian Deep Learning pseudo-CT and Maximum Likelihood Estimation of Activity and Attenuation

    Authors: Andrew P. Leynes, Sangtae P. Ahn, Kristen A. Wangerin, Sandeep S. Kaushik, Florian Wiesinger, Thomas A. Hope, Peder E. Z. Larson

    Abstract: A major remaining challenge for magnetic resonance-based attenuation correction methods (MRAC) is their susceptibility to sources of MRI artifacts (e.g. implants, motion) and uncertainties due to the limitations of MRI contrast (e.g. accurate bone delineation and density, and separation of air/bone). We propose using a Bayesian deep convolutional neural network that, in addition to generating an i… ▽ More

    Submitted 13 October, 2021; v1 submitted 10 January, 2020; originally announced January 2020.

    Comments: Accepted to the IEEE Transactions on Radiation and Plasma Medical Sciences on October 3, 2021. To be published under open access Creative Commons Attribution License (CC BY)

    Journal ref: IEEE Transactions on Radiation and Plasma Medical Sciences, Early access, 2021

  29. arXiv:2001.02407  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition

    Authors: Zhixuan Lin, Yi-Fu Wu, Skand Vishwanath Peri, Weihao Sun, Gautam Singh, Fei Deng, **dong Jiang, Sung** Ahn

    Abstract: The ability to decompose complex multi-object scenes into meaningful abstractions like objects is fundamental to achieve higher-level cognition. Previous approaches for unsupervised object-oriented scene representation learning are either based on spatial-attention or scene-mixture approaches and limited in scalability which is a main obstacle towards modeling real-world scenes. In this paper, we… ▽ More

    Submitted 15 March, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: Accepted in ICLR 2020

  30. arXiv:1905.00985  [pdf, other

    cs.LG eess.IV stat.ML

    Conditional WGANs with Adaptive Gradient Balancing for Sparse MRI Reconstruction

    Authors: Itzik Malkiel, Sangtae Ahn, Valentina Taviani, Anne Menini, Lior Wolf, Christopher J. Hardy

    Abstract: Recent sparse MRI reconstruction models have used Deep Neural Networks (DNNs) to reconstruct relatively high-quality images from highly undersampled k-space data, enabling much faster MRI scanning. However, these techniques sometimes struggle to reconstruct sharp images that preserve fine detail while maintaining a natural appearance. In this work, we enhance the image quality by using a Condition… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.