Skip to main content

Showing 1–30 of 30 results for author: Park, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.16743  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition System-Independent Word Error Rate Estimation

    Authors: Chanho Park, Mingjie Chen, Thomas Hain

    Abstract: Word error rate (WER) is a metric used to evaluate the quality of transcriptions produced by Automatic Speech Recognition (ASR) systems. In many applications, it is of interest to estimate WER given a pair of a speech utterance and a transcript. Previous work on WER estimation focused on building models that are trained with a specific ASR system in mind (referred to as ASR system-dependent). Thes… ▽ More

    Submitted 26 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 (long)

  2. arXiv:2403.16372  [pdf, other

    cs.LG cs.DC eess.SP

    SignSGD with Federated Voting

    Authors: Chanho Park, H. Vincent Poor, Namyoon Lee

    Abstract: Distributed learning is commonly used for accelerating model training by harnessing the computational capabilities of multiple-edge devices. However, in practical applications, the communication delay emerges as a bottleneck due to the substantial information exchange required between workers and a central parameter server. SignSGD with majority voting (signSGD-MV) is an effective distributed lear… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  3. arXiv:2402.01340  [pdf, ps, other

    cs.LG cs.CR eess.SP

    SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding

    Authors: Chanho Park, Namyoon Lee

    Abstract: Distributed learning is an effective approach to accelerate model training using multiple workers. However, substantial communication delays emerge between workers and a parameter server due to massive costs associated with communicating gradients. SignSGD with majority voting (signSGD-MV) is a simple yet effective optimizer that reduces communication costs through one-bit quantization, yet the co… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  4. arXiv:2401.13936  [pdf, ps, other

    eess.SY

    Learning-based sensing and computing decision for data freshness in edge computing-enabled networks

    Authors: Sinwoong Yun, Dongsun Kim, Chanwon Park, Jemin Lee

    Abstract: As the demand on artificial intelligence (AI)-based applications increases, the freshness of sensed data becomes crucial in the wireless sensor networks. Since those applications require a large amount of computation for processing the sensed data, it is essential to offload the computation load to the edge computing (EC) server. In this paper, we propose the sensing and computing decision (SCD) a… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 15 pages

  5. arXiv:2312.15386  [pdf, other

    physics.data-an astro-ph.EP eess.IV physics.ao-ph

    Hyperspectral shadow removal with Iterative Logistic Regression and latent Parametric Linear Combination of Gaussians

    Authors: Core Francisco Park, Maya Nasr, Manuel Pérez-Carrasco, Eleanor Walker, Douglas Finkbeiner, Cecilia Garraffo

    Abstract: Shadow detection and removal is a challenging problem in the analysis of hyperspectral images. Yet, this step is crucial for analyzing data for remote sensing applications like methane detection. In this work, we develop a shadow detection and removal method only based on the spectrum of each pixel and the overall distribution of spectral values. We first introduce Iterative Logistic Regression (I… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  6. arXiv:2312.04846  [pdf, other

    cs.SD eess.AS

    Sound Source Localization for a Source inside a Structure using Ac-CycleGAN

    Authors: Shunsuke Kita, Choong Sik Park, Yoshinobu Kajikawa

    Abstract: We propose a method for sound source localization (SSL) for a source inside a structure using Ac-CycleGAN under unpaired data conditions. The proposed method utilizes a large amount of simulated data and a small amount of actual experimental data to locate a sound source inside a structure in a real environment. An Ac-CycleGAN generator contributes to the transformation of simulated data into real… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  7. arXiv:2311.04468  [pdf

    eess.IV q-bio.NC

    A human brain atlas of chi-separation for normative iron and myelin distributions

    Authors: Kyeongseon Min, Beomseok Sohn, Woo Jung Kim, Chae Jung Park, Soohwa Song, Dong Hoon Shin, Kyung Won Chang, Na-Young Shin, Minjun Kim, Hyeong-Geol Shin, Phil Hyu Lee, Jongho Lee

    Abstract: Iron and myelin are primary susceptibility sources in the human brain. These substances are essential for healthy brain, and their abnormalities are often related to various neurological disorders. Recently, an advanced susceptibility map** technique, which is referred to as chi-separation, has been proposed, successfully disentangling paramagnetic iron from diamagnetic myelin. This method opene… ▽ More

    Submitted 2 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 19 pages, 9 figures

  8. arXiv:2311.02003  [pdf, other

    eess.IV cs.CV

    A Structured Pruning Algorithm for Model-based Deep Learning

    Authors: Chicago Park, Weijie Gan, Zihao Zou, Yuyang Hu, Zhixin Sun, Ulugbek S. Kamilov

    Abstract: There is a growing interest in model-based deep learning (MBDL) for solving imaging inverse problems. MBDL networks can be seen as iterative algorithms that estimate the desired image using a physical measurement model and a learned image prior specified using a convolutional neural net (CNNs). The iterative nature of MBDL networks increases the test-time computational complexity, which limits the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  9. arXiv:2310.08225  [pdf, other

    eess.AS cs.CL cs.SD

    Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text

    Authors: Chanho Park, Chengsong Lu, Mingjie Chen, Thomas Hain

    Abstract: The quality of automatic speech recognition (ASR) is typically measured by word error rate (WER). WER estimation is a task aiming to predict the WER of an ASR system, given a speech utterance and a transcription. This task has gained increasing attention while advanced ASR systems are trained on large amounts of data. In this case, WER estimation becomes necessary in many scenarios, for example, s… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 5 pages

  10. Improving Image Classification of Knee Radiographs: An Automated Image Labeling Approach

    Authors: Jikai Zhang, Carlos Santos, Christine Park, Maciej Mazurowski, Roy Colglazier

    Abstract: Large numbers of radiographic images are available in knee radiology practices which could be used for training of deep learning models for diagnosis of knee abnormalities. However, those images do not typically contain readily available labels due to limitations of human annotations. The purpose of our study was to develop an automated labeling approach that improves the image classification mode… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: This is the preprint version

  11. arXiv:2306.04137  [pdf, other

    cs.MA eess.SY

    Multi-Agent Reinforcement Learning for Cooperative Air Transportation Services in City-Wide Autonomous Urban Air Mobility

    Authors: Chanyoung Park, Gyu Seon Kim, Soohyun Park, Soyi Jung, Joongheon Kim

    Abstract: The development of urban-air-mobility (UAM) is rapidly progressing with spurs, and the demand for efficient transportation management systems is a rising need due to the multifaceted environmental uncertainties. Thus, this paper proposes a novel air transportation service management algorithm based on multi-agent deep reinforcement learning (MADRL) to address the challenges of multi-UAM cooperatio… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 15 pages, 14 figures

  12. arXiv:2302.13827  [pdf, other

    eess.SY eess.SP math.ST

    Efficient Point Mass Predictor for Continuous and Discrete Models with Linear Dynamics

    Authors: Jakub Matousek, **drich Dunik, Marek Brandner, Chan Gook Park, Yeongkwon Choe

    Abstract: This paper deals with state estimation of stochastic models with linear state dynamics, continuous or discrete in time. The emphasis is laid on a numerical solution to the state prediction by the time-update step of the grid-point-based point-mass filter (PMF), which is the most computationally demanding part of the PMF algorithm. A novel way of manipulating the grid, leading to the time-update in… ▽ More

    Submitted 17 April, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted for IFAC 2023

  13. Unsupervised data selection for Speech Recognition with contrastive loss ratios

    Authors: Chanho Park, Rehan Ahmad, Thomas Hain

    Abstract: This paper proposes an unsupervised data selection method by using a submodular function based on contrastive loss ratios of target and training data sets. A model using a contrastive loss function is trained on both sets. Then the ratio of frame-level losses for each model is used by a submodular function. By using the submodular function, a training set for automatic speech recognition matching… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 5 pages, accepted by ICASSP 2022

    Journal ref: IEEEInt.Conf.Acoust.SpeechSignalProcess. (2022) 8587-8591

  14. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, ** Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  15. arXiv:2204.11669  [pdf

    eess.IV cs.AI physics.med-ph

    Deep-learning-enabled Brain Hemodynamic Map** Using Resting-state fMRI

    Authors: Xirui Hou, Pengfei Guo, Puyang Wang, Peiying Liu, Doris D. M. Lin, Hongli Fan, Yang Li, Zhiliang Wei, Zixuan Lin, Dengrong Jiang, ** **, Catherine Kelly, Jay J. Pillai, Judy Huang, Marco C. Pinho, Binu P. Thomas, Babu G. Welch, Denise C. Park, Vishal M. Patel, Argye E. Hillis, Hanzhang Lu

    Abstract: Cerebrovascular disease is a leading cause of death globally. Prevention and early intervention are known to be the most effective forms of its management. Non-invasive imaging methods hold great promises for early stratification, but at present lack the sensitivity for personalized prognosis. Resting-state functional magnetic resonance imaging (rs-fMRI), a powerful tool previously used for mappin… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Journal ref: npj Digital Medicine (2023) 116

  16. arXiv:2203.15015  [pdf, other

    eess.IV cs.CV

    Deep Interactive Learning-based ovarian cancer segmentation of H&E-stained whole slide images to study morphological patterns of BRCA mutation

    Authors: David Joon Ho, M. Herman Chui, Chad M. Vanderbilt, Jiwon Jung, Mark E. Robson, Chan-Sik Park, ** Roh, Thomas J. Fuchs

    Abstract: Deep learning has been widely used to analyze digitized hematoxylin and eosin (H&E)-stained histopathology whole slide images. Automated cancer segmentation using deep learning can be used to diagnose malignancy and to find novel morphological patterns to predict molecular subtypes. To train pixel-wise cancer segmentation models, manual annotation from pathologists is generally a bottleneck due to… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  17. arXiv:2203.10827  [pdf, other

    eess.AS

    Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments

    Authors: Dongseok Heo, Cheul Young Park, Jaemin Cheun, Myung ** Ko

    Abstract: Deep speaker embeddings have been shown effective for assessing cognitive impairments aside from their original purpose of speaker verification. However, the research found that speaker embeddings encode speaker identity and an array of information, including speaker demographics, such as sex and age, and speech contents to an extent, which are known confounders in the assessment of cognitive impa… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 5 pages, submitted to INTERSPEECH 2022

  18. arXiv:2203.08914  [pdf, other

    eess.IV cs.CV cs.LG

    Knee arthritis severity measurement using deep learning: a publicly available algorithm with a multi-institutional validation showing radiologist-level performance

    Authors: Hanxue Gu, Keyu Li, Roy J. Colglazier, Jichen Yang, Michael Lebhar, Jonathan O'Donnell, William A. Jiranek, Richard C. Mather, Rob J. French, Nicholas Said, Jikai Zhang, Christine Park, Maciej A. Mazurowski

    Abstract: The assessment of knee osteoarthritis (KOA) severity on knee X-rays is a central criteria for the use of total knee arthroplasty. However, this assessment suffers from imprecise standards and a remarkably high inter-reader variability. An algorithmic, automated assessment of KOA severity could improve overall outcomes of knee replacement procedures by increasing the appropriateness of its use. We… ▽ More

    Submitted 21 July, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

  19. DXM-TransFuse U-net: Dual Cross-Modal Transformer Fusion U-net for Automated Nerve Identification

    Authors: Baijun Xie, Gary Milam, Bo Ning, Jaepyeong Cha, Chung Hyuk Park

    Abstract: Accurate nerve identification is critical during surgical procedures for preventing any damages to nerve tissues. Nerve injuries can lead to long-term detrimental effects for patients as well as financial overburdens. In this study, we develop a deep-learning network framework using the U-Net architecture with a Transformer block based fusion module at the bottleneck to identify nerve tissues from… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Journal ref: Computerized Medical Imaging and Graphics, 2022-07-01, Volume 99, Article 102090

  20. arXiv:2202.06431  [pdf, other

    eess.IV cs.CV cs.LG

    AI can evolve without labels: self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation

    Authors: Sangjoon Park, Gwanghyun Kim, Yu** Oh, Joon Beom Seo, Sang Min Lee, ** Hwan Kim, Sungjun Moon, Jae-Kwang Lim, Chang Min Park, Jong Chul Ye

    Abstract: Although deep learning-based computer-aided diagnosis systems have recently achieved expert-level performance, develo** a robust deep learning model requires large, high-quality data with manual annotation, which is expensive to obtain. This situation poses the problem that the chest x-rays collected annually in hospitals cannot be used due to the lack of manual labeling by experts, especially i… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: 24 pages

  21. arXiv:2109.06579  [pdf, other

    eess.SP cs.LG

    Bayesian AirComp with Sign-Alignment Precoding for Wireless Federated Learning

    Authors: Chanho Park, Seunghoon Lee, Namyoon Lee

    Abstract: In this paper, we consider the problem of wireless federated learning based on sign stochastic gradient descent (signSGD) algorithm via a multiple access channel. When sending locally computed gradient's sign information, each mobile device requires to apply precoding to circumvent wireless fading effects. In practice, however, acquiring perfect knowledge of channel state information (CSI) at all… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: This paper is 8 pages long, and has 4 figures. This paper is the extended version of the conference paper which is accepted in 2021 IEEE GlobeCom

  22. arXiv:2108.08016  [pdf, ps, other

    cs.IT eess.SP

    Low-Complexity Algorithm for Outage Optimal Resource Allocation in Energy Harvesting-Based UAV Identification Networks

    Authors: Jae Cheol Park, Kyu-Min Kang, Junil Choi

    Abstract: We study an unmanned aerial vehicle (UAV) identification network equipped with an energy harvesting (EH) technique. In the network, the UAVs harvest energy through radio frequency (RF) signals transmitted from ground control stations (GCSs) and then transmit their identification information to the ground receiver station (GRS). Specifically, we first derive a closed-form expression of the outage p… ▽ More

    Submitted 21 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 5 pages, 4 figures, accepted to IEEE Communications Letters, Aug. 2021

  23. arXiv:2104.11401  [pdf

    cs.LG cs.CV eess.IV

    Intentional Deep Overfit Learning (IDOL): A Novel Deep Learning Strategy for Adaptive Radiation Therapy

    Authors: Jaehee Chun, Justin C. Park, Sven Olberg, You Zhang, Dan Nguyen, **g Wang, ** Sung Kim, Steve Jiang

    Abstract: In this study, we propose a tailored DL framework for patient-specific performance that leverages the behavior of a model intentionally overfitted to a patient-specific training dataset augmented from the prior information available in an ART workflow - an approach we term Intentional Deep Overfit Learning (IDOL). Implementing the IDOL framework in any task in radiotherapy consists of two training… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  24. arXiv:2103.06620  [pdf, other

    cs.SD cs.CG eess.AS

    Topological Data Analysis of Korean Music in Jeongganbo: A Cycle Structure

    Authors: Mai Lan Tran, Changbom Park, Jae-Hun Jung

    Abstract: Jeongganbo is a unique music representation invented by Sejong the Great. Contrary to the western music notation, the pitch of each note is encrypted and the length is visualized directly in a matrix form in Jeongganbo. We use topological data analysis (TDA) to analyze the Korean music written in Jeongganbo for Suyeonjang, Songuyeo, and Taryong, those well-known pieces played at the palace and amo… ▽ More

    Submitted 30 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

  25. arXiv:2012.15486  [pdf, other

    eess.SP cs.LG

    Bayesian Federated Learning over Wireless Networks

    Authors: Seunghoon Lee, Chanho Park, Song-Nam Hong, Yonina C. Eldar, Namyoon Lee

    Abstract: Federated learning is a privacy-preserving and distributed training method using heterogeneous data sets stored at local devices. Federated learning over wireless networks requires aggregating locally computed gradients at a server where the mobile devices send statistically distinct gradient information over heterogenous communication links. This paper proposes a Bayesian federated learning (BFL)… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 30 pages, 7 figures, submitted to IEEE Journal on Selected Areas in Communications

  26. arXiv:2011.07012  [pdf, other

    eess.SP

    Ensuring Data Freshness for Blockchain-enabled Monitoring Networks

    Authors: Minsu Kim, Sungho Lee, Chanwon Park, Jemin Lee, Walid Saad

    Abstract: The age of information (AoI) is a recently proposed metric for quantifying data freshness in real-time status monitoring systems where timeliness is of importance. In this paper, the problem of characterizing and controlling the AoI is studied in the context of blockchain-enabled monitoring networks (BeMN). In BeMN, status updates from sources are transmitted and recorded in a blockchain. To inves… ▽ More

    Submitted 19 May, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 13 pages, 10 figures. arXiv admin note: text overlap with arXiv:2010.14783

  27. arXiv:2010.14783  [pdf, other

    eess.SP

    Age of Information Analysis in Hyperledger Fabric Blockchain-enabled Monitoring Networks

    Authors: Minsu Kim, Sungho Lee, Chanwon Park, Jemin Lee

    Abstract: Age of information (AoI) is a recently proposed metric for quantifying data freshness in real-time status monitoring systems where timeliness is of importance. In this paper, we explore the data freshness in the Hyperledger Fabric Blockchain-enabled monitoring network (HeMN) by leveraging the AoI metric. In HeMN, status updates from sources are transmitted through an uplink and recorded in a Hyper… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 6 pages, 5 figures; This paper is submitted to IEEE International Conference on Communications (ICC) 2021

  28. arXiv:2004.05830  [pdf, other

    eess.AS cs.CV cs.LG cs.SD eess.IV

    From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech

    Authors: Hyeong-Seok Choi, Changdae Park, Kyogu Lee

    Abstract: This work seeks the possibility of generating the human face from voice solely based on the audio-visual data without any human-labeled annotations. To this end, we propose a multi-modal learning framework that links the inference stage and generation stage. First, the inference networks are trained to match the speaker identity between the two different modalities. Then the trained inference netw… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: 18 pages, 12 figures, Published as a conference paper at International Conference on Learning Representations (ICLR) 2020. (camera-ready version)

  29. ST-GRAT: A Novel Spatio-temporal Graph Attention Network for Accurately Forecasting Dynamically Changing Road Speed

    Authors: Cheonbok Park, Chunggi Lee, Hyo** Bahng, Yunwon Tae, Kihwan Kim, Seungmin **, Sungahn Ko, Jaegul Choo

    Abstract: Predicting road traffic speed is a challenging task due to different types of roads, abrupt speed change and spatial dependencies between roads; it requires the modeling of dynamically changing spatial dependencies among roads and temporal patterns over long input sequences. This paper proposes a novel spatio-temporal graph attention (ST-GRAT) that effectively captures the spatio-temporal dynamics… ▽ More

    Submitted 20 October, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: to be published in CIKM-2020

  30. arXiv:1710.03147  [pdf, ps, other

    eess.SP physics.atom-ph

    Advanced Satellite-based Frequency Transfer at the 10^{-16} Level

    Authors: M. Fujieda, S-H. Yang, T. Gotoh, S-W. Hwang, H. Hachisu, H. Kim, Y. K. Lee, R. Tabuchi, T. Ido, W-K. Lee, M-S. Heo, C. Y. Park, D-H. Yu, G. Petit

    Abstract: Advanced satellite-based frequency transfers by TWCP and IPPP have been performed between NICT and KRISS. We confirm that the disagreement between them is less than 1x10^{-16} at an averaging time of several days. Additionally, an intercontinental frequency ratio measurement of Sr and Yb optical lattice clocks was directly performed by TWCP. We achieved an uncertainty at the mid-10^{-16} level aft… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

    Comments: 9 pages, 5 figures