Skip to main content

Showing 1–12 of 12 results for author: Kang, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.00130  [pdf, other

    eess.IV cs.CV cs.LG

    A Flexible 2.5D Medical Image Segmentation Approach with In-Slice and Cross-Slice Attention

    Authors: Amarjeet Kumar, Hongxu Jiang, Muhammad Imran, Cyndi Valdes, Gabriela Leon, Dahyun Kang, Parvathi Nataraj, Yuyin Zhou, Michael D. Weiss, Wei Shao

    Abstract: Deep learning has become the de facto method for medical image segmentation, with 3D segmentation models excelling in capturing complex 3D structures and 2D models offering high computational efficiency. However, segmenting 2.5D images, which have high in-plane but low through-plane resolution, is a relatively unexplored challenge. While applying 2D models to individual slices of a 2.5D image is f… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  2. arXiv:2403.15803  [pdf, other

    eess.IV cs.CV

    Innovative Quantitative Analysis for Disease Progression Assessment in Familial Cerebral Cavernous Malformations

    Authors: Ruige Zong, Tao Wang, Chunwang Li, Xinlin Zhang, Yuanbin Chen, Longxuan Zhao, Qixuan Li, Qinquan Gao, Dezhi Kang, Fuxin Lin, Tong Tong

    Abstract: Familial cerebral cavernous malformation (FCCM) is a hereditary disorder characterized by abnormal vascular structures within the central nervous system. The FCCM lesions are often numerous and intricate, making quantitative analysis of the lesions a labor-intensive task. Consequently, clinicians face challenges in quantitatively assessing the severity of lesions and determining whether lesions ha… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  3. arXiv:2403.08187  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children

    Authors: Taekyung Ahn, Yeonjung Hong, Younggon Im, Do Hyung Kim, Dayoung Kang, Joo Won Jeong, Jae Won Kim, Min Jung Kim, Ah-ra Cho, Dae-Hyun Jang, Hosung Nam

    Abstract: This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children wit… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 12 pages, 2 figures

    ACM Class: I.2.7

  4. arXiv:2307.07123  [pdf, other

    cs.CV eess.IV

    Improved Flood Insights: Diffusion-Based SAR to EO Image Translation

    Authors: Minseok Seo, Youngtack Oh, Doyi Kim, Dongmin Kang, Yeji Choi

    Abstract: Driven by rapid climate change, the frequency and intensity of flood events are increasing. Electro-Optical (EO) satellite imagery is commonly utilized for rapid response. However, its utilities in flood situations are hampered by issues such as cloud cover and limitations during nighttime, making accurate assessment of damage challenging. Several alternative flood detection techniques utilizing S… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 10 pages, 6 figures

    Report number: 10

  5. arXiv:2305.17842  [pdf, other

    cs.RO cs.AI eess.SY

    RL + Model-based Control: Using On-demand Optimal Control to Learn Versatile Legged Locomotion

    Authors: Dongho Kang, ** Cheng, Miguel Zamora, Fatemeh Zargarbashi, Stelian Coros

    Abstract: This paper presents a control framework that combines model-based optimal control and reinforcement learning (RL) to achieve versatile and robust legged locomotion. Our approach enhances the RL training process by incorporating on-demand reference motions generated through finite-horizon optimal control, covering a broad range of velocities and gaits. These reference motions serve as targets for t… ▽ More

    Submitted 4 September, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: The paper has been accepted for publication in IEEE Robotics and Automation Letters (RA-L). You can find the copyright information on the front page of the paper. The supplementary video is available in https://www.youtube.com/watch?v=qPttVfzGS84

  6. arXiv:2305.10732  [pdf, ps, other

    eess.IV cs.CV

    BlindHarmony: "Blind" Harmonization for MR Images via Flow model

    Authors: Hwihun Jeong, Heejoon Byun, Dong Un Kang, Jongho Lee

    Abstract: In MRI, images of the same contrast (e.g., T$_1$) from the same subject can exhibit noticeable differences when acquired using different hardware, sequences, or scan parameters. These differences in images create a domain gap that needs to be bridged by a step called image harmonization, to process the images successfully using conventional or deep learning-based image analysis (e.g., segmentation… ▽ More

    Submitted 16 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 accepted. 9 pages and 5 Figures for manuscipt, supplementary included

  7. arXiv:2212.05662  [pdf, other

    cs.LG eess.SY

    Optimal Planning of Hybrid Energy Storage Systems using Curtailed Renewable Energy through Deep Reinforcement Learning

    Authors: Dongju Kang, Doeun Kang, Sumin Hwangbo, Haider Niaz, Won Bo Lee, J. Jay Liu, Jonggeol Na

    Abstract: Energy management systems (EMS) are becoming increasingly important in order to utilize the continuously growing curtailed renewable energy. Promising energy storage systems (ESS), such as batteries and green hydrogen should be employed to maximize the efficiency of energy stakeholders. However, optimal decision-making, i.e., planning the leveraging between different strategies, is confronted with… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: 30 pages, 8 figures

  8. arXiv:2210.14406  [pdf, other

    eess.AS cs.AI cs.CL cs.SD

    RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech

    Authors: Kyumin Park, Keon Lee, Daeyoung Kim, Dongyeop Kang

    Abstract: Even with recent advances in speech synthesis models, the evaluation of such models is based purely on human judgement as a single naturalness score, such as the Mean Opinion Score (MOS). The score-based metric does not give any further information about which parts of speech are unnatural or why human judges believe they are unnatural. We present a novel speech dataset, RedPen, with human annotat… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Submitted to ICASSP 2023

  9. arXiv:2210.11946  [pdf, other

    eess.SY cs.CV

    RT-MOT: Confidence-Aware Real-Time Scheduling Framework for Multi-Object Tracking Tasks

    Authors: Donghwa Kang, Seunghoon Lee, Hoon Sung Chwa, Seung-Hwan Bae, Chang Mook Kang, **kyu Lee, Hyeongboo Baek

    Abstract: Different from existing MOT (Multi-Object Tracking) techniques that usually aim at improving tracking accuracy and average FPS, real-time systems such as autonomous vehicles necessitate new requirements of MOT under limited computing resources: (R1) guarantee of timely execution and (R2) high tracking accuracy. In this paper, we propose RT-MOT, a novel system design for multiple MOT tasks, which a… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to 2022 Real-Time Systems Symposium (RTSS)

  10. arXiv:2207.01520  [pdf, other

    eess.IV cs.CV

    Adaptive GLCM sampling for transformer-based COVID-19 detection on CT

    Authors: Okchul Jung, Dong Un Kang, Gwanghyun Kim, Se Young Chun

    Abstract: The world has suffered from COVID-19 (SARS-CoV-2) for the last two years, causing much damage and change in people's daily lives. Thus, automated detection of COVID-19 utilizing deep learning on chest computed tomography (CT) scans became promising, which helps correct diagnosis efficiently. Recently, transformer-based COVID-19 detection method on CT is proposed to utilize 3D information in CT vol… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 6 pages

  11. arXiv:2110.07716  [pdf

    cs.CV eess.IV

    Adversarial Scene Reconstruction and Object Detection System for Assisting Autonomous Vehicle

    Authors: Md Foysal Haque, Hay-Youn Lim, Dae-Seong Kang

    Abstract: In the current computer vision era classifying scenes through video surveillance systems is a crucial task. Artificial Intelligence (AI) Video Surveillance technologies have been advanced remarkably while artificial intelligence and deep learning ascended into the system. Adopting the superior compounds of deep learning visual classification methods achieved enormous accuracy in classifying visual… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  12. arXiv:1911.07410  [pdf, other

    eess.IV cs.CV

    Multi-Temporal Recurrent Neural Networks For Progressive Non-Uniform Single Image Deblurring With Incremental Temporal Training

    Authors: Dongwon Park, Dong Un Kang, Jisoo Kim, Se Young Chun

    Abstract: Multi-scale (MS) approaches have been widely investigated for blind single image / video deblurring that sequentially recovers deblurred images in low spatial scale first and then in high spatial scale later with the output of lower scales. MS approaches have been effective especially for severe blurs induced by large motions in high spatial scale since those can be seen as small blurs in low spat… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: 10 pages, 8 figures, 6 tables, work in progress