Skip to main content

Showing 1–50 of 83 results for author: Hwang, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00626  [pdf, other

    cs.LG cs.AI

    Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models

    Authors: Sangwoong Yoon, Himchan Hwang, Dohyun Kwon, Yung-Kyun Noh, Frank C. Park

    Abstract: We present a maximum entropy inverse reinforcement learning (IRL) approach for improving the sample quality of diffusion generative models, especially when the number of generation time steps is small. Similar to how IRL trains a policy based on the reward function learned from expert demonstrations, we train (or fine-tune) a diffusion model using the log probability density estimated from trainin… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Code is released at https://github.com/swyoon/Diffusion-by-MaxEntIRL

  2. arXiv:2406.13144  [pdf, other

    cs.CL cs.AI

    DialSim: A Real-Time Simulator for Evaluating Long-Term Dialogue Understanding of Conversational Agents

    Authors: Jiho Kim, Woosog Chay, Hyeonji Hwang, Daeun Kyung, Hyunseung Chung, Eunbyeol Cho, Yohan Jo, Edward Choi

    Abstract: Recent advancements in Large Language Models (LLMs) have significantly enhanced the capabilities of conversational agents, making them applicable to various fields (e.g., education). Despite their progress, the evaluation of the agents often overlooks the complexities of real-world conversations, such as real-time interactions, multi-party dialogues, and extended contextual dependencies. To bridge… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.07922  [pdf

    cs.CL

    Automated Information Extraction from Thyroid Operation Narrative: A Comparative Study of GPT-4 and Fine-tuned KoELECTRA

    Authors: Dongsuk Jang, Hyeryun Park, Jiye Son, Hyeonuk Hwang, Su** Kim, **wook Choi

    Abstract: In the rapidly evolving field of healthcare, the integration of artificial intelligence (AI) has become a pivotal component in the automation of clinical workflows, ushering in a new era of efficiency and accuracy. This study focuses on the transformative capabilities of the fine-tuned KoELECTRA model in comparison to the GPT-4 model, aiming to facilitate automated information extraction from thyr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 3 tables

    Journal ref: AMIA Joint Summits on Translational Science Proceedings, 2024, pp. 249-257

  4. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  5. arXiv:2405.16082  [pdf

    cs.CV cs.AI

    Uncertainty Measurement of Deep Learning System based on the Convex Hull of Training Sets

    Authors: Hyekyoung Hwang, Jitae Shin

    Abstract: Deep Learning (DL) has made remarkable achievements in computer vision and adopted in safety critical domains such as medical imaging or autonomous drive. Thus, it is necessary to understand the uncertainty of the model to effectively reduce accidents and losses due to misjudgment of the Deep Neural Networks (DNN). This can start by efficiently selecting data that could potentially malfunction to… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 11 pages

  6. arXiv:2405.12701  [pdf, other

    cs.CL cs.AI

    OLAPH: Improving Factuality in Biomedical Long-form Question Answering

    Authors: Minbyul Jeong, Hyeon Hwang, Chanwoong Yoon, Taewhoo Lee, Jaewoo Kang

    Abstract: In the medical domain, numerous scenarios necessitate the long-form generation ability of large language models (LLMs). Specifically, when addressing patients' questions, it is essential that the model's response conveys factual claims, highlighting the need for an automated method to evaluate those claims. Thus, we introduce MedLFQA, a benchmark dataset reconstructed using long-form question-answ… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  7. arXiv:2404.14873  [pdf, ps, other

    stat.ML cs.LG math.NA

    Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data

    Authors: Hyeontae Jo, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found tha… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 pages, 10 figures

    MSC Class: 65L08; 65D17; 68U07

  8. arXiv:2404.10346  [pdf, other

    cs.CL

    Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

    Authors: Hyeonbin Hwang, Doyoung Kim, Seungone Kim, Seonghyeon Ye, Minjoon Seo

    Abstract: Training on large amounts of rationales (i.e., CoT Fine-tuning) is effective at improving the reasoning capabilities of large language models (LLMs). However, acquiring human-authored rationales or augmenting rationales from proprietary models is costly and not scalable. In this paper, we study the problem of whether LLMs could self-improve their reasoning capabilities. To this end, we propose Sel… ▽ More

    Submitted 16 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Preprint Under Review

  9. arXiv:2404.00376  [pdf, other

    cs.CL

    Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks

    Authors: Hyunjae Kim, Hyeon Hwang, Jiwoo Lee, Sihyeon Park, Dain Kim, Taewhoo Lee, Chanwoong Yoon, Jiwoong Sohn, Donghee Choi, Jaewoo Kang

    Abstract: While recent advancements in commercial large language models (LM) have shown promising results in medical tasks, their closed-source nature poses significant privacy and security concerns, hindering their widespread use in the medical field. Despite efforts to create open-source models, their limited parameters often result in insufficient multi-step reasoning capabilities required for solving co… ▽ More

    Submitted 30 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Added new LLaMA-3-based models and experiments on NEJM case challenges

  10. arXiv:2402.09084  [pdf, other

    cs.LG cs.AI

    Sobolev Training for Operator Learning

    Authors: Namkyeong Cho, Junseung Ryu, Hyung Ju Hwang

    Abstract: This study investigates the impact of Sobolev Training on operator learning frameworks for improving model performance. Our research reveals that integrating derivative information into the loss function enhances the training process, and we propose a novel framework to approximate derivatives on irregular meshes in operator learning. Our findings are supported by both experimental evidence and th… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  11. arXiv:2402.08187  [pdf, other

    cs.LG math.NA

    Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids

    Authors: Sung Woong Cho, Jae Yong Lee, Hyung Ju Hwang

    Abstract: Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 25 pages, 11 figures

    MSC Class: 65D17; 68U07

  12. arXiv:2402.06794  [pdf, other

    cs.CV cs.AI

    Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing

    Authors: Hochul Hwang, Sunjae Kwon, Yekyung Kim, Donghyun Kim

    Abstract: Safely navigating street intersections is a complex challenge for blind and low-vision individuals, as it requires a nuanced understanding of the surrounding context - a task heavily reliant on visual cues. Traditional methods for assisting in this decision-making process often fall short, lacking the ability to provide a comprehensive scene analysis and safety level. This paper introduces an inno… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  13. arXiv:2402.06790  [pdf, other

    cs.RO cs.HC

    Towards Robotic Companions: Understanding Handler-Guide Dog Interactions for Informed Guide Dog Robot Design

    Authors: Hochul Hwang, Hee-Tae Jung, Nicholas A Giudice, Joydeep Biswas, Sunghoon Ivan Lee, Donghyun Kim

    Abstract: Dog guides are favored by blind and low-vision (BLV) individuals for their ability to enhance independence and confidence by reducing safety concerns and increasing navigation efficiency compared to traditional mobility aids. However, only a relatively small proportion of BLV individuals work with dog guides due to their limited availability and associated maintenance responsibilities. There is co… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  14. Examining Rail Transportation Route of Crude Oil in the United States Using Crowdsourced Social Media Data

    Authors: Yuandong Liu, Majbah Uddin, Shih-Miao Chin, Ho-Ling Hwang, Jiaoli Chen

    Abstract: Safety issues associated with transporting crude oil by rail have been a concern since the boom of US domestic shale oil production in 2012. During the last decade, over 300 crude oil by rail incidents have occurred in the US. Some of them have caused adverse consequences including fire and hazardous materials leakage. However, only limited information on the routes of crude-on-rail and their asso… ▽ More

    Submitted 12 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Journal ref: Transportation Research Record, 2678(1), 218-228 (2024)

  15. Improving the accuracy of freight mode choice models: A case study using the 2017 CFS PUF data set and ensemble learning techniques

    Authors: Diyi Liu, Hyeonsup Lim, Majbah Uddin, Yuandong Liu, Lee D. Han, Ho-ling Hwang, Shih-Miao Chin

    Abstract: The US Census Bureau has collected two rounds of experimental data from the Commodity Flow Survey, providing shipment-level characteristics of nationwide commodity movements, published in 2012 (i.e., Public Use Microdata) and in 2017 (i.e., Public Use File). With this information, data-driven methods have become increasingly valuable for understanding detailed patterns in freight logistics. In thi… ▽ More

    Submitted 12 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Journal ref: Expert Systems with Applications, 240, 122478 (2024)

  16. arXiv:2312.15949  [pdf, other

    cs.LG math.NA

    HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork

    Authors: Jae Yong Lee, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear map**s between function spaces. However, the DeepONet requires many parameters a… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 26 pages, 13 figures. Published as a conference paper at Eleventh International Conference on Learning Representations (ICLR 2023)

    MSC Class: 65D17; 68U07

  17. arXiv:2312.03397  [pdf, other

    cs.LG cs.AI

    Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning

    Authors: Sangwoong Yoon, Dohyun Kwon, Himchan Hwang, Yung-Kyun Noh, Frank C. Park

    Abstract: We present Generalized Contrastive Divergence (GCD), a novel objective function for training an energy-based model (EBM) and a sampler simultaneously. GCD generalizes Contrastive Divergence (Hinton, 2002), a celebrated algorithm for training EBM, by replacing Markov Chain Monte Carlo (MCMC) distribution with a trainable sampler, such as a diffusion model. In GCD, the joint training of EBM and a di… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 Workshop on Diffusion Models

  18. arXiv:2312.01015  [pdf, other

    cs.RO

    Aggressive Trajectory Tracking for Nano Quadrotors Using Embedded Nonlinear Model Predictive Control

    Authors: Muhammad Kazim, Hyunjae Sim, Gihun Shin, Hwancheol Hwang, Kwang-Ki K. Kim

    Abstract: This paper presents an aggressive trajectory tracking method for a small lightweight nano-quadrotor using nonlinear model predictive control (NMPC) based on acados. Controlling a nano quadrotor for accurate trajectory tracking at high speed in dynamic environments is challenging due to complex aerodynamic forces that introduce significant disturbances and large positional tracking errors. These ae… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    MSC Class: 49M37; 65K05; 90C30; 90C53; 90C90

  19. arXiv:2311.10339  [pdf, other

    cs.CV

    A2XP: Towards Private Domain Generalization

    Authors: Geunhyeok Yu, Hyoseok Hwang

    Abstract: Deep Neural Networks (DNNs) have become pivotal in various fields, especially in computer vision, outperforming previous methodologies. A critical challenge in their deployment is the bias inherent in data across different domains, such as image style and environmental conditions, leading to domain gaps. This necessitates techniques for learning general representations from biased training data, k… ▽ More

    Submitted 17 April, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR 2024. Our code is available at https://github.com/AIRLABkhu/A2XP

  20. An Interpretable Machine Learning Framework to Understand Bikeshare Demand before and during the COVID-19 Pandemic in New York City

    Authors: Majbah Uddin, Ho-Ling Hwang, Md Sami Hasnine

    Abstract: In recent years, bikesharing systems have become increasingly popular as affordable and sustainable micromobility solutions. Advanced mathematical models such as machine learning are required to generate good forecasts for bikeshare demand. To this end, this study proposes a machine learning modeling framework to estimate hourly demand in a large-scale bikesharing system. Two Extreme Gradient Boos… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  21. arXiv:2308.04171  [pdf, other

    cs.AR cs.NE

    Core interface optimization for multi-core neuromorphic processors

    Authors: Zhe Su, Hyunjung Hwang, Tristan Torchet, Giacomo Indiveri

    Abstract: Hardware implementations of Spiking Neural Networks (SNNs) represent a promising approach to edge-computing for applications that require low-power and low-latency, and which cannot resort to external cloud-based computing services. However, most solutions proposed so far either support only relatively small networks, or take up significant hardware resources, to implement large networks. To reali… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  22. arXiv:2307.10928  [pdf, other

    cs.CL cs.AI

    FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

    Authors: Seonghyeon Ye, Doyoung Kim, Sungdong Kim, Hyeonbin Hwang, Seungone Kim, Yongrae Jo, James Thorne, Juho Kim, Minjoon Seo

    Abstract: Evaluation of Large Language Models (LLMs) is challenging because instruction-following necessitates alignment with human values and the required set of skills varies depending on the instruction. However, previous studies have mainly focused on coarse-grained evaluation (i.e. overall preference-based evaluation), which limits interpretability since it does not consider the nature of user instruct… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: ICLR 2024 Spotlight

  23. arXiv:2304.12972  [pdf

    cs.CV physics.chem-ph

    Automated Solubility Analysis System and Method Using Computer Vision and Machine Learning

    Authors: Gahee Kim, Minwoo Jeon, Hyun Do Choi, Jun Ki Cho, Youn-Suk Choi, Hyoseok Hwang

    Abstract: In this study, a novel active solubility sensing device using computer vision is proposed to improve separation purification performance and prevent malfunctions of separation equipment such as preparative liquid chromatographers and evaporators. The proposed device actively measures the solubility by transmitting a solution using a background image. The proposed system is a combination of a devic… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 20 pages, 6 figures, 3 tables

  24. Text2Time: Transformer-based Article Time Period Prediction

    Authors: Karthick Prasad Gunasekaran, B Chase Babrich, Saurabh Shirodkar, Hee Hwang

    Abstract: The task of predicting the publication period of text documents, such as news articles, is an important but less studied problem in the field of natural language processing. Predicting the year of a news article can be useful in various contexts, such as historical research, sentiment analysis, and media monitoring. In this work, we investigate the problem of predicting the publication period of a… ▽ More

    Submitted 23 April, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 8 Pages

  25. arXiv:2303.14773  [pdf, other

    cs.CV cs.AI cs.LG

    BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

    Authors: Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek Lim, Geunyoung Jung, Jiyoung Jung, Hosik Choi, Kyungwoo Song

    Abstract: With the surge of large-scale pre-trained models (PTMs), fine-tuning these models to numerous downstream tasks becomes a crucial problem. Consequently, parameter efficient transfer learning (PETL) of large models has grasped huge attention. While recent PETL methods showcase impressive performance, they rely on optimistic assumptions: 1) the entire parameter set of a PTM is available, and 2) a suf… ▽ More

    Submitted 8 July, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023 (v2: citation error was fixed)

  26. arXiv:2303.08622  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer

    Authors: Serin Yang, Hyunmin Hwang, Jong Chul Ye

    Abstract: Diffusion models have shown great promise in text-guided image style transfer, but there is a trade-off between style transformation and content preservation due to their stochastic nature. Existing methods require computationally expensive fine-tuning of diffusion models or additional neural network. To address this, here we propose a zero-shot contrastive loss for diffusion models that doesn't r… ▽ More

    Submitted 12 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  27. arXiv:2303.04980  [pdf, other

    cs.CV

    Decision-BADGE: Decision-based Adversarial Batch Attack with Directional Gradient Estimation

    Authors: Geunhyeok Yu, Minwoo Jeon, Hyoseok Hwang

    Abstract: The susceptibility of deep neural networks (DNNs) to adversarial examples has prompted an increase in the deployment of adversarial attacks. Image-agnostic universal adversarial perturbations (UAPs) are much more threatening, but many limitations exist to implementing UAPs in real-world scenarios where only binary decisions are returned. In this research, we propose Decision-BADGE, a novel method… ▽ More

    Submitted 14 August, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 9 pages (7 pages except for references), 4 figures, 4 tables

  28. arXiv:2302.14691  [pdf, other

    cs.CL cs.AI

    Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following

    Authors: Seonghyeon Ye, Hyeonbin Hwang, Sohee Yang, Hyeongu Yun, Yireun Kim, Minjoon Seo

    Abstract: In this paper, we present our finding that prepending a Task-Agnostic Prefix Prompt (TAPP) to the input improves the instruction-following ability of various Large Language Models (LLMs) during inference. TAPP is different from canonical prompts for LLMs in that it is a fixed prompt prepended to the beginning of every input regardless of the target task for zero-shot generalization. We observe tha… ▽ More

    Submitted 24 December, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: AAAI 2024

  29. arXiv:2301.07695  [pdf, other

    cs.CL cs.AI

    EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records

    Authors: Gyubok Lee, Hyeonji Hwang, Seongsu Bae, Yeonsu Kwon, Woncheol Shin, Seongjun Yang, Minjoon Seo, Jong-Yeup Kim, Edward Choi

    Abstract: We present a new text-to-SQL dataset for electronic health records (EHRs). The utterances were collected from 222 hospital staff members, including physicians, nurses, and insurance review and health records teams. To construct the QA dataset on structured EHR data, we conducted a poll at a university hospital and used the responses to create seed questions. We then manually linked these questions… ▽ More

    Submitted 25 December, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: Published as a conference paper at NeurIPS 2022 (Track on Datasets and Benchmarks)

  30. arXiv:2212.04734  [pdf, other

    cs.LG cs.AI cs.CL

    MED-SE: Medical Entity Definition-based Sentence Embedding

    Authors: Hyeonbin Hwang, Haanju Yoo, Yera Choi

    Abstract: We propose Medical Entity Definition-based Sentence Embedding (MED-SE), a novel unsupervised contrastive learning framework designed for clinical texts, which exploits the definitions of medical entities. To this end, we conduct an extensive analysis of multiple sentence embedding techniques in clinical semantic textual similarity (STS) settings. In the entity-centric setting that we have designed… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: 8 pages, 2 figures, 9 tables

  31. arXiv:2211.15426  [pdf

    cs.CL

    AI Knows Which Words Will Appear in Next Year's Korean CSAT

    Authors: Byunghyun Ban, Jejong Lee, Hyeonmok Hwang

    Abstract: A text-mining-based word class categorization method and LSTM-based vocabulary pattern prediction method are introduced in this paper. A preprocessing method based on simple text appearance frequency analysis is first described. This method was developed as a data screening tool but showed 4.35 ~ 6.21 times higher than previous works. An LSTM deep learning method is also suggested for vocabulary a… ▽ More

    Submitted 2 August, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: update additional experiment result

  32. arXiv:2211.09385  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    ComMU: Dataset for Combinatorial Music Generation

    Authors: Lee Hyun, Taehyun Kim, Hyolim Kang, Minjoo Ki, Hyeonchan Hwang, Kwanho Park, Sharang Han, Seon Joo Kim

    Abstract: Commercial adoption of automatic music composition requires the capability of generating diverse and high-quality music suitable for the desired context (e.g., music for romantic movies, action games, restaurants, etc.). In this paper, we introduce combinatorial music generation, a new task to create varying background music based on given conditions. Combinatorial music generation creates short s… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 19 pages, 12 figures

  33. arXiv:2210.13368  [pdf, other

    cs.RO cs.AI

    System Configuration and Navigation of a Guide Dog Robot: Toward Animal Guide Dog-Level Guiding Work

    Authors: Hochul Hwang, Tim Xia, Ibrahima Keita, Ken Suzuki, Joydeep Biswas, Sunghoon I. Lee, Donghyun Kim

    Abstract: A robot guide dog has compelling advantages over animal guide dogs for its cost-effectiveness, potential for mass production, and low maintenance burden. However, despite the long history of guide dog robot research, previous studies were conducted with little or no consideration of how the guide dog handler and the guide dog work as a team for navigation. To develop a robotic guiding system that… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally

  34. arXiv:2210.12853  [pdf, other

    physics.ao-ph cs.AI cs.CV cs.LG

    Deep-Learning-Based Precipitation Nowcasting with Ground Weather Station Data and Radar Data

    Authors: Jihoon Ko, Kyuhan Lee, Hyun** Hwang, Kijung Shin

    Abstract: Recently, many deep-learning techniques have been applied to various weather-related prediction tasks, including precipitation nowcasting (i.e., predicting precipitation levels and locations in the near future). Most existing deep-learning-based approaches for precipitation nowcasting, however, consider only radar and/or satellite images as inputs, and meteorological observations collected from gr… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: to appear at the 17th International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM-22)

  35. arXiv:2210.10968  [pdf, other

    cs.DS math.CO

    Identities and periodic oscillations of divide-and-conquer recurrences splitting at half

    Authors: Hsien-Kuei Hwang, Svante Janson, Tsung-Hsi Tsai

    Abstract: We study divide-and-conquer recurrences of the form \begin{equation*} f(n) = αf(\lfloor \tfrac n2\rfloor) + βf(\lceil \tfrac n2\rceil) + g(n) \qquad(n\ge2), \end{equation*} with $g(n)$ and $f(1)$ given, where $α,β\ge0$ with $α+β>0$; such recurrences appear often in analysis of computer algorithms, numeration systems, combinatorial sequences, and related areas. We show that the solution sat… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 69 pages, 13 figures, 13 tables

    MSC Class: 68Q25; 39B12; 11B37; 11B83; 05A15; 05A16; 42A16 ACM Class: F.2.2; G.2.1; G.2.3

  36. arXiv:2210.03861  [pdf, other

    cs.CV

    Towards Light Weight Object Detection System

    Authors: Dharma KC, Venkata Ravi Kiran Dayana, Meng-Lin Wu, Venkateswara Rao Cherukuri, Hau Hwang

    Abstract: Transformers are a popular choice for classification tasks and as backbones for object detection tasks. However, their high latency brings challenges in their adaptation to lightweight object detection systems. We present an approximation of the self-attention layers used in the transformer architecture. This approximation reduces the latency of the classification system while incurring minimal lo… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  37. arXiv:2209.10956  [pdf, other

    cs.LG

    XClusters: Explainability-first Clustering

    Authors: Hyunseung Hwang, Steven Euijong Whang

    Abstract: We study the problem of explainability-first clustering where explainability becomes a first-class citizen for clustering. Previous clustering approaches use decision trees for explanation, but only after the clustering is completed. In contrast, our approach is to perform clustering and decision tree training holistically where the decision tree's performance and size also influence the clusterin… ▽ More

    Submitted 11 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 13 pages

  38. arXiv:2207.10354  [pdf, other

    cs.CV

    Learning from Data with Noisy Labels Using Temporal Self-Ensemble

    Authors: Jun Ho Lee, Jae Soon Baik, Tae Hwan Hwang, Jun Won Choi

    Abstract: There are inevitably many mislabeled data in real-world datasets. Because deep neural networks (DNNs) have an enormous capacity to memorize noisy labels, a robust training scheme is required to prevent labeling errors from degrading the generalization performance of DNNs. Current state-of-the-art methods present a co-training scheme that trains dual networks using samples associated with small los… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

  39. arXiv:2207.04175  [pdf, other

    cs.CV

    Direct Handheld Burst Imaging to Simulated Defocus

    Authors: Meng-Lin Wu, Venkata Ravi Kiran Dayana, Hau Hwang

    Abstract: A shallow depth-of-field image keeps the subject in focus, and the foreground and background contexts blurred. This effect requires much larger lens apertures than those of smartphone cameras. Conventional methods acquire RGB-D images and blur image regions based on their depth. However, this approach is not suitable for reflective or transparent surfaces, or finely detailed object silhouettes, wh… ▽ More

    Submitted 13 July, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: ICIP 2022

  40. arXiv:2207.03075  [pdf, other

    cs.LG cs.AI

    Towards the Practical Utility of Federated Learning in the Medical Domain

    Authors: Seongjun Yang, Hyeonji Hwang, Daeyoung Kim, Radhika Dua, Jong-Yeup Kim, Eunho Yang, Edward Choi

    Abstract: Federated learning (FL) is an active area of research. One of the most suitable areas for adopting FL is the medical domain, where patient privacy must be respected. Previous research, however, does not provide a practical guide to applying FL in the medical domain. We propose empirical benchmarks and experimental settings for three representative medical datasets with different modalities: longit… ▽ More

    Submitted 19 May, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to the Main conference of CHIL2023

  41. arXiv:2207.01765  [pdf, other

    math.NA cs.AI cs.LG physics.comp-ph

    opPINN: Physics-Informed Neural Network with operator learning to approximate solutions to the Fokker-Planck-Landau equation

    Authors: Jae Yong Lee, Juhi Jang, Hyung Ju Hwang

    Abstract: We propose a hybrid framework opPINN: physics-informed neural network (PINN) with operator learning for approximating the solution to the Fokker-Planck-Landau (FPL) equation. The opPINN framework is divided into two steps: Step 1 and Step 2. After the operator surrogate models are trained during Step 1, PINN can effectively approximate the solution to the FPL equation during Step 2 by using the pr… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 28 pages, 12 figures

    MSC Class: 68T20; 35Q84; 35B40; 82C40

  42. arXiv:2205.15543  [pdf, other

    q-bio.QM cs.CV eess.IV

    AI-based automated Meibomian gland segmentation, classification and reflection correction in infrared Meibography

    Authors: Ripon Kumar Saha, A. M. Mahmud Chowdhury, Kyung-Sun Na, Gyu Deok Hwang, Youngsub Eom, Jaeyoung Kim, Hae-Gon Jeon, Ho Sik Hwang, Euiheon Chung

    Abstract: Purpose: Develop a deep learning-based automated method to segment meibomian glands (MG) and eyelids, quantitatively analyze the MG area and MG ratio, estimate the meiboscore, and remove specular reflections from infrared images. Methods: A total of 1600 meibography images were captured in a clinical setting. 1000 images were precisely annotated with multiple revisions by investigators and graded… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: 11 pages, 13 Figures, 5 Supplementary Figures

  43. arXiv:2205.12429  [pdf, other

    eess.IV cs.CV

    Interaction of a priori Anatomic Knowledge with Self-Supervised Contrastive Learning in Cardiac Magnetic Resonance Imaging

    Authors: Makiya Nakashima, Inyeop Jang, Ramesh Basnet, Mitchel Benovoy, W. H. Wilson Tang, Christopher Nguyen, Deborah Kwon, Tae Hyun Hwang, David Chen

    Abstract: Training deep learning models on cardiac magnetic resonance imaging (CMR) can be a challenge due to the small amount of expert generated labels and inherent complexity of data source. Self-supervised contrastive learning (SSCL) has recently been shown to boost performance in several medical imaging tasks. However, it is unclear how much the pre-trained representation reflects the primary organ of… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Under review at Machine Learning in Healthcare

  44. arXiv:2205.01059  [pdf, other

    cs.LG cs.AI math.NA math.OC

    Enhanced Physics-Informed Neural Networks with Augmented Lagrangian Relaxation Method (AL-PINNs)

    Authors: Hwijae Son, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Physics-Informed Neural Networks (PINNs) have become a prominent application of deep learning in scientific computation, as they are powerful approximators of solutions to nonlinear partial differential equations (PDEs). There have been numerous attempts to facilitate the training process of PINNs by adjusting the weight of each component of the loss function, called adaptive loss-balancing algori… ▽ More

    Submitted 30 May, 2023; v1 submitted 29 April, 2022; originally announced May 2022.

  45. arXiv:2204.06353  [pdf, other

    cs.LG cs.SI

    AHP: Learning to Negative Sample for Hyperedge Prediction

    Authors: Hyun** Hwang, Seungwoo Lee, Chanyoung Park, Kijung Shin

    Abstract: Hypergraphs (i.e., sets of hyperedges) naturally represent group relations (e.g., researchers co-authoring a paper and ingredients used together in a recipe), each of which corresponds to a hyperedge (i.e., a subset of nodes). Predicting future or missing hyperedges bears significant implications for many applications (e.g., collaboration and recipe recommendation). What makes hyperedge prediction… ▽ More

    Submitted 15 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: To be published in the Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2022)

  46. arXiv:2204.02181  [pdf, other

    cs.CV

    Vision Transformer Equipped with Neural Resizer on Facial Expression Recognition Task

    Authors: Hyeonbin Hwang, Soyeon Kim, Wei-** Park, Jiho Seo, Kyungtae Ko, Hyeon Yeo

    Abstract: When it comes to wild conditions, Facial Expression Recognition is often challenged with low-quality data and imbalanced, ambiguous labels. This field has much benefited from CNN based approaches; however, CNN models have structural limitation to see the facial regions in distant. As a remedy, Transformer has been introduced to vision fields with global receptive field, but requires adjusting inpu… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted to IEEE ICASSP 2022

  47. Effective Training Strategies for Deep-learning-based Precipitation Nowcasting and Estimation

    Authors: Jihoon Ko, Kyuhan Lee, Hyun** Hwang, Seok-Geun Oh, Seok-Woo Son, Kijung Shin

    Abstract: Deep learning has been successfully applied to precipitation nowcasting. In this work, we propose a pre-training scheme and a new loss function for improving deep-learning-based nowcasting. First, we adapt U-Net, a widely-used deep-learning model, for the two problems of interest here: precipitation nowcasting and precipitation estimation from radar images. We formulate the former as a classificat… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: to appear in Computers & Geosciences

  48. arXiv:2201.11967  [pdf, other

    cs.LG math.NA

    Pseudo-Differential Neural Operator: Generalized Fourier Neural Operator for Learning Solution Operators of Partial Differential Equations

    Authors: ** Young Shin, Jae Yong Lee, Hyung Ju Hwang

    Abstract: Learning the map** between two function spaces has garnered considerable research attention. However, learning the solution operator of partial differential equations (PDEs) remains a challenge in scientific computing. Fourier neural operator (FNO) was recently proposed to learn solution operators, and it achieved an excellent performance. In this study, we propose a novel \textit{pseudo-differe… ▽ More

    Submitted 4 March, 2024; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 23 pages, 13 figures

    MSC Class: 35S05; 47G30; 68U07

  49. arXiv:2111.04941  [pdf, other

    math.OC cs.AI cs.LG math.NA physics.comp-ph

    Solving PDE-constrained Control Problems Using Operator Learning

    Authors: Rakhoon Hwang, Jae Yong Lee, ** Young Shin, Hyung Ju Hwang

    Abstract: The modeling and control of complex physical systems are essential in real-world problems. We propose a novel framework that is generally applicable to solving PDE-constrained optimal control problems by introducing surrogate models for PDE solution operators with special regularizers. The procedure of the proposed framework is divided into two phases: solution operator learning for PDE constraint… ▽ More

    Submitted 26 December, 2023; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: 15 pages, 12 figures. Published as a conference paper at Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI 2022)

    MSC Class: 68U07

  50. arXiv:2109.14935  [pdf

    physics.app-ph cs.ET

    Ionic Sieving Through One-Atom-Thick 2D Material Enables Analog Nonvolatile Memory for Neuromorphic Computing

    Authors: Revannath Dnyandeo Nikam, Jongwon Lee, Wooseok Choi, Writam Banerjee, Myonghoon Kwak, Manoj Yadav, Hyunsang Hwang

    Abstract: The first report on ion transport through atomic sieves of atomically-thin 2D material is provided to solve critical limitations of electrochemical random-access memory (ECRAM) devices.

    Submitted 30 September, 2021; originally announced September 2021.

    Journal ref: Small 2021, 2103543