Skip to main content

Showing 1–50 of 77 results for author: Jaehyun

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05516  [pdf, other

    eess.AS cs.AI cs.SD eess.SP

    Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation

    Authors: ** Woo Lee, Jaehyun Park, Min Jun Choi, Kyogu Lee

    Abstract: While significant advancements have been made in music generation and differentiable sound synthesis within machine learning and computer audition, the simulation of instrument vibration guided by physical laws has been underexplored. To address this gap, we introduce a novel model for simulating the spatio-temporal motion of nonlinear strings, integrating modal synthesis and spectral modeling wit… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2406.16994  [pdf, other

    eess.SP cs.AI

    Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks

    Authors: Gyu Seon Kim, Yeryeong Cho, Jaehyun Chung, Soohyun Park, Soyi Jung, Zhu Han, Joongheon Kim

    Abstract: Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for prov… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 22 figures

  3. arXiv:2406.13633  [pdf, ps, other

    cs.LG math.OC

    Reinforcement Learning for Infinite-Horizon Average-Reward MDPs with Multinomial Logistic Function Approximation

    Authors: Jaehyun Park, Dabeen Lee

    Abstract: We study model-based reinforcement learning with non-linear function approximation where the transition function of the underlying Markov decision process (MDP) is given by a multinomial logistic (MNL) model. In this paper, we develop two algorithms for the infinite-horizon average reward setting. Our first algorithm \texttt{UCRL2-MNL} applies to the class of communicating MDPs and achieves an… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.11128  [pdf, other

    cs.LG cs.RO

    Model Adaptation for Time Constrained Embodied Control

    Authors: Jaehyun Song, Minjong Yoo, Honguk Woo

    Abstract: When adopting a deep learning model for embodied agents, it is required that the model structure be optimized for specific tasks and operational conditions. Such optimization can be static such as model compression or dynamic such as adaptive inference. Yet, these techniques have not been fully investigated for embodied control systems subject to time constraints, which necessitate sequential deci… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 8 pages, 5 figures, Accepted in The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)

  5. arXiv:2406.08527  [pdf, other

    cs.LG cs.AI

    Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

    Authors: Jaehyun Nam, Kyuyoung Kim, Seunghyuk Oh, Jihoon Tack, Jaehyung Kim, **woo Shin

    Abstract: Learning effective representations from raw data is crucial for the success of deep learning methods. However, in the tabular domain, practitioners often prefer augmenting raw column features over using learned representations, as conventional tree-based algorithms frequently outperform competing approaches. As a result, feature engineering methods that automatically generate candidate features ha… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 18 pages

  6. arXiv:2405.11828  [pdf, other

    cs.LG

    Federated Learning with Incomplete Sensing Modalities

    Authors: Adiba Orzikulova, Jaehyun Kwak, Jaemin Shin, Sung-Ju Lee

    Abstract: Many mobile sensing applications utilize data from various modalities, including motion and physiological sensors in mobile and wearable devices. Federated Learning (FL) is particularly suitable for these applications thanks to its privacy-preserving feature. However, challenges such as limited battery life, poor network conditions, and sensor malfunctions can restrict the use of all available mod… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  7. arXiv:2405.07543  [pdf

    cs.LG cs.RO

    Accelerating the Evolution of Personalized Automated Lane Change through Lesson Learning

    Authors: Jia Hu, Mingyue Lei, Duo Li, Zhenning Li, Jaehyun, So, Haoran Wang

    Abstract: Personalization is crucial for the widespread adoption of advanced driver assistance system. To match up with each user's preference, the online evolution capability is a must. However, conventional evolution methods learn from naturalistic driving data, which requires a lot computing power and cannot be applied online. To address this challenge, this paper proposes a lesson learning approach: lea… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2405.02845  [pdf, other

    cs.LG q-bio.MN

    Data-Efficient Molecular Generation with Hierarchical Textual Inversion

    Authors: Seo** Kim, Jaehyun Nam, Sihyun Yu, Younghoon Shin, **woo Shin

    Abstract: Develo** an effective molecular generation framework even with a limited number of molecules is often important for its practical deployment, e.g., drug discovery, since acquiring task-related molecular data requires expensive and time-consuming experimental costs. To tackle this issue, we introduce Hierarchical textual Inversion for Molecular generation (HI-Mol), a novel data-efficient molecula… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  9. arXiv:2405.02499  [pdf, other

    cs.CR cs.AR

    DRAMScope: Uncovering DRAM Microarchitecture and Characteristics by Issuing Memory Commands

    Authors: Hwayong Nam, Seungmin Baek, Minbok Wi, Michael Jaemin Kim, Jaehyun Park, Chihun Song, Nam Sung Kim, Jung Ho Ahn

    Abstract: The demand for precise information on DRAM microarchitectures and error characteristics has surged, driven by the need to explore processing in memory, enhance reliability, and mitigate security vulnerability. Nonetheless, DRAM manufacturers have disclosed only a limited amount of information, making it difficult to find specific information on their DRAM microarchitectures. This paper addresses t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: To appear at the 51st IEEE/ACM International Symposium on Computer Architecture (ISCA)

  10. arXiv:2404.15305  [pdf, other

    eess.SP cs.LG

    ADAPT^2: Adapting Pre-Trained Sensing Models to End-Users via Self-Supervision Replay

    Authors: Hyungjun Yoon, Jaehyun Kwak, Biniyam Aschalew Tolera, Gaole Dai, Mo Li, Taesik Gong, Kimin Lee, Sung-Ju Lee

    Abstract: Self-supervised learning has emerged as a method for utilizing massive unlabeled data for pre-training models, providing an effective feature extractor for various mobile sensing applications. However, when deployed to end-users, these models encounter significant domain shifts attributed to user diversity. We investigate the performance degradation that occurs when self-supervised models are fine… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  11. An Optimal MPC Algorithm for Subunit-Monge Matrix Multiplication, with Applications to LIS

    Authors: Jaehyun Koo

    Abstract: We present an $O(1)$-round fully-scalable deterministic massively parallel algorithm for computing the min-plus matrix multiplication of unit-Monge matrices. We use this to derive a $O(\log n)$-round fully-scalable massively parallel algorithm for solving the exact longest increasing subsequence (LIS) problem. For a fully-scalable MPC regime, this result substantially improves the previously known… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: To appear in SPAA 2024

  12. arXiv:2404.13081  [pdf, other

    cs.CL cs.AI cs.LG

    SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs

    Authors: Jaehyung Kim, Jaehyun Nam, Sangwoo Mo, Jong** Park, Sang-Woo Lee, Minjoon Seo, Jung-Woo Ha, **woo Shin

    Abstract: Large language models (LLMs) have made significant advancements in various natural language processing tasks, including question answering (QA) tasks. While incorporating new information with the retrieval of relevant passages is a promising way to improve QA with LLMs, the existing methods often require additional fine-tuning which becomes infeasible with recent LLMs. Augmenting retrieved passage… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted at ICLR 2024

  13. Anarchy in the APSP: Algorithm and Hardness for Incorrect Implementation of Floyd-Warshall

    Authors: Jaehyun Koo

    Abstract: The celebrated Floyd-Warshall algorithm efficiently computes the all-pairs shortest path, and its simplicity made it a staple in computer science classes. Frequently, students discover a variant of this Floyd-Warshall algorithm by mixing up the loop order, ending up with the incorrect APSP matrix. This paper considers a computational problem of computing this incorrect APSP matrix. We will propose… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: To appear in FUN 2024

  14. arXiv:2403.04504  [pdf, other

    cs.AI

    Improving Matrix Completion by Exploiting Rating Ordinality in Graph Neural Networks

    Authors: Jaehyun Lee, SeongKu Kang, Hwanjo Yu

    Abstract: Matrix completion is an important area of research in recommender systems. Recent methods view a rating matrix as a user-item bi-partite graph with labeled edges denoting observed ratings and predict the edges between the user and item nodes by using the graph neural network (GNN). Despite their effectiveness, they treat each rating type as an independent relation type and thus cannot sufficiently… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 4 pages, 2 figures, 3 tables

  15. arXiv:2401.12019  [pdf, other

    cs.CV

    Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency

    Authors: Woonghyun Ka, Jae Young Lee, Jaehyun Choi, Junmo Kim

    Abstract: In stereo-matching knowledge distillation methods of the self-supervised monocular depth estimation, the stereo-matching network's knowledge is distilled into a monocular depth network through pseudo-depth maps. In these methods, the learning-based stereo-confidence network is generally utilized to identify errors in the pseudo-depth maps to prevent transferring the errors. However, the learning-b… ▽ More

    Submitted 22 January, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: ICASSP 2024. The first two authors are equally contributed

  16. arXiv:2401.12001  [pdf, other

    cs.CV

    Modeling Stereo-Confidence Out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

    Authors: Jae Young Lee, Woonghyun Ka, Jaehyun Choi, Junmo Kim

    Abstract: We propose a novel stereo-confidence that can be measured externally to various stereo-matching networks, offering an alternative input modality choice of the cost volume for learning-based approaches, especially in safety-critical systems. Grounded in the foundational concepts of disparity definition and the disparity plane sweep, the proposed stereo-confidence method is built upon the idea that… ▽ More

    Submitted 22 January, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: AAAI 2024. The first two authors contributed equally

  17. arXiv:2312.15288  [pdf, other

    cs.CV stat.ML

    Understanding normalization in contrastive representation learning and out-of-distribution detection

    Authors: Tai Le-Gia, Jaehyun Ahn

    Abstract: Contrastive representation learning has emerged as an outstanding approach for anomaly detection. In this work, we explore the $\ell_2$-norm of contrastive features and its applications in out-of-distribution detection. We propose a simple method based on contrastive learning, which incorporates out-of-distribution data by discriminating against normal samples in the contrastive layer space. Our a… ▽ More

    Submitted 8 April, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

  18. arXiv:2312.04885  [pdf, other

    cs.CV

    VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

    Authors: Hanjung Kim, Jaehyun Kang, Miran Heo, Sukjun Hwang, Seoung Wug Oh, Seon Joo Kim

    Abstract: In recent years, online Video Instance Segmentation (VIS) methods have shown remarkable advancement with their powerful query-based detectors. Utilizing the output queries of the detector at the frame-level, these methods achieve high accuracy on challenging benchmarks. However, our observations demonstrate that these methods heavily rely on location information, which often causes incorrect assoc… ▽ More

    Submitted 8 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Technical report

  19. arXiv:2312.03005  [pdf, other

    cs.LG cs.CV

    Few-Shot Anomaly Detection with Adversarial Loss for Robust Feature Representations

    Authors: Jae Young Lee, Wonjun Lee, Jaehyun Choi, Yongkwi Lee, Young Seog Yoon

    Abstract: Anomaly detection is a critical and challenging task that aims to identify data points deviating from normal patterns and distributions within a dataset. Various methods have been proposed using a one-class-one-model approach, but these techniques often face practical problems such as memory inefficiency and the requirement of sufficient data for training. In particular, few-shot anomaly detection… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: BMVC 2023

  20. arXiv:2311.09585  [pdf, other

    cs.CL

    LifeTox: Unveiling Implicit Toxicity in Life Advice

    Authors: Minbeom Kim, Jahyun Koo, Hwanhee Lee, Joonsuk Park, Hwaran Lee, Kyomin Jung

    Abstract: As large language models become increasingly integrated into daily life, detecting implicit toxicity across diverse contexts is crucial. To this end, we introduce LifeTox, a dataset designed for identifying implicit toxicity within a broad range of advice-seeking scenarios. Unlike existing safety datasets, LifeTox comprises diverse contexts derived from personal experiences through open-ended ques… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, NAACL 2024

  21. arXiv:2311.07223  [pdf, other

    cs.PL

    Wasm SpecTec: Engineering a Formal Language Standard

    Authors: Joachim Breitner, Philippa Gardner, Jaehyun Lee, Sam Lindley, Matija Pretnar, Xiaojia Rao, Andreas Rossberg, Sukyoung Ryu, Wonho Shin, Conrad Watt, Dongjun Youn

    Abstract: WebAssembly (Wasm) is a low-level bytecode language and virtual machine, intended as a compilation target for a wide range of programming languages, which is seeing increasing adoption across diverse ecosystems. As a young technology, Wasm continues to evolve -- it reached version 2.0 last year and another major update is expected soon. For a new feature to be standardised in Wasm, four key arte… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 5 pages, 7 figures

  22. arXiv:2311.03722  [pdf, other

    cs.RO cs.CV

    Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM

    Authors: Seongwook Yoon, Jaehyun Kim, Sanghoon Sull

    Abstract: Visual odometry and Simultaneous Localization And Map** (SLAM) has been studied as one of the most important tasks in the areas of computer vision and robotics, to contribute to autonomous navigation and augmented reality systems. In case of feature-based odometry/SLAM, a moving visual sensor observes a set of 3D points from different viewpoints, correspondences between the projected 2D points i… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 12 pages

  23. arXiv:2311.01018  [pdf, other

    cs.CV

    Expanding Expressiveness of Diffusion Models with Limited Data via Self-Distillation based Fine-Tuning

    Authors: Jiwan Hur, Jaehyun Choi, Gyo** Han, Dong-Jae Lee, Junmo Kim

    Abstract: Training diffusion models on limited datasets poses challenges in terms of limited generation capacity and expressiveness, leading to unsatisfactory results in various downstream tasks utilizing pretrained diffusion models, such as domain translation and text-guided image manipulation. In this paper, we propose Self-Distillation for Fine-Tuning diffusion models (SDFT), a methodology to address the… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: WACV 2024

  24. arXiv:2310.08929  [pdf, other

    cs.CV

    Leveraging Image Augmentation for Object Manipulation: Towards Interpretable Controllability in Object-Centric Learning

    Authors: **woo Kim, Janghyuk Choi, Jaehyun Kang, Changyeon Lee, Ho-** Choi, Seon Joo Kim

    Abstract: The binding problem in artificial neural networks is actively explored with the goal of achieving human-level recognition skills through the comprehension of the world in terms of symbol-like entities. Especially in the field of computer vision, object-centric learning (OCL) is extensively researched to better understand complex scenes by acquiring object representations or slots. While recent stu… ▽ More

    Submitted 1 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  25. arXiv:2310.06765  [pdf, other

    cs.RO

    Efficient Graduated Non-Convexity for Pose Graph Optimization

    Authors: Wonseok Kang, Jaehyun Kim, Jiseong Chung, Seungwon Choi, Tae-wan Kim

    Abstract: We propose a novel approach to Graduated Non-Convexity (GNC) and demonstrate its efficacy through its application in robust pose graph optimization, a key component in SLAM backends. Traditional GNC methods often rely on heuristic methods for GNC schedule, updating control parameter μ for escalating the non-convexity. In contrast, our approach leverages the properties of convex functions and conve… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 6 pages, 6 figures

  26. arXiv:2310.06541  [pdf, other

    cs.AI

    Realizing Stabilized Landing for Computation-Limited Reusable Rockets: A Quantum Reinforcement Learning Approach

    Authors: Gyu Seon Kim, JaeHyun Chung, Soohyun Park

    Abstract: The advent of reusable rockets has heralded a new era in space exploration, reducing the costs of launching satellites by a significant factor. Traditional rockets were disposable, but the design of reusable rockets for repeated use has revolutionized the financial dynamics of space missions. The most critical phase of reusable rockets is the landing stage, which involves managing the tremendous s… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 5 pages, 5 figures

  27. arXiv:2308.13327  [pdf, other

    cs.CV

    3D Face Alignment Through Fusion of Head Pose Information and Features

    Authors: Jaehyun So, Youngjoon Han

    Abstract: The ability of humans to infer head poses from face shapes, and vice versa, indicates a strong correlation between the two. Accordingly, recent studies on face alignment have employed head pose information to predict facial landmarks in computer vision tasks. In this study, we propose a novel method that employs head pose information to improve face alignment performance by fusing said information… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  28. arXiv:2308.11444  [pdf, other

    cs.RO

    Adaptive Graduated Non-Convexity for Pose Graph Optimization

    Authors: Seungwon Choi, Wonseok Kang, Jiseong Chung, Jaehyun Kim, Tae-wan Kim

    Abstract: We present a novel approach to robust pose graph optimization based on Graduated Non-Convexity (GNC). Unlike traditional GNC-based methods, the proposed approach employs an adaptive shape function using B-spline to optimize the shape of the robust kernel. This aims to reduce GNC iterations, boosting computational speed without compromising accuracy. When integrated with the open-source riSAM algor… ▽ More

    Submitted 23 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 4 pages, 3 figures. Accepted for the workshop on Robotic Perception and Map**(ROPEM): Frontier Vision & Learning Techniques, organized at the 2023 International Conference on Intelligent Robots and Systems (IROS)

  29. arXiv:2308.09092  [pdf, other

    cs.CR

    Watch Out! Smartwatches as criminal tool and digital forensic investigations

    Authors: Seungjae Jeon, Jaehyun Chung, Doowon Jeong

    Abstract: In the rapidly advancing technological landscape, smartwatches have materialized as multifunctional devices integral to our daily routines. Smartwatches store a substantial amount of personal information, potentially serving as repositories of digital evidence. Thus, digital forensic researchers have devoted considerable effort to exploring smartwatch forensic techniques. However, it has been obse… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  30. A Forensic Methodology for Detecting Image Manipulations

    Authors: Jiwon Lee, Seungjae Jeon, Yunji Park, Jaehyun Chung, Doowon Jeong

    Abstract: By applying artificial intelligence to image editing technology, it has become possible to generate high-quality images with minimal traces of manipulation. However, since these technologies can be misused for criminal activities such as dissemination of false information, destruction of evidence, and denial of facts, it is crucial to implement strong countermeasures. In this study, image file and… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Journal ref: Journal of The Korea Institute of Information Security and Cryptology (2023)

  31. arXiv:2307.08671  [pdf, other

    cs.CR cs.AI

    Deep Cross-Modal Steganography Using Neural Representations

    Authors: Gyo** Han, Dong-Jae Lee, Jiwan Hur, Jaehyun Choi, Junmo Kim

    Abstract: Steganography is the process of embedding secret data into another message or data, in such a way that it is not easily noticeable. With the advancement of deep learning, Deep Neural Networks (DNNs) have recently been utilized in steganography. However, existing deep steganography techniques are limited in scope, as they focus on specific data types and are not effective for cross-modal steganogra… ▽ More

    Submitted 7 October, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: ICIP 2023 Oral

  32. arXiv:2306.11526  [pdf, other

    cs.LG

    Understanding Contrastive Learning Through the Lens of Margins

    Authors: Daniel Rho, TaeSoo Kim, Sooill Park, Jaehyun Park, JaeHan Park

    Abstract: Contrastive learning, along with its variations, has been a highly effective self-supervised learning method across diverse domains. Contrastive learning measures the distance between representations using cosine similarity and uses cross-entropy for representation learning. Within the same framework of cosine-similarity-based representation learning, margins have played a significant role in enha… ▽ More

    Submitted 10 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  33. arXiv:2306.08204  [pdf, other

    cs.AI cs.LG

    Unraveling the ARC Puzzle: Mimicking Human Solutions with Object-Centric Decision Transformer

    Authors: Jaehyun Park, Jaegyun Im, Sanha Hwang, Mintaek Lim, Sabina Ualibekova, Se** Kim, Sundong Kim

    Abstract: In the pursuit of artificial general intelligence (AGI), we tackle Abstraction and Reasoning Corpus (ARC) tasks using a novel two-pronged approach. We employ the Decision Transformer in an imitation learning paradigm to model human problem-solving, and introduce an object detection algorithm, the Push and Pull clustering method. This dual strategy enhances AI's ARC problem-solving skills and provi… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  34. arXiv:2306.04732  [pdf, other

    cs.RO

    Online Multi-Contact Receding Horizon Planning via Value Function Approximation

    Authors: Jiayi Wang, Sanghyun Kim, Teguh Santoso Lembono, Wenqian Du, Jaehyun Shim, Saeid Samadi, Ke Wang, Vladimir Ivan, Sylvain Calinon, Sethu Vijayakumar, Steve Tonneau

    Abstract: Planning multi-contact motions in a receding horizon fashion requires a value function to guide the planning with respect to the future, e.g., building momentum to traverse large obstacles. Traditionally, the value function is approximated by computing trajectories in a prediction horizon (never executed) that foresees the future beyond the execution horizon. However, given the non-convex dynamics… ▽ More

    Submitted 17 April, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  35. X-ray: Discovering DRAM Internal Structure and Error Characteristics by Issuing Memory Commands

    Authors: Hwayong Nam, Seungmin Baek, Minbok Wi, Michael Jaemin Kim, Jaehyun Park, Chihun Song, Nam Sung Kim, Jung Ho Ahn

    Abstract: The demand for accurate information about the internal structure and characteristics of dynamic random-access memory (DRAM) has been on the rise. Recent studies have explored the structure and characteristics of DRAM to improve processing in memory, enhance reliability, and mitigate a vulnerability known as rowhammer. However, DRAM manufacturers only disclose limited information through official d… ▽ More

    Submitted 12 August, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 4 pages, 7 figures, accepted at IEEE Computer Architecture Letters

  36. arXiv:2304.07675  [pdf, other

    cs.CV

    Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging

    Authors: Jielin Qiu, Peide Huang, Makiya Nakashima, Jaehyun Lee, Jiacheng Zhu, Wilson Tang, Pohao Chen, Christopher Nguyen, Byung-Hak Kim, Debbie Kwon, Douglas Weber, Ding Zhao, David Chen

    Abstract: Self-supervised learning is crucial for clinical imaging applications, given the lack of explicit labels in healthcare. However, conventional approaches that rely on precise vision-language alignment are not always feasible in complex clinical imaging modalities, such as cardiac magnetic resonance (CMR). CMR provides a comprehensive visualization of cardiac anatomy, physiology, and microstructure,… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: 24 pages

  37. arXiv:2304.04625  [pdf, other

    cs.LG cs.CR cs.CV

    Reinforcement Learning-Based Black-Box Model Inversion Attacks

    Authors: Gyo** Han, Jaehyun Choi, Haeil Lee, Junmo Kim

    Abstract: Model inversion attacks are a type of privacy attack that reconstructs private data used to train a machine learning model, solely by accessing the model. Recently, white-box model inversion attacks leveraging Generative Adversarial Networks (GANs) to distill knowledge from public datasets have been receiving great attention because of their excellent attack performance. On the other hand, current… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: CVPR 2023, Accepted

  38. arXiv:2304.00715  [pdf, other

    cs.DB cs.CC

    Guaranteeing the Õ(AGM/OUT) Runtime for Uniform Sampling and OUT Size Estimation over Joins

    Authors: Kyoungmin Kim, Jaehyun Ha, George Fletcher, Wook-Shin Han

    Abstract: We propose a new method for estimating the number of answers OUT of a small join query Q in a large database D, and for uniform sampling over joins. Our method is the first to satisfy all the following statements. - Support arbitrary Q, which can be either acyclic or cyclic, and contain binary and non-binary relations. - Guarantee an arbitrary small error with a high probability always in Õ(AGM/OU… ▽ More

    Submitted 9 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 19 pages

  39. arXiv:2303.13726  [pdf, other

    cs.RO

    Topology-Based MPC for Automatic Footstep Placement and Contact Surface Selection

    Authors: Jaehyun Shim, Carlos Mastalli, Thomas Corbères, Steve Tonneau, Vladimir Ivan, Sethu Vijayakumar

    Abstract: State-of-the-art approaches to footstep planning assume reduced-order dynamics when solving the combinatorial problem of selecting contact surfaces in real time. However, in exchange for computational efficiency, these approaches ignore joint torque limits and limb dynamics. In this work, we address these limitations by presenting a topology-based approach that enables model predictive control (MP… ▽ More

    Submitted 29 July, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 7 pages, 6 figures

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2023

  40. arXiv:2303.11545  [pdf, other

    cs.CV cs.AI cs.LG

    Fix the Noise: Disentangling Source Feature for Controllable Domain Translation

    Authors: Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Jaejun Yoo, Junmo Kim

    Abstract: Recent studies show strong generative performance in domain translation especially by using transfer learning techniques on the unconditional generator. However, the control between different domain features using a single model is still challenging. Existing methods often require additional models, which is computationally demanding and leads to unsatisfactory visual quality. In addition, they ha… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023. The code is available at https://github.com/LeeDongYeun/FixNoise. Extended from arXiv:2204.14079 (AICC workshop at CVPR 2022)

  41. arXiv:2303.08610  [pdf, other

    cs.SD eess.AS

    Blind Estimation of Audio Processing Graph

    Authors: Sungho Lee, Jaehyun Park, Seungryeol Paik, Kyogu Lee

    Abstract: Musicians and audio engineers sculpt and transform their sounds by connecting multiple processors, forming an audio processing graph. However, most deep-learning methods overlook this real-world practice and assume fixed graph settings. To bridge this gap, we develop a system that reconstructs the entire graph from a given reference audio. We first generate a realistic graph-reference pair dataset… ▽ More

    Submitted 7 May, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  42. arXiv:2303.00918  [pdf, other

    cs.LG cs.AI

    STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables

    Authors: Jaehyun Nam, Jihoon Tack, Kyungmin Lee, Hankook Lee, **woo Shin

    Abstract: Learning with few labeled tabular samples is often an essential requirement for industrial machine learning applications as varieties of tabular data suffer from high annotation costs or have difficulties in collecting new samples for novel tasks. Despite the utter importance, such a problem is quite under-explored in the field of tabular learning, and existing few-shot learning schemes from other… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 (Spotlight)

  43. arXiv:2301.06392  [pdf, other

    cs.CV

    I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field Images

    Authors: Jiwan Hur, Jae Young Lee, Jaehyun Choi, Junmo Kim

    Abstract: Light field (LF) camera captures rich information from a scene. Using the information, the LF de-occlusion (LF-DeOcc) task aims to reconstruct the occlusion-free center view image. Existing LF-DeOcc studies mainly focus on the sparsely sampled (sparse) LF images where most of the occluded regions are visible in other views due to the large disparity. In this paper, we expand LF-DeOcc in more chall… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: WACV 2023

  44. arXiv:2211.15875  [pdf, other

    cs.LG cs.CR cs.CV

    Data Poisoning Attack Aiming the Vulnerability of Continual Learning

    Authors: Gyo** Han, Jaehyun Choi, Hyeong Gwon Hong, Junmo Kim

    Abstract: Generally, regularization-based continual learning models limit access to the previous task data to imitate the real-world constraints related to memory and privacy. However, this introduces a problem in these models by not being able to track the performance on each task. In essence, current continual learning methods are susceptible to attacks on previous tasks. We demonstrate the vulnerability… ▽ More

    Submitted 3 July, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: ICIP 2023 (NeurIPS 2022 ML Safety Workshop accepted paper)

  45. arXiv:2211.02686  [pdf, ps, other

    cs.AR cs.LG

    LightNorm: Area and Energy-Efficient Batch Normalization Hardware for On-Device DNN Training

    Authors: Seock-Hwan Noh, Junsang Park, Dahoon Park, Jahyun Koo, Jeik Choi, Jaeha Kung

    Abstract: When training early-stage deep neural networks (DNNs), generating intermediate features via convolution or linear layers occupied most of the execution time. Accordingly, extensive research has been done to reduce the computational burden of the convolution or linear layers. In recent mobile-friendly DNNs, however, the relative number of operations involved in processing these layers has significa… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: The paper is going to appearin the 40th IEEE International Conference on Computer Design (ICCD), 2022

  46. arXiv:2210.01323  [pdf, other

    cs.CV cs.AI

    ASAP: Accurate semantic segmentation for real time performance

    Authors: Jaehyun Park, Subin Lee, Eon Kim, Byeongjun Moon, Dabeen Yu, Yeonseung Yu, Junghwan Kim

    Abstract: Feature fusion modules from encoder and self-attention module have been adopted in semantic segmentation. However, the computation of these modules is costly and has operational limitations in real-time environments. In addition, segmentation performance is limited in autonomous driving environments with a lot of contextual information perpendicular to the road surface, such as people, buildings,… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures

  47. arXiv:2207.00555  [pdf, other

    eess.AS cs.CL cs.LG

    FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning

    Authors: Yeonghyeon Lee, Kangwook Jang, Jahyun Goo, Youngmoon Jung, Hoirin Kim

    Abstract: Large-scale speech self-supervised learning (SSL) has emerged to the main field of speech processing, however, the problem of computational cost arising from its vast size makes a high entry barrier to academia. In addition, existing distillation techniques of speech SSL models compress the model by reducing layers, which induces performance degradation in linguistic pattern recognition tasks such… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022

  48. arXiv:2204.14079  [pdf, other

    cs.CV cs.AI cs.LG

    Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN

    Authors: Dongyeun Lee, Jae Young Lee, Doyeon Kim, Jaehyun Choi, Junmo Kim

    Abstract: Transfer learning of StyleGAN has recently shown great potential to solve diverse tasks, especially in domain translation. Previous methods utilized a source model by swap** or freezing weights during transfer learning, however, they have limitations on visual quality and controlling source features. In other words, they require additional models that are computationally demanding and have restr… ▽ More

    Submitted 21 March, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Full CVPR 2023 paper is available at arXiv:2303.11545. Best paper of CVPRW AICC 2022 (CVPR 2022 Workshop on AI for Content Creation). The code is available at https://github.com/LeeDongYeun/FixNoise

  49. arXiv:2204.03872  [pdf, other

    cs.LG cs.CV

    Controllable Missingness from Uncontrollable Missingness: Joint Learning Measurement Policy and Imputation

    Authors: Seongwook Yoon, Jaehyun Kim, Heejeong Lim, Sanghoon Sull

    Abstract: Due to the cost or interference of measurement, we need to control measurement system. Assuming that each variable can be measured sequentially, there exists optimal policy choosing next measurement for the former observations. Though optimal measurement policy is actually dependent on the goal of measurement, we mainly focus on retrieving complete data, so called as imputation. Also, we adapt the… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  50. arXiv:2203.07554  [pdf, other

    cs.RO cs.AI eess.SY

    Agile Maneuvers in Legged Robots: a Predictive Control Approach

    Authors: Carlos Mastalli, Wolfgang Merkt, Guiyang Xin, Jaehyun Shim, Michael Mistry, Ioannis Havoutis, Sethu Vijayakumar

    Abstract: Planning and execution of agile locomotion maneuvers have been a longstanding challenge in legged robotics. It requires to derive motion plans and local feedback policies in real-time to handle the nonholonomy of the kinetic momenta. To achieve so, we propose a hybrid predictive controller that considers the robot's actuation limits and full-body dynamics. It combines the feedback policies with ta… ▽ More

    Submitted 18 July, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: 20 pages, 16 figures