Skip to main content

Showing 1–27 of 27 results for author: Seong, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18947  [pdf, other

    cs.LG cs.RO

    Self-Supervised Interpretable End-to-End Learning via Latent Functional Modularity

    Authors: Hyunki Seong, David Hyunchul Shim

    Abstract: We introduce MoNet, a novel functionally modular network for self-supervised and interpretable end-to-end learning. By leveraging its functional modularity with a latent-guided contrastive loss function, MoNet efficiently learns task-specific decision-making processes in latent space without requiring task-level supervision. Moreover, our method incorporates an online, post-hoc explainability appr… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 February, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures. Accepted at ICML 2024. Camera-ready version

  2. arXiv:2403.16664  [pdf, other

    cs.RO

    Skill Q-Network: Learning Adaptive Skill Ensemble for Mapless Navigation in Unknown Environments

    Authors: Hyunki Seong, David Hyunchul Shim

    Abstract: This paper focuses on the acquisition of mapless navigation skills within unknown environments. We introduce the Skill Q-Network (SQN), a novel reinforcement learning method featuring an adaptive skill ensemble mechanism. Unlike existing methods, our model concurrently learns a high-level skill decision process alongside multiple low-level navigation skills, all without the need for prior knowledg… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

  3. arXiv:2312.15894  [pdf, other

    cs.CV

    Task-Disruptive Background Suppression for Few-Shot Segmentation

    Authors: Suho Park, SuBeen Lee, Sangeek Hyun, Hyun Seok Seong, Jae-Pil Heo

    Abstract: Few-shot segmentation aims to accurately segment novel target objects within query images using only a limited number of annotated support images. The recent works exploit support background as well as its foreground to precisely compute the dense correlations between query and support. However, they overlook the characteristics of the background that generally contains various types of objects. I… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  4. arXiv:2309.08397  [pdf, other

    cs.RO

    Topological Exploration using Segmented Map with Keyframe Contribution in Subterranean Environments

    Authors: Boseong Kim, Hyunki Seong, D. Hyunchul Shim

    Abstract: Existing exploration algorithms mainly generate frontiers using random sampling or motion primitive methods within a specific sensor range or search space. However, frontiers generated within constrained spaces lead to back-and-forth maneuvers in large-scale environments, thereby diminishing exploration efficiency. To address this issue, we propose a method that utilizes a 3D dense map to generate… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 7 pages, 8 figures

  5. arXiv:2308.03257  [pdf, other

    cs.RO cs.AI

    TempFuser: Learning Agile, Tactical, and Acrobatic Flight Maneuvers Using a Long Short-Term Temporal Fusion Transformer

    Authors: Hyunki Seong, David Hyunchul Shim

    Abstract: Dogfighting is a challenging scenario in aerial applications that requires a comprehensive understanding of both strategic maneuvers and the aerodynamics of agile aircraft. The aerial agent needs to not only understand tactically evolving maneuvers of fighter jets from a long-term perspective but also react to rapidly changing aerodynamics of aircraft from a short-term viewpoint. In this paper, we… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: 8 pages, 7 figures

  6. arXiv:2308.00093  [pdf, other

    cs.CV

    Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification

    Authors: SuBeen Lee, WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

    Abstract: The difficulty of the fine-grained image classification mainly comes from a shared overall appearance across classes. Thus, recognizing discriminative details, such as eyes and beaks for birds, is a key in the task. However, this is particularly challenging when training data is limited. To address this, we propose Task Discrepancy Maximization (TDM), a task-oriented channel attention method tailo… ▽ More

    Submitted 28 July, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2207.01376

  7. arXiv:2307.04422  [pdf, other

    cs.RO eess.SY

    A Versatile Door Opening System with Mobile Manipulator through Adaptive Position-Force Control and Reinforcement Learning

    Authors: Gyuree Kang, Hyunki Seong, Daegyu Lee, D. Hyunchul Shim

    Abstract: The ability of robots to navigate through doors is crucial for their effective operation in indoor environments. Consequently, extensive research has been conducted to develop robots capable of opening specific doors. However, the diverse combinations of door handles and opening directions necessitate a more versatile door opening system for robots to successfully operate in real-world environment… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  8. arXiv:2303.15014  [pdf, other

    cs.CV

    Leveraging Hidden Positives for Unsupervised Semantic Segmentation

    Authors: Hyun Seok Seong, WonJun Moon, SuBeen Lee, Jae-Pil Heo

    Abstract: Dramatic demand for manpower to label pixel-level annotations triggered the advent of unsupervised semantic segmentation. Although the recent work employing the vision transformer (ViT) backbone shows exceptional performance, there is still a lack of consideration for task-specific training guidance and local semantic consistency. To tackle these issues, we leverage contrastive learning by excavat… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  9. arXiv:2303.09463  [pdf, other

    cs.RO eess.SY

    An Autonomous System for Head-to-Head Race: Design, Implementation and Analysis; Team KAIST at the Indy Autonomous Challenge

    Authors: Chanyoung Jung, Andrea Finazzi, Hyunki Seong, Daegyu Lee, Seungwook Lee, Bosung Kim, Gyuri Gang, Seungil Han, David Hyunchul Shim

    Abstract: While the majority of autonomous driving research has concentrated on everyday driving scenarios, further safety and performance improvements of autonomous vehicles require a focus on extreme driving conditions. In this context, autonomous racing is a new area of research that has been attracting considerable interest recently. Due to the fact that a vehicle is driven by its perception, planning,… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 35 pages, 31 figures, 5 tables, Field Robotics (accepted)

  10. arXiv:2301.04685  [pdf, other

    cs.CV

    SHUNIT: Style Harmonization for Unpaired Image-to-Image Translation

    Authors: Seokbeom Song, Suhyeon Lee, Hongje Seong, Kyoungwon Min, Euntai Kim

    Abstract: We propose a novel solution for unpaired image-to-image (I2I) translation. To translate complex images with a wide range of objects to a different domain, recent approaches often use the object annotations to perform per-class source-to-target style map**. However, there remains a point for us to exploit in the I2I. An object in each class consists of multiple components, and all the sub-object… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023

  11. arXiv:2301.01470  [pdf, other

    cs.RO cs.LG eess.SY

    Model Parameter Identification via a Hyperparameter Optimization Scheme for Autonomous Racing Systems

    Authors: Hyunki Seong, Chanyoung Chung, David Hyunchul Shim

    Abstract: In this letter, we propose a model parameter identification method via a hyperparameter optimization scheme (MI-HPO). Our method adopts an efficient explore-exploit strategy to identify the parameters of dynamic models in a data-driven optimization manner. We utilize our method for model parameter identification of the AV-21, a full-scaled autonomous race vehicle. We then incorporate the optimized… ▽ More

    Submitted 6 August, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 6 pages, 8 figures. Published in IEEE Control Systems Letters (L-CSS)

    Journal ref: IEEE Control Systems Letters, vol. 7, pp. 1652-1657, 2023

  12. arXiv:2211.13471  [pdf, other

    cs.CV

    Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition

    Authors: WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

    Abstract: A dramatic increase in real-world video volume with extremely diverse and emerging topics naturally forms a long-tailed video distribution in terms of their categories, and it spotlights the need for Video Long-Tailed Recognition (VLTR). In this work, we summarize the challenges in VLTR and explore how to overcome them. The challenges are: (1) it is impractical to re-train the whole model for high… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023. Code is available at https://github.com/wjun0830/MOVE

  13. arXiv:2211.02307  [pdf, other

    cs.CV

    Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

    Authors: Kyusik Cho, Suhyeon Lee, Hongje Seong, Euntai Kim

    Abstract: The network trained for domain adaptation is prone to bias toward the easy-to-transfer classes. Since the ground truth label on the target domain is unavailable during training, the bias problem leads to skewed predictions, forgetting to predict hard-to-transfer classes. To address this problem, we propose Cross-domain Moving Object Mixing (CMOM) that cuts several objects, including hard-to-transf… ▽ More

    Submitted 27 January, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: WACV 2023

  14. arXiv:2211.01213  [pdf, other

    cs.IT cs.DC cs.LG cs.NI cs.SI

    FiFo: Fishbone Forwarding in Massive IoT Networks

    Authors: Hayoung Seong, Junseon Kim, Won-Yong Shin, Howon Lee

    Abstract: Massive Internet of Things (IoT) networks have a wide range of applications, including but not limited to the rapid delivery of emergency and disaster messages. Although various benchmark algorithms have been developed to date for message delivery in such applications, they pose several practical challenges such as insufficient network coverage and/or highly redundant transmissions to expand the c… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 13 pages, 16 figures, 5 tables; to appear in the IEEE Internet of Things Journal (Please cite our journal version that will appear in an upcoming issue.)

  15. arXiv:2210.17302  [pdf, other

    cs.RO eess.SY

    Design, Field Evaluation, and Traffic Analysis of a Competitive Autonomous Driving Model in a Congested Environment

    Authors: Daegyu Lee, Hyunki Seong, Seungil Han, Gyuree Kang, D. Hyunchul Shim, Yoon** Yoon

    Abstract: Recently, numerous studies have investigated cooperative traffic systems using the communication among vehicle-to-everything (V2X). Unfortunately, when multiple autonomous vehicles are deployed while exposed to communication failure, there might be a conflict of ideal conditions between various autonomous vehicles leading to adversarial situation on the roads. In South Korea, virtual and real-worl… ▽ More

    Submitted 6 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

  16. arXiv:2207.13353  [pdf, other

    cs.CV

    One-Trimap Video Matting

    Authors: Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, Joon-Young Lee

    Abstract: Recent studies made great progress in video matting by extending the success of trimap-based image matting to the video domain. In this paper, we push this task toward a more practical setting and propose One-Trimap Video Matting network (OTVM) that performs video matting robustly using only one user-annotated trimap. A key of OTVM is the joint modeling of trimap propagation and alpha prediction.… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  17. arXiv:2207.12232  [pdf, other

    cs.RO eess.SY

    A Resilient Navigation and Path Planning System for High-speed Autonomous Race Car

    Authors: Daegyu Lee, Chanyoung Jung, Andrea Finazzi, Hyunki Seong, D. Hyunchul Shim

    Abstract: This paper describes a resilient navigation and planning system used in the Indy Autonomous Challenge (IAC) competition. The IAC is a competition where full-scale race cars run autonomously on Indianapolis Motor Speedway(IMS) up to 290 km/h (180 mph). Race cars will experience severe vibrations. Especially at high speeds. These vibrations can degrade standard localization algorithms based on preci… ▽ More

    Submitted 15 September, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

  18. arXiv:2207.10024  [pdf, other

    cs.CV

    Difficulty-Aware Simulator for Open Set Recognition

    Authors: WonJun Moon, Junho Park, Hyun Seok Seong, Cheol-Ho Cho, Jae-Pil Heo

    Abstract: Open set recognition (OSR) assumes unknown instances appear out of the blue at the inference time. The main challenge of OSR is that the response of models for unknowns is totally unpredictable. Furthermore, the diversity of open set makes it harder since instances have different difficulty levels. Therefore, we present a novel framework, DIfficulty-Aware Simulator (DIAS), that generates fakes wit… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022. Code is available at github.com/wjun0830/Difficulty-Aware-Simulator

  19. arXiv:2204.01458  [pdf, other

    cs.CV

    Correlation Verification for Image Retrieval

    Authors: Seongwon Lee, Hongje Seong, Suhyeon Lee, Euntai Kim

    Abstract: Geometric verification is considered a de facto solution for the re-ranking task in image retrieval. In this study, we propose a novel image retrieval re-ranking network named Correlation Verification Networks (CVNet). Our proposed network, comprising deeply stacked 4D convolutional layers, gradually compresses dense feature correlation into image similarity while learning diverse geometric matchi… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022 (Oral Presentation)

  20. arXiv:2204.01446  [pdf, other

    cs.CV

    WildNet: Learning Domain Generalized Semantic Segmentation from the Wild

    Authors: Suhyeon Lee, Hongje Seong, Seongwon Lee, Euntai Kim

    Abstract: We present a new domain generalized semantic segmentation network named WildNet, which learns domain-generalized features by leveraging a variety of contents and styles from the wild. In domain generalization, the low generalization ability for unseen target domains is clearly due to overfitting to the source domain. To address this problem, previous works have focused on generalizing the domain b… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022

  21. arXiv:2112.12402  [pdf, other

    cs.CV

    Iteratively Selecting an Easy Reference Frame Makes Unsupervised Video Object Segmentation Easier

    Authors: Youngjo Lee, Hongje Seong, Euntai Kim

    Abstract: Unsupervised video object segmentation (UVOS) is a per-pixel binary labeling problem which aims at separating the foreground object from the background in the video without using the ground truth (GT) mask of the foreground object. Most of the previous UVOS models use the first frame or the entire video as a reference frame to specify the mask of the foreground object. Our question is why the firs… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI 2022

  22. arXiv:2109.11404  [pdf, other

    cs.CV

    Hierarchical Memory Matching Network for Video Object Segmentation

    Authors: Hongje Seong, Seoung Wug Oh, Joon-Young Lee, Seongwon Lee, Suhyeon Lee, Euntai Kim

    Abstract: We present Hierarchical Memory Matching Network (HMMN) for semi-supervised video object segmentation. Based on a recent memory-based method [33], we propose two advanced memory read modules that enable us to perform memory reading in multiple scales while exploiting temporal smoothness. We first propose a kernel guided memory matching module that replaces the non-local dense memory read, commonly… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Accepted to ICCV 2021

  23. arXiv:2106.04094  [pdf, other

    cs.RO

    Game-Theoretic Model Predictive Control with Data-Driven Identification of Vehicle Model for Head-to-Head Autonomous Racing

    Authors: Chanyoung Jung, Seungwook Lee, Hyunki Seong, Andrea Finazzi, David Hyunchul Shim

    Abstract: Resolving edge-cases in autonomous driving, head-to-head autonomous racing is getting a lot of attention from the industry and academia. In this study, we propose a game-theoretic model predictive control (MPC) approach for head-to-head autonomous racing and data-driven model identification method. For the practical estimation of nonlinear model parameters, we adopted the hyperband algorithm, whic… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: 6 pages, 7 figures, ICRA workshop on Opportunities and Challenges with Autonomous Racing, 31 May, 2021(accepted)

    ACM Class: J.7.1

  24. arXiv:2012.12545  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer

    Authors: Suhyeon Lee, Junhyuk Hyun, Hongje Seong, Euntai Kim

    Abstract: In this paper, we tackle the unsupervised domain adaptation (UDA) for semantic segmentation, which aims to segment the unlabeled real data using labeled synthetic data. The main problem of UDA for semantic segmentation relies on reducing the domain gap between the real image and synthetic image. To solve this problem, we focused on separating information in an image into content and style. Here, o… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

    Comments: Accepted to AAAI 2021

  25. arXiv:2007.08270  [pdf, other

    cs.CV

    Kernelized Memory Network for Video Object Segmentation

    Authors: Hongje Seong, Junhyuk Hyun, Euntai Kim

    Abstract: Semi-supervised video object segmentation (VOS) is a task that involves predicting a target object in a video when the ground truth segmentation mask of the target object is given in the first frame. Recently, space-time memory networks (STM) have received significant attention as a promising solution for semi-supervised VOS. However, an important point is overlooked when applying STM to VOS. The… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted to ECCV 2020

  26. arXiv:1907.11440  [pdf, other

    cs.CV

    Universal Pooling -- A New Pooling Method for Convolutional Neural Networks

    Authors: Junhyuk Hyun, Hongje Seong, Euntai Kim

    Abstract: Pooling is one of the main elements in convolutional neural networks. The pooling reduces the size of the feature map, enabling training and testing with a limited amount of computation. This paper proposes a new pooling method named universal pooling. Unlike the existing pooling methods such as average pooling, max pooling, and stride pooling with fixed pooling function, universal pooling generat… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

  27. arXiv:1907.07570  [pdf, other

    cs.CV

    FOSNet: An End-to-End Trainable Deep Neural Network for Scene Recognition

    Authors: Hongje Seong, Junhyuk Hyun, Euntai Kim

    Abstract: Scene recognition is an image recognition problem aimed at predicting the category of the place at which the image is taken. In this paper, a new scene recognition method using the convolutional neural network (CNN) is proposed. The proposed method is based on the fusion of the object and the scene information in the given image and the CNN framework is named as FOS (fusion of object and scene) Ne… ▽ More

    Submitted 18 July, 2019; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works