Skip to main content

Showing 1–50 of 81 results for author: Jung, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01073  [pdf, other

    cs.RO

    No More Potentially Dynamic Objects: Static Point Cloud Map Generation based on 3D Object Detection and Ground Projection

    Authors: Soo** Woo, Donghwi Jung, Seong-Woo Kim

    Abstract: In this paper, we propose an algorithm to generate a static point cloud map based on LiDAR point cloud data. Our proposed pipeline detects dynamic objects using 3D object detectors and projects points of dynamic objects onto the ground. Typically, point cloud data acquired in real-time serves as a snapshot of the surrounding areas containing both static objects and dynamic objects. The static obje… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.00289  [pdf, other

    eess.SY cs.IT

    Personalised Outfit Recommendation via History-aware Transformers

    Authors: David Jung, Julien Monteil, Philip Schulz, Volodymyr Vaskovych

    Abstract: We present the history-aware transformer (HAT), a transformer-based model that uses shoppers' purchase history to personalise outfit predictions. The aim of this work is to recommend outfits that are internally coherent while matching an individual shopper's style and taste. To achieve this, we stack two transformer models, one that produces outfit representations and another one that processes th… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  3. arXiv:2406.19848  [pdf, other

    cs.RO

    3D Operation of Autonomous Excavator based on Reinforcement Learning through Independent Reward for Individual Joints

    Authors: Yoonkyu Yoo, Donghwi Jung, Seong-Woo Kim

    Abstract: In this paper, we propose a control algorithm based on reinforcement learning, employing independent rewards for each joint to control excavators in a 3D space. The aim of this research is to address the challenges associated with achieving precise control of excavators, which are extensively utilized in construction sites but prove challenging to control with precision due to their hydraulic stru… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  4. arXiv:2406.17869  [pdf, other

    cs.CV

    Burst Image Super-Resolution with Base Frame Selection

    Authors: Sanghyun Kim, Min Jung Lee, Woohyeok Kim, Deunsol Jung, Jaesung Rim, Sunghyun Cho, Minsu Cho

    Abstract: Burst image super-resolution has been a topic of active research in recent years due to its ability to obtain a high-resolution image by using complementary information between multiple frames in the burst. In this work, we explore using burst shots with non-uniform exposures to confront real-world practical scenarios by introducing a new benchmark dataset, dubbed Non-uniformly Exposed Burst Image… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: CVPR2024W NTIRE accepted

  5. arXiv:2406.17256  [pdf, other

    cs.CV

    Disentangled Motion Modeling for Video Frame Interpolation

    Authors: Jaihyun Lew, Jooyoung Choi, Chaehun Shin, Dahuin Jung, Sungroh Yoon

    Abstract: Video frame interpolation (VFI) aims to synthesize intermediate frames in between existing frames to enhance visual smoothness and quality. Beyond the conventional methods based on the reconstruction loss, recent works employ the high quality generative models for perceptual quality. However, they require complex training and large computational cost for modeling on the pixel space. In this paper,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  6. arXiv:2404.04819  [pdf, other

    cs.CV

    Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer

    Authors: Hyeong** Nam, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Human-object contact serves as a strong cue to understand how humans physically interact with objects. Nevertheless, it is not widely explored to utilize human-object contact information for the joint reconstruction of 3D human and object from a single image. In this work, we present a novel joint 3D human-object reconstruction method (CONTHO) that effectively exploits contact information between… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Published at CVPR 2024, 19 pages including the supplementary material

  7. arXiv:2404.00450  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Planning and Editing What You Retrieve for Enhanced Tool Learning

    Authors: Tenghao Huang, Dongwon Jung, Muhao Chen

    Abstract: Recent advancements in integrating external tools with Large Language Models (LLMs) have opened new frontiers, with applications in mathematical reasoning, code generators, and smart assistants. However, existing methods, relying on simple one-time retrieval strategies, fall short on effectively and accurately shortlisting relevant tools. This paper introduces a novel PLUTO (Planning, Learning, an… ▽ More

    Submitted 4 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: This paper is accepted at NAACL-Findings 2024

  8. arXiv:2403.10911  [pdf, other

    cs.CV

    Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

    Authors: Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang, Sungroh Yoon

    Abstract: Test-time adaptation (TTA) addresses the unforeseen distribution shifts occurring during test time. In TTA, both performance and, memory and time consumption serve as crucial considerations. A recent diffusion-based TTA approach for restoring corrupted images involves image-level updates. However, using pixel space diffusion significantly increases resource requirements compared to conventional mo… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  9. arXiv:2403.09055  [pdf, other

    cs.CV

    StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

    Authors: Jaerin Lee, Daniel Sungho Jung, Kanggeon Lee, Kyoung Mu Lee

    Abstract: The enormous success of diffusion models in text-to-image synthesis has made them promising candidates for the next generation of end-user applications for image generation and editing. Previous works have focused on improving the usability of diffusion models by reducing the inference time or increasing user interactivity by allowing new, fine-grained controls such as region-based text prompts. H… ▽ More

    Submitted 1 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 29 pages, 16 figures. v2: typos corrected, references added. Project page: https://jaerinlee.com/research/StreamMultiDiffusion

  10. arXiv:2403.07366  [pdf, other

    cs.CV cs.LG

    Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors

    Authors: Jonghyun Lee, Dahuin Jung, Saehyung Lee, Junsung Park, Juhyeon Shin, Uiwon Hwang, Sungroh Yoon

    Abstract: Test-time adaptation (TTA) fine-tunes pre-trained deep neural networks for unseen test data. The primary challenge of TTA is limited access to the entire test dataset during online updates, causing error accumulation. To mitigate it, TTA methods have utilized the model output's entropy as a confidence metric that aims to determine which samples have a lower likelihood of causing error. Through exp… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 Spotlight; 26 pages, 9 figures, 20 tables;

  11. arXiv:2402.04535  [pdf, other

    cs.RO

    MuNES: Multifloor Navigation Including Elevators and Stairs

    Authors: Donghwi Jung, Chan Kim, Jae-Kyung Cho, Seong-Woo Kim

    Abstract: We propose a scheme called MuNES for single map** and trajectory planning including elevators and stairs. Optimized multifloor trajectories are important for optimal interfloor movements of robots. However, given two or more options of moving between floors, it is difficult to select the best trajectory because there are no suitable indoor multifloor maps in the existing methods. To solve this p… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  12. arXiv:2401.14616  [pdf, other

    cs.CL cs.AI

    Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse

    Authors: Seungyoon Lee, Dahyun Jung, Chanjun Park, Seolhwa Lee, Heuiseok Lim

    Abstract: We introduce the concept of "Alternative Speech" as a new way to directly combat hate speech and complement the limitations of counter-narrative. An alternative speech provides practical alternatives to hate speech in real-world scenarios by offering speech-level corrections to speakers while considering the surrounding context and promoting speakers to reform. Further, an alternative speech can c… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted for The First Workshop on Data-Centric AI (DCAI) at ICDM 2023

  13. arXiv:2401.04143  [pdf, other

    cs.CV

    RHOBIN Challenge: Reconstruction of Human Object Interaction

    Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeong** Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

    Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

  14. arXiv:2312.15924  [pdf, other

    cs.IT eess.SP

    Modeling and Analysis of GEO Satellite Networks

    Authors: Dong-Hyun Jung, Hongjae Nam, Junil Choi, David J. Love

    Abstract: The extensive coverage offered by satellites makes them effective in enhancing service continuity for users on dynamic airborne and maritime platforms, such as airplanes and ships. In particular, geosynchronous Earth orbit (GEO) satellites ensure stable connectivity for terrestrial users due to their stationary characteristics when observed from Earth. This paper introduces a novel approach to mod… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 12 pages, 9 figures, submitted to IEEE Transactions on Wireless Communications

  15. arXiv:2312.04266  [pdf, other

    cs.CV

    Activity Grammars for Temporal Action Segmentation

    Authors: Dayoung Gong, Joonseok Lee, Deunsol Jung, Suha Kwak, Minsu Cho

    Abstract: Sequence prediction on temporal data requires the ability to understand compositional structures of multi-level semantics beyond individual and contextual properties. The task of temporal action segmentation, which aims at translating an untrimmed activity video into a sequence of action segments, remains challenging for this reason. This paper addresses the problem by introducing an effective act… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted to NeurIPS 2023

  16. arXiv:2311.15890  [pdf, other

    cs.LG cs.CV

    Stability-Informed Initialization of Neural Ordinary Differential Equations

    Authors: Theodor Westny, Arman Mohammadi, Daniel Jung, Erik Frisk

    Abstract: This paper addresses the training of Neural Ordinary Differential Equations (neural ODEs), and in particular explores the interplay between numerical integration techniques, stability regions, step size, and initialization techniques. It is shown how the choice of integration technique implicitly regularizes the learned model, and how the solver's corresponding stability region affects training an… ▽ More

    Submitted 1 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

  17. arXiv:2310.16492  [pdf, other

    cs.CV cs.AI cs.LG

    On the Powerfulness of Textual Outlier Exposure for Visual OoD Detection

    Authors: Sangha Park, Jisoo Mok, Dahuin Jung, Saehyung Lee, Sungroh Yoon

    Abstract: Successful detection of Out-of-Distribution (OoD) data is becoming increasingly important to ensure safe deployment of neural networks. One of the main challenges in OoD detection is that neural networks output overconfident predictions on OoD data, make it difficult to determine OoD-ness of data solely based on their predictions. Outlier exposure addresses this issue by introducing an additional… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  18. arXiv:2310.10088  [pdf, other

    eess.IV cs.CV cs.LG

    PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising

    Authors: Hyemi Jang, Junsung Park, Dahuin Jung, Jaihyun Lew, Ho Bae, Sungroh Yoon

    Abstract: Although supervised image denoising networks have shown remarkable performance on synthesized noisy images, they often fail in practice due to the difference between real and synthesized noise. Since clean-noisy image pairs from the real world are extremely costly to gather, self-supervised learning, which utilizes noisy input itself as a target, has been studied. To prevent a self-supervised deno… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  19. arXiv:2309.01943  [pdf, other

    cs.CV

    Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery

    Authors: JoonKyu Park, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: Understanding how two hands interact with each other is a key component of accurate 3D interacting hand mesh recovery. However, recent Transformer-based methods struggle to learn the interaction between two hands as they directly utilize two hand features as input tokens, which results in distant token problem. The distant token problem represents that input tokens are in heterogeneous spaces, lea… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted at ICCVW 2023

  20. arXiv:2308.06554  [pdf, other

    cs.CV

    Cyclic Test-Time Adaptation on Monocular Video for 3D Human Mesh Reconstruction

    Authors: Hyeong** Nam, Daniel Sungho Jung, Yeonguk Oh, Kyoung Mu Lee

    Abstract: Despite recent advances in 3D human mesh reconstruction, domain gap between training and test data is still a major challenge. Several prior works tackle the domain gap problem via test-time adaptation that fine-tunes a network relying on 2D evidence (e.g., 2D human keypoints) from test images. However, the high reliance on 2D evidence during adaptation causes two major issues. First, 2D evidence… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Published at ICCV 2023, 16 pages including the supplementary material

  21. arXiv:2307.14856  [pdf, other

    cs.CL cs.AI

    Exploiting the Potential of Seq2Seq Models as Robust Few-Shot Learners

    Authors: Jihyeon Lee, Dain Kim, Doohae Jung, Boseop Kim, Kyoung-Woon On

    Abstract: In-context learning, which offers substantial advantages over fine-tuning, is predominantly observed in decoder-only models, while encoder-decoder (i.e., seq2seq) models excel in methods that rely on weight updates. Recently, a few studies have demonstrated the feasibility of few-shot learning with seq2seq models; however, this has been limited to tasks that align well with the seq2seq architectur… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  22. arXiv:2306.14470  [pdf, other

    cs.CL cs.AI

    Knowledge Graph-Augmented Korean Generative Commonsense Reasoning

    Authors: Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park, Heuiseok Lim

    Abstract: Generative commonsense reasoning refers to the task of generating acceptable and logical assumptions about everyday situations based on commonsense understanding. By utilizing an existing dataset such as Korean CommonGen, language generation models can learn commonsense reasoning specific to the Korean language. However, language models often fail to consider the relationships between concepts and… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted for Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023

  23. arXiv:2306.05067  [pdf, other

    cs.LG cs.AI cs.CV

    Improving Visual Prompt Tuning for Self-supervised Vision Transformers

    Authors: Seungryong Yoo, Eunji Kim, Dahuin Jung, Jungbeom Lee, Sungroh Yoon

    Abstract: Visual Prompt Tuning (VPT) is an effective tuning method for adapting pretrained Vision Transformers (ViTs) to downstream tasks. It leverages extra learnable tokens, known as prompts, which steer the frozen pretrained ViTs. Although VPT has demonstrated its applicability with supervised vision transformers, it often underperforms with self-supervised ones. Through empirical observations, we deduce… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  24. arXiv:2306.01574  [pdf, other

    cs.LG cs.AI cs.CV

    Probabilistic Concept Bottleneck Models

    Authors: Eunji Kim, Dahuin Jung, Sangha Park, Siwon Kim, Sungroh Yoon

    Abstract: Interpretable models are designed to make decisions in a human-interpretable manner. Representatively, Concept Bottleneck Models (CBM) follow a two-step process of concept prediction and class prediction based on the predicted concepts. CBM provides explanations with high-level concepts derived from concept predictions; thus, reliable concept predictions are important for trustworthiness. In this… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  25. arXiv:2305.18726  [pdf, other

    cs.CV

    Diffusion-Stego: Training-free Diffusion Generative Steganography via Message Projection

    Authors: Daegyu Kim, Chaehun Shin, Jooyoung Choi, Dahuin Jung, Sungroh Yoon

    Abstract: Generative steganography is the process of hiding secret messages in generated images instead of cover images. Existing studies on generative steganography use GAN or Flow models to obtain high hiding message capacity and anti-detection ability over cover images. However, they create relatively unrealistic stego images because of the inherent limitations of generative models. We propose Diffusion-… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  26. arXiv:2305.04670  [pdf, other

    cs.LG

    Analysis of Numerical Integration in RNN-Based Residuals for Fault Diagnosis of Dynamic Systems

    Authors: Arman Mohammadi, Theodor Westny, Daniel Jung, Mattias Krysander

    Abstract: Data-driven modeling and machine learning are widely used to model the behavior of dynamic systems. One application is the residual evaluation of technical systems where model predictions are compared with measurement data to create residuals for fault diagnosis applications. While recurrent neural network models have been shown capable of modeling complex non-linear dynamic systems, they are limi… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  27. arXiv:2305.01955  [pdf, other

    cs.IT eess.SP

    Satellite Clusters Flying in Formation: Orbital Configuration-Dependent Performance Analyses

    Authors: Dong-Hyun Jung, Joon-Gyu Ryu, Junil Choi

    Abstract: This paper considers a downlink satellite communication system where a satellite cluster, i.e., a satellite swarm consisting of one leader and multiple follower satellites, serves a ground terminal. The satellites in the cluster form either a linear or circular formation moving in a group and cooperatively send their signals by maximum ratio transmission precoding. We first conduct a coordinate tr… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 5 pages, 5 figures, submitted to IEEE Transactions on Vehicular Technology

  28. arXiv:2304.04997  [pdf, other

    cs.CV cs.AI

    Relational Context Learning for Human-Object Interaction Detection

    Authors: Sanghyun Kim, Deunsol Jung, Minsu Cho

    Abstract: Recent state-of-the-art methods for HOI detection typically build on transformer architectures with two decoder branches, one for human-object pair detection and the other for interaction classification. Such disentangled transformers, however, may suffer from insufficient context exchange between the branches and lead to a lack of context information for relational reasoning, which is critical in… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: accepted to CVPR2023

  29. arXiv:2304.03495  [pdf, other

    cs.CV

    Devil's on the Edges: Selective Quad Attention for Scene Graph Generation

    Authors: Deunsol Jung, Sanghyun Kim, Won Hwa Kim, Minsu Cho

    Abstract: Scene graph generation aims to construct a semantic graph structure from an image such that its nodes and edges respectively represent objects and their relationships. One of the major challenges for the task lies in the presence of distracting objects and relationships in images; contextual reasoning is strongly distracted by irrelevant objects or backgrounds and, more importantly, a vast number… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR 2023; Project page at https://cvlab.postech.ac.kr/research/SQUAT/

  30. arXiv:2303.15060  [pdf, other

    cs.CV

    TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering

    Authors: Jaehoon Choi, Dongki Jung, Taejae Lee, Sangwook Kim, Youngdong Jung, Dinesh Manocha, Donghwan Lee

    Abstract: We present a new pipeline for acquiring a textured mesh in the wild with a single smartphone which offers access to images, depth maps, and valid poses. Our method first introduces an RGBD-aided structure from motion, which can yield filtered depth maps and refines camera poses guided by corresponding depth. Then, we adopt the neural implicit surface reconstruction method, which allows for high-qu… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR23. Project Page: https://jh-choi.github.io/TMO/

  31. arXiv:2303.11853  [pdf, other

    cs.RO cs.AI

    LoRCoN-LO: Long-term Recurrent Convolutional Network-based LiDAR Odometry

    Authors: Donghwi Jung, Jae-Kyung Cho, Younghwa Jung, Soohyun Shin, Seong-Woo Kim

    Abstract: We propose a deep learning-based LiDAR odometry estimation method called LoRCoN-LO that utilizes the long-term recurrent convolutional network (LRCN) structure. The LRCN layer is a structure that can process spatial and temporal information at once by using both CNN and LSTM layers. This feature is suitable for predicting continuous robot movements as it uses point clouds that contain spatial info… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 4 pages, ICEIC 2023

  32. arXiv:2303.07846  [pdf, other

    cs.LG cs.AI

    Sample-efficient Adversarial Imitation Learning

    Authors: Dahuin Jung, Hyungyu Lee, Sungroh Yoon

    Abstract: Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods still require numerous expert demonstration samples to successfully imitate an expert's behavior. To improve sample efficiency, we utilize self-supervised representation learning, wh… ▽ More

    Submitted 23 January, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Published at JMLR (Journal of Machine Learning Research), A preliminary version of this manuscript was presented at Deep RL Workshop, NeurIPS 2022

  33. arXiv:2302.08741  [pdf, other

    cs.CV cs.AI cs.LG

    New Insights for the Stability-Plasticity Dilemma in Online Continual Learning

    Authors: Dahuin Jung, Dong** Lee, Sunwon Hong, Hyemi Jang, Ho Bae, Sungroh Yoon

    Abstract: The aim of continual learning is to learn new tasks continuously (i.e., plasticity) without forgetting previously learned knowledge from old tasks (i.e., stability). In the scenario of online continual learning, wherein data comes strictly in a streaming manner, the plasticity of online continual learning is more vulnerable than offline continual learning because the training signal that can be ob… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted to ICLR2023

  34. Satellite Clustering for Non-Terrestrial Networks: Concept, Architectures, and Applications

    Authors: Dong-Hyun Jung, Gyeongrae Im, Joon-Gyu Ryu, Seungkeun Park, Heejung Yu, Junil Choi

    Abstract: Recently, mega-constellations with a massive number of low Earth orbit (LEO) satellites are being considered as a possible solution for providing global coverage due to relatively low latency and high throughput compared to geosynchronous orbit satellites. However, as the number of satellites and operators participating in the LEO constellation increases, inter-satellite interference will become m… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: 7 pages, 7 figures, 1 table, submitted to IEEE Vehicular Technology Magazine

    Journal ref: IEEE Vehicular Technology Magazine, vol. 18, no. 3, pp. 29-37, Sep. 2023

  35. arXiv:2211.07116  [pdf, other

    cs.CV

    Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval

    Authors: Deunsol Jung, Dahyun Kang, Suha Kwak, Minsu Cho

    Abstract: Metric learning aims to build a distance metric typically by learning an effective embedding function that maps similar objects into nearby points in its embedding space. Despite recent advances in deep metric learning, it remains challenging for the learned metric to generalize to unseen classes with a substantial domain gap. To tackle the issue, we explore a new problem of few-shot metric learni… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted at ACCV 2022

  36. arXiv:2211.06390  [pdf, other

    cs.AR

    The BlackParrot BedRock Cache Coherence System

    Authors: Mark Wyse, Daniel Petrisko, Farzam Gilani, Yuan-Mao Chueh, Paul Gao, Dai Cheol Jung, Sripathi Muralitharan, Shashank Vijaya Ranga, Mark Oskin, Michael Taylor

    Abstract: This paper presents BP-BedRock, the open-source cache coherence protocol and system implemented within the BlackParrot 64-bit RISC-V multicore processor. BP-BedRock implements the BedRock directory-based MOESIF cache coherence protocol and includes two different open-source coherence protocol engines, one FSM-based and the other microcode programmable. Both coherence engines support coherent uncac… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  37. arXiv:2210.14226  [pdf, other

    cs.LG cs.AI cs.DC

    FedClassAvg: Local Representation Learning for Personalized Federated Learning on Heterogeneous Neural Networks

    Authors: Jaehee Jang, Heonseok Ha, Dahuin Jung, Sungroh Yoon

    Abstract: Personalized federated learning is aimed at allowing numerous clients to train personalized models while participating in collaborative training in a communication-efficient manner without exchanging private data. However, many personalized federated learning algorithms assume that clients have the same neural network architecture, and those for heterogeneous models remain understudied. In this st… ▽ More

    Submitted 26 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to ICPP 2022. Code: https://github.com/hukla/fedclassavg

  38. When Satellites Work as Eavesdroppers

    Authors: Dong-Hyun Jung, Joon-Gyu Ryu, Junil Choi

    Abstract: This paper considers satellite eavesdroppers in uplink satellite communication systems where the eavesdroppers are randomly distributed at arbitrary altitudes according to homogeneous binomial point processes and attempt to overhear signals that a ground terminal transmits to a serving satellite. Non-colluding eavesdrop** satellites are assumed, i.e., they do not cooperate with each other, so th… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 16 pages, 11 figures, 1 table, accepted by IEEE Transactions on Information Forensics and Security

    Journal ref: IEEE Transactions on Information Forensics and Security, vol. 17, pp. 2784-2799, July 2022

  39. arXiv:2206.07567  [pdf, other

    cs.DC

    A Unifying Approach to Efficient (Near)-Gathering of Disoriented Robots with Limited Visibility

    Authors: Jannik Castenow, Jonas Harbig, Daniel Jung, Peter Kling, Till Knollmann, Friedhelm Meyer auf der Heide

    Abstract: We consider a swarm of $n$ robots in \mathbb{R}^d. The robots are oblivious, disoriented (no common coordinate system/compass), and have limited visibility (observe other robots up to a constant distance). The basic formation task gathering requires that all robots reach the same, not predefined position. In the related near-gathering task, they must reach distinct positions such that every robot… ▽ More

    Submitted 9 September, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

  40. arXiv:2206.06640  [pdf, other

    cs.CV cs.LG

    Confidence Score for Source-Free Unsupervised Domain Adaptation

    Authors: Jonghyun Lee, Dahuin Jung, Junho Yim, Sungroh Yoon

    Abstract: Source-free unsupervised domain adaptation (SFUDA) aims to obtain high performance in the unlabeled target domain using the pre-trained source model, not the source data. Existing SFUDA methods assign the same importance to all target samples, which is vulnerable to incorrect pseudo-labels. To differentiate between sample importance, in this study, we propose a novel sample-wise confidence score,… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: ICML 2022 camera ready

  41. arXiv:2203.10996  [pdf, other

    cs.IR cs.AI cs.MM

    Technologies for AI-Driven Fashion Social Networking Service with E-Commerce

    Authors: **seok Seol, Seongjae Kim, Sungchan Park, Holim Lim, Hyunsoo Na, Eunyoung Park, Dohee Jung, Soyoung Park, Kangwoo Lee, Sang-goo Lee

    Abstract: The rapid growth of the online fashion market brought demands for innovative fashion services and commerce platforms. With the recent success of deep learning, many applications employ AI technologies such as visual search and recommender systems to provide novel and beneficial services. In this paper, we describe applied technologies for AI-driven fashion social networking service that incorporat… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: 16 pages, accepted in International Semantic Intelligence Conference (ISIC) 2022, The Applications and Deployment Track

  42. arXiv:2203.05332  [pdf, other

    cs.CV

    SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning

    Authors: Jaehoon Choi, Dongki Jung, Yonghan Lee, Deokhwa Kim, Dinesh Manocha, Donghwan Lee

    Abstract: Monocular depth estimation in the wild inherently predicts depth up to an unknown scale. To resolve scale ambiguity issue, we present a learning algorithm that leverages monocular simultaneous localization and map** (SLAM) with proprioceptive sensors. Such monocular SLAM systems can provide metrically scaled camera poses. Given these metric poses and monocular sequences, we propose a self-superv… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  43. arXiv:2110.09286  [pdf, other

    cs.CV eess.SP

    Gait-based Human Identification through Minimum Gait-phases and Sensors

    Authors: Muhammad Zeeshan Arshad, Dawoon Jung, Mina Park, Kyung-Ryoul Mun, **wook Kim

    Abstract: Human identification is one of the most common and critical tasks for condition monitoring, human-machine interaction, and providing assistive services in smart environments. Recently, human gait has gained new attention as a biometric for identification to achieve contactless identification from a distance robust to physical appearances. However, an important aspect of gait identification through… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2021)

    ACM Class: I.2

  44. arXiv:2110.07821  [pdf, other

    cs.CV cs.LG eess.SP

    Gait-based Frailty Assessment using Image Representation of IMU Signals and Deep CNN

    Authors: Muhammad Zeeshan Arshad, Dawoon Jung, Mina Park, Hyungeun Shin, **wook Kim, Kyung-Ryoul Mun

    Abstract: Frailty is a common and critical condition in elderly adults, which may lead to further deterioration of health. However, difficulties and complexities exist in traditional frailty assessments based on activity-related questionnaires. These can be overcome by monitoring the effects of frailty on the gait. In this paper, it is shown that by encoding gait signals as images, deep learning-based model… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2021)

    ACM Class: I.2

  45. arXiv:2110.06383  [pdf, other

    cs.LG

    Real-time Drift Detection on Time-series Data

    Authors: Nandini Ramanan, Rasool Tahmasbi, Marjorie Sayer, Deokwoo Jung, Shalini Hemachandran, Claudionor Nunes Coelho Jr

    Abstract: Practical machine learning applications involving time series data, such as firewall log analysis to proactively detect anomalous behavior, are concerned with real time analysis of streaming data. Consequently, we need to update the ML models as the statistical characteristics of such data may shift frequently with time. One alternative explored in the literature is to retrain models with updated… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 5 pages, 5 figures

  46. arXiv:2108.05615  [pdf, other

    cs.CV

    DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes

    Authors: Dongki Jung, Jaehoon Choi, Yonghan Lee, Deokhwa Kim, Changick Kim, Dinesh Manocha, Donghwan Lee

    Abstract: We present a novel approach for estimating depth from a monocular camera as it moves through complex and crowded indoor environments, e.g., a department store or a metro station. Our approach predicts absolute scale depth maps over the entire scene consisting of a static background and multiple moving people, by training on dynamic scenes. Since it is difficult to collect dense depth maps from cro… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  47. arXiv:2108.02949  [pdf, other

    cs.LG

    Auxiliary Class Based Multiple Choice Learning

    Authors: Sihwan Kim, Dae Yon Jung, Taejang Park

    Abstract: The merit of ensemble learning lies in having different outputs from many individual models on a single input, i.e., the diversity of the base models. The high quality of diversity can be achieved when each model is specialized to different subsets of the whole dataset. Moreover, when each model explicitly knows to which subsets it is specialized, more opportunities arise to improve diversity. In… ▽ More

    Submitted 7 December, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

  48. arXiv:2106.07473  [pdf, other

    cs.LG

    Time Series Anomaly Detection with label-free Model Selection

    Authors: Deokwoo Jung, Nandini Ramanan, Mehrnaz Amjadi, Sankeerth Rao Karingula, Jake Taylor, Claudionor Nunes Coelho Jr

    Abstract: Anomaly detection for time-series data becomes an essential task for many data-driven applications fueled with an abundance of data and out-of-the-box machine-learning algorithms. In many real-world settings, develo** a reliable anomaly model is highly challenging due to insufficient anomaly labels and the prohibitively expensive cost of obtaining anomaly examples. It imposes a significant bottl… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 11 pages, 1 Figure, 4 tables

  49. arXiv:2106.05319  [pdf, other

    cs.LG stat.ML

    Stein Latent Optimization for Generative Adversarial Networks

    Authors: Uiwon Hwang, Heeseung Kim, Dahuin Jung, Hyemi Jang, Hyungyu Lee, Sungroh Yoon

    Abstract: Generative adversarial networks (GANs) with clustered latent spaces can perform conditional generation in a completely unsupervised manner. In the real world, the salient attributes of unlabeled data can be imbalanced. However, most of existing unsupervised conditional GANs cannot cluster attributes of these data in their latent spaces properly because they assume uniform distributions of the attr… ▽ More

    Submitted 15 March, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: ICLR 2022 camera ready

  50. Performance Analysis of Satellite Communication System Under the Shadowed-Rician Fading: A Stochastic Geometry Approach

    Authors: Dong-Hyun Jung, Joon-Gyu Ryu, Woo-** Byun, Junil Choi

    Abstract: In this paper, we consider downlink low Earth orbit (LEO) satellite communication systems where multiple LEO satellites are uniformly distributed over a sphere at a certain altitude according to a homogeneous binomial point process (BPP). Based on the characteristics of the BPP, we analyze the distance distributions and the distribution cases for the serving satellite. We analytically derive the e… ▽ More

    Submitted 23 June, 2022; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: 15 pages, 14 figures, 1 table, accepted by IEEE Transactions on Communications

    Journal ref: IEEE Transactions on Communications, vol. 70, no. 4, pp. 2707-2721, Apr. 2022