Skip to main content

Showing 1–11 of 11 results for author: Wee, D

.
  1. arXiv:2310.05878  [pdf

    cs.LG

    A Machine Learning Approach to Predicting Single Event Upsets

    Authors: Archit Gupta, Chong Yock Eng, Deon Lim Meng Wee, Rashna Analia Ahmed, See Min Sim

    Abstract: A single event upset (SEU) is a critical soft error that occurs in semiconductor devices on exposure to ionising particles from space environments. SEUs cause bit flips in the memory component of semiconductors. This creates a multitude of safety hazards as stored information becomes less reliable. Currently, SEUs are only detected several hours after their occurrence. CREMER, the model presented… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  2. arXiv:2306.01395  [pdf, other

    cs.CV

    Masked Autoencoder for Unsupervised Video Summarization

    Authors: Minho Shim, Taeoh Kim, **hyung Kim, Dongyoon Wee

    Abstract: Summarizing a video requires a diverse understanding of the video, ranging from recognizing scenes to evaluating how much each frame is essential enough to be selected as a summary. Self-supervised learning (SSL) is acknowledged for its robustness and flexibility to multiple downstream tasks, but the video SSL has not shown its value for dense understanding tasks like video summarization. We claim… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  3. arXiv:2303.17285  [pdf, other

    cs.CV

    Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection

    Authors: Pilhyeon Lee, Taeoh Kim, Minho Shim, Dongyoon Wee, Hyeran Byun

    Abstract: Temporal action detection aims to predict the time intervals and the classes of action instances in the video. Despite the promising performance, existing two-stream models exhibit slow inference speed due to their reliance on computationally expensive optical flow. In this paper, we introduce a decomposed cross-modal distillation framework to build a strong RGB-based detector by transferring know… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023

  4. arXiv:2303.05835  [pdf, other

    cs.CV

    You Only Train Once: Multi-Identity Free-Viewpoint Neural Human Rendering from Monocular Videos

    Authors: Jaehyeok Kim, Dongyoon Wee, Dan Xu

    Abstract: We introduce You Only Train Once (YOTO), a dynamic human generation framework, which performs free-viewpoint rendering of different human identities with distinct motions, via only one-time training from monocular videos. Most prior works for the task require individualized optimization for each input video that contains a distinct human identity, leading to a significant amount of time and resour… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  5. arXiv:2210.14165  [pdf, other

    cs.CV

    MEEV: Body Mesh Estimation On Egocentric Video

    Authors: Nicolas Monet, Dongyoon Wee

    Abstract: This technical report introduces our solution, MEEV, proposed to the EgoBody Challenge at ECCV 2022. Captured from head-mounted devices, the dataset consists of human body shape and motion of interacting people. The EgoBody dataset has challenges such as occluded body or blurry image. In order to overcome the challenges, MEEV is designed to exploit multiscale features for rich spatial information.… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 5 pages

  6. arXiv:2206.15015  [pdf, other

    cs.CV

    Exploring Temporally Dynamic Data Augmentation for Video Recognition

    Authors: Taeoh Kim, **hyung Kim, Minho Shim, Sangdoo Yun, Myunggu Kang, Dongyoon Wee, Sangyoun Lee

    Abstract: Data augmentation has recently emerged as an essential component of modern training recipes for visual recognition tasks. However, data augmentation for video recognition has been rarely explored despite its effectiveness. Few existing augmentation recipes for video recognition naively extend the image augmentation methods by applying the same operations to the whole video frames. Our main idea is… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Technical Report

  7. arXiv:2206.04906  [pdf, other

    cs.CV cs.LG

    Out of Sight, Out of Mind: A Source-View-Wise Feature Aggregation for Multi-View Image-Based Rendering

    Authors: Geonho Cha, Chaehun Shin, Sungroh Yoon, Dongyoon Wee

    Abstract: To estimate the volume density and color of a 3D point in the multi-view image-based rendering, a common approach is to inspect the consensus existence among the given source image features, which is one of the informative cues for the estimation procedure. To this end, most of the previous methods utilize equally-weighted aggregation features. However, this could make it hard to check the consens… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  8. arXiv:2205.10006  [pdf, other

    cs.CV cs.AI cs.LG

    Self-Supervised Depth Estimation with Isometric-Self-Sample-Based Learning

    Authors: Geonho Cha, Ho-Deok Jang, Dongyoon Wee

    Abstract: Managing the dynamic regions in the photometric loss formulation has been a main issue for handling the self-supervised depth estimation problem. Most previous methods have alleviated this issue by removing the dynamic regions in the photometric loss formulation based on the masks estimated from another module, making it difficult to fully utilize the training images. In this paper, to handle this… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  9. arXiv:2205.00968  [pdf, other

    cs.CV

    Detection Recovery in Online Multi-Object Tracking with Sparse Graph Tracker

    Authors: Jeongseok Hyun, Myunggu Kang, Dongyoon Wee, Dit-Yan Yeung

    Abstract: In existing joint detection and tracking methods, pairwise relational features are used to match previous tracklets to current detections. However, the features may not be discriminative enough for a tracker to identify a target from a large number of detections. Selecting only high-scored detections for tracking may lead to missed detections whose confidence score is low. Consequently, in the onl… ▽ More

    Submitted 19 September, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted to WACV 2023; fix figures

  10. arXiv:2204.03865  [pdf, other

    cs.CV

    Frequency Selective Augmentation for Video Representation Learning

    Authors: **hyung Kim, Taeoh Kim, Minho Shim, Dongyoon Han, Dongyoon Wee, Junmo Kim

    Abstract: Recent self-supervised video representation learning methods focus on maximizing the similarity between multiple augmented views from the same video and largely rely on the quality of generated views. However, most existing methods lack a mechanism to prevent representation learning from bias towards static information in the video. In this paper, we propose frequency augmentation (FreqAug), a spa… ▽ More

    Submitted 6 December, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: AAAI23

  11. Electronic, vibrational and transport properties of pnictogen substituted ternary skutterudites

    Authors: Dmitri Volja, Boris Kozinsky, An Li, Daehyun Wee, Nicola Marzari, Marco Fornari

    Abstract: First principles calculations are used to investigate electronic band structure and vibrational spectra of pnictogen substituted ternary skutterudites. We compare the results with the prototypical binary composition CoSb$_3$ to identify the effects of substitutions on the Sb site, and evaluate the potential of ternary skutterudites for thermoelectric applications. Electronic transport coefficients… ▽ More

    Submitted 7 December, 2011; originally announced December 2011.