Skip to main content

Showing 1–37 of 37 results for author: Kim, S W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12095  [pdf, other

    cs.CV cs.AI cs.RO

    DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

    Authors: Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus

    Abstract: We propose DistillNeRF, a self-supervised learning framework addressing the challenge of understanding 3D environments from limited 2D observations in autonomous driving. Our method is a generalizable feedforward model that predicts a rich neural scene representation from sparse, single-frame multi-view camera inputs, and is trained self-supervised with differentiable rendering to reconstruct RGB,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.10324  [pdf, other

    cs.CV cs.LG

    L4GM: Large 4D Gaussian Reconstruction Model

    Authors: Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng, Karsten Kreis, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling

    Abstract: We present L4GM, the first 4D Large Reconstruction Model that produces animated objects from a single-view video input -- in a single feed-forward pass that takes only a second. Key to our success is a novel dataset of multiview videos containing curated, rendered animated objects from Objaverse. This dataset depicts 44K diverse objects with 110K animations rendered in 48 viewpoints, resulting in… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project page: https://research.nvidia.com/labs/toronto-ai/l4gm

  3. arXiv:2406.06650  [pdf, other

    eess.IV cs.CV

    Predicting the risk of early-stage breast cancer recurrence using H\&E-stained tissue images

    Authors: Geongyu Lee, Joonho Lee, Tae-Yeong Kwak, Sun Woo Kim, Youngmee Kwon, Chungyeul Kim, Hyeyoon Chang

    Abstract: Accurate prediction of the likelihood of recurrence is important in the selection of postoperative treatment for patients with early-stage breast cancer. In this study, we investigated whether deep learning algorithms can predict patients' risk of recurrence by analyzing the pathology images of their cancer histology. A total of 125 hematoxylin and eosin stained breast cancer whole slide images la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 7 figures

  4. arXiv:2405.14126  [pdf, other

    cs.LG cs.AI cs.CV

    The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks

    Authors: Bum Jun Kim, Yoshinobu Kawahara, Sang Woo Kim

    Abstract: Dynamical systems are often time-varying, whose modeling requires a function that evolves with respect to time. Recent studies such as the neural ordinary differential equation proposed a time-dependent neural network, which provides a neural network varying with respect to time. However, we claim that the architectural choice to build a time-dependent neural network significantly affects its time… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 14 pages, 7 figures

  5. arXiv:2405.14115  [pdf, other

    cs.CV cs.AI cs.LG

    Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers

    Authors: Bum Jun Kim, Sang Woo Kim

    Abstract: Vision transformers (ViTs) have demonstrated remarkable performance in a variety of vision tasks. Despite their promising capabilities, training a ViT requires a large amount of diverse data. Several studies empirically found that using rich data augmentations, such as Mixup, Cutmix, and random erasing, is critical to the successful training of ViTs. Now, the use of rich data augmentations has bec… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures

  6. arXiv:2404.10765  [pdf, other

    cs.CV

    RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting

    Authors: Ashkan Mirzaei, Riccardo De Lutio, Seung Wook Kim, David Acuna, Jonathan Kelly, Sanja Fidler, Igor Gilitschenski, Zan Gojcic

    Abstract: Neural reconstruction approaches are rapidly emerging as the preferred representation for 3D scenes, but their limited editability is still posing a challenge. In this work, we propose an approach for 3D scene inpainting -- the task of coherently replacing parts of the reconstructed scene with desired content. Scene inpainting is an inherently ill-posed task as there exist many solutions that plau… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Project page: https://reffusion.github.io

  7. arXiv:2402.01149  [pdf, other

    cs.CV

    Scale Equalization for Multi-Level Feature Fusion

    Authors: Bum Jun Kim, Sang Woo Kim

    Abstract: Deep neural networks have exhibited remarkable performance in a variety of computer vision fields, especially in semantic segmentation tasks. Their success is often attributed to multi-level feature fusion, which enables them to understand both global and local information from an image. However, we found that multi-level features from parallel branches are on different scales. The scale disequili… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 10 pages, 3 figures

  8. arXiv:2401.11739  [pdf, other

    cs.CV cs.LG

    EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models

    Authors: Koichi Namekata, Amirmojtaba Sabour, Sanja Fidler, Seung Wook Kim

    Abstract: Diffusion models have recently received increasing research attention for their remarkable transfer abilities in semantic segmentation tasks. However, generating fine-grained segmentation masks with diffusion models often requires additional training on annotated datasets, leaving it unclear to what extent pre-trained diffusion models alone understand the semantic relations of their generated imag… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: ICLR 2024. Project page: https://kmcode1.github.io/Projects/EmerDiff/

  9. arXiv:2312.13763  [pdf, other

    cs.CV cs.LG

    Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models

    Authors: Huan Ling, Seung Wook Kim, Antonio Torralba, Sanja Fidler, Karsten Kreis

    Abstract: Text-guided diffusion models have revolutionized image and video generation and have also been successfully used for optimization-based 3D object synthesis. Here, we instead focus on the underexplored text-to-4D setting and synthesize dynamic, animated 3D objects using score distillation methods with an additional temporal dimension. Compared to previous work, we pursue a novel compositional gener… ▽ More

    Submitted 3 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Project page: https://research.nvidia.com/labs/toronto-ai/AlignYourGaussians/

  10. arXiv:2311.13570  [pdf, other

    cs.CV

    WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space

    Authors: Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger, Karsten Kreis

    Abstract: Modern learning-based approaches to 3D-aware image synthesis achieve high photorealism and 3D-consistent viewpoint changes for the generated images. Existing approaches represent instances in a shared canonical space. However, for in-the-wild datasets a shared canonical system can be difficult to define or might not even exist. In this work, we instead model instances in view space, alleviating th… ▽ More

    Submitted 12 April, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  11. arXiv:2311.03938  [pdf, other

    cs.CV

    Analysis of NaN Divergence in Training Monocular Depth Estimation Model

    Authors: Bum Jun Kim, Hyeonah Jang, Sang Woo Kim

    Abstract: The latest advances in deep learning have facilitated the development of highly accurate monocular depth estimation models. However, when training a monocular depth estimation network, practitioners and researchers have observed not a number (NaN) loss, which disrupts gradient descent optimization. Although several practitioners have reported the stochastic and mysterious occurrence of NaN loss th… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures

  12. arXiv:2311.02077  [pdf, other

    cs.CV

    EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

    Authors: Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang

    Abstract: We present EmerNeRF, a simple yet powerful approach for learning spatial-temporal representations of dynamic driving scenes. Grounded in neural fields, EmerNeRF simultaneously captures scene geometry, appearance, motion, and semantics via self-bootstrap**. EmerNeRF hinges upon two core components: First, it stratifies scenes into static and dynamic fields. This decomposition emerges purely from… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: See the project page for code, data, and request pre-trained models: https://emernerf.github.io

  13. arXiv:2307.14179  [pdf, other

    cs.CV

    Resolution-Aware Design of Atrous Rates for Semantic Segmentation Networks

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: DeepLab is a widely used deep neural network for semantic segmentation, whose success is attributed to its parallel architecture called atrous spatial pyramid pooling (ASPP). ASPP uses multiple atrous convolutions with different atrous rates to extract both local and global information. However, fixed values of atrous rates are used for the ASPP module, which restricts the size of its field of vie… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 18 pages, 12 figures

  14. arXiv:2307.07487  [pdf, other

    cs.CV cs.LG

    DreamTeacher: Pretraining Image Backbones with Deep Generative Models

    Authors: Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, Karsten Kreis, Antonio Torralba, Sanja Fidler

    Abstract: In this work, we introduce a self-supervised feature representation learning framework DreamTeacher that utilizes generative networks for pre-training downstream image backbones. We propose to distill knowledge from a trained generative model into standard image backbones that have been well engineered for specific perception tasks. We investigate two types of knowledge distillation: 1) distilling… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Project page: https://research.nvidia.com/labs/toronto-ai/DreamTeacher/

  15. arXiv:2305.04722  [pdf, other

    cs.CV

    Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive Fields

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: Vision transformers (ViTs) that model an image as a sequence of partitioned patches have shown notable performance in diverse vision tasks. Because partitioning patches eliminates the image structure, to reflect the order of patches, ViTs utilize an explicit component called positional embedding. However, we claim that the use of positional embedding does not simply guarantee the order-awareness o… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 11 pages, 7 Figures

  16. arXiv:2304.09787  [pdf, other

    cs.CV

    NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

    Authors: Seung Wook Kim, Bradley Brown, Kangxue Yin, Karsten Kreis, Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler

    Abstract: Automatically generating high-quality real world 3D scenes is of enormous interest for applications such as virtual reality and robotics simulation. Towards this goal, we introduce NeuralField-LDM, a generative model capable of synthesizing complex 3D environments. We leverage Latent Diffusion Models that have been successfully utilized for efficient high-quality 2D content creation. We first trai… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  17. arXiv:2304.08818  [pdf, other

    cs.CV cs.LG

    Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

    Authors: Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, Karsten Kreis

    Abstract: Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding excessive compute demands by training a diffusion model in a compressed lower-dimensional latent space. Here, we apply the LDM paradigm to high-resolution video generation, a particularly resource-intensive task. We first pre-train an LDM on images only; then, we turn the image generator into a video generator by int… ▽ More

    Submitted 27 December, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR) 2023. Project page: https://research.nvidia.com/labs/toronto-ai/VideoLDM/

  18. arXiv:2302.06112  [pdf, other

    cs.LG cs.CV

    How to Use Dropout Correctly on Residual Networks with Batch Normalization

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Donggeon Lee, Sang Woo Kim

    Abstract: For the stable optimization of deep neural networks, regularization methods such as dropout and batch normalization have been used in various tasks. Nevertheless, the correct position to apply dropout has rarely been discussed, and different positions have been employed depending on the practitioners. In this study, we investigate the correct position to apply dropout. We demonstrate that for a re… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures

  19. arXiv:2302.03193  [pdf, other

    cs.LG cs.CV

    On the Ideal Number of Groups for Isometric Gradient Propagation

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Sang Woo Kim

    Abstract: Recently, various normalization layers have been proposed to stabilize the training of deep neural networks. Among them, group normalization is a generalization of layer normalization and instance normalization by allowing a degree of freedom in the number of groups it uses. However, to determine the optimal number of groups, trial-and-error-based hyperparameter tuning is required, and such experi… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 10 pages, 2 figures

  20. arXiv:2208.12544  [pdf

    cs.LG eess.SP physics.flu-dyn

    Deep learning-based denoising for fast time-resolved flame emission spectroscopy in high-pressure combustion environment

    Authors: Taekeun Yoon, Seon Woong Kim, Hosung Byun, Younsik Kim, Campbell D. Carter, Hyungrok Do

    Abstract: A deep learning strategy is developed for fast and accurate gas property measurements using flame emission spectroscopy (FES). Particularly, the short-gated fast FES is essential to resolve fast-evolving combustion behaviors. However, as the exposure time for capturing the flame emission spectrum gets shorter, the signal-to-noise ratio (SNR) decreases, and characteristic spectral features indicati… ▽ More

    Submitted 26 December, 2022; v1 submitted 29 July, 2022; originally announced August 2022.

    Comments: 25 pages, 12 figures, accepted to Combustion and Flame

    Report number: Combustion and Flame 248 (2023) 112583

  21. arXiv:2206.02903  [pdf, other

    cs.CV

    Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps

    Authors: Seung Wook Kim, Karsten Kreis, Daiqing Li, Antonio Torralba, Sanja Fidler

    Abstract: Modern image generative models show remarkable sample quality when trained on a single domain or class of objects. In this work, we introduce a generative adversarial network that can simultaneously generate aligned image samples from multiple related domains. We leverage the fact that a variety of object classes share common attributes, with certain geometric differences. We propose Polymorphic-G… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: CVPR 2022 Oral

  22. arXiv:2205.07260  [pdf, other

    cs.CV

    Guidelines for the Regularization of Gammas in Batch Normalization for Deep Residual Networks

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee, Wonseok Jeong, Sang Woo Kim

    Abstract: L2 regularization for weights in neural networks is widely used as a standard training trick. However, L2 regularization for gamma, a trainable parameter of batch normalization, remains an undiscussed mystery and is applied in different ways depending on the library and practitioner. In this paper, we study whether L2 regularization for gamma is valid. To explore this issue, we consider two approa… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: 12 pages, 6 figures

  23. arXiv:2201.04684  [pdf, other

    cs.CV

    BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

    Authors: Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Adela Barriuso, Sanja Fidler, Antonio Torralba

    Abstract: Annotating images with pixel-wise labels is a time-consuming and costly process. Recently, DatasetGAN showcased a promising alternative - to synthesize a large labeled dataset via a generative adversarial network (GAN) by exploiting a small set of manually labeled, GAN-generated images. Here, we scale DatasetGAN to ImageNet scale of class diversity. We take image samples from the class-conditional… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: https://nv-tlabs.github.io/big-datasetgan/

  24. arXiv:2111.08413  [pdf, other

    cs.CV

    Improved Robustness of Vision Transformer via PreLayerNorm in Patch Embedding

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee, Wonseok Jeong, Sang Woo Kim

    Abstract: Vision transformers (ViTs) have recently demonstrated state-of-the-art performance in a variety of vision tasks, replacing convolutional neural networks (CNNs). Meanwhile, since ViT has a different architecture than CNN, it may behave differently. To investigate the reliability of ViT, this paper studies the behavior and robustness of ViT. We compared the robustness of CNN and ViT by assuming vari… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 7 pages, 8 figures. Work in Progress

  25. arXiv:2111.03186  [pdf, other

    cs.CV cs.AI

    EditGAN: High-Precision Semantic Image Editing

    Authors: Huan Ling, Karsten Kreis, Daiqing Li, Seung Wook Kim, Antonio Torralba, Sanja Fidler

    Abstract: Generative adversarial networks (GANs) have recently found applications in image editing. However, most GAN based image editing methods often require large scale datasets with semantic segmentation annotations for training, only provide high level control, or merely interpolate between different images. Here, we propose EditGAN, a novel method for high quality, high precision semantic image editin… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  26. arXiv:2108.13576  [pdf, other

    cs.CV

    Dead Pixel Test Using Effective Receptive Field

    Authors: Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee, Wonseok Jeong, Sang Woo Kim

    Abstract: Deep neural networks have been used in various fields, but their internal behavior is not well known. In this study, we discuss two counterintuitive behaviors of convolutional neural networks (CNNs). First, we evaluated the size of the receptive field. Previous studies have attempted to increase or control the size of the receptive field. However, we observed that the size of the receptive field d… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: 9 pages, 5 figures

  27. arXiv:2104.15060  [pdf, other

    cs.CV cs.RO

    DriveGAN: Towards a Controllable High-Quality Neural Simulation

    Authors: Seung Wook Kim, Jonah Philion, Antonio Torralba, Sanja Fidler

    Abstract: Realistic simulators are critical for training and verifying robotics systems. While most of the contemporary simulators are hand-crafted, a scaleable way to build simulators is to use machine learning to learn how the environment behaves in response to an action, directly from data. In this work, we aim to learn to simulate a dynamic environment directly in pixel-space, by watching unannotated se… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: CVPR 2021 Oral

  28. Self-supervised driven consistency training for annotation efficient histopathology image analysis

    Authors: Chetan L. Srinidhi, Seung Wook Kim, Fu-Der Chen, Anne L. Martel

    Abstract: Training a neural network with a large labeled dataset is still a dominant paradigm in computational histopathology. However, obtaining such exhaustive manual annotations is often expensive, laborious, and prone to inter and Intra-observer variability. While recent self-supervised and semi-supervised methods can alleviate this need by learn-ing unsupervised feature representations, they still stru… ▽ More

    Submitted 3 October, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Journal ref: Medical Image Analysis, Volume 75, January 2022

  29. arXiv:2008.07083  [pdf, other

    cs.NI cs.CV cs.LG

    Edge Network-Assisted Real-Time Object Detection Framework for Autonomous Driving

    Authors: Seung Wook Kim, Keunsoo Ko, Haneul Ko, Victor C. M. Leung

    Abstract: Autonomous vehicles (AVs) can achieve the desired results within a short duration by offloading tasks even requiring high computational power (e.g., object detection (OD)) to edge clouds. However, although edge clouds are exploited, real-time OD cannot always be guaranteed due to dynamic channel quality. To mitigate this problem, we propose an edge network-assisted real-time OD framework~(EODF). I… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: This paper will be published in IEEE Network

  30. arXiv:2005.12126  [pdf, other

    cs.CV

    Learning to Simulate Dynamic Environments with GameGAN

    Authors: Seung Wook Kim, Yuhao Zhou, Jonah Philion, Antonio Torralba, Sanja Fidler

    Abstract: Simulation is a crucial component of any robotic system. In order to simulate correctly, we need to write complex rules of the environment: how dynamic agents behave, and how the actions of each of the agents affect the behavior of others. In this paper, we aim to learn a simulator by simply watching an agent interact with an environment. We focus on graphics games as a proxy of the real environme… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: CVPR 2020

  31. arXiv:2001.05153  [pdf, other

    cs.CV

    Extending Class Activation Map** Using Gaussian Receptive Field

    Authors: Bum Jun Kim, Gyogwon Koo, Hyeyeon Choi, Sang Woo Kim

    Abstract: This paper addresses the visualization task of deep learning models. To improve Class Activation Map** (CAM) based visualization method, we offer two options. First, we propose Gaussian upsampling, an improved upsampling method that can reflect the characteristics of deep learning models. Second, we identify and modify unnatural terms in the mathematical derivation of the existing CAM studies. B… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: 7 pages, 5 figures

  32. arXiv:1912.13082  [pdf, other

    cs.CL cs.AI

    The Shmoop Corpus: A Dataset of Stories with Loosely Aligned Summaries

    Authors: Atef Chaudhury, Makarand Tapaswi, Seung Wook Kim, Sanja Fidler

    Abstract: Understanding stories is a challenging reading comprehension problem for machines as it requires reading a large volume of text and following long-range dependencies. In this paper, we introduce the Shmoop Corpus: a dataset of 231 stories that are paired with detailed multi-paragraph summaries for each individual chapter (7,234 chapters), where the summary is chronologically aligned with respect t… ▽ More

    Submitted 1 January, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: Project page: http://www.cs.toronto.edu/~makarand/shmoop/ Dataset at: https://github.com/achaudhury/shmoop-corpus/

  33. Selective Distillation of Weakly Annotated GTD for Vision-based Slab Identification System

    Authors: Sang Jun Lee, Sang Woo Kim, Wookyong Kwon, Gyogwon Koo, Jong Pil Yun

    Abstract: This paper proposes an algorithm for recognizing slab identification numbers in factory scenes. In the development of a deep-learning based system, manual labeling to make ground truth data (GTD) is an important but expensive task. Furthermore, the quality of GTD is closely related to the performance of a supervised learning algorithm. To reduce manual work in the labeling process, we generated we… ▽ More

    Submitted 13 December, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: 10 pages, 12 figures, submitted to a journal

    Journal ref: IEEE Access 7 (2019) 23177-23186

  34. arXiv:1810.01616  [pdf, other

    cs.CV

    Cascaded Pyramid Network for 3D Human Pose Estimation Challenge

    Authors: Sungeun Hong, Won** Jung, Ilsang Woo, Seung Wook Kim

    Abstract: Over the past decade, there has been a growing interest in human pose estimation. Although much work has been done on 2D pose estimation, 3D pose estimation has still been relatively studied less. In this paper, we propose a top-bottom based two-stage 3D estimation framework. GloabalNet and RefineNet in our 2D pose estimation process enable us to find occluded or invisible 2D joints while 2D-to-3D… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: Accepted to ECCV Workshop 2018

  35. arXiv:1806.02453  [pdf, other

    cs.CV

    Visual Reasoning by Progressive Module Networks

    Authors: Seung Wook Kim, Makarand Tapaswi, Sanja Fidler

    Abstract: Humans learn to solve tasks of increasing complexity by building on top of previously acquired knowledge. Typically, there exists a natural progression in the tasks that we learn - most do not require completely independent solutions, but can be broken down into simpler subtasks. We propose to represent a solver for each task as a neural module that calls existing modules (solvers for simpler task… ▽ More

    Submitted 27 September, 2018; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: 17 pages, 5 figures

  36. arXiv:1804.03533  [pdf, ps, other

    cs.NI

    Multi-band RF Energy and Spectrum Harvesting in Cognitive Radio Networks

    Authors: Ahmad Alsharoa, Nathan M Neihart, Sang W Kim, Ahmed E Kamal

    Abstract: This paper investigates a multi-band harvesting (EH) schemes under cognitive radio interweave framework. All secondary users are considered as EH nodes that are allowed to harvest energy from multiple bands of Radio Frequency (RF) sources. A win-win framework is proposed, where SUs can sense the spectrum to determine whether the spectrum is busy, and hence they may harvest from RF energy, or if it… ▽ More

    Submitted 10 April, 2018; originally announced April 2018.

  37. arXiv:1403.6555  [pdf, ps, other

    cs.IT

    Modify-and-Forward for Securing Cooperative Relay Communications

    Authors: Sang Wu Kim

    Abstract: We proposed a new physical layer technique that can enhance the security of cooperative relay communications. The proposed approach modifies the decoded message at the relay according to the unique channel state between the relay and the destination such that the destination can utilize the modified message to its advantage while the eavesdropper cannot. We present a practical method for securely… ▽ More

    Submitted 25 March, 2014; originally announced March 2014.

    Comments: IEEE International Zurich Seminar on Communications, Feb. 2014