Skip to main content

Showing 1–45 of 45 results for author: Choi, J Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12909  [pdf, other

    cs.LG physics.comp-ph

    Scalable Training of Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN

    Authors: Massimiliano Lupo Pasini, Jong Youl Choi, Kshitij Mehta, Pei Zhang, David Rogers, Jonghyun Bae, Khaled Z. Ibrahim, Ashwin M. Aji, Karl W. Schulz, Jorda Polo, Prasanna Balaprakash

    Abstract: We present our work on develo** and training scalable graph foundation models (GFM) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduction of and comparison across algorithmic innovations that de… ▽ More

    Submitted 28 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 16 pages, 13 figures

    MSC Class: 68T07; 68T09 ACM Class: C.2.4; I.2.11

  2. arXiv:2405.16301  [pdf, other

    cs.CV cs.LG

    Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples

    Authors: Dae Ung Jo, Kyuewang Lee, JaeHo Chung, ** Young Choi

    Abstract: Securing a sufficient amount of paired data is important to train an image-text retrieval (ITR) model, but collecting paired data is very expensive. To address this issue, in this paper, we propose an active learning algorithm for ITR that can collect paired data cost-efficiently. Previous studies assume that image-text pairs are given and their category labels are asked to the annotator. However,… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  3. arXiv:2405.15780  [pdf, other

    cs.CV cs.LG

    Sequence Length Scaling in Vision Transformers for Scientific Images on Frontier

    Authors: Aristeidis Tsaris, Chengming Zhang, Xiao Wang, Junqi Yin, Siyan Liu, Moetasim Ashfaq, Ming Fan, Jong Youl Choi, Mohamed Wahib, Dan Lu, Prasanna Balaprakash, Feiyi Wang

    Abstract: Vision Transformers (ViTs) are pivotal for foundational models in scientific imagery, including Earth science applications, due to their capability to process large sequence lengths. While transformers for text has inspired scaling sequence lengths in ViTs, yet adapting these for ViTs introduces unique challenges. We develop distributed sequence parallelism for ViTs, enabling them to handle up to… ▽ More

    Submitted 17 April, 2024; originally announced May 2024.

  4. arXiv:2405.03958  [pdf, other

    cs.CV cs.AI cs.LG

    Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model

    Authors: Joo Young Choi, Jaesung R. Park, Inkyu Park, Jaewoong Cho, Albert No, Ernest K. Ryu

    Abstract: Current state-of-the-art diffusion models employ U-Net architectures containing convolutional and (qkv) self-attention layers. The U-Net processes images while being conditioned on the time embedding input for each sampling step and the class or caption embedding input corresponding to the desired conditional generation. Such conditioning involves scale-and-shift operations to the convolutional la… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  5. arXiv:2403.06225  [pdf, other

    cs.CV cs.AI

    MoST: Motion Style Transformer between Diverse Action Contents

    Authors: Boeun Kim, Jungho Kim, Hyung ** Chang, ** Young Choi

    Abstract: While existing motion style transfer methods are effective between two motions with identical content, their performance significantly diminishes when transferring style between motions with different contents. This challenge lies in the lack of clear separation between content and style of a motion. To tackle this challenge, we propose a novel motion style transformer that effectively disentangle… ▽ More

    Submitted 20 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  6. arXiv:2401.03846  [pdf, other

    cs.CV cs.LG

    UFO: Unidentified Foreground Object Detection in 3D Point Cloud

    Authors: Hyunjun Choi, Hawook Jeong, ** Young Choi

    Abstract: In this paper, we raise a new issue on Unidentified Foreground Object (UFO) detection in 3D point clouds, which is a crucial technology in autonomous driving in the wild. UFO detection is challenging in that existing 3D object detectors encounter extremely hard challenges in both 3D localization and Out-of-Distribution (OOD) detection. To tackle these challenges, we suggest a new UFO detection fra… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Under review

  7. Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video Grounding

    Authors: Sunoh Kim, Jungchan Cho, Joonsang Yu, YoungJoon Yoo, ** Young Choi

    Abstract: In the weakly supervised temporal video grounding study, previous methods use predetermined single Gaussian proposals which lack the ability to express diverse events described by the sentence query. To enhance the expression ability of a proposal, we propose a Gaussian mixture proposal (GMP) that can depict arbitrary shapes by learning importance, centroid, and range of every Gaussian in the mixt… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI 2024

  8. arXiv:2311.05407  [pdf

    physics.comp-ph cs.LG physics.chem-ph

    Data Distillation for Neural Network Potentials toward Foundational Dataset

    Authors: Gang Seob Jung, Sangkeun Lee, Jong Youl Choi

    Abstract: Machine learning (ML) techniques and atomistic modeling have rapidly transformed materials design and discovery. Specifically, generative models can swiftly propose promising materials for targeted applications. However, the predicted properties of materials through the generative models often do not match with calculated properties through ab initio calculations. This discrepancy can arise becaus… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  9. arXiv:2308.01030  [pdf, other

    cs.LG cs.AI cs.CV

    Three Factors to Improve Out-of-Distribution Detection

    Authors: Hyunjun Choi, JaeHo Chung, Hawook Jeong, ** Young Choi

    Abstract: In the problem of out-of-distribution (OOD) detection, the usage of auxiliary data as outlier data for fine-tuning has demonstrated encouraging performance. However, previous methods have suffered from a trade-off between classification accuracy (ACC) and OOD detection performance (AUROC, FPR, AUPR). To improve this trade-off, we make three contributions: (i) Incorporating a self-knowledge distill… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: Under review

  10. arXiv:2306.10485  [pdf, other

    cs.CV cs.AI cs.LG

    Balanced Energy Regularization Loss for Out-of-distribution Detection

    Authors: Hyunjun Choi, Hawook Jeong, ** Young Choi

    Abstract: In the field of out-of-distribution (OOD) detection, a previous method that use auxiliary data as OOD data has shown promising performance. However, the method provides an equal loss to all auxiliary data to differentiate them from inliers. However, based on our observation, in various tasks, there is a general imbalance in the distribution of the auxiliary OOD data across classes. We propose a ba… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: Accepted by CVPR2023

  11. arXiv:2305.16618  [pdf, other

    cs.LG cs.SI

    Confidence-Based Feature Imputation for Graphs with Partially Known Features

    Authors: Daeho Um, Jiwoong Park, Seulki Park, ** Young Choi

    Abstract: This paper investigates a missing feature imputation problem for graph learning tasks. Several methods have previously addressed learning tasks on graphs with missing features. However, in cases of high rates of missing features, they were unable to avoid significant performance degradation. To overcome this limitation, we introduce a novel concept of channel-wise confidence in a node feature, whi… ▽ More

    Submitted 28 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2023. 28 pages

  12. arXiv:2304.13995  [pdf, other

    cs.CV cs.AI

    Rotation and Translation Invariant Representation Learning with Implicit Neural Representations

    Authors: Sehyun Kwon, Joo Young Choi, Ernest K. Ryu

    Abstract: In many computer vision applications, images are acquired with arbitrary or random rotations and translations, and in such setups, it is desirable to obtain semantic representations disentangled from the image orientation. Examples of such applications include semiconductor wafer defect inspection, plankton microscope images, and inference on single-particle cryo-electron microscopy (cryo-EM) micr… ▽ More

    Submitted 12 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

  13. arXiv:2304.10727  [pdf, other

    cs.CV cs.AI

    RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models

    Authors: Seulki Park, Daeho Um, Hajung Yoon, Sanghyuk Chun, Sangdoo Yun, ** Young Choi

    Abstract: In this paper, we propose a robustness benchmark for image-text matching models to assess their vulnerabilities. To this end, we insert adversarial texts and images into the search pool (i.e., gallery set) and evaluate models with the adversarial data. Specifically, we replace a word in the text to change the meaning of the text and mix images with different images to create perceptible changes in… ▽ More

    Submitted 14 July, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  14. arXiv:2304.05067  [pdf

    eess.AS cs.IR cs.SD

    Audio Bank: A High-Level Acoustic Signal Representation for Audio Event Recognition

    Authors: Tushar Sandhan, Sukanya Sonowal, ** Young Choi

    Abstract: Automatic audio event recognition plays a pivotal role in making human robot interaction more closer and has a wide applicability in industrial automation, control and surveillance systems. Audio event is composed of intricate phonic patterns which are harmonically entangled. Audio recognition is dominated by low and mid-level features, which have demonstrated their recognition capability but they… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 6 pages, 9 figures, published in IEEE International Conf ICCAS 2014 (Best paper award)

  15. arXiv:2211.10967  [pdf, other

    cs.CV

    Font Representation Learning via Paired-glyph Matching

    Authors: Junho Cho, Kyuewang Lee, ** Young Choi

    Abstract: Fonts can convey profound meanings of words in various forms of glyphs. Without typography knowledge, manually selecting an appropriate font or designing a new font is a tedious and painful task. To allow users to explore vast font styles and create new font styles, font retrieval and font style transfer methods have been proposed. These tasks increase the need for learning high-quality font repre… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted to BMVC2022

  16. arXiv:2210.14480  [pdf, other

    cs.LG cs.AI cs.SI

    Meta-node: A Concise Approach to Effectively Learn Complex Relationships in Heterogeneous Graphs

    Authors: Jiwoong Park, Jisu Jeong, Kyungmin Kim, ** Young Choi

    Abstract: Existing message passing neural networks for heterogeneous graphs rely on the concepts of meta-paths or meta-graphs due to the intrinsic nature of heterogeneous graphs. However, the meta-paths and meta-graphs need to be pre-configured before learning and are highly dependent on expert knowledge to construct them. To tackle this challenge, we propose a novel concept of meta-node for message passing… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  17. arXiv:2207.11333  [pdf, other

    cs.LG cs.DC physics.chem-ph physics.comp-ph

    Scalable training of graph convolutional neural networks for fast and accurate predictions of HOMO-LUMO gap in molecules

    Authors: Jong Youl Choi, Pei Zhang, Kshitij Mehta, Andrew Blanchard, Massimiliano Lupo Pasini

    Abstract: Graph Convolutional Neural Network (GCNN) is a popular class of deep learning (DL) models in material science to predict material properties from the graph representation of molecular structures. Training an accurate and comprehensive GCNN surrogate for molecular design requires large-scale graph datasets and is usually a time-consuming process. Recent advances in GPUs and distributed computing op… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: 19 pages, 9 figures

    MSC Class: 68Q85; 68M14; 68W15; 68W15 ACM Class: I.2.11

  18. arXiv:2207.06101  [pdf, other

    cs.CV cs.AI

    Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning

    Authors: Boeun Kim, Hyung ** Chang, Jungho Kim, ** Young Choi

    Abstract: We propose a new transformer model for the task of unsupervised learning of skeleton motion sequences. The existing transformer model utilized for unsupervised skeleton-based action learning is learned the instantaneous velocity of each joint from adjacent frames without global motion information. Thus, the model has difficulties in learning the attention globally over whole-body motions and tempo… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022

  19. Position-aware Location Regression Network for Temporal Video Grounding

    Authors: Sunoh Kim, Kimin Yun, ** Young Choi

    Abstract: The key to successful grounding for video surveillance is to understand a semantic phrase corresponding to important actors and objects. Conventional methods ignore comprehensive contexts for the phrase or require heavy computation for multiple phrases. To understand comprehensive contexts with only one semantic phrase, we propose Position-aware Location Regression Network (PLRN) which exploits po… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted in AVSS 2021

  20. arXiv:2203.07837  [pdf, other

    cs.CV

    Pose-MUM : Reinforcing Key Points Relationship for Semi-Supervised Human Pose Estimation

    Authors: JongMok Kim, Hwijun Lee, Jaeseung Lim, Jongkeun Na, Nojun Kwak, ** Young Choi

    Abstract: A well-designed strong-weak augmentation strategy and the stable teacher to generate reliable pseudo labels are essential in the teacher-student framework of semi-supervised learning (SSL). Considering these in mind, to suit the semi-supervised human pose estimation (SSHPE) task, we propose a novel approach referred to as Pose-MUM that modifies Mix/UnMix (MUM) augmentation. Like MUM in the dense p… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  21. arXiv:2202.02981  [pdf, other

    cs.LG math.OC stat.ML

    Neural Tangent Kernel Analysis of Deep Narrow Neural Networks

    Authors: Jongmin Lee, Joo Young Choi, Ernest K. Ryu, Albert No

    Abstract: The tremendous recent progress in analyzing the training dynamics of overparameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we present the first trainability guarantee of infinitely deep but narrow neural networks. We study the infinite-depth limit of a multilayer perceptron (MLP) with a… ▽ More

    Submitted 27 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Journal ref: Published in International Conference on Machine Learning, 2022

  22. arXiv:2202.01954  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Multi-task graph neural networks for simultaneous prediction of global and atomic properties in ferromagnetic systems

    Authors: Massimiliano Lupo Pasini, Pei Zhang, Samuel Temple Reeve, Jong Youl Choi

    Abstract: We introduce a multi-tasking graph convolutional neural network, HydraGNN, to simultaneously predict both global and atomic physical properties and demonstrate with ferromagnetic materials. We train HydraGNN on an open-source ab initio density functional theory (DFT) dataset for iron-platinum (FePt) with a fixed body centered tetragonal (BCT) lattice structure and fixed volume to simultaneously pr… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 13 pages, 6 figures

    Journal ref: Mach. Learn.: Sci. Technol. 3 025007 (2022)

  23. arXiv:2112.00412  [pdf, other

    cs.CV cs.AI

    The Majority Can Help The Minority: Context-rich Minority Oversampling for Long-tailed Classification

    Authors: Seulki Park, Youngkyu Hong, Byeongho Heo, Sangdoo Yun, ** Young Choi

    Abstract: The problem of class imbalanced data is that the generalization performance of the classifier deteriorates due to the lack of data from minority classes. In this paper, we propose a novel minority over-sampling method to augment diversified minority samples by leveraging the rich context of the majority classes as background images. To diversify the minority samples, our key idea is to paste an im… ▽ More

    Submitted 27 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted by CVPR 2022, 14 pages

  24. arXiv:2110.02444  [pdf, other

    cs.CV cs.LG

    Influence-Balanced Loss for Imbalanced Visual Classification

    Authors: Seulki Park, Jongin Lim, Younghan Jeon, ** Young Choi

    Abstract: In this paper, we propose a balancing training method to address problems in imbalanced data learning. To this end, we derive a new loss used in the balancing training phase that alleviates the influence of samples that cause an overfitted decision boundary. The proposed loss efficiently improves the performance of any type of imbalance learning methods. In experiments on multiple benchmark data s… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Published in ICCV 2021

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 735-744

  25. arXiv:2106.07217  [pdf, other

    cs.CV cs.AI

    Influential Rank: A New Perspective of Post-training for Robust Model against Noisy Labels

    Authors: Seulki Park, Hwanjun Song, Daeho Um, Dae Ung Jo, Sangdoo Yun, ** Young Choi

    Abstract: Deep neural network can easily overfit to even noisy labels due to its high capacity, which degrades the generalization performance of a model. To overcome this issue, we propose a new approach for learning from noisy labels (LNL) via post-training, which can significantly improve the generalization performance of any pre-trained model on noisy label data. To this end, we rather exploit the overfi… ▽ More

    Submitted 19 April, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 15 pages

  26. arXiv:2105.12764  [pdf, other

    cs.DC

    Scalable Multigrid-based Hierarchical Scientific Data Refactoring on GPUs

    Authors: Jieyang Chen, Lipeng Wan, Xin Liang, Ben Whitney, Qing Liu, Qian Gong, David Pugmire, Nicholas Thompson, Jong Youl Choi, Matthew Wolf, Todd Munson, Ian Foster, Scott Klasky

    Abstract: Rapid growth in scientific data and a widening gap between computational speed and I/O bandwidth makes it increasingly infeasible to store and share all data produced by scientific simulations. Instead, we need methods for reducing data volumes: ideally, methods that can scale data volumes adaptively so as to enable negotiation of performance and fidelity tradeoffs in different situations. Multigr… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2007.04457

  27. arXiv:2103.16046  [pdf, other

    cs.LG

    Unsupervised Hyperbolic Representation Learning via Message Passing Auto-Encoders

    Authors: Jiwoong Park, Junho Cho, Hyung ** Chang, ** Young Choi

    Abstract: Most of the existing literature regarding hyperbolic embedding concentrate upon supervised learning, whereas the use of unsupervised hyperbolic embedding is less well explored. In this paper, we analyze how unsupervised tasks can benefit from learned representations in hyperbolic space. To explore how well the hierarchical structure of unlabeled data can be represented in hyperbolic spaces, we des… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

  28. arXiv:2011.06388   

    cs.NE cs.LG

    An ensemble of Density based Geometric One-Class Classifier and Genetic Algorithm

    Authors: Do Gyun Kim, ** Young Choi

    Abstract: One of the most rising issues in recent machine learning research is One-Class Classification which considers data set composed of only one class and outliers. It is more reasonable than traditional Multi-Class Classification in dealing with some problematic data set or special cases. Generally, classification accuracy and interpretability for user are considered as trade-off in OCC methods. Class… ▽ More

    Submitted 25 February, 2021; v1 submitted 2 October, 2020; originally announced November 2020.

    Comments: This manuscript contains a wrong definitions and equations

  29. arXiv:2008.13742  [pdf, other

    cs.DC cs.PF

    Chimbuko: A Workflow-Level Scalable Performance Trace Analysis Tool

    Authors: Sungsoo Ha, Wonyong Jeong, Gyorgy Matyasfalvi, Cong Xie, Kevin Huck, Jong Youl Choi, Abid Malik, Li Tang, Hubertus Van Dam, Line Pouchard, Wei Xu, Shinjae Yoo, Nicholas D'Imperio, Kerstin Kleese Van Dam

    Abstract: Because of the limits input/output systems currently impose on high-performance computing systems, a new generation of workflows that include online data reduction and analysis is emerging. Diagnosing their performance requires sophisticated performance analysis capabilities due to the complexity of execution patterns and underlying hardware, and no tool could handle the voluminous performance tra… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

  30. arXiv:2006.10222  [pdf, other

    cs.LG stat.ML

    Class-Attentive Diffusion Network for Semi-Supervised Classification

    Authors: Jongin Lim, Daeho Um, Hyung ** Chang, Dae Ung Jo, ** Young Choi

    Abstract: Recently, graph neural networks for semi-supervised classification have been widely studied. However, existing methods only use the information of limited neighbors and do not deal with the inter-class connections in graphs. In this paper, we propose Adaptive aggregation with Class-Attentive Diffusion (AdaCAD), a new aggregation scheme that adaptively aggregates nodes probably of the same class am… ▽ More

    Submitted 29 December, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted to AAAI 2021

  31. arXiv:2002.06048  [pdf, other

    cs.CV

    AutoLR: Layer-wise Pruning and Auto-tuning of Learning Rates in Fine-tuning of Deep Networks

    Authors: Youngmin Ro, ** Young Choi

    Abstract: Existing fine-tuning methods use a single learning rate over all layers. In this paper, first, we discuss that trends of layer-wise weight variations by fine-tuning using a single learning rate do not match the well-known notion that lower-level layers extract general features and higher-level layers extract specific features. Based on our discussion, we propose an algorithm that improves fine-tun… ▽ More

    Submitted 3 January, 2021; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: Accepted to AAAI 2021

  32. arXiv:2002.02868  [pdf, ps, other

    cs.LG math.OC stat.ML

    Differentiable Forward and Backward Fixed-Point Iteration Layers

    Authors: Younghan Jeon, Minsik Lee, ** Young Choi

    Abstract: Recently, several studies proposed methods to utilize some classes of optimization problems in designing deep neural networks to encode constraints that conventional layers cannot capture. However, these methods are still in their infancy and require special treatments, such as analyzing the KKT condition, for deriving the backpropagation formula. In this paper, we propose a new layer formulation… ▽ More

    Submitted 14 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  33. arXiv:1908.04015  [pdf, other

    cs.CV cs.LG

    Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

    Authors: YoungJoon Yoo, Sangdoo Yun, Hyung ** Chang, Yiannis Demiris, ** Young Choi

    Abstract: This paper proposes a new high dimensional regression method by merging Gaussian process regression into a variational autoencoder framework. In contrast to other regression methods, the proposed method focuses on the case where output responses are on a complex high dimensional manifold, such as images. Our contributions are summarized as follows: (i) A new regression method estimating high dimen… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

    Comments: Published in CVPR 2017

  34. arXiv:1908.02441  [pdf, other

    cs.LG cs.CV stat.ML

    Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning

    Authors: Jiwoong Park, Minsik Lee, Hyung ** Chang, Kyuewang Lee, ** Young Choi

    Abstract: We propose a symmetric graph convolutional autoencoder which produces a low-dimensional latent representation from a graph. In contrast to the existing graph autoencoders with asymmetric decoder parts, the proposed autoencoder has a newly designed decoder which builds a completely symmetric autoencoder form. For the reconstruction of node features, the decoder is designed based on Laplacian sharpe… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: 10 pages, 3 figures, ICCV 2019 accepted

  35. arXiv:1905.12867  [pdf, other

    cs.LG stat.ML

    Cross-modal Variational Auto-encoder with Distributed Latent Spaces and Associators

    Authors: Dae Ung Jo, ByeongJu Lee, Jongwon Choi, Haanju Yoo, ** Young Choi

    Abstract: In this paper, we propose a novel structure for a cross-modal data association, which is inspired by the recent research on the associative learning structure of the brain. We formulate the cross-modal association in Bayesian inference framework realized by a deep neural network with multiple variational auto-encoders and variational associators. The variational associators transfer the latent spa… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 10 pages, 6 figures

  36. arXiv:1905.10079  [pdf, other

    cs.LG stat.ML

    Neuro-Optimization: Learning Objective Functions Using Neural Networks

    Authors: Younghan Jeon, Minsik Lee, ** Young Choi

    Abstract: Mathematical optimization is widely used in various research fields. With a carefully-designed objective function, mathematical optimization can be quite helpful in solving many problems. However, objective functions are usually hand-crafted and designing a good one can be quite challenging. In this paper, we propose a novel framework to learn the objective function based on a neural net-work. The… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: 10 pages,6 figures

  37. arXiv:1904.08833  [pdf, other

    cs.RO eess.SY

    A Passivity-based Nonlinear Admittance Control with Application to Powered Upper-limb Control under Unknown Environmental Interactions

    Authors: Min Jun Kim, Woongyong Lee, Jae Yeon Choi, Goobong Chung, Kyung-Lyong Han, Il Seop Choi, Christian Ott, Wan Kyun Chung

    Abstract: This paper presents an admittance controller based on the passivity theory for a powered upper-limb exoskeleton robot which is governed by the nonlinear equation of motion. Passivity allows us to include a human operator and environmental interaction in the control loop. The robot interacts with the human operator via F/T sensor and interacts with the environment mainly via end-effectors. Although… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: Accepted in IEEE/ASME Transactions on Mechatronics (T-MECH)

  38. arXiv:1904.01866  [pdf, other

    cs.CV cs.LG

    A Comprehensive Overhaul of Feature Distillation

    Authors: Byeongho Heo, Jeesoo Kim, Sangdoo Yun, Hyo** Park, Nojun Kwak, ** Young Choi

    Abstract: We investigate the design aspects of feature distillation methods achieving network compression and propose a novel feature distillation method in which the distillation loss is designed to make a synergy among various aspects: teacher transform, student transform, distillation feature position and distance function. Our proposed distillation loss includes a feature transform with a newly designed… ▽ More

    Submitted 9 August, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted at ICCV 2019

  39. Skeleton-based Action Recognition of People Handling Objects

    Authors: Sunoh Kim, Kimin Yun, Jongyoul Park, ** Young Choi

    Abstract: In visual surveillance systems, it is necessary to recognize the behavior of people handling objects such as a phone, a cup, or a plastic bag. In this paper, to address this problem, we propose a new framework for recognizing object-related human actions by graph convolutional networks using human and object poses. In this framework, we construct skeletal graphs of reliable human poses by selectiv… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

    Comments: Accepted in WACV 2019

  40. arXiv:1901.06140  [pdf, other

    cs.CV

    Backbone Can Not be Trained at Once: Rolling Back to Pre-trained Network for Person Re-Identification

    Authors: Youngmin Ro, Jongwon Choi, Dae Ung Jo, Byeongho Heo, Jongin Lim, ** Young Choi

    Abstract: In person re-identification (ReID) task, because of its shortage of trainable dataset, it is common to utilize fine-tuning method using a classification network pre-trained on a large dataset. However, it is relatively difficult to sufficiently fine-tune the low-level layers of the network due to the gradient vanishing problem. In this work, we propose a novel fine-tuning strategy that allows low-… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

    Comments: Accepted to AAAI 2019

  41. arXiv:1811.03233  [pdf, other

    cs.LG cs.CV stat.ML

    Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons

    Authors: Byeongho Heo, Minsik Lee, Sangdoo Yun, ** Young Choi

    Abstract: An activation boundary for a neuron refers to a separating hyperplane that determines whether the neuron is activated or deactivated. It has been long considered in neural networks that the activations of neurons, rather than their exact output values, play the most important role in forming classification friendly partitions of the hidden feature space. However, as far as we know, this aspect of… ▽ More

    Submitted 14 December, 2018; v1 submitted 7 November, 2018; originally announced November 2018.

    Comments: Accepted to AAAI 2019

  42. arXiv:1805.05532  [pdf, other

    cs.LG cs.CV stat.ML

    Knowledge Distillation with Adversarial Samples Supporting Decision Boundary

    Authors: Byeongho Heo, Minsik Lee, Sangdoo Yun, ** Young Choi

    Abstract: Many recent works on knowledge distillation have provided ways to transfer the knowledge of a trained network for improving the learning process of a new one, but finding a good technique for knowledge distillation is still an open problem. In this paper, we provide a new perspective based on a decision boundary, which is one of the most important component of a classifier. The generalization perf… ▽ More

    Submitted 14 December, 2018; v1 submitted 14 May, 2018; originally announced May 2018.

    Comments: Accepted to AAAI 2019

  43. Context-aware Deep Feature Compression for High-speed Visual Tracking

    Authors: Jongwon Choi, Hyung ** Chang, Tobias Fischer, Sangdoo Yun, Kyuewang Lee, Jiyeoup Jeong, Yiannis Demiris, ** Young Choi

    Abstract: We propose a new context-aware correlation filter based tracking framework to achieve both high computational speed and state-of-the-art performance among real-time trackers. The major contribution to the high computational speed lies in the proposed deep feature compression that is achieved by a context-aware scheme utilizing multiple expert auto-encoders; a context in our framework refers to the… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: 9 pages, 6 figures, Accepted in CVPR2018 (IEEE conference on Computer Vision and Pattern Recognition)

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR2018)

  44. arXiv:1706.00522  [pdf, other

    cs.PF physics.comp-ph

    On the Scalability of Data Reduction Techniques in Current and Upcoming HPC Systems from an Application Perspective

    Authors: Axel Huebl, Rene Widera, Felix Schmitt, Alexander Matthes, Norbert Podhorszki, Jong Youl Choi, Scott Klasky, Michael Bussmann

    Abstract: We implement and benchmark parallel I/O methods for the fully-manycore driven particle-in-cell code PIConGPU. Identifying throughput and overall I/O size as a major challenge for applications on today's and future HPC systems, we present a scaling law characterizing performance bottlenecks in state-of-the-art approaches for data reduction. Consequently, we propose, implement and verify multi-threa… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Comments: 15 pages, 5 figures, accepted for DRBSD-1 in conjunction with ISC'17

    ACM Class: D.4.8; B.4.3; I.6.6

    Journal ref: J.M. Kunkel et al. (Eds.): ISC High Performance Workshops 2017, LNCS 10524, pp. 15-29, 2017

  45. arXiv:1505.03532  [pdf, other

    cs.DC cs.CE cs.DS physics.plasm-ph

    Towards Real-Time Detection and Tracking of Spatio-Temporal Features: Blob-Filaments in Fusion Plasma

    Authors: Lingfei Wu, Kesheng Wu, Alex Sim, Michael Churchill, Jong Y. Choi, Andreas Stathopoulos, Cs Chang, Scott Klasky

    Abstract: A novel algorithm and implementation of real-time identification and tracking of blob-filaments in fusion reactor data is presented. Similar spatio-temporal features are important in many other applications, for example, ignition kernels in combustion and tumor cells in a medical image. This work presents an approach for extracting these features by dividing the overall task into three steps: loca… ▽ More

    Submitted 2 July, 2016; v1 submitted 13 May, 2015; originally announced May 2015.

    Comments: 14 pages, 40 figures