Skip to main content

Showing 1–44 of 44 results for author: Noh, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13996  [pdf, other

    eess.SP cs.HC

    Detecting Gait Abnormalities in Foot-Floor Contacts During Walking Through FootstepInduced Structural Vibrations

    Authors: Yiwen Dong, Yuyan Wu, Hae Young Noh

    Abstract: Gait abnormality detection is critical for the early discovery and progressive tracking of musculoskeletal and neurological disorders, such as Parkinson's and Cerebral Palsy. Especially, analyzing the foot-floor contacts during walking provides important insights into gait patterns, such as contact area, contact force, and contact time, enabling gait abnormality detection through these measurement… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: The 14th International Workshop on Structural Health Monitoring (IWSHM)

  2. arXiv:2404.02486  [pdf, other

    eess.SY cs.IT

    Joint Optimization on Uplink OFDMA and MU-MIMO for IEEE 802.11ax: Deep Hierarchical Reinforcement Learning Approach

    Authors: Hyeonho Noh, Harim Lee, Hyun Jong Yang

    Abstract: This letter tackles a joint user scheduling, frequency resource allocation (USRA), multi-input-multi-output mode selection (MIMO MS) between single-user MIMO and multi-user (MU) MIMO, and MU-MIMO user selection problem, integrating uplink orthogonal frequency division multiple access (OFDMA) in IEEE 802.11ax. Specifically, we focus on \textit{unsaturated traffic conditions} where users' data deman… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  3. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  4. arXiv:2403.01479  [pdf, other

    cs.CL cs.AI

    Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

    Authors: Heegon **, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh, Yeonsoo Lee

    Abstract: The advent of scalable deep models and large datasets has improved the performance of Neural Machine Translation. Knowledge Distillation (KD) enhances efficiency by transferring knowledge from a teacher model to a more compact student model. However, KD approaches to Transformer architecture often rely on heuristics, particularly when deciding which teacher layers to distill from. In this paper, w… ▽ More

    Submitted 25 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2312.13289  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Stoichiometry Representation Learning with Polymorphic Crystal Structures

    Authors: Namkyeong Lee, Heewoong Noh, Gyoung S. Na, Tianfan Fu, Jimeng Sun, Chanyoung Park

    Abstract: Despite the recent success of machine learning (ML) in materials science, its success heavily relies on the structural description of crystal, which is itself computationally demanding and occasionally unattainable. Stoichiometry descriptors can be an alternative approach, which reveals the ratio between elements involved to form a certain compound without any structural information. However, it i… ▽ More

    Submitted 17 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 AI4Science Workshop

  6. arXiv:2311.12856  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer

    Authors: Namkyeong Lee, Heewoong Noh, Sungwon Kim, Dongmin Hyun, Gyoung S. Na, Chanyoung Park

    Abstract: The density of states (DOS) is a spectral property of crystalline materials, which provides fundamental insights into various characteristics of the materials. While previous works mainly focus on obtaining high-quality representations of crystalline materials for DOS prediction, we focus on predicting the DOS from the obtained representations by reflecting the nature of DOS: DOS determines the ge… ▽ More

    Submitted 22 November, 2023; v1 submitted 24 October, 2023; originally announced November 2023.

    Comments: NeurIPS 2023. arXiv admin note: text overlap with arXiv:2303.07000

  7. arXiv:2310.13805   

    cs.LG cs.CV

    Normalizing flow-based deep variational Bayesian network for seismic multi-hazards and impacts estimation from InSAR imagery

    Authors: Xuechun Li, Paula M. Burgi, Wei Ma, Hae Young Noh, David J. Wald, Susu Xu

    Abstract: Onsite disasters like earthquakes can trigger cascading hazards and impacts, such as landslides and infrastructure damage, leading to catastrophic losses; thus, rapid and accurate estimates are crucial for timely and effective post-disaster responses. Interferometric Synthetic aperture radar (InSAR) data is important in providing high-resolution onsite information for rapid hazard estimation. Most… ▽ More

    Submitted 20 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: This paper needs to be reviewed by the USGS

  8. arXiv:2306.12626  [pdf, other

    cs.CV eess.IV

    1st Place Solution to MultiEarth 2023 Challenge on Multimodal SAR-to-EO Image Translation

    Authors: **gi Ju, Hyeoncheol Noh, Minwoo Kim, Dong-Geol Choi

    Abstract: The Multimodal Learning for Earth and Environment Workshop (MultiEarth 2023) aims to harness the substantial amount of remote sensing data gathered over extensive periods for the monitoring and analysis of Earth's ecosystems'health. The subtask, Multimodal SAR-to-EO Image Translation, involves the use of robust SAR data, even under adverse weather and lighting conditions, transforming it into high… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  9. arXiv:2304.08925  [pdf, other

    cs.LG cs.PF

    Understand Data Preprocessing for Effective End-to-End Training of Deep Neural Networks

    Authors: ** Gong, Yuxin Ma, Cheng Li, Xiaosong Ma, Sam H. Noh

    Abstract: In this paper, we primarily focus on understanding the data preprocessing pipeline for DNN Training in the public cloud. First, we run experiments to test the performance implications of the two major data preprocessing methods using either raw data or record files. The preliminary results show that data preprocessing is a clear bottleneck, even with the most efficient software and hardware config… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  10. arXiv:2303.11606  [pdf, other

    cs.CV

    CAFS: Class Adaptive Framework for Semi-Supervised Semantic Segmentation

    Authors: **gi Ju, Hyeoncheol Noh, Yooseung Wang, Minseok Seo, Dong-Geol Choi

    Abstract: Semi-supervised semantic segmentation learns a model for classifying pixels into specific classes using a few labeled samples and numerous unlabeled images. The recent leading approach is consistency regularization by selftraining with pseudo-labeling pixels having high confidences for unlabeled images. However, using only highconfidence pixels for self-training may result in losing much of the in… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 13 pages, 9 figures

  11. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  12. arXiv:2303.07000  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.comp-ph

    Predicting Density of States via Multi-modal Transformer

    Authors: Namkyeong Lee, Heewoong Noh, Sungwon Kim, Dongmin Hyun, Gyoung S. Na, Chanyoung Park

    Abstract: The density of states (DOS) is a spectral property of materials, which provides fundamental insights on various characteristics of materials. In this paper, we propose a model to predict the DOS by reflecting the nature of DOS: DOS determines the general distribution of states as a function of energy. Specifically, we integrate the heterogeneous information obtained from the crystal structure and… ▽ More

    Submitted 10 April, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 Workshop on Machine Learning for Materials (ML4Materials)

  13. arXiv:2212.10504  [pdf, other

    cs.CL

    Can Current Task-oriented Dialogue Models Automate Real-world Scenarios in the Wild?

    Authors: Sang-Woo Lee, Sungdong Kim, Donghyeon Ko, Donghoon Ham, Youngki Hong, Shin Ah Oh, Hyunhoon Jung, Wangkyo Jung, Kyunghyun Cho, Donghyun Kwak, Hyungsuk Noh, Woomyoung Park

    Abstract: Task-oriented dialogue (TOD) systems are mainly based on the slot-filling-based TOD (SF-TOD) framework, in which dialogues are broken down into smaller, controllable units (i.e., slots) to fulfill a specific task. A series of approaches based on this framework achieved remarkable success on various TOD benchmarks. However, we argue that the current TOD benchmarks are limited to surrogate real-worl… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  14. arXiv:2212.07939  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis

    Authors: Shinhyeok Oh, HyeongRae Noh, Yoonseok Hong, Insoo Oh

    Abstract: With the advent of deep learning, a huge number of text-to-speech (TTS) models which produce human-like speech have emerged. Recently, by introducing syntactic and semantic information w.r.t the input text, various approaches have been proposed to enrich the naturalness and expressiveness of TTS models. Although these strategies showed impressive results, they still have some limitations in utiliz… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI 2023

  15. GaitVibe+: Enhancing Structural Vibration-based Footstep Localization Using Temporary Cameras for In-home Gait Analysis

    Authors: Yiwen Dong, **gxiao Liu, Hae Young Noh

    Abstract: In-home gait analysis is important for providing early diagnosis and adaptive treatments for individuals with gait disorders. Existing systems include wearables and pressure mats, but they have limited scalability. Recent studies have developed vision-based systems to enable scalable, accurate in-home gait analysis, but it faces privacy concerns due to the exposure of people's appearances. Our pri… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: 7 pages, 7 figures

    ACM Class: J.3

  16. arXiv:2211.12118  [pdf, other

    cs.CL

    HaRiM$^+$: Evaluating Summary Quality with Hallucination Risk

    Authors: Seonil Son, Junsoo Park, Jeong-in Hwang, Junghwa Lee, Hyungjong Noh, Yeonsoo Lee

    Abstract: One of the challenges of develo** a summarization model arises from the difficulty in measuring the factual inconsistency of the generated text. In this study, we reinterpret the decoder overconfidence-regularizing objective suggested in (Miao et al., 2021) as a hallucination risk measurement to better estimate the quality of generated summaries. We propose a reference-free metric, HaRiM+, which… ▽ More

    Submitted 24 November, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 9 pages (+ 21 pages of Appendix), AACL 2022

    Journal ref: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022), pages 895-924

  17. arXiv:2204.07653  [pdf

    cs.CE

    Bayesian Updating of Seismic Ground Failure Estimates via Causal Graphical Models and Satellite Imagery

    Authors: Susu Xu, Joshua Dimasaka, David J. Wald, Hae Young Noh

    Abstract: Earthquake-induced secondary ground failure hazards, such as liquefaction and landslides, result in catastrophic building and infrastructure damage as well as human fatalities. To facilitate emergency responses and mitigate losses, the U.S. Geological Survey provides a rapid hazard estimation system for earthquake-triggered landslides and liquefaction using geospatial susceptibility proxies and Sh… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: The 17th World Conference on Earthquake Engineering, Sendai, Japan, Sep. 2021

  18. arXiv:2204.01200  [pdf, other

    cs.CV eess.IV

    Unsupervised Change Detection Based on Image Reconstruction Loss

    Authors: Hyeoncheol Noh, **gi Ju, Minseok Seo, Jongchan Park, Dong-Geol Choi

    Abstract: To train the change detector, bi-temporal images taken at different times in the same area are used. However, collecting labeled bi-temporal images is expensive and time consuming. To solve this problem, various unsupervised change detection methods have been proposed, but they still require unlabeled bi-temporal images. In this paper, we propose unsupervised change detection based on image recons… ▽ More

    Submitted 4 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: 10 pages, 7 figures

  19. arXiv:2201.00722  [pdf, other

    math.AP cond-mat.mtrl-sci cs.LG

    Predicting Peak Stresses In Microstructured Materials Using Convolutional Encoder-Decoder Learning

    Authors: Ankit Shrivastava, **gxiao Liu, Kaushik Dayal, Hae Young Noh

    Abstract: This work presents a machine learning approach to predict peak-stress clusters in heterogeneous polycrystalline materials. Prior work on using machine learning in the context of mechanics has largely focused on predicting the effective response and overall structure of stress fields. However, their ability to predict peak stresses -- which are of critical importance to failure -- is unexplored, be… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: To appear in Mathematics and Mechanics of Solids

  20. arXiv:2112.10360  [pdf, other

    cs.CL

    May the Force Be with Your Copy Mechanism: Enhanced Supervised-Copy Method for Natural Language Generation

    Authors: Sanghyuk Choi, Jeong-in Hwang, Hyungjong Noh, Yeonsoo Lee

    Abstract: Recent neural sequence-to-sequence models with a copy mechanism have achieved remarkable progress in various text generation tasks. These models addressed out-of-vocabulary problems and facilitated the generation of rare words. However, the identification of the word which needs to be copied is difficult, as observed by prior copy models, which suffer from incorrect generation and lacking abstract… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: 8 pages, 3 figures, 8 tables and 4 pages of appendices

  21. HierMUD: Hierarchical Multi-task Unsupervised Domain Adaptation between Bridges for Drive-by Damage Diagnosis

    Authors: **gxiao Liu, Susu Xu, Mario Bergés, Hae Young Noh

    Abstract: Monitoring bridge health using vibrations of drive-by vehicles has various benefits, such as no need for directly installing and maintaining sensors on the bridge. However, many of the existing drive-by monitoring approaches are based on supervised learning models that require labeled data from every bridge of interest, which is expensive and time-consuming, if not impossible, to obtain. To this e… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Journal ref: Structural Health Monitoring 22(3):1941-1968, 2023

  22. arXiv:2101.04882  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Asymmetric self-play for automatic goal discovery in robotic manipulation

    Authors: OpenAI OpenAI, Matthias Plappert, Raul Sampedro, Tao Xu, Ilge Akkaya, Vineet Kosaraju, Peter Welinder, Ruben D'Sa, Arthur Petron, Henrique P. d. O. Pinto, Alex Paino, Hyeonwoo Noh, Lilian Weng, Qiming Yuan, Casey Chu, Wojciech Zaremba

    Abstract: We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play a game. Alice is asked to propose challenging goals and Bob aims to solve them. We show that this method can discover highly diverse and complex goals without an… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Videos are shown at https://robotics-self-play.github.io

  23. arXiv:2010.14274  [pdf, other

    cs.AI cs.LG

    Behavior Priors for Efficient Reinforcement Learning

    Authors: Dhruva Tirumala, Alexandre Galashov, Hyeonwoo Noh, Leonard Hasenclever, Razvan Pascanu, Jonathan Schwarz, Guillaume Desjardins, Wojciech Marian Czarnecki, Arun Ahuja, Yee Whye Teh, Nicolas Heess

    Abstract: As we deploy reinforcement learning agents to solve increasingly challenging problems, methods that allow us to inject prior knowledge about the structure of the world and effective solution strategies becomes increasingly important. In this work we consider how information and architectural constraints can be combined with ideas from the probabilistic modeling literature to learn behavior priors… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: Submitted to Journal of Machine Learning Research (JMLR)

  24. arXiv:2008.06867  [pdf, other

    eess.AS cs.CL cs.SD

    Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder

    Authors: Hyun-Wook Yoon, Sang-Hoon Lee, Hyeong-Rae Noh, Seong-Whan Lee

    Abstract: In recent works, a flow-based neural vocoder has shown significant improvement in real-time speech generation task. The sequence of invertible flow operations allows the model to convert samples from simple distribution to audio samples. However, training a continuous density model on discrete audio data can degrade model performance due to the topological difference between latent and actual dist… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted in INTERSPEECH2020

  25. arXiv:2006.03641  [pdf

    cs.CV

    Knowledge transfer between bridges for drive-by monitoring using adversarial and multi-task learning

    Authors: **gxiao Liu, Mario Bergés, Jacobo Bielak, Hae Young Noh

    Abstract: Monitoring bridge health using the vibrations of drive-by vehicles has various benefits, such as low cost and no need for direct installation or on-site maintenance of equipment on the bridge. However, many such approaches require labeled data from every bridge, which is expensive and time-consuming, if not impossible, to obtain. This is further exacerbated by having multiple diagnostic tasks, suc… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  26. arXiv:2005.14038  [pdf, other

    cs.DC

    HetPipe: Enabling Large DNN Training on (Whimpy) Heterogeneous GPU Clusters through Integration of Pipelined Model Parallelism and Data Parallelism

    Authors: Jay H. Park, Gyeongchan Yun, Chang M. Yi, Nguyen T. Nguyen, Seungmin Lee, Jaesik Choi, Sam H. Noh, Young-ri Choi

    Abstract: Deep Neural Network (DNN) models have continuously been growing in size in order to improve the accuracy and quality of the models. Moreover, for training of large DNN models, the use of heterogeneous GPUs is inevitable due to the short release cycle of new GPU architectures. In this paper, we investigate how to enable training of large DNN models on a heterogeneous GPU cluster that possibly inclu… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

  27. arXiv:2004.02822  [pdf, other

    cs.CV

    LaNet: Real-time Lane Identification by Learning Road SurfaceCharacteristics from Accelerometer Data

    Authors: Madhumitha Harishankar, Jun Han, Sai Vineeth Kalluru Srinivas, Faisal Alqarni, Shi Su, Shijia Pan, Hae Young Noh, Pei Zhang, Marco Gruteser, Patrick Tague

    Abstract: The resolution of GPS measurements, especially in urban areas, is insufficient for identifying a vehicle's lane. In this work, we develop a deep LSTM neural network model LaNet that determines the lane vehicles are on by periodically classifying accelerometer samples collected by vehicles as they drive in real time. Our key finding is that even adjacent patches of road surfaces contain characteris… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  28. arXiv:2002.02105  [pdf, other

    cs.CE cs.LG eess.SP physics.app-ph

    Damage-sensitive and domain-invariant feature extraction for vehicle-vibration-based bridge health monitoring

    Authors: **gxiao Liu, Bingqing Chen, Siheng Chen, Mario Berges, Jacobo Bielak, HaeYoung Noh

    Abstract: We introduce a physics-guided signal processing approach to extract a damage-sensitive and domain-invariant (DS & DI) feature from acceleration response data of a vehicle traveling over a bridge to assess bridge health. Motivated by indirect sensing methods' benefits, such as low-cost and low-maintenance, vehicle-vibration-based bridge health monitoring has been studied to efficiently monitor brid… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: To appear in Proc. ICASSP2020, May 04-08, 2020, Barcelona, Spain. IEEE

    MSC Class: 68T10 (Primary); 37N20 (Secondary) ACM Class: I.5.4; J.2

  29. arXiv:1911.11170  [pdf, other

    cs.CV

    Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation and One-Shot Channel Pruning

    Authors: Ilchae Jung, Kihyun You, Hyeonwoo Noh, Minsu Cho, Bohyung Han

    Abstract: We propose a novel meta-learning framework for real-time object tracking with efficient model adaptation and channel pruning. Given an object tracker, our framework learns to fine-tune its model parameters in only a few iterations of gradient-descent during tracking while pruning its network channels using the target ground-truth at the first frame. Such a learning problem is formulated as a meta-… ▽ More

    Submitted 4 December, 2019; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 9 pages, 5 figures, AAAI 2020 accepted

  30. arXiv:1909.09595  [pdf, other

    cs.CL cs.LG cs.NE

    SANVis: Visual Analytics for Understanding Self-Attention Networks

    Authors: Cheonbok Park, Inyoup Na, Yongjang Jo, Sungbok Shin, Jaehyo Yoo, Bum Chul Kwon, Jian Zhao, Hyungjong Noh, Yeonsoo Lee, Jaegul Choo

    Abstract: Attention networks, a deep neural network architecture inspired by humans' attention mechanism, have seen significant success in image captioning, machine translation, and many other applications. Recently, they have been further evolved into an advanced approach called multi-head self-attention networks, which can encode a set of input vectors, e.g., word vectors in a sentence, into another set o… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: VAST Short - IEEE VIS 2019

  31. arXiv:1908.10508  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    O-MedAL: Online Active Deep Learning for Medical Image Analysis

    Authors: Asim Smailagic, Pedro Costa, Alex Gaudio, Kartik Khandelwal, Mostafa Mirshekari, Jonathon Fagert, Devesh Walawalkar, Susu Xu, Adrian Galdran, Pei Zhang, Aurélio Campilho, Hae Young Noh

    Abstract: Active Learning methods create an optimized labeled training set from unlabeled data. We introduce a novel Online Active Deep Learning method for Medical Image Analysis. We extend our MedAL active learning framework to present new results in this paper. Our novel sampling method queries the unlabeled examples that maximize the average distance to all training set examples. Our online method enhanc… ▽ More

    Submitted 27 July, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: Code: https://github.com/adgaudio/o-medal ; Accepted and published by Wiley Journal of Pattern Recognition and Knowledge Discovery ; Journal URL: https://doi.org/10.1002/widm.1353

    Journal ref: Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 10.4 (2020): e1353

  32. arXiv:1903.07438  [pdf, other

    cs.LG stat.ML

    Exploiting Hierarchy for Learning and Transfer in KL-regularized RL

    Authors: Dhruva Tirumala, Hyeonwoo Noh, Alexandre Galashov, Leonard Hasenclever, Arun Ahuja, Greg Wayne, Razvan Pascanu, Yee Whye Teh, Nicolas Heess

    Abstract: As reinforcement learning agents are tasked with solving more challenging and diverse tasks, the ability to incorporate prior knowledge into the learning system and to exploit reusable structure in solution space is likely to become increasingly important. The KL-regularized expected reward objective constitutes one possible tool to this end. It introduces an additional component, a default or pri… ▽ More

    Submitted 23 January, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

  33. arXiv:1901.05803  [pdf, other

    cs.DC

    Accelerated Training for CNN Distributed Deep Learning through Automatic Resource-Aware Layer Placement

    Authors: Jay H. Park, Sunghwan Kim, **won Lee, Myeongjae Jeon, Sam H. Noh

    Abstract: The Convolutional Neural Network (CNN) model, often used for image classification, requires significant training time to obtain high accuracy. To this end, distributed training is performed with the parameter server (PS) architecture using multiple servers. Unfortunately, scalability has been found to be poor in existing architectures. We find that the PS network is the bottleneck as it communicat… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

  34. arXiv:1810.02358  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

    Authors: Hyeonwoo Noh, Taehoon Kim, Jonghwan Mun, Bohyung Han

    Abstract: We study how to leverage off-the-shelf visual and linguistic data to cope with out-of-vocabulary answers in visual question answering task. Existing large-scale visual datasets with annotations such as image class labels, bounding boxes and region descriptions are good sources for learning rich and diverse visual concepts. However, it is not straightforward how the visual concepts can be captured… ▽ More

    Submitted 7 April, 2019; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: CVPR 2019

  35. arXiv:1809.09287  [pdf

    cs.CV

    MedAL: Deep Active Learning Sampling Method for Medical Image Analysis

    Authors: Asim Smailagic, Hae Young Noh, Pedro Costa, Devesh Walawalkar, Kartik Khandelwal, Mostafa Mirshekari, Jonathon Fagert, Adrián Galdrán, Susu Xu

    Abstract: Deep learning models have been successfully used in medical image analysis problems but they require a large amount of labeled images to obtain good performance.Deep learning models have been successfully used in medical image analysis problems but they require a large amount of labeled images to obtain good performance. However, such large labeled datasets are costly to acquire. Active learning t… ▽ More

    Submitted 28 September, 2018; v1 submitted 24 September, 2018; originally announced September 2018.

    Comments: Accepted as conference paper for ICMLA 2018

  36. arXiv:1809.05161  [pdf, other

    cs.GT

    An Incentive Mechanism for Crowd Sensing with Colluding Agents

    Authors: Susu Xu, Weiguang Mao, Yue Cao, Hae Young Noh, Nihar B. Shah

    Abstract: Vehicular mobile crowd sensing is a fast-emerging paradigm to collect data about the environment by mounting sensors on vehicles such as taxis. An important problem in vehicular crowd sensing is to design payment mechanisms to incentivize drivers (agents) to collect data, with the overall goal of obtaining the maximum amount of data (across multiple vehicles) for a given budget. Past works on this… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

  37. arXiv:1807.07964  [pdf, other

    cs.CL cs.AI

    Question-Aware Sentence Gating Networks for Question and Answering

    Authors: Minjeong Kim, David Keetae Park, Hyungjong Noh, Yeonsoo Lee, Jaegul Choo

    Abstract: Machine comprehension question answering, which finds an answer to the question given a passage, involves high-level reasoning processes of understanding and tracking the relevant contents across various semantic units such as words, phrases, and sentences in a document. This paper proposes the novel question-aware sentence gating networks that directly incorporate the sentence-level information i… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

  38. arXiv:1710.05179  [pdf, other

    cs.LG cs.CV

    Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

    Authors: Hyeonwoo Noh, Tackgeun You, Jonghwan Mun, Bohyung Han

    Abstract: Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is known as a successful regularizer, but it is still not clear enough why such training techniques work well in practice and how we can maximize their benefit in the… ▽ More

    Submitted 9 November, 2017; v1 submitted 14 October, 2017; originally announced October 2017.

    Comments: NIPS 2017 camera ready

  39. arXiv:1612.06321  [pdf, other

    cs.CV

    Large-Scale Image Retrieval with Attentive Deep Local Features

    Authors: Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, Bohyung Han

    Abstract: We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELF (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for keypoint selecti… ▽ More

    Submitted 2 February, 2018; v1 submitted 19 December, 2016; originally announced December 2016.

    Comments: ICCV 2017. Code and dataset available: https://github.com/tensorflow/models/tree/master/research/delf

  40. arXiv:1606.03647  [pdf, other

    cs.CV

    Training Recurrent Answering Units with Joint Loss Minimization for VQA

    Authors: Hyeonwoo Noh, Bohyung Han

    Abstract: We propose a novel algorithm for visual question answering based on a recurrent deep neural network, where every module in the network corresponds to a complete answering unit with attention mechanism by itself. The network is optimized by minimizing loss aggregated from all the units, which share model parameters while receiving different information to compute attention probability. For training… ▽ More

    Submitted 29 September, 2016; v1 submitted 11 June, 2016; originally announced June 2016.

  41. arXiv:1511.05756  [pdf, other

    cs.CV cs.CL cs.LG

    Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction

    Authors: Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han

    Abstract: We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions. For the adaptive parameter prediction, we employ a separate parameter prediction network, which consists of gated recurrent unit (GRU) taking a question as its input and a fully-connected layer generating a… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  42. Automated Synchronization of Driving Data Using Vibration and Steering Events

    Authors: Lex Fridman, Daniel E Brown, William Angell, Irman Abdić, Bryan Reimer, Hae Young Noh

    Abstract: We propose a method for automated synchronization of vehicle sensors useful for the study of multi-modal driver behavior and for the design of advanced driver assistance systems. Multi-sensor decision fusion relies on synchronized data streams in (1) the offline supervised learning context and (2) the online prediction context. In practice, such data streams are often out of sync due to the absenc… ▽ More

    Submitted 1 March, 2016; v1 submitted 20 October, 2015; originally announced October 2015.

    Comments: Accepted for Publication in Elsevier Pattern Recognition Letters

  43. arXiv:1506.04924  [pdf, other

    cs.CV

    Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation

    Authors: Seunghoon Hong, Hyeonwoo Noh, Bohyung Han

    Abstract: We propose a novel deep neural network architecture for semi-supervised semantic segmentation using heterogeneous annotations. Contrary to existing approaches posing semantic segmentation as a single task of region-based classification, our algorithm decouples classification and segmentation, and learns a separate network for each task. In this architecture, labels associated with an image are ide… ▽ More

    Submitted 17 June, 2015; v1 submitted 16 June, 2015; originally announced June 2015.

    Comments: Added a link to the project page for more comprehensive illustration of results

  44. arXiv:1505.04366  [pdf, other

    cs.CV

    Learning Deconvolution Network for Semantic Segmentation

    Authors: Hyeonwoo Noh, Seunghoon Hong, Bohyung Han

    Abstract: We propose a novel semantic segmentation algorithm by learning a deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer net. The deconvolution network is composed of deconvolution and unpooling layers, which identify pixel-wise class labels and predict segmentation masks. We apply the trained network to each proposal in an input image, and construc… ▽ More

    Submitted 17 May, 2015; originally announced May 2015.