Skip to main content

Showing 1–50 of 64 results for author: Yoon, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10296  [pdf, other

    cs.CL cs.AI cs.CY

    CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer

    Authors: Heeseok Jung, Jaesang Yoo, Yohaan Yoon, Yeonju Jang

    Abstract: Knowledge tracing (KT), wherein students' problem-solving histories are used to estimate their current levels of knowledge, has attracted significant interest from researchers. However, most existing KT models were developed with an ID-based paradigm, which exhibits limitations in cold-start performance. These limitations can be mitigated by leveraging the vast quantities of external knowledge pos… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2405.19691  [pdf, other

    cs.HC

    Designing Prompt Analytics Dashboards to Analyze Student-ChatGPT Interactions in EFL Writing

    Authors: Minsun Kim, SeonGyeom Kim, Suyoun Lee, Yoosang Yoon, Junho Myung, Haneul Yoo, Hyungseung Lim, Jieun Han, Yoonsu Kim, So-Yeon Ahn, Juho Kim, Alice Oh, Hwajung Hong, Tak Yeon Lee

    Abstract: While ChatGPT has significantly impacted education by offering personalized resources for students, its integration into educational settings poses unprecedented risks, such as inaccuracies and biases in AI-generated content, plagiarism and over-reliance on AI, and privacy and security issues. To help teachers address such risks, we conducted a two-phase iterative design process that comprises sur… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  4. arXiv:2403.18277  [pdf, other

    cs.CL

    BlendX: Complex Multi-Intent Detection with Blended Patterns

    Authors: Ye** Yoon, Jungyeon Lee, Kangsan Kim, Chanhee Park, Taeuk Kim

    Abstract: Task-oriented dialogue (TOD) systems are commonly designed with the presumption that each utterance represents a single intent. However, this assumption may not accurately reflect real-world situations, where users frequently express multiple intents within a single utterance. While there is an emerging interest in multi-intent detection (MID), existing in-domain datasets such as MixATIS and MixSN… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING2024

  5. arXiv:2402.11159  [pdf, other

    cs.CL cs.CV

    Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability

    Authors: Yejun Yoon, Seunghyun Yoon, Kunwoo Park

    Abstract: This paper addresses the critical challenge of assessing the representativeness of news thumbnail images, which often serve as the first visual engagement for readers when an article is disseminated on social media. We focus on whether a news image represents the actors discussed in the news text. To serve the challenge, we introduce NewsTT, a manually annotated dataset of 1000 news thumbnail imag… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 (findings), 16 pages

  6. arXiv:2402.08178  [pdf, other

    cs.AI

    LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents

    Authors: Jae-Woo Choi, Youngwoo Yoon, Hyobin Ong, Jaehong Kim, Minsu Jang

    Abstract: Large language models (LLMs) have recently received considerable attention as alternative solutions for task planning. However, comparing the performance of language-oriented task planners becomes difficult, and there exists a dearth of detailed exploration regarding the effects of various factors such as pre-trained model selection and prompt construction. To address this, we propose a benchmark… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: ICLR 2024. Code: https://github.com/lbaa2022/LLMTaskPlanning

  7. arXiv:2401.15938  [pdf, other

    cs.CV eess.SY

    Motion-induced error reduction for high-speed dynamic digital fringe projection system

    Authors: Sanghoon Jeon, Hyo-Geon Lee, Jae-Sung Lee, Bo-Min Kang, Byung-Wook Jeon, Jun Young Yoon, Jae-Sang Hyun

    Abstract: In phase-shifting profilometry (PSP), any motion during the acquisition of fringe patterns can introduce errors because it assumes both the object and measurement system are stationary. Therefore, we propose a method to pixel-wise reduce the errors when the measurement system is in motion due to a motorized linear stage. The proposed method introduces motion-induced error reduction algorithm, whic… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 9 pages, 7 figures

  8. arXiv:2312.03005  [pdf, other

    cs.LG cs.CV

    Few-Shot Anomaly Detection with Adversarial Loss for Robust Feature Representations

    Authors: Jae Young Lee, Wonjun Lee, Jaehyun Choi, Yongkwi Lee, Young Seog Yoon

    Abstract: Anomaly detection is a critical and challenging task that aims to identify data points deviating from normal patterns and distributions within a dataset. Various methods have been proposed using a one-class-one-model approach, but these techniques often face practical problems such as memory inefficiency and the requirement of sufficient data for training. In particular, few-shot anomaly detection… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: BMVC 2023

  9. arXiv:2311.08439  [pdf, other

    eess.IV cs.CV cs.LG

    A Unified Approach for Comprehensive Analysis of Various Spectral and Tissue Doppler Echocardiography

    Authors: Jaeik Jeon, Jiyeon Kim, Yeonggul Jang, Yeonyee E. Yoon, Dawun Jeong, Youngtaek Hong, Seung-Ah Lee, Hyuk-Jae Chang

    Abstract: Doppler echocardiography offers critical insights into cardiac function and phases by quantifying blood flow velocities and evaluating myocardial motion. However, previous methods for automating Doppler analysis, ranging from initial signal processing techniques to advanced deep learning approaches, have been constrained by their reliance on electrocardiogram (ECG) data and their inability to proc… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  10. arXiv:2310.11651  [pdf, other

    eess.SY cs.CR

    US Microelectronics Packaging Ecosystem: Challenges and Opportunities

    Authors: Rouhan Noor, Himanandhan Reddy Kottur, Patrick J Craig, Liton Kumar Biswas, M Shafkat M Khan, Nitin Varshney, Hamed Dalir, Elif Akçalı, Bahareh Ghane Motlagh, Charles Woychik, Yong-Kyu Yoon, Navid Asadizanjani

    Abstract: The semiconductor industry is experiencing a significant shift from traditional methods of shrinking devices and reducing costs. Chip designers actively seek new technological solutions to enhance cost-effectiveness while incorporating more features into the silicon footprint. One promising approach is Heterogeneous Integration (HI), which involves advanced packaging techniques to integrate indepe… ▽ More

    Submitted 30 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 22 pages, 8 figures

  11. arXiv:2310.08897  [pdf, other

    eess.IV cs.CV cs.LG

    Self supervised convolutional kernel based handcrafted feature harmonization: Enhanced left ventricle hypertension disease phenoty** on echocardiography

    Authors: **a Lee, Youngtaek Hong, Dawun Jeong, Yeonggul Jang, Jaeik Jeon, Sihyeon Jeong, Taekgeun Jung, Yeonyee E. Yoon, Inki Moon, Seung-Ah Lee, Hyuk-Jae Chang

    Abstract: Radiomics, a medical imaging technique, extracts quantitative handcrafted features from images to predict diseases. Harmonization in those features ensures consistent feature extraction across various imaging devices and protocols. Methods for harmonization include standardized imaging protocols, statistical adjustments, and evaluating feature robustness. Myocardial diseases such as Left Ventricul… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 11 pages, 7 figures

  12. arXiv:2308.16483  [pdf, other

    eess.SP cs.HC cs.LG

    Improving Out-of-Distribution Detection in Echocardiographic View Classication through Enhancing Semantic Features

    Authors: Jaeik Jeon, Seongmin Ha, Yeonggul Jang, Yeonyee E. Yoon, Jiyeon Kim, Hyunseok Jeong, Dawun Jeong, Youngtaek Hong, Seung-Ah Lee Hyuk-Jae Chang

    Abstract: In echocardiographic view classification, accurately detecting out-of-distribution (OOD) data is essential but challenging, especially given the subtle differences between in-distribution and OOD data. While conventional OOD detection methods, such as Mahalanobis distance (MD) are effective in far-OOD scenarios with clear distinctions between distributions, they struggle to discern the less obviou… ▽ More

    Submitted 23 November, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  13. arXiv:2308.12646  [pdf, other

    cs.HC cs.GR cs.LG

    The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

    Authors: Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the GENEA Challenge 2023, in which participating teams built speech-driven gesture-generation systems using the same speech and motion dataset, followed by a joint evaluation. This year's challenge provided data on both sides of a dyadic interaction, allowing teams to generate full-body motion for an agent given its speech (text and audio) and the speech and motion of the int… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: The first three authors made equal contributions. Accepted for publication at the ACM International Conference on Multimodal Interaction (ICMI)

    ACM Class: I.3; I.2

  14. arXiv:2308.11901  [pdf, other

    cs.CV

    Camera-Driven Representation Learning for Unsupervised Domain Adaptive Person Re-identification

    Authors: Geon Lee, Sanghoon Lee, Dohyung Kim, Younghoon Shin, Yongsang Yoon, Bumsub Ham

    Abstract: We present a novel unsupervised domain adaption method for person re-identification (reID) that generalizes a model trained on a labeled source domain to an unlabeled target domain. We introduce a camera-driven curriculum learning (CaCL) framework that leverages camera labels of person images to transfer knowledge from source to target domains progressively. To this end, we divide target domain da… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  15. arXiv:2304.10768  [pdf, other

    cs.PL

    Inductive Program Synthesis via Iterative Forward-Backward Abstract Interpretation

    Authors: Yongho Yoon, Woosuk Lee, Kwangkeun Yi

    Abstract: A key challenge in example-based program synthesis is the gigantic search space of programs. To address this challenge, various work proposed to use abstract interpretation to prune the search space. However, most of existing approaches have focused only on forward abstract interpretation, and thus cannot fully exploit the power of abstract interpretation. In this paper, we propose a novel approac… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  16. arXiv:2303.12822  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Co-Speech Gesture Synthesis using Discrete Gesture Token Learning

    Authors: Shuhong Lu, Youngwoo Yoon, Andrew Feng

    Abstract: Synthesizing realistic co-speech gestures is an important and yet unsolved problem for creating believable motions that can drive a humanoid robot to interact and communicate with human users. Such capability will improve the impressions of the robots by human users and will find applications in education, training, and medical services. One challenge in learning the co-speech gesture model is tha… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 3 tables

  17. arXiv:2303.08737  [pdf, other

    cs.HC cs.LG cs.MM

    Evaluating gesture generation in a large-scale open challenge: The GENEA Challenge 2022

    Authors: Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the second GENEA Challenge to benchmark data-driven automatic co-speech gesture generation. Participating teams used the same speech and motion dataset to build gesture-generation systems. Motion generated by all these systems was rendered to video using a standardised visualisation pipeline and evaluated in several large, crowdsourced user studies. Unlike when comparing diff… ▽ More

    Submitted 28 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: The first three authors made equal contributions and share joint first authorship. Accepted for publication in the ACM Transactions on Graphics (TOG).Please see https://youngwoo-yoon.github.io/GENEAchallenge2022/ for all challenge materials. arXiv admin note: text overlap with arXiv:2208.10441

    ACM Class: I.3; I.2

  18. arXiv:2210.17302  [pdf, other

    cs.RO eess.SY

    Design, Field Evaluation, and Traffic Analysis of a Competitive Autonomous Driving Model in a Congested Environment

    Authors: Daegyu Lee, Hyunki Seong, Seungil Han, Gyuree Kang, D. Hyunchul Shim, Yoon** Yoon

    Abstract: Recently, numerous studies have investigated cooperative traffic systems using the communication among vehicle-to-everything (V2X). Unfortunately, when multiple autonomous vehicles are deployed while exposed to communication failure, there might be a conflict of ideal conditions between various autonomous vehicles leading to adversarial situation on the roads. In South Korea, virtual and real-worl… ▽ More

    Submitted 6 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

  19. arXiv:2208.10441  [pdf, other

    cs.HC cs.GR cs.LG cs.MM cs.SD eess.AS

    The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

    Authors: Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the second GENEA Challenge to benchmark data-driven automatic co-speech gesture generation. Participating teams used the same speech and motion dataset to build gesture-generation systems. Motion generated by all these systems was rendered to video using a standardised visualisation pipeline and evaluated in several large, crowdsourced user studies. Unlike when comparing diff… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 12 pages, 5 figures; final version for ACM ICMI 2022

    ACM Class: I.3; I.2

  20. arXiv:2207.05297  [pdf, other

    cs.CR cs.LG cs.NI

    Efficient and Privacy Preserving Group Signature for Federated Learning

    Authors: Sneha Kanchan, Jae Won Jang, Jun Yong Yoon, Bong Jun Choi

    Abstract: Federated Learning (FL) is a Machine Learning (ML) technique that aims to reduce the threats to user data privacy. Training is done using the raw data on the users' device, called clients, and only the training results, called gradients, are sent to the server to be aggregated and generate an updated model. However, we cannot assume that the server can be trusted with private information, such as… ▽ More

    Submitted 15 July, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  21. arXiv:2207.02182  [pdf, other

    cs.CV cs.LG

    ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

    Authors: Jae Soon Baik, In Young Yoon, Jun Won Choi

    Abstract: Modern deep learning has achieved great success in various fields. However, it requires the labeling of huge amounts of data, which is expensive and labor-intensive. Active learning (AL), which identifies the most informative samples to be labeled, is becoming increasingly important to maximize the efficiency of the training process. The existing AL methods mostly use only a single final fixed mod… ▽ More

    Submitted 16 October, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

  22. DBN-Mix: Training Dual Branch Network Using Bilateral Mixup Augmentation for Long-Tailed Visual Recognition

    Authors: Jae Soon Baik, In Young Yoon, Jun Won Choi

    Abstract: There is growing interest in the challenging visual perception task of learning from long-tailed class distributions. The extreme class imbalance in the training dataset biases the model to prefer recognizing majority class data over minority class data. Furthermore, the lack of diversity in minority class samples makes it difficult to find a good representation. In this paper, we propose an effec… ▽ More

    Submitted 20 August, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

  23. arXiv:2205.15567  [pdf, ps, other

    cs.LG

    Few-Shot Unlearning by Model Inversion

    Authors: Youngsik Yoon, **hwan Nam, Hyojeong Yun, Jaeho Lee, Dongwoo Kim, Jungseul Ok

    Abstract: We consider a practical scenario of machine unlearning to erase a target dataset, which causes unexpected behavior from the trained model. The target dataset is often assumed to be fully identifiable in a standard unlearning scenario. Such a flawless identification, however, is almost impossible if the training dataset is inaccessible at the time of unlearning. Unlike previous approaches requiring… ▽ More

    Submitted 14 March, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

  24. arXiv:2204.12318  [pdf, other

    cs.CV

    Evaluating the Quality of a Synthesized Motion with the Fréchet Motion Distance

    Authors: Antoine Maiorca, Youngwoo Yoon, Thierry Dutoit

    Abstract: Evaluating the Quality of a Synthesized Motion with the Fréchet Motion Distance

    Submitted 27 April, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 2 pages, 2 figures

  25. arXiv:2204.05533  [pdf, other

    cs.CL cs.SI

    How does fake news use a thumbnail? CLIP-based Multimodal Detection on the Unrepresentative News Image

    Authors: Hyewon Choi, Yejun Yoon, Seunghyun Yoon, Kunwoo Park

    Abstract: This study investigates how fake news uses a thumbnail for a news article with a focus on whether a news article's thumbnail represents the news content correctly. A news article shared with an irrelevant thumbnail can mislead readers into having a wrong impression of the issue, especially in social media environments where users are less likely to click the link and consume the entire content. We… ▽ More

    Submitted 27 April, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: 9 pages, 8 figures including appendix figure, 2 tables. Published in Findings of ACL workshop, CONSTRAINT 2022 (Long paper). The manuscript is slightly revised after the camera ready version

  26. arXiv:2203.16518  [pdf, other

    cs.CV cs.AI cs.LG

    Collaborative Transformers for Grounded Situation Recognition

    Authors: Junhyeong Cho, Youngseok Yoon, Suha Kwak

    Abstract: Grounded situation recognition is the task of predicting the main activity, entities playing certain roles within the activity, and bounding-box groundings of the entities in the given image. To effectively deal with this challenging task, we introduce a novel approach where the two processes for activity classification and entity estimation are interactive and complementary. To implement this ide… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022, Code: https://github.com/jhcho99/CoFormer

  27. arXiv:2203.01495  [pdf

    cs.CR

    Disperse rotation operator DRT and use in some stream ciphers

    Authors: Yong-** Kim, Yong-Ho Yon, Son-Gyong Kim

    Abstract: The rotation operator is frequently used in several stream ciphers, including HC-128, Rabbit, and Salsa20, the final candidates for eSTREAM. This is because the rotation operator (ROT) is simple but has very good dispersibility. In this paper, we propose a disperse rotation operator (DRT), which has the same structure as ROT but has better dispersibility. In addition, the use of DRT instead of ROT… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 12 pages, 1 figures, 20 tables

    ACM Class: K.6.5; D.2.7

  28. arXiv:2202.09021  [pdf, other

    cs.LG

    Effective Urban Region Representation Learning Using Heterogeneous Urban Graph Attention Network (HUGAT)

    Authors: Namwoo Kim, Yoon** Yoon

    Abstract: Revealing the hidden patterns sha** the urban environment is essential to understand its dynamics and to make cities smarter. Recent studies have demonstrated that learning the representations of urban regions can be an effective strategy to uncover the intrinsic characteristics of urban areas. However, existing studies lack in incorporating diversity in urban data sources. In this work, we prop… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 9 pages, 6 figures

  29. PGCN: Progressive Graph Convolutional Networks for Spatial-Temporal Traffic Forecasting

    Authors: Yuyol Shin, Yoon** Yoon

    Abstract: The complex spatial-temporal correlations in transportation networks make the traffic forecasting problem challenging. Since transportation system inherently possesses graph structures, many research efforts have been put with graph neural networks. Recently, constructing adaptive graphs to the data has shown promising results over the models relying on a single static graph structure. However, th… ▽ More

    Submitted 21 March, 2024; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 12 pages, 6 figures

    Journal ref: IEEE Transactions on Intelligent Transportation Systems (2024) 1-12

  30. arXiv:2112.06536  [pdf, other

    cs.CV

    SphereSR: 360° Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation

    Authors: Youngho Yoon, Inchul Chung, Lin Wang, Kuk-** Yoon

    Abstract: The 360°imaging has recently gained great attention; however, its angular resolution is relatively lower than that of a narrow field-of-view (FOV) perspective image as it is captured by using fisheye lenses with the same sensor size. Therefore, it is beneficial to super-resolve a 360°image. Some attempts have been made but mostly considered the equirectangular projection (ERP) as one of the way fo… ▽ More

    Submitted 13 December, 2021; v1 submitted 13 December, 2021; originally announced December 2021.

  31. arXiv:2112.06171  [pdf, other

    cs.CV

    Pixel-wise Deep Image Stitching

    Authors: Hyeokjun Kweon, Hyeonseong Kim, Yoonsu Kang, Youngho Yoon, Wooseong Jeong, Kuk-** Yoon

    Abstract: Image stitching aims at stitching the images taken from different viewpoints into an image with a wider field of view. Existing methods warp the target image to the reference image using the estimated warp function, and a homography is one of the most commonly used war** functions. However, when images have large parallax due to non-planar scenes and translational motion of a camera, the homogra… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

  32. arXiv:2111.11647  [pdf, other

    cs.AI cs.LG cs.NE

    Inducing Functions through Reinforcement Learning without Task Specification

    Authors: Junmo Cho, Dong-Hwan Lee, Young-Gyu Yoon

    Abstract: We report a bio-inspired framework for training a neural network through reinforcement learning to induce high level functions within the network. Based on the interpretation that animals have gained their cognitive functions such as object recognition - without ever being specifically trained for - as a result of maximizing their fitness to the environment, we place our agent in an environment wh… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 14 pages

  33. arXiv:2111.10135  [pdf, other

    cs.CV cs.AI cs.LG

    Grounded Situation Recognition with Transformers

    Authors: Junhyeong Cho, Youngseok Yoon, Hyeonjun Lee, Suha Kwak

    Abstract: Grounded Situation Recognition (GSR) is the task that not only classifies a salient action (verb), but also predicts entities (nouns) associated with semantic roles and their locations in the given image. Inspired by the remarkable success of Transformers in vision tasks, we propose a GSR model based on a Transformer encoder-decoder architecture. The attention mechanism of our model enables accura… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: Accepted to BMVC 2021, Code: https://github.com/jhcho99/gsrtr

  34. arXiv:2111.07513  [pdf

    cs.LG stat.ML

    A Comparative Study on Basic Elements of Deep Learning Models for Spatial-Temporal Traffic Forecasting

    Authors: Yuyol Shin, Yoon** Yoon

    Abstract: Traffic forecasting plays a crucial role in intelligent transportation systems. The spatial-temporal complexities in transportation networks make the problem especially challenging. The recently suggested deep learning models share basic elements such as graph convolution, graph attention, recurrent units, and/or attention mechanism. In this study, we designed an in-depth comparative study for fou… ▽ More

    Submitted 22 March, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: 14 pages, 4 figures, 3 Tables, This paper is accepted for AAAI-22 Workshop: AI for Transportation

  35. SGToolkit: An Interactive Gesture Authoring Toolkit for Embodied Conversational Agents

    Authors: Youngwoo Yoon, Keunwoo Park, Minsu Jang, Jaehong Kim, Geehyuk Lee

    Abstract: Non-verbal behavior is essential for embodied agents like social robots, virtual avatars, and digital humans. Existing behavior authoring approaches including keyframe animation and motion capture are too expensive to use when there are numerous utterances requiring gestures. Automatic generation methods show promising results, but their output quality is not satisfactory yet, and it is hard to mo… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted to UIST'21

  36. arXiv:2102.11617  [pdf, other

    cs.HC cs.GR cs.MM

    A large, crowdsourced evaluation of gesture generation systems on common data: The GENEA Challenge 2020

    Authors: Taras Kucherenko, Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Gustav Eje Henter

    Abstract: Co-speech gestures, gestures that accompany speech, play an important role in human communication. Automatic co-speech gesture generation is thus a key enabling technology for embodied conversational agents (ECAs), since humans expect ECAs to be capable of multi-modal communication. Research into gesture generation is rapidly gravitating towards data-driven methods. Unfortunately, individual resea… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Accepted for publication at the 26th International Conference on Intelligent User Interfaces (IUI'21). 11 pages, 5 figures

    ACM Class: I.3; I.2

  37. HEMVIP: Human Evaluation of Multiple Videos in Parallel

    Authors: Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Gustav Eje Henter

    Abstract: In many research areas, for example motion and gesture generation, objective measures alone do not provide an accurate impression of key stimulus traits such as perceived quality or appropriateness. The gold standard is instead to evaluate these aspects through user studies, especially subjective evaluations of video stimuli. Common evaluation paradigms either present individual stimuli to be scor… ▽ More

    Submitted 20 October, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 6 pages, 1 figures. Proceedings of the 22th ACM International Conference on Multimodal Interaction. 2021. Montreal, Canada

  38. arXiv:2009.09073  [pdf

    cs.CY

    Running the COVID-19 marathon: the behavioral adaptations in mobility and facemask over 27 weeks of pandemic in Seoul, South Korea

    Authors: Jungwoo Cho, Yuyol Shin, Seyun Kim, Namwoo Kim, Soohwan Oh, Haechan Cho, Yoon** Yoon

    Abstract: Battle with COVID-19 turned out to be a marathon, not a sprint, and behavioral adjustments have been unavoidable to stay viable. In this paper, we employ a data-centric approach to investigate individual mobility adaptations and mask-wearing in Seoul, South Korea. We first identify six epidemic phases and two waves based on COVID-19 case count and its geospatial dispersion. The phase-specific line… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: 22 pages of manuscript, 19 pages of supplementary information

  39. arXiv:2009.02119  [pdf, other

    cs.GR cs.CV cs.HC

    Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity

    Authors: Youngwoo Yoon, Bok Cha, Joo-Haeng Lee, Minsu Jang, Jaeyeon Lee, Jaehong Kim, Geehyuk Lee

    Abstract: For human-like agents, including virtual avatars and social robots, making proper gestures while speaking is crucial in human--agent interaction. Co-speech gestures enhance interaction experiences and make the agents look alive. However, it is difficult to generate human-like gestures due to the lack of understanding of how people gesture. Data-driven approaches attempt to learn gesticulation skil… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: 16 pages; ACM Transactions on Graphics (SIGGRAPH Asia 2020)

  40. arXiv:2009.00712  [pdf, other

    cs.LG cs.AI eess.SP

    Short-term Traffic Prediction with Deep Neural Networks: A Survey

    Authors: Kyungeun Lee, Moonjung Eo, Euna Jung, Yoon** Yoon, Wonjong Rhee

    Abstract: In modern transportation systems, an enormous amount of traffic data is generated every day. This has led to rapid progress in short-term traffic prediction (STTP), in which deep learning methods have recently been applied. In traffic networks with complex spatiotemporal relationships, deep neural networks (DNNs) often perform well because they are capable of automatically extracting the most impo… ▽ More

    Submitted 28 August, 2020; originally announced September 2020.

  41. arXiv:2009.00100  [pdf, other

    cs.CV cs.LG cs.MM

    Online Multi-Object Tracking and Segmentation with GMPHD Filter and Mask-based Affinity Fusion

    Authors: Young-min Song, Young-chul Yoon, Kwang** Yoon, Moongu Jeon, Seong-Whan Lee, Witold Pedrycz

    Abstract: In this paper, we propose a highly practical fully online multi-object tracking and segmentation (MOTS) method that uses instance segmentation results as an input. The proposed method is based on the Gaussian mixture probability hypothesis density (GMPHD) filter, a hierarchical data association (HDA), and a mask-based affinity fusion (MAF) model to achieve high-performance online tracking. The HDA… ▽ More

    Submitted 11 June, 2021; v1 submitted 31 August, 2020; originally announced September 2020.

  42. COVID-19 Mobility Data Collection of Seoul, South Korea

    Authors: Jungwoo Cho, Soohwan Oh, Seyun Kim, Namwoo Kim, Yuyol Shin, Haechan Cho, Yoon** Yoon

    Abstract: The relationship between pandemic and human mobility has received considerable attention from scholars, as it can provide an indication of how mobility patterns change in response to a public health crisis or whether reduced mobility contributes to preventing the spread of an infectious disease. While several studies attempted to unveil such relationship, no studies have focused on changes in huma… ▽ More

    Submitted 12 August, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

  43. arXiv:2005.03818  [pdf, other

    cs.HC cs.AI

    Choose Your Own Question: Encouraging Self-Personalization in Learning Path Construction

    Authors: Youngduck Choi, Yoonho Na, Youngjik Yoon, Jonghun Shin, Chan Bae, Hongseok Suh, Byungsoo Kim, Jaewe Heo

    Abstract: Learning Path Recommendation is the heart of adaptive learning, the educational paradigm of an Interactive Educational System (IES) providing a personalized learning experience based on the student's history of learning activities. In typical existing IESs, the student must fully consume a recommended learning item to be provided a new recommendation. This workflow comes with several limitations.… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  44. arXiv:2004.14025  [pdf, other

    cs.AI cs.CL

    Multi-View Attention Network for Visual Dialog

    Authors: Sung** Park, Taesun Whang, Yeochan Yoon, Heuiseok Lim

    Abstract: Visual dialog is a challenging vision-language task in which a series of questions visually grounded by a given image are answered. To resolve the visual dialog task, a high-level understanding of various multimodal inputs (e.g., question, dialog history, and image) is required. Specifically, it is necessary for an agent to 1) determine the semantic intent of question and 2) align question-relevan… ▽ More

    Submitted 6 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

  45. arXiv:2003.07514  [pdf, other

    cs.CV cs.LG eess.IV

    Predictively Encoded Graph Convolutional Network for Noise-Robust Skeleton-based Action Recognition

    Authors: Jongmin Yu, Yongsang Yoon, Moongu Jeon

    Abstract: In skeleton-based action recognition, graph convolutional networks (GCNs), which model human body skeletons using graphical components such as nodes and connections, have achieved remarkable performance recently. However, current state-of-the-art methods for skeleton-based action recognition usually work on the assumption that the completely observed skeletons will be provided. This may be problem… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: Submitted to ECCV 2020

  46. arXiv:1912.03443  [pdf, other

    cs.DB

    Joins on Samples: A Theoretical Guide for Practitioners

    Authors: Dawei Huang, Dong Young Yoon, Seth Pettie, Barzan Mozafari

    Abstract: Despite decades of research on approximate query processing (AQP), our understanding of sample-based joins has remained limited and, to some extent, even superficial. The common belief in the community is that joining random samples is futile. This belief is largely based on an early result showing that the join of two uniform samples is not an independent sample of the original join, and that it… ▽ More

    Submitted 24 January, 2020; v1 submitted 7 December, 2019; originally announced December 2019.

    Comments: 19 pages

  47. Incorporating dynamicity of transportation network with multi-weight traffic graph convolutional network for traffic forecasting

    Authors: Yuyol Shin, Yoon** Yoon

    Abstract: Traffic forecasting problem remains a challenging task in the intelligent transportation system due to its spatio-temporal complexity. Although temporal dependency has been well studied and discussed, spatial dependency is relatively less explored due to its large variations, especially in the urban environment. In this study, a novel graph convolutional network model, Multi-Weight Traffic Graph C… ▽ More

    Submitted 26 May, 2021; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: 11 pages, 7 figures, Accepted to IEEE Transactions on Intelligent Transportation Systems (2020)

    MSC Class: 68T99

    Journal ref: IEEE Trans. Intell. Transp. Syst., 0 (2020) 1-11

  48. arXiv:1908.11060  [pdf, other

    cs.CV

    PopEval: A Character-Level Approach to End-To-End Evaluation Compatible with Word-Level Benchmark Dataset

    Authors: Hong-Seok Lee, Youngmin Yoon, Pil-Hoon Jang, Chankyu Choi

    Abstract: The most prevalent scope of interest for OCR applications used to be scanned documents, but it has now shifted towards the natural scene. Despite the change of times, the existing evaluation methods are still based on the old criteria suited better for the past interests. In this paper, we propose PopEval, a novel evaluation approach for the recent OCR interests. The new and past evaluation algori… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted by ICDAR 2019

  49. Online Multi-Object Tracking Framework with the GMPHD Filter and Occlusion Group Management

    Authors: Young-min Song, Kwang** Yoon, Young-Chul Yoon, Kin-Choong Yow, Moongu Jeon

    Abstract: In this paper, we propose an efficient online multi-object tracking framework based on the GMPHD filter and occlusion group management scheme where the GMPHD filter utilizes hierarchical data association to reduce the false negatives caused by miss detection. The hierarchical data association consists of two steps: detection-to-track and track-to-track associations, which can recover the lost trac… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

    Comments: This paper includes 15 pages and 9 figures, and has been prepared for a journal (not yet submitted anywhere)

  50. arXiv:1907.01256  [pdf, other

    cs.CL cs.LG

    A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning

    Authors: Yo Joong Choe, Jiyeon Ham, Kyubyong Park, Yeoil Yoon

    Abstract: Grammatical error correction can be viewed as a low-resource sequence-to-sequence task, because publicly available parallel corpora are limited. To tackle this challenge, we first generate erroneous versions of large unannotated corpora using a realistic noising function. The resulting parallel corpora are subsequently used to pre-train Transformer models. Then, by sequentially applying transfer l… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: Accepted to ACL 2019 Workshop on Innovative Use of NLP for Building Educational Applications (BEA)