Skip to main content

Showing 1–50 of 82 results for author: Son, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16042  [pdf, other

    cs.CV

    Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification

    Authors: Inès Hyeonsu Kim, JoungBin Lee, Soowon Son, Woojeong **, Kyusun Cho, Junyoung Seo, Min-Seop Kwak, Seokju Cho, JeongYeol Baek, Byeongwon Lee, Seungryong Kim

    Abstract: Person re-identification (Re-ID) often faces challenges due to variations in human poses and camera viewpoints, which significantly affect the appearance of individuals across images. Existing datasets frequently lack diversity and scalability in these aspects, hindering the generalization of Re-ID models to new camera systems. Previous methods have attempted to address these issues through data a… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: The project page is available at https://ku-cvlab.github.io/Diff-ID/

  2. arXiv:2406.12721  [pdf

    eess.AS cs.SD

    Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 4

    Authors: Sang Won Son, Jongyeon Park, Hong Kook Kim, Sulaiman Vesal, Jeong Eun Lim

    Abstract: In this report, we propose three novel methods for develo** a sound event detection (SED) model for the DCASE 2024 Challenge Task 4. First, we propose an auxiliary decoder attached to the final convolutional block to improve feature extraction capabilities while reducing dependency on embeddings from pre-trained large models. The proposed auxiliary decoder operates independently from the main de… ▽ More

    Submitted 24 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 challenge Task4, 4 pages

  3. arXiv:2406.12016  [pdf, other

    cs.LG

    Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization

    Authors: Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee

    Abstract: Despite recent advances in LLM quantization, activation quantization remains to be challenging due to the activation outliers. Conventional remedies, e.g., mixing precisions for different channels, introduce extra overhead and reduce the speedup. In this work, we develop a simple yet effective strategy to facilitate per-tensor activation quantization by preventing the generation of problematic tok… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2406.10809  [pdf, other

    cs.CL cs.AI

    Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

    Authors: Yoonna Jang, Suhyune Son, Jeongwoo Lee, Junyoung Son, Yuna Hur, Jungwoo Lim, Hyeonseok Moon, Kisu Yang, Heuiseok Lim

    Abstract: Despite the striking advances in recent language generation performance, model-generated responses have suffered from the chronic problem of hallucinations that are either untrue or unfaithful to a given source. Especially in the task of knowledge grounded conversation, the models are required to generate informative responses, but hallucinated utterances lead to miscommunication. In particular, e… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at EMNLP 2023

  5. arXiv:2406.01431  [pdf, other

    cs.RO

    Deep Stochastic Kinematic Models for Probabilistic Motion Forecasting in Traffic

    Authors: Laura Zheng, Sanghyun Son, **g Liang, Xijun Wang, Brian Clipp, Ming C. Lin

    Abstract: Kinematic priors have shown to be helpful in boosting generalization and performance in prior work on trajectory forecasting. Specifically, kinematic priors have been applied such that models predict a set of actions instead of future output trajectories. By unrolling predicted trajectories via time integration and models of kinematic dynamics, predicted trajectories are not only kinematically fea… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 8 pages

  6. arXiv:2405.09862  [pdf, other

    quant-ph cs.NI

    Performance of Quantum Networks Using Heterogeneous Link Architectures

    Authors: Kento Samuel Soon, Naphan Benchasattabuse, Michal Hajdušek, Kentaro Teramoto, Shota Nagayama, Rodney Van Meter

    Abstract: The heterogeneity of quantum link architectures is an essential theme in designing quantum networks for technological interoperability and possibly performance optimization. However, the performance of heterogeneously connected quantum links has not yet been addressed. Here, we investigate the integration of two inherently different technologies, with one link where the photons flow from the nodes… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 10 pages, 10 figures

  7. arXiv:2405.09861  [pdf, other

    quant-ph cs.NI

    An Implementation and Analysis of a Practical Quantum Link Architecture Utilizing Entangled Photon Sources

    Authors: Kento Samuel Soon, Michal Hajdušek, Shota Nagayama, Naphan Benchasattabuse, Kentaro Teramoto, Ryosuke Satoh, Rodney Van Meter

    Abstract: Quantum repeater networks play a crucial role in distributing entanglement. Various link architectures have been proposed to facilitate the creation of Bell pairs between distant nodes, with entangled photon sources emerging as a primary technology for building quantum networks. Our work advances the Memory-Source-Memory (MSM) link architecture, addressing the absence of practical implementation d… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 8 figures

  8. arXiv:2405.04537  [pdf, other

    cs.CV cs.AI cs.GR

    An intuitive multi-frequency feature representation for SO(3)-equivariant networks

    Authors: Dongwon Son, Jaehyung Kim, Sanghyeon Son, Beomjoon Kim

    Abstract: The usage of 3D vision algorithms, such as shape reconstruction, remains limited because they require inputs to be at a fixed canonical rotation. Recently, a simple equivariant network, Vector Neuron (VN) has been proposed that can be easily used with the state-of-the-art 3D neural network (NN) architectures. However, its performance is limited because it is designed to use only three-dimensional… ▽ More

    Submitted 15 March, 2024; originally announced May 2024.

    Comments: ICLR 2024

  9. arXiv:2404.13445  [pdf, other

    cs.CV cs.GR

    DMesh: A Differentiable Mesh Representation

    Authors: Sanghyun Son, Matheus Gadelha, Yang Zhou, Zexiang Xu, Ming C. Lin, Yi Zhou

    Abstract: We present a differentiable representation, DMesh, for general 3D triangular meshes. DMesh considers both the geometry and connectivity information of a mesh. In our design, we first get a set of convex tetrahedra that compactly tessellates the domain based on Weighted Delaunay Triangulation (WDT), and select triangular faces on the tetrahedra to define the final mesh. We formulate probability of… ▽ More

    Submitted 1 June, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: 35 pages, 22 figures. Updated with more analysis and experimental results

  10. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  11. arXiv:2403.16049  [pdf, other

    cs.LG physics.soc-ph

    Improving Demand Forecasting in Open Systems with Cartogram-Enhanced Deep Learning

    Authors: Sangjoon Park, Yongsung Kwon, Hyungjoon Soh, Mi ** Lee, Seung-Woo Son

    Abstract: Predicting temporal patterns across various domains poses significant challenges due to their nuanced and often nonlinear trajectories. To address this challenge, prediction frameworks have been continuously refined, employing data-driven statistical methods, mathematical models, and machine learning. Recently, as one of the challenging systems, shared transport systems such as public bicycles hav… ▽ More

    Submitted 26 May, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 11 pages, 7 figures

  12. PECI-Net: Bolus segmentation from video fluoroscopic swallowing study images using preprocessing ensemble and cascaded inference

    Authors: Dougho Park, Younghun Kim, Harim Kang, Junmyeoung Lee, **young Choi, Taeyeon Kim, Sangeok Lee, Seokil Son, Minsol Kim, Injung Kim

    Abstract: Bolus segmentation is crucial for the automated detection of swallowing disorders in videofluoroscopic swallowing studies (VFSS). However, it is difficult for the model to accurately segment a bolus region in a VFSS image because VFSS images are translucent, have low contrast and unclear region boundaries, and lack color information. To overcome these challenges, we propose PECI-Net, a network arc… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 20 pages, 8 figures,

    Journal ref: Computers in Biology and Medicine (2024)

  13. arXiv:2403.01479  [pdf, other

    cs.CL cs.AI

    Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

    Authors: Heegon **, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh, Yeonsoo Lee

    Abstract: The advent of scalable deep models and large datasets has improved the performance of Neural Machine Translation. Knowledge Distillation (KD) enhances efficiency by transferring knowledge from a teacher model to a more compact student model. However, KD approaches to Transformer architecture often rely on heuristics, particularly when deciding which teacher layers to distill from. In this paper, w… ▽ More

    Submitted 25 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

    MSC Class: 68T50 ACM Class: I.2.7

  14. arXiv:2402.14886  [pdf

    cs.LG cs.AI

    Applying Reinforcement Learning to Optimize Traffic Light Cycles

    Authors: Seungah Son, Juhee **

    Abstract: Manual optimization of traffic light cycles is a complex and time-consuming task, necessitating the development of automated solutions. In this paper, we propose the application of reinforcement learning to optimize traffic light cycles in real-time. We present a case study using the Simulation Urban Mobility simulator to train a Deep Q-Network algorithm. The experimental results showed 44.16% dec… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  15. arXiv:2402.14854  [pdf, other

    cs.CL cs.AI

    A Dual-Prompting for Interpretable Mental Health Language Models

    Authors: Hyolim Jeon, Dongje Yoo, Daeun Lee, Sejung Son, Seungbae Kim, **young Han

    Abstract: Despite the increasing demand for AI-based mental health monitoring tools, their practical utility for clinicians is limited by the lack of interpretability.The CLPsych 2024 Shared Task (Chim et al., 2024) aims to enhance the interpretability of Large Language Models (LLMs), particularly in mental health analysis, by providing evidence of suicidality through linguistic content. We propose a dual-p… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the Ninth Workshop on Computational Linguistics and Clinical Psychology 2024

  16. arXiv:2401.15726  [pdf, other

    cs.CV

    Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data

    Authors: Young-Jae Park, Minseok Seo, Doyi Kim, Hyeri Kim, Sanghoon Choi, Beomkyu Choi, Jeongwon Ryu, Sohee Son, Hae-Gon Jeon, Yeji Choi

    Abstract: In the face of escalating climate changes, typhoon intensities and their ensuing damage have surged. Accurate trajectory prediction is crucial for effective damage control. Traditional physics-based models, while comprehensive, are computationally intensive and rely heavily on the expertise of forecasters. Contemporary data-driven methods often rely on reanalysis data, which can be considered to b… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: This paper was accepted for a Spotlight presentation at ICLR 2024

  17. arXiv:2401.00460  [pdf, other

    cs.CV

    RainSD: Rain Style Diversification Module for Image Synthesis Enhancement using Feature-Level Style Distribution

    Authors: Hyeonjae Jeon, Junghyun Seo, Taesoo Kim, Sungho Son, Jungki Lee, Gyeungho Choi, Yongseob Lim

    Abstract: Autonomous driving technology nowadays targets to level 4 or beyond, but the researchers are faced with some limitations for develo** reliable driving algorithms in diverse challenges. To promote the autonomous vehicles to spread widely, it is important to address safety issues on this technology. Among various safety concerns, the sensor blockage problem by severe weather conditions can be one… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Under Review

    MSC Class: 14J60 (Autonomous Vehicles) 14F05; 14J26 (Adverse Weather Condition)

  18. arXiv:2312.08710  [pdf, other

    cs.LG cs.AI

    Gradient Informed Proximal Policy Optimization

    Authors: Sanghyun Son, Laura Yu Zheng, Ryan Sullivan, Yi-Ling Qiao, Ming C. Lin

    Abstract: We introduce a novel policy learning method that integrates analytical gradients from differentiable environments with the Proximal Policy Optimization (PPO) algorithm. To incorporate analytical gradients into the PPO framework, we introduce the concept of an α-policy that stands as a locally superior policy. By adaptively modifying the α value, we can effectively manage the influence of analytica… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 27 pages, NeurIPS 2023 Conference

  19. Learning Co-Speech Gesture for Multimodal Aphasia Type Detection

    Authors: Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, **young Han

    Abstract: Aphasia, a language disorder resulting from brain damage, requires accurate identification of specific aphasia types, such as Broca's and Wernicke's aphasia, for effective treatment. However, little attention has been paid to develo** methods to detect different types of aphasia. Recognizing the importance of analyzing co-speech gestures for distinguish aphasia types, we propose a multimodal gra… ▽ More

    Submitted 20 October, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 accepted

    Journal ref: EMNLP 2023:Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

  20. arXiv:2308.02596  [pdf, other

    physics.soc-ph cond-mat.dis-nn cs.DM stat.CO

    Revisiting small-world network models: Exploring technical realizations and the equivalence of the Newman-Watts and Harary models

    Authors: Seora Son, Eun Ji Choi, Sang Hoon Lee

    Abstract: We address the relatively less known facts on the equivalence and technical realizations surrounding two network models showing the "small-world" property, namely the Newman-Watts and the Harary models. We provide the most accurate (in terms of faithfulness to the original literature) versions of these models to clarify the deviation from them existing in their variants adopted in one of the most… ▽ More

    Submitted 12 December, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: 11 pages, 5 figures, 1 table

    Journal ref: J. Korean Phys. Soc. 83, 879 (2023)

  21. arXiv:2307.12751  [pdf, other

    eess.IV cs.CV

    ICF-SRSR: Invertible scale-Conditional Function for Self-Supervised Real-world Single Image Super-Resolution

    Authors: Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee

    Abstract: Single image super-resolution (SISR) is a challenging ill-posed problem that aims to up-sample a given low-resolution (LR) image to a high-resolution (HR) counterpart. Due to the difficulty in obtaining real LR-HR training pairs, recent approaches are trained on simulated LR images degraded by simplified down-sampling operators, e.g., bicubic. Such an approach can be problematic in practice becaus… ▽ More

    Submitted 31 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  22. Towards Suicide Prevention from Bipolar Disorder with Temporal Symptom-Aware Multitask Learning

    Authors: Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, **young Han

    Abstract: Bipolar disorder (BD) is closely associated with an increased risk of suicide. However, while the prior work has revealed valuable insight into understanding the behavior of BD patients on social media, little attention has been paid to develo** a model that can predict the future suicidality of a BD patient. Therefore, this study proposes a multi-task learning model for predicting the future su… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: KDD 2023 accepted

    Journal ref: KDD 2023: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

  23. arXiv:2306.06461  [pdf

    eess.AS cs.SD

    Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4

    Authors: Ji Won Kim, Sang Won Son, Yoonah Song, Hong Kook Kim, Il Hoon Song, Jeong Eun Lim

    Abstract: This report proposes a frequency dynamic convolution (FDY) with a large kernel attention (LKA)-convolutional recurrent neural network (CRNN) with a pre-trained bidirectional encoder representation from audio transformers (BEATs) embedding-based sound event detection (SED) model that employs a mean-teacher and pseudo-label approach to address the challenge of limited labeled data for DCASE 2023 Tas… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: DCASE 2023 Challenge Task 4A, 5 pages

  24. arXiv:2305.15417  [pdf, other

    eess.IV cs.CV cs.LG

    Entropy-Aware Similarity for Balanced Clustering: A Case Study with Melanoma Detection

    Authors: Seok Bin Son, Soohyun Park, Joongheon Kim

    Abstract: Clustering data is an unsupervised learning approach that aims to divide a set of data points into multiple groups. It is a crucial yet demanding subject in machine learning and data mining. Its successful applications span various fields. However, conventional clustering techniques necessitate the consideration of balance significance in specific applications. Therefore, this paper addresses the… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  25. Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

    Authors: Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

    Abstract: Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, Code URL: https://github.com/raymin0223/patch-mix_contrastive_learning

  26. arXiv:2305.06110  [pdf, other

    cs.CV

    Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase

    Authors: Shreya Ghosh, Md Rakibul Hasan, Pradyumna Agrawal, Zhixi Cai, Susannah Soon, Abhinav Dhall, Tom Gedeon

    Abstract: This paper proposes a feedback mechanism to 'break bad habits' using the Pavlok device. Pavlok utilises beeps, vibration and shocks as a mode of aversion technique to help individuals with behaviour modification. While the device can be useful in certain periodic daily life situations, like alarms and exercise notifications, the device relies on manual operations that limit its usage. To this end,… ▽ More

    Submitted 10 May, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Shreya Ghosh, Md Rakibul Hasan and Pradyumna Agrawal contributed equally to this research

  27. arXiv:2302.13509  [pdf, other

    cs.RO

    GeoLCR: Attention-based Geometric Loop Closure and Registration

    Authors: **g Liang, Sanghyun Son, Ming Lin, Dinesh Manocha

    Abstract: We present a novel algorithm specially designed for loop detection and registration that utilizes Lidar-based perception. Our approach to loop detection involves voxelizing point clouds, followed by an overlap calculation to confirm whether a vehicle has completed a loop. We further enhance the current pose's accuracy via an innovative point-level registration model. The efficacy of our algorithm… ▽ More

    Submitted 16 July, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

  28. arXiv:2302.10494  [pdf, other

    cs.LG cs.CV

    MaskedKD: Efficient Distillation of Vision Transformers with Masked Images

    Authors: Seungwoo Son, Namhoon Lee, Jaeho Lee

    Abstract: Knowledge distillation is an effective method for training lightweight models, but it introduces a significant amount of computational overhead to the training cost, as the method requires acquiring teacher supervisions on training samples. This additional cost -- called distillation cost -- is most pronounced when we employ large-scale teacher models such as vision transformers (ViTs). We present… ▽ More

    Submitted 31 May, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  29. arXiv:2211.12118  [pdf, other

    cs.CL

    HaRiM$^+$: Evaluating Summary Quality with Hallucination Risk

    Authors: Seonil Son, Junsoo Park, Jeong-in Hwang, Junghwa Lee, Hyungjong Noh, Yeonsoo Lee

    Abstract: One of the challenges of develo** a summarization model arises from the difficulty in measuring the factual inconsistency of the generated text. In this study, we reinterpret the decoder overconfidence-regularizing objective suggested in (Miao et al., 2021) as a hallucination risk measurement to better estimate the quality of generated summaries. We propose a reference-free metric, HaRiM+, which… ▽ More

    Submitted 24 November, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 9 pages (+ 21 pages of Appendix), AACL 2022

    Journal ref: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022), pages 895-924

  30. arXiv:2210.10482  [pdf, other

    cs.LG

    Effective Targeted Attacks for Adversarial Self-Supervised Learning

    Authors: Minseon Kim, Hyeonjeong Ha, Sooel Son, Sung Ju Hwang

    Abstract: Recently, unsupervised adversarial training (AT) has been highlighted as a means of achieving robustness in models without any label information. Previous studies in unsupervised AT have mostly focused on implementing self-supervised learning (SSL) frameworks, which maximize the instance-wise classification loss to generate adversarial examples. However, we observe that simply maximizing the self-… ▽ More

    Submitted 26 October, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2023

  31. arXiv:2210.08046  [pdf, other

    cs.GR cs.LG cs.MA

    Differentiable Hybrid Traffic Simulation

    Authors: Sanghyun Son, Yi-Ling Qiao, Jason Sewall, Ming C. Lin

    Abstract: We introduce a novel differentiable hybrid traffic simulator, which simulates traffic using a hybrid model of both macroscopic and microscopic models and can be directly integrated into a neural network for traffic control and flow optimization. This is the first differentiable traffic simulator for macroscopic and hybrid models that can compute gradients for traffic states across time steps and i… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 13 pages, Siggraph Asia 2022 Journal Paper

    ACM Class: I.6.1; I.6.3

  32. arXiv:2210.03772  [pdf, other

    cs.RO cs.MA

    Traffic-Aware Autonomous Driving with Differentiable Traffic Simulation

    Authors: Laura Zheng, Sanghyun Son, Ming C. Lin

    Abstract: While there have been advancements in autonomous driving control and traffic simulation, there have been little to no works exploring their unification with deep learning. Works in both areas seem to focus on entirely different exclusive problems, yet traffic and driving are inherently related in the real world. In this paper, we present Traffic-Aware Autonomous Driving (TrAAD), a generalizable di… ▽ More

    Submitted 6 April, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

  33. arXiv:2209.10922  [pdf, other

    cs.CL

    Learning to Write with Coherence From Negative Examples

    Authors: Seonil Son, Jaeseo Lim, Youwon Jang, Jaeyoung Lee, Byoung-Tak Zhang

    Abstract: Coherence is one of the critical factors that determine the quality of writing. We propose writing relevance (WR) training method for neural encoder-decoder natural language generation (NLG) models which improves coherence of the continuation by leveraging negative examples. WR loss regresses the vector representation of the context and generated sentence toward positive continuation by contrastin… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 4+1 pages, 4 figures, 2 tables. ICASSP 2022 rejected

  34. arXiv:2209.09777  [pdf, other

    cs.RO

    WGICP: Differentiable Weighted GICP-Based Lidar Odometry

    Authors: Sanghyun Son, **g Liang, Ming Lin, Dinesh Manocha

    Abstract: We present a novel differentiable weighted generalized iterative closest point (WGICP) method applicable to general 3D point cloud data, including that from Lidar. Our method builds on differentiable generalized ICP (GICP), and we propose using the differentiable K-Nearest Neighbor (KNN) algorithm to enhance differentiability. The differentiable GICP algorithm provides the gradient of output pose… ▽ More

    Submitted 3 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: 6 pages

  35. arXiv:2209.06422  [pdf, other

    cs.CL

    Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models

    Authors: Suhyune Son, Chanjun Park, Jungseob Lee, Midan Shim, Chanhee Lee, Yoonna Jang, Jaehyung Seo, Heuiseok Lim

    Abstract: As pre-trained language models become more resource-demanding, the inequality between resource-rich languages such as English and resource-scarce languages is worsening. This can be attributed to the fact that the amount of available training data in each language follows the power-law distribution, and most of the languages belong to the long tail of the distribution. Some research areas attempt… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  36. arXiv:2209.00862  [pdf, other

    cs.CR cs.AI

    Spatio-Temporal Attack Course-of-Action (COA) Search Learning for Scalable and Time-Varying Networks

    Authors: Haemin Lee, Seok Bin Son, Won Joon Yun, Joongheon Kim, Soyi Jung, Dong Hwa Kim

    Abstract: One of the key topics in network security research is the autonomous COA (Couse-of-Action) attack search method. Traditional COA attack search methods that passively search for attacks can be difficult, especially as the network gets bigger. To address these issues, new autonomous COA techniques are being developed, and among them, an intelligent spatial algorithm is designed in this paper for eff… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  37. arXiv:2205.13763  [pdf, other

    cs.AI

    Tutorial on Course-of-Action (COA) Attack Search Methods in Computer Networks

    Authors: Seok Bin Son, Soohyun Park, Haemin Lee, Joongheon Kim, Soyi Jung, Donghwa Kim

    Abstract: In the literature of modern network security research, deriving effective and efficient course-of-action (COA) attach search methods are of interests in industry and academia. As the network size grows, the traditional COA attack search methods can suffer from the limitations to computing and communication resources. Therefore, various methods have been developed to solve these problems, and reinf… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  38. arXiv:2204.04892  [pdf, other

    cs.LG cs.AI

    JORLDY: a fully customizable open source framework for reinforcement learning

    Authors: Kyushik Min, Hyunho Lee, Kwansu Shin, Taehak Lee, Hojoon Lee, **won Choi, Sungho Son

    Abstract: Recently, Reinforcement Learning (RL) has been actively researched in both academic and industrial fields. However, there exist only a few RL frameworks which are developed for researchers or students who want to study RL. In response, we propose an open-source RL framework "Join Our Reinforcement Learning framework for Develo** Yours" (JORLDY). JORLDY provides more than 20 widely used RL algori… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 12 pages, 6 figures

  39. arXiv:2203.15662  [pdf, other

    cs.CV

    MatteFormer: Transformer-Based Image Matting via Prior-Tokens

    Authors: GyuTae Park, SungJoon Son, JaeYoung Yoo, SeHo Kim, Nojun Kwak

    Abstract: In this paper, we propose a transformer-based image matting model called MatteFormer, which takes full advantage of trimap information in the transformer block. Our method first introduces a prior-token which is a global representation of each trimap region (e.g. foreground, background and unknown). These prior-tokens are used as global priors and participate in the self-attention mechanism of eac… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  40. arXiv:2203.13009  [pdf, other

    cs.CV

    CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image

    Authors: Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee

    Abstract: Recently, significant progress has been made on image denoising with strong supervision from large-scale datasets. However, obtaining well-aligned noisy-clean training image pairs for each specific scenario is complicated and costly in practice. Consequently, applying a conventional supervised denoising network on in-the-wild noisy inputs is not straightforward. Although several studies have chall… ▽ More

    Submitted 29 March, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Published at CVPR 2022

  41. arXiv:2203.11799  [pdf, other

    cs.CV eess.IV

    AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network

    Authors: Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Blind-spot network (BSN) and its variants have made significant advances in self-supervised denoising. Nevertheless, they are still bound to synthetic noisy inputs due to less practical assumptions like pixel-wise independent noise. Hence, it is challenging to deal with spatially correlated real-world noise using self-supervised BSN. Recently, pixel-shuffle downsampling (PD) has been proposed to r… ▽ More

    Submitted 24 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR2022

  42. arXiv:2203.11593  [pdf, other

    cs.CV

    Unified Negative Pair Generation toward Well-discriminative Feature Space for Face Recognition

    Authors: Junuk Jung, Seonhoon Lee, Heung-Seon Oh, Yongjun Park, Joochan Park, Sungbin Son

    Abstract: The goal of face recognition (FR) can be viewed as a pair similarity optimization problem, maximizing a similarity set $\mathcal{S}^p$ over positive pairs, while minimizing similarity set $\mathcal{S}^n$ over negative pairs. Ideally, it is expected that FR models form a well-discriminative feature space (WDFS) that satisfies $\inf{\mathcal{S}^p} > \sup{\mathcal{S}^n}$. With regard to WDFS, the exi… ▽ More

    Submitted 18 April, 2024; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 9 pages, 6 figures, Published at BMVC22

  43. Effective Training Strategies for Deep-learning-based Precipitation Nowcasting and Estimation

    Authors: Jihoon Ko, Kyuhan Lee, Hyun** Hwang, Seok-Geun Oh, Seok-Woo Son, Kijung Shin

    Abstract: Deep learning has been successfully applied to precipitation nowcasting. In this work, we propose a pre-training scheme and a new loss function for improving deep-learning-based nowcasting. First, we adapt U-Net, a widely-used deep-learning model, for the two problems of interest here: precipitation nowcasting and precipitation estimation from radar images. We formulate the former as a classificat… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: to appear in Computers & Geosciences

  44. arXiv:2202.09533  [pdf, other

    cs.CV eess.IV

    C2N: Practical Generative Noise Modeling for Real-World Denoising

    Authors: Geonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Learning-based image denoising methods have been bounded to situations where well-aligned noisy and clean images are given, or samples are synthesized from predetermined noise models, e.g., Gaussian. While recent generative noise modeling methods aim to simulate the unknown distribution of real-world noise, several limitations still exist. In a practical scenario, a noise generator should learn to… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 2350-2359

  45. arXiv:2201.03239  [pdf, other

    cs.CL

    There is no rose without a thorn: Finding weaknesses on BlenderBot 2.0 in terms of Model, Data and User-Centric Approach

    Authors: Jungseob Lee, Midan Shim, Suhyune Son, Chanjun Park, Yu** Kim, Heuiseok Lim

    Abstract: BlenderBot 2.0 is a dialogue model that represents open-domain chatbots by reflecting real-time information and remembering user information for an extended period using an internet search module and multi-session. Nonetheless, the model still has room for improvement. To this end, we examine BlenderBot 2.0 limitations and errors from three perspectives: model, data, and user. From the data point… ▽ More

    Submitted 8 July, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: English and Extention Version of "Empirical study on BlenderBot 2.0 errors analysis in terms of model, data and dialogue" (Journal of the Korea Convergence Society)

  46. arXiv:2112.08619  [pdf, other

    cs.CL cs.AI

    Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

    Authors: Yoonna Jang, Jungwoo Lim, Yuna Hur, Dongsuk Oh, Suhyune Son, Yeonsoo Lee, Donghoon Shin, Seungryong Kim, Heuiseok Lim

    Abstract: Humans usually have conversations by making use of prior knowledge about a topic and background information of the people whom they are talking to. However, existing conversational agents and datasets do not consider such comprehensive information, and thus they have a limitation in generating the utterances where the knowledge and persona are fused properly. To address this issue, we introduce a… ▽ More

    Submitted 16 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted paper at the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  47. arXiv:2111.01717  [pdf, other

    cs.CV

    MixFace: Improving Face Verification Focusing on Fine-grained Conditions

    Authors: Junuk Jung, Sungbin Son, Joochan Park, Yongjun Park, Seonhoon Lee, Heung-Seon Oh

    Abstract: The performance of face recognition has become saturated for public benchmark datasets such as LFW, CFP-FP, and AgeDB, owing to the rapid advances in CNNs. However, the effects of faces with various fine-grained conditions on FR models have not been investigated because of the absence of such datasets. This paper analyzes their effects in terms of different conditions and loss functions using K-FA… ▽ More

    Submitted 19 June, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 9 pages, 6 figures

  48. Toward Real-World Super-Resolution via Adaptive Downsampling Models

    Authors: Sanghyun Son, Jaeha Kim, Wei-Sheng Lai, Ming-Husan Yang, Kyoung Mu Lee

    Abstract: Most image super-resolution (SR) methods are developed on synthetic low-resolution (LR) and high-resolution (HR) image pairs that are constructed by a predetermined operation, e.g., bicubic downsampling. As existing methods typically learn an inverse map** of the specific function, they produce blurry results when applied to real-world images whose exact formulation is different and unknown. The… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at TPAMI

  49. arXiv:2106.10147  [pdf, other

    cs.CR cs.LG

    Evaluating the Robustness of Trigger Set-Based Watermarks Embedded in Deep Neural Networks

    Authors: Suyoung Lee, Wonho Song, Suman Jana, Meeyoung Cha, Sooel Son

    Abstract: Trigger set-based watermarking schemes have gained emerging attention as they provide a means to prove ownership for deep neural network model owners. In this paper, we argue that state-of-the-art trigger set-based watermarking algorithms do not achieve their designed goal of proving ownership. We posit that this impaired capability stems from two common experimental flaws that the existing resear… ▽ More

    Submitted 19 January, 2023; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 15 pages, accepted at IEEE TDSC

  50. arXiv:2106.03839  [pdf, other

    cs.CV

    NTIRE 2021 Challenge on Burst Super-Resolution: Methods and Results

    Authors: Goutam Bhat, Martin Danelljan, Radu Timofte, Kazutoshi Akita, Wooyeong Cho, Haoqiang Fan, Lanpeng Jia, Daeshik Kim, Bruno Lecouat, Youwei Li, Shuaicheng Liu, Ziluan Liu, Ziwei Luo, Takahiro Maeda, Julien Mairal, Christian Micheloni, Xuan Mo, Takeru Oba, Pavel Ostyakov, Jean Ponce, Sanghyeok Son, Jian Sun, Norimichi Ukita, Rao Muhammad Umer, Youliang Yan , et al. (3 additional authors not shown)

    Abstract: This paper reviews the NTIRE2021 challenge on burst super-resolution. Given a RAW noisy burst as input, the task in the challenge was to generate a clean RGB image with 4 times higher resolution. The challenge contained two tracks; Track 1 evaluating on synthetically generated data, and Track 2 using real-world bursts from mobile camera. In the final testing phase, 6 teams submitted results using… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: NTIRE 2021 Burst Super-Resolution challenge report