Skip to main content

Showing 1–50 of 102 results for author: Jung, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15664  [pdf, other

    stat.ML cs.LG

    Flat Posterior Does Matter For Bayesian Transfer Learning

    Authors: Sungjun Lim, Jeyoon Yeom, Sooyon Kim, Hoyoon Byun, **ho Kang, Yohan Jung, Jiyoung Jung, Kyungwoo Song

    Abstract: The large-scale pre-trained neural network has achieved notable success in enhancing performance for downstream tasks. Another promising approach for generalization is Bayesian Neural Network (BNN), which integrates Bayesian methods into neural network architectures, offering advantages such as Bayesian Model averaging (BMA) and uncertainty quantification. Despite these benefits, transfer learning… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.07923  [pdf, other

    cs.SD cs.AI eess.AS

    CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting

    Authors: Sichen **, Youngmoon Jung, Seung** Lee, Jaeyoung Roh, Changwoo Han, Hoonyoung Cho

    Abstract: This paper introduces a novel approach for streaming openvocabulary keyword spotting (KWS) with text-based keyword enrollment. For every input frame, the proposed method finds the optimal alignment ending at the frame using connectionist temporal classification (CTC) and aggregates the frame-level acoustic embedding (AE) to obtain higher-level (i.e., character, word, or phrase) AE that aligns with… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.05314  [pdf, other

    eess.AS cs.AI eess.SP

    Relational Proxy Loss for Audio-Text based Keyword Spotting

    Authors: Youngmoon Jung, Seung** Lee, Joon-Young Yang, Jaeyoung Roh, Chang Woo Han, Hoon-Young Cho

    Abstract: In recent years, there has been an increasing focus on user convenience, leading to increased interest in text-based keyword enrollment systems for keyword spotting (KWS). Since the system utilizes text input during the enrollment phase and audio input during actual usage, we call this task audio-text based KWS. To enable this task, both acoustic and text encoders are typically trained using deep… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, Accepted by Interspeech 2024

  4. arXiv:2406.00798  [pdf, other

    cs.CV cs.AI

    PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency

    Authors: Yeonsung Jung, Heecheol Yun, Joonhyung Park, **-Hwa Kim, Eunho Yang

    Abstract: Neural Radiance Fields (NeRF) have shown remarkable performance in learning 3D scenes. However, NeRF exhibits vulnerability when confronted with distractors in the training images -- unexpected objects are present only within specific views, such as moving entities like pedestrians or birds. Excluding distractors during dataset construction is a straightforward solution, but without prior knowledg… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  5. arXiv:2405.15092  [pdf, other

    cs.AI cs.CL

    Dissociation of Faithful and Unfaithful Reasoning in LLMs

    Authors: Evelyn Yee, Alice Li, Chenyu Tang, Yeon Ho Jung, Ramamohan Paturi, Leon Bergen

    Abstract: Large language models (LLMs) improve their performance in downstream tasks when they generate Chain of Thought reasoning text before producing an answer. Our research investigates how LLMs recover from errors in Chain of Thought, reaching the correct final answer despite mistakes in the reasoning text. Through analysis of these error recovery behaviors, we find evidence for unfaithfulness in Chain… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: code published at https://github.com/CoTErrorRecovery/CoTErrorRecovery

  6. arXiv:2404.06808  [pdf, other

    cs.LG

    Formation-Controlled Dimensionality Reduction

    Authors: Taeuk Jeong, Yoon Mo Jung

    Abstract: Dimensionality reduction represents the process of generating a low dimensional representation of high dimensional data. Motivated by the formation control of mobile agents, we propose a nonlinear dynamical system for dimensionality reduction. The system consists of two parts; the control of neighbor points, addressing local structures, and the control of remote points, accounting for global struc… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  7. arXiv:2404.03138  [pdf, other

    cs.CV cs.GR

    Discontinuity-preserving Normal Integration with Auxiliary Edges

    Authors: Hyomin Kim, Yucheol Jung, Seungyong Lee

    Abstract: Many surface reconstruction methods incorporate normal integration, which is a process to obtain a depth map from surface gradients. In this process, the input may represent a surface with discontinuities, e.g., due to self-occlusion. To reconstruct an accurate depth map from the input normal map, hidden surface gradients occurring from the jumps must be handled. To model these jumps correctly, we… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: To appear at CVPR 2024. For supplementary video, see https://youtu.be/MTTcW5kAOFE

    ACM Class: I.4.5

  8. arXiv:2404.02949  [pdf, other

    cs.LG cs.AI

    The SaTML '24 CNN Interpretability Competition: New Innovations for Concept-Level Interpretability

    Authors: Stephen Casper, Jieun Yun, Joonhyuk Baek, Yeseong Jung, Minhwan Kim, Kiwan Kwon, Saerom Park, Hayden Moore, David Shriver, Marissa Connor, Keltin Grimes, Angus Nicolson, Arush Tagade, Jessica Rumbelow, Hieu Minh Nguyen, Dylan Hadfield-Menell

    Abstract: Interpretability techniques are valuable for hel** humans understand and oversee AI systems. The SaTML 2024 CNN Interpretability Competition solicited novel methods for studying convolutional neural networks (CNNs) at the ImageNet scale. The objective of the competition was to help human crowd-workers identify trojans in CNNs. This report showcases the methods and results of four featured compet… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Competition for SaTML 2024

  9. arXiv:2403.03960  [pdf, other

    physics.chem-ph cs.LG

    Assessing the Extrapolation Capability of Template-Free Retrosynthesis Models

    Authors: Shuan Chen, Yousung Jung

    Abstract: Despite the acknowledged capability of template-free models in exploring unseen reaction spaces compared to template-based models for retrosynthesis prediction, their ability to venture beyond established boundaries remains relatively uncharted. In this study, we empirically assess the extrapolation capability of state-of-the-art template-free models by meticulously assembling an extensive set of… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  10. arXiv:2402.08601  [pdf, other

    cs.CV

    Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing

    Authors: Yunji Jung, Seokju Lee, Tair Djanibekov, Hyunjung Shim, Jong Chul Ye

    Abstract: Text-guided non-rigid editing involves complex edits for input images, such as changing motion or compositions within their surroundings. Since it requires manipulating the input structure, existing methods often struggle with preserving object identity and background, particularly when combined with Stable Diffusion. In this work, we propose a training-free approach for non-rigid editing with Sta… ▽ More

    Submitted 14 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  11. arXiv:2402.05448  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

    Authors: Bumsoo Kim, Sanghyun Byun, Yonghoon Jung, Wonseop Shin, Sareer UI Amin, Sanghyun Seo

    Abstract: In this paper, we first present the character texture generation system \textit{Minecraft-ify}, specified to Minecraft video game toward in-game application. Ours can generate face-focused image for texture map** tailored to 3D virtual character having cube manifold. While existing projects or works only generate texture, proposed system can inverse the user-provided real image, or generate aver… ▽ More

    Submitted 3 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 2 pages, 2 figures. Accepted as Spotlight to NeurIPS 2023 Workshop on Machine Learning for Creativity and Design

  12. arXiv:2401.08998  [pdf, other

    cs.LG cs.CR cs.CV

    Attack and Reset for Unlearning: Exploiting Adversarial Noise toward Machine Unlearning through Parameter Re-initialization

    Authors: Yoonhwa Jung, Ikhyun Cho, Shun-Hsiang Hsu, Julia Hockenmaier

    Abstract: With growing concerns surrounding privacy and regulatory compliance, the concept of machine unlearning has gained prominence, aiming to selectively forget or erase specific learned information from a trained model. In response to this critical need, we introduce a novel approach called Attack-and-Reset for Unlearning (ARU). This algorithm leverages meticulously crafted adversarial noise to generat… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  13. arXiv:2312.11890  [pdf, other

    cs.CL cs.SI

    Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction

    Authors: Unggi Lee, Sungjun Yoon, Joon Seo Yun, Kyoungsoo Park, YoungHoon Jung, Damji Stratton, Hyeoncheol Kim

    Abstract: This paper presents novel techniques for enhancing the performance of knowledge tracing (KT) models by focusing on the crucial factor of question and concept difficulty level. Despite the acknowledged significance of difficulty, previous KT research has yet to exploit its potential for model optimization and has struggled to predict difficulty from unseen data. To address these problems, we propos… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures, 2 tables

  14. arXiv:2312.05611  [pdf, other

    cs.LG cs.AI

    Triplet Edge Attention for Algorithmic Reasoning

    Authors: Yeonjoon Jung, Sungsoo Ahn

    Abstract: This work investigates neural algorithmic reasoning to develop neural networks capable of learning from classical algorithms. The main challenge is to develop graph neural networks that are expressive enough to predict the given algorithm outputs while generalizing well to out-of-distribution data. In this work, we introduce a new graph neural network layer called Triplet Edge Attention (TEA), an… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  15. arXiv:2311.10309  [pdf, other

    cs.LG cs.RO

    Imagination-Augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments

    Authors: Sang-Hyun Lee, Yoonjae Jung, Seung-Woo Seo

    Abstract: Hierarchical reinforcement learning (HRL) incorporates temporal abstraction into reinforcement learning (RL) by explicitly taking advantage of hierarchical structure. Modern HRL typically designs a hierarchical agent composed of a high-level policy and low-level policies. The high-level policy selects which low-level policy to activate at a lower frequency and the activated low-level policy select… ▽ More

    Submitted 23 January, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 15 pages, 9 figures; corrected typos, added references, revised experiments (results unchanged)

  16. arXiv:2310.18119  [pdf, other

    cs.CL cs.AI

    Towards a Unified Conversational Recommendation System: Multi-task Learning via Contextualized Knowledge Distillation

    Authors: Yeongseo Jung, Eunseo Jung, Lei Chen

    Abstract: In Conversational Recommendation System (CRS), an agent is asked to recommend a set of items to users within natural language conversations. To address the need for both conversational capability and personalized recommendations, prior works have utilized separate recommendation and dialogue modules. However, such approach inevitably results in a discrepancy between recommendation results and gene… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference

  17. arXiv:2310.05538  [pdf, other

    eess.IV cs.CV cs.LG

    M3FPolypSegNet: Segmentation Network with Multi-frequency Feature Fusion for Polyp Localization in Colonoscopy Images

    Authors: Ju-Hyeon Nam, Seo-Hyeong Park, Nur Suriza Syazwany, Yerim Jung, Yu-Han Im, Sang-Chul Lee

    Abstract: Polyp segmentation is crucial for preventing colorectal cancer a common type of cancer. Deep learning has been used to segment polyps automatically, which reduces the risk of misdiagnosis. Localizing small polyps in colonoscopy images is challenging because of its complex characteristics, such as color, occlusion, and various shapes of polyps. To address this challenge, a novel frequency-based ful… ▽ More

    Submitted 9 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 5pages. 2023 IEEE International Conference on Image Processing (ICIP). IEEE, 2023

    MSC Class: 92C55

  18. arXiv:2309.14888  [pdf, other

    cs.CV

    Nearest Neighbor Guidance for Out-of-Distribution Detection

    Authors: Jaewoo Park, Yoon Gyo Jung, Andrew Beng ** Teoh

    Abstract: Detecting out-of-distribution (OOD) samples are crucial for machine learning models deployed in open-world environments. Classifier-based scores are a standard approach for OOD detection due to their fine-grained detection capability. However, these scores often suffer from overconfidence issues, misclassifying OOD samples distant from the in-distribution region. To address this challenge, we prop… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV2023

  19. arXiv:2309.00237  [pdf, other

    cs.CL cs.AI

    Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

    Authors: Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seung** Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi

    Abstract: The development of large language models tailored for handling patients' clinical notes is often hindered by the limited accessibility and usability of these notes due to strict privacy regulations. To address these challenges, we first create synthetic large-scale clinical notes using publicly available case reports extracted from biomedical literature. We then use these synthetic notes to train… ▽ More

    Submitted 13 June, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: ACL 2024 (Findings)

  20. arXiv:2308.16529  [pdf

    cs.RO cs.AI cs.HC

    Develo** Social Robots with Empathetic Non-Verbal Cues Using Large Language Models

    Authors: Yoon Kyung Lee, Yoonwon Jung, Gyuyi Kang, Sowon Hahn

    Abstract: We propose augmenting the empathetic capacities of social robots by integrating non-verbal cues. Our primary contribution is the design and labeling of four types of empathetic non-verbal cues, abbreviated as SAFE: Speech, Action (gesture), Facial expression, and Emotion, in a social robot. These cues are generated using a Large Language Model (LLM). We developed an LLM-based conversational system… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Journal ref: In Proceedings of 2023 IEEE International Conference on Robot & Human Interactive Communication (RO-MAN)

  21. Mesh Density Adaptation for Template-based Shape Reconstruction

    Authors: Yucheol Jung, Hyomin Kim, Gyeongha Hwang, Seung-Hwan Baek, Seungyong Lee

    Abstract: In 3D shape reconstruction based on template mesh deformation, a regularization, such as smoothness energy, is employed to guide the reconstruction into a desirable direction. In this paper, we highlight an often overlooked property in the regularization: the vertex density in the mesh. Without careful control on the density, the reconstruction may suffer from under-sampling of vertices near shape… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: To appear at SIGGRAPH 2023. Jung and Kim shares equal contribution. For codes, see https://github.com/ycjungSubhuman/density-adaptation/

    ACM Class: I.4.5; I.3.5

  22. arXiv:2307.05916  [pdf, other

    cs.CV

    SwiFT: Swin 4D fMRI Transformer

    Authors: Peter Yongho Kim, Junbeom Kwon, Sunghwan Joo, Sangyoon Bae, Donggyu Lee, Yoonho Jung, Shinjae Yoo, Jiook Cha, Taesup Moon

    Abstract: Modeling spatiotemporal brain dynamics from high-dimensional data, such as functional Magnetic Resonance Imaging (fMRI), is a formidable task in neuroscience. Existing approaches for fMRI analysis utilize hand-crafted features, but the process of feature extraction risks losing essential information in fMRI scans. To address this challenge, we present SwiFT (Swin 4D fMRI Transformer), a Swin Trans… ▽ More

    Submitted 31 October, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  23. arXiv:2307.01350  [pdf, other

    cs.RO

    Dynamic Mobile Manipulation via Whole-Body Bilateral Teleoperation of a Wheeled Humanoid

    Authors: Amartya Purushottam, Yeongtae Jung, Christopher Xu, Joao Ramos

    Abstract: Humanoid robots have the potential to help human workers by realizing physically demanding manipulation tasks such as moving large boxes within warehouses. We define such tasks as Dynamic Mobile Manipulation (DMM). This paper presents a framework for DMM via whole-body teleoperation, built upon three key contributions: Firstly, a teleoperation framework employing a Human Machine Interface (HMI) an… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  24. arXiv:2306.08126  [pdf, other

    cs.CL cs.AI

    PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer

    Authors: Xu Han, Bin Guo, Yoon Jung, Benjamin Yao, Yu Zhang, Xiaohu Liu, Chenlei Guo

    Abstract: Personalized dialogue agents (DAs) powered by large pre-trained language models (PLMs) often rely on explicit persona descriptions to maintain personality consistency. However, such descriptions may not always be available or may pose privacy concerns. To tackle this bottleneck, we introduce PersonaPKT, a lightweight transfer learning approach that can build persona-consistent dialogue models with… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 10 pages, 3 figures, accepted to SustaiNLP 2023

  25. Social Robots As Companions for Lonely Hearts: The Role of Anthropomorphism and Robot Appearance

    Authors: Yoonwon Jung, Sowon Hahn

    Abstract: Loneliness is a distressing personal experience and a growing social issue. Social robots could alleviate the pain of loneliness, particularly for those who lack in-person interaction. This paper investigated how the effect of loneliness on the anthropomorphism of social robots differs by robot appearance, and how it influences purchase intention. Participants viewed a video of one of the three ro… ▽ More

    Submitted 4 July, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted for oral presentation at the 32nd IEEE International Conference on Robot and Human Interactive Communication(RO-MAN 2023). Camera-ready (ver2)

    Journal ref: 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Busan, Korea, Republic of, 2023, pp. 2520-2525

  26. arXiv:2305.00278  [pdf, other

    cs.CV cs.AI cs.LG

    Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected

    Authors: Dongsheng Han, Chaoning Zhang, Yu Qiao, Maryam Qamar, Yuna Jung, SeungKyu Lee, Sung-Ho Bae, Choong Seon Hong

    Abstract: Meta AI Research has recently released SAM (Segment Anything Model) which is trained on a large segmentation dataset of over 1 billion masks. As a foundation model in the field of computer vision, SAM (Segment Anything Model) has gained attention for its impressive performance in generic object segmentation. Despite its strong capability in a wide range of zero-shot transfer tasks, it remains unkn… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  27. A Survey on Graph Diffusion Models: Generative AI in Science for Molecule, Protein and Material

    Authors: Mengchun Zhang, Maryam Qamar, Taegoo Kang, Yuna Jung, Chenshuang Zhang, Sung-Ho Bae, Chaoning Zhang

    Abstract: Diffusion models have become a new SOTA generative modeling method in various fields, for which there are multiple survey works that provide an overall survey. With the number of articles on diffusion models increasing exponentially in the past few years, there is an increasing need for surveys of diffusion models on specific fields. In this work, we are committed to conducting a survey on the gra… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  28. arXiv:2303.15060  [pdf, other

    cs.CV

    TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering

    Authors: Jaehoon Choi, Dongki Jung, Taejae Lee, Sangwook Kim, Youngdong Jung, Dinesh Manocha, Donghwan Lee

    Abstract: We present a new pipeline for acquiring a textured mesh in the wild with a single smartphone which offers access to images, depth maps, and valid poses. Our method first introduces an RGBD-aided structure from motion, which can yield filtered depth maps and refines camera poses guided by corresponding depth. Then, we adopt the neural implicit surface reconstruction method, which allows for high-qu… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR23. Project Page: https://jh-choi.github.io/TMO/

  29. arXiv:2303.11853  [pdf, other

    cs.RO cs.AI

    LoRCoN-LO: Long-term Recurrent Convolutional Network-based LiDAR Odometry

    Authors: Donghwi Jung, Jae-Kyung Cho, Younghwa Jung, Soohyun Shin, Seong-Woo Kim

    Abstract: We propose a deep learning-based LiDAR odometry estimation method called LoRCoN-LO that utilizes the long-term recurrent convolutional network (LRCN) structure. The LRCN layer is a structure that can process spatial and temporal information at once by using both CNN and LSTM layers. This feature is suitable for predicting continuous robot movements as it uses point clouds that contain spatial info… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 4 pages, ICEIC 2023

  30. arXiv:2301.10413  [pdf, other

    cs.CV

    Local Feature Extraction from Salient Regions by Feature Map Transformation

    Authors: Yerim Jung, Nur Suriza Syazwany Binti Ahmad Nizam, Sang-Chul Lee

    Abstract: Local feature matching is essential for many applications, such as localization and 3D reconstruction. However, it is challenging to match feature points accurately in various camera viewpoints and illumination conditions. In this paper, we propose a framework that robustly extracts and describes salient local features regardless of changing light and viewpoints. The framework suppresses illuminat… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: British Machine Vision Conference (BMVC) 2022

  31. arXiv:2211.15950  [pdf, other

    eess.IV cs.CV

    Enhanced artificial intelligence-based diagnosis using CBCT with internal denoising: Clinical validation for discrimination of fungal ball, sinusitis, and normal cases in the maxillary sinus

    Authors: Kyungsu Kim, Chae Yeon Lim, Joong Bo Shin, Myung ** Chung, Yong Gi Jung

    Abstract: The cone-beam computed tomography (CBCT) provides 3D volumetric imaging of a target with low radiation dose and cost compared with conventional computed tomography, and it is widely used in the detection of paranasal sinus disease. However, it lacks the sensitivity to detect soft tissue lesions owing to reconstruction constraints. Consequently, only physicians with expertise in CBCT reading can di… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  32. arXiv:2210.16423  [pdf

    cs.RO cs.HC

    Transferability-based Chain Motion Map** from Humans to Humanoids for Teleoperation

    Authors: Matthew Stanley, Yunsik Jung, Michael Bowman, Lingfeng Tao, Xiaoli Zhang

    Abstract: Although data-driven motion map** methods are promising to allow intuitive robot control and teleoperation that generate human-like robot movement, they normally require tedious pair-wise training for each specific human and robot pair. This paper proposes a transferability-based map** scheme to allow new robot and human input systems to leverage the map** of existing trained pairs to form a… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  33. arXiv:2210.12363  [pdf, other

    stat.ML cs.LG stat.ME

    Bayesian Convolutional Deep Sets with Task-Dependent Stationary Prior

    Authors: Yohan Jung, **kyoo Park

    Abstract: Convolutional deep sets are the architecture of a deep neural network (DNN) that can model stationary stochastic process. This architecture uses the kernel smoother and the DNN to construct the translation equivariant functional representations, and thus reflects the inductive bias of the stationarity into DNN. However, since this architecture employs the kernel smoother known as the non-parametri… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: 13 pages, 7 figures

  34. arXiv:2210.11153  [pdf, other

    eess.IV cs.CV

    Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report

    Authors: Marcos V. Conde, Radu Timofte, Yibin Huang, **gyang Peng, Chang Chen, Cheng Li, Eduardo Pérez-Pellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu , et al. (18 additional authors not shown)

    Abstract: Cameras capture sensor RAW images and transform them into pleasant RGB images, suitable for the human eyes, using their integrated Image Signal Processor (ISP). Numerous low-level vision tasks operate in the RAW domain (e.g. image denoising, white balance) due to its linear relationship with the scene irradiance, wide-range of information at 12bits, and sensor designs. Despite this, RAW image data… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Advances in Image Manipulation (AIM) workshop

  35. arXiv:2210.07762  [pdf, other

    cs.CV

    Controllable Style Transfer via Test-time Training of Implicit Neural Representation

    Authors: Sunwoo Kim, Youngjo Min, Younghun Jung, Seungryong Kim

    Abstract: We propose a controllable style transfer framework based on Implicit Neural Representation that pixel-wisely controls the stylized output via test-time training. Unlike traditional image optimization methods that often suffer from unstable convergence and learning-based methods that require intensive training and have limited generalization ability, we present a model optimization framework that o… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Project Page: https://ku-cvlab.github.io/INR-st/

  36. arXiv:2209.06421  [pdf, other

    cs.GR

    A Transfer Function Design Using A Knowledge Database based on Deep Image and Primitive Intensity Profile Features Retrieval

    Authors: Younhyun Jung, Jim Kong, **man Kim

    Abstract: Transfer function (TF) plays a key role for the generation of direct volume rendering (DVR), by enabling accurate identification of structures of interest (SOIs) interactively as well as ensuring appropriate visibility of them. Attempts at mitigating the repetitive manual process of TF design have led to approaches that make use of a knowledge database consisting of pre-designed TFs by domain expe… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: submitted to Computer Graphics Forum for review

  37. Deep Deformable 3D Caricatures with Learned Shape Control

    Authors: Yucheol Jung, Wonjong Jang, Soong** Kim, Jiaolong Yang, Xin Tong, Seungyong Lee

    Abstract: A 3D caricature is an exaggerated 3D depiction of a human face. The goal of this paper is to model the variations of 3D caricatures in a compact parameter space so that we can provide a useful data-driven toolkit for handling 3D caricature deformations. To achieve the goal, we propose an MLP-based framework for building a deformable surface model, which takes a latent code and produces a 3D surfac… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: ACM SIGGRAPH 2022. For the project page, see https://ycjungsubhuman.github.io/DeepDeformable3DCaricatures

    ACM Class: I.3.4; I.2.6

  38. arXiv:2207.10025  [pdf, other

    cs.CV

    Learning from Synthetic Data: Facial Expression Classification based on Ensemble of Multi-task Networks

    Authors: Jae-Yeop Jeong, Yeong-Gi Hong, JiYeon Oh, Sumin Hong, **-Woo Jeong, Yuchul Jung

    Abstract: Facial expression in-the-wild is essential for various interactive computing domains. Especially, "Learning from Synthetic Data" (LSD) is an important topic in the facial expression recognition task. In this paper, we propose a multi-task learning-based facial expression recognition approach which consists of emotion and appearance learning branches that can share all face information, and present… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Page 3, Added reference [2], [33]

  39. arXiv:2207.05374  [pdf, other

    cs.CV

    Rethinking gradient weights' influence over saliency map estimation

    Authors: Masud An Nur Islam Fahim, Nazmus Saqib, Shafkat Khan Siam, Ho Yub Jung

    Abstract: Class activation map (CAM) helps to formulate saliency maps that aid in interpreting the deep neural network's prediction. Gradient-based methods are generally faster than other branches of vision interpretability and independent of human guidance. The performance of CAM-like studies depends on the governing model's layer response, and the influences of the gradients. Typical gradient-oriented CAM… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  40. arXiv:2207.05176  [pdf, other

    cs.CV cs.LG eess.IV

    Denoising single images by feature ensemble revisited

    Authors: Masud An Nur Islam Fahim, Nazmus Saqib, Shafkat Khan Siam, Ho Yub Jung

    Abstract: Image denoising is still a challenging issue in many computer vision sub-domains. Recent studies show that significant improvements are made possible in a supervised setting. However, few challenges, such as spatial fidelity and cartoon-like smoothing remain unresolved or decisively overlooked. Our study proposes a simple yet efficient architecture for the denoising problem that addresses the afor… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  41. arXiv:2207.00555  [pdf, other

    eess.AS cs.CL cs.LG

    FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning

    Authors: Yeonghyeon Lee, Kangwook Jang, Jahyun Goo, Youngmoon Jung, Hoirin Kim

    Abstract: Large-scale speech self-supervised learning (SSL) has emerged to the main field of speech processing, however, the problem of computational cost arising from its vast size makes a high entry barrier to academia. In addition, existing distillation techniques of speech SSL models compress the model by reducing layers, which induces performance degradation in linguistic pattern recognition tasks such… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022

  42. arXiv:2205.13561  [pdf

    cs.RO cs.AI

    Physics-Guided Hierarchical Reward Mechanism for Learning-Based Robotic Gras**

    Authors: Yunsik Jung, Lingfeng Tao, Michael Bowman, Jiucai Zhang, Xiaoli Zhang

    Abstract: Learning-based gras** can afford real-time grasp motion planning of multi-fingered robotics hands thanks to its high computational efficiency. However, learning-based methods are required to explore large search spaces during the learning process. The search space causes low learning efficiency, which has been the main barrier to its practical adoption. In addition, the trained policy lacks a ge… ▽ More

    Submitted 23 July, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

  43. arXiv:2203.13235  [pdf, other

    cs.CV

    Facial Expression Recognition based on Multi-head Cross Attention Network

    Authors: Jae-Yeop Jeong, Yeong-Gi Hong, Daun Kim, Yuchul Jung, **-Woo Jeong

    Abstract: Facial expression in-the-wild is essential for various interactive computing domains. In this paper, we proposed an extended version of DAN model to address the VA estimation and facial expression challenges introduced in ABAW 2022. Our method produced preliminary results of 0.44 of mean CCC value for the VA estimation task, and 0.33 of the average F1 score for the expression classification task.

    Submitted 24 March, 2022; originally announced March 2022.

  44. arXiv:2203.03558  [pdf, other

    cs.RO

    Hands-free Telelocomotion of a Wheeled Humanoid toward Dynamic Mobile Manipulation via Teleoperation

    Authors: Amartya Purushottam, Yeongtae Jung, Kevin Murphy, Donghoon Baek, Joao Ramos

    Abstract: Robotic systems that can dynamically combine manipulation and locomotion could facilitate dangerous or physically demanding labor. For instance, firefighter humanoid robots could leverage their body by leaning against collapsed building rubble to push it aside. Here we introduce a teleoperation system that targets the realization of these tasks using human whole-body motor skills. We describe a ne… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  45. arXiv:2203.03516  [pdf, other

    cs.RO

    A Large Force Haptic Interface with Modular Linear Actuators

    Authors: Yeongtae Jung, Joao Ramos

    Abstract: This paper presents a haptic interface with modular linear actuators which can address limitations of conventional devices based on rotatory joints. The proposed haptic interface is composed of parallel linear actuators that provide high backdrivability and small inertia. The performance of the haptic interface is compared with the conventional mechanisms in terms of force capability, reflected in… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  46. arXiv:2201.10704  [pdf

    cs.HC

    Mixed reality hologram slicer (mxdR-HS): a marker-less tangible user interface for interactive holographic volume visualization

    Authors: Hoijoon Jung, Younhyun Jung, Michael Fulham, **man Kim

    Abstract: Mixed reality head-mounted displays (mxdR-HMD) have the potential to visualize volumetric medical imaging data in holograms to provide a true sense of volumetric depth. An effective user interface, however, has yet to be thoroughly studied. Tangible user interfaces (TUIs) enable a tactile interaction with a hologram through an object. The object has physical properties indicating how it might be u… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 17 pages

  47. arXiv:2112.12353  [pdf

    cs.LG cs.DL cs.IR

    LAME: Layout Aware Metadata Extraction Approach for Research Articles

    Authors: Jongyun Choi, Hyesoo Kong, Hwamook Yoon, Heung-Seon Oh, Yuchul Jung

    Abstract: The volume of academic literature, such as academic conference papers and journals, has increased rapidly worldwide, and research on metadata extraction is ongoing. However, high-performing metadata extraction is still challenging due to diverse layout formats according to journal publishers. To accommodate the diversity of the layouts of academic journals, we propose a novel LAyout-aware Metadata… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    ACM Class: I.2.7

  48. arXiv:2112.01021  [pdf, other

    cs.LG

    Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation

    Authors: Yeonsung Jung, Ha** Shim, June Yong Yang, Eunho Yang

    Abstract: Deep neural networks (DNNs), despite their impressive ability to generalize over-capacity networks, often rely heavily on malignant bias as shortcuts instead of task-related information for discriminative tasks. To address this problem, recent studies utilize auxiliary information related to the bias, which is rarely obtainable in practice, or sift through a handful of bias-free samples for debias… ▽ More

    Submitted 5 July, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

  49. arXiv:2112.00290  [pdf, other

    cs.CV

    Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics

    Authors: Andreas Heinecke, Emanuel Mayer, Abhinav Natarajan, Yoonju Jung

    Abstract: Die analysis is an essential numismatic method, and an important tool of ancient economic history. Yet, manual die studies are too labor-intensive to comprehensively study large coinages such as those of the Roman Empire. We address this problem by proposing a model for unsupervised computational die analysis, which can reduce the time investment necessary for large-scale die studies by several or… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  50. arXiv:2110.14794  [pdf, other

    cs.CR cs.LG stat.ML

    Masked LARk: Masked Learning, Aggregation and Reporting worKflow

    Authors: Joseph J. Pfeiffer III, Denis Charles, Davis Gilton, Young Hun Jung, Mehul Parsana, Erik Anderson

    Abstract: Today, many web advertising data flows involve passive cross-site tracking of users. Enabling such a mechanism through the usage of third party tracking cookies (3PC) exposes sensitive user data to a large number of parties, with little oversight on how that data can be used. Thus, most browsers are moving towards removal of 3PC in subsequent browser iterations. In order to substantially improve e… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Microsoft Journal of Applied Research (MSJAR Volume 16)

    MSC Class: 68T07