Skip to main content

Showing 1–50 of 323 results for author: Shin, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12904  [pdf, other

    cs.LG physics.comp-ph physics.optics

    Meent: Differentiable Electromagnetic Simulator for Machine Learning

    Authors: Yongha Kim, Anthony W. Jung, Sanmun Kim, Kevin Octavian, Doyoung Heo, Chae** Park, Jeongmin Shin, Sunghyun Nam, Chanhyung Park, Juho Park, Sangjun Han, **myoung Lee, Seolho Kim, Min Seok Jang, Chan Y. Park

    Abstract: Electromagnetic (EM) simulation plays a crucial role in analyzing and designing devices with sub-wavelength scale structures such as solar cells, semiconductor devices, image sensors, future displays and integrated photonic devices. Specifically, optics problems such as estimating semiconductor device structures and designing nanophotonic devices provide intriguing research topics with far-reachin… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: under review

  2. arXiv:2406.08796  [pdf, other

    cs.CL

    Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning

    Authors: Janghoon Han, Changho Lee, Joongbo Shin, Stanley Jungkyu Choi, Honglak Lee, Kynghoon Bae

    Abstract: Instruction tuning has emerged as a powerful technique, significantly boosting zero-shot performance on unseen tasks. While recent work has explored cross-lingual generalization by applying instruction tuning to multilingual models, previous studies have primarily focused on English, with a limited exploration of non-English tasks. For an in-depth exploration of cross-lingual generalization in ins… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024 (Camera-ready), by Janghoon Han and Changho Lee, with equal contribution

  3. arXiv:2406.08527  [pdf, other

    cs.LG cs.AI

    Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

    Authors: Jaehyun Nam, Kyuyoung Kim, Seunghyuk Oh, Jihoon Tack, Jaehyung Kim, **woo Shin

    Abstract: Learning effective representations from raw data is crucial for the success of deep learning methods. However, in the tabular domain, practitioners often prefer augmenting raw column features over using learned representations, as conventional tree-based algorithms frequently outperform competing approaches. As a result, feature engineering methods that automatically generate candidate features ha… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 18 pages

  4. arXiv:2406.07398  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Visual Representation Learning with Stochastic Frame Prediction

    Authors: Huiwon Jang, Dongyoung Kim, Junsu Kim, **woo Shin, Pieter Abbeel, Younggyo Seo

    Abstract: Self-supervised learning of image representations by predicting future frames is a promising direction but still remains a challenge. This is because of the under-determined nature of frame prediction; multiple potential futures can arise from a single current frame. To tackle this challenge, in this paper, we revisit the idea of stochastic video generation that learns to capture uncertainty in fr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: International Conference on Machine Learning (ICML) 2024

  5. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  6. arXiv:2406.04639  [pdf, other

    cs.LG cs.AI cs.CV

    Cooperative Meta-Learning with Gradient Augmentation

    Authors: Jongyun Shin, Seun** Han, Jangho Kim

    Abstract: Model agnostic meta-learning (MAML) is one of the most widely used gradient-based meta-learning, consisting of two optimization loops: an inner loop and outer loop. MAML learns the new task from meta-initialization parameters with an inner update and finds the meta-initialization parameters in the outer loop. In general, the injection of noise into the gradient of the model for augmenting the grad… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to UAI 2024

  7. arXiv:2406.04412  [pdf, other

    cs.LG cs.AI cs.CL

    Aligning Large Language Models with Self-generated Preference Data

    Authors: Dongyoung Kim, Kimin Lee, **woo Shin, Jaehyung Kim

    Abstract: Aligning large language models (LLMs) with human preferences becomes a key component to obtaining state-of-the-art performance, but it yields a huge cost to construct a large human-annotated preference dataset. To tackle this problem, we propose a new framework that boosts the alignment of LLMs through Self-generated Preference data (Selfie) using only a very small amount of human-annotated prefer… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 18 pages, under review

  8. arXiv:2406.04064  [pdf, other

    cs.CL cs.AI cs.CY

    Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models

    Authors: Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park

    Abstract: Social bias is shaped by the accumulation of social perceptions towards targets across various demographic identities. To fully understand such social bias in large language models (LLMs), it is essential to consider the composite of social perceptions from diverse perspectives among identities. Previous studies have either evaluated biases in LLMs by indirectly assessing the presence of sentiment… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  9. arXiv:2405.17382  [pdf, other

    cs.LG cs.CL

    ReMoDetect: Reward Models Recognize Aligned LLM's Generations

    Authors: Hyunseok Lee, Jihoon Tack, **woo Shin

    Abstract: The remarkable capabilities and easy accessibility of large language models (LLMs) have significantly increased societal risks (e.g., fake news generation), necessitating the development of LLM-generated text (LGT) detection methods for safe usage. However, detecting LGTs is challenging due to the vast number of LLMs, making it impractical to account for each LLM individually; hence, it is crucial… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 20 pages

  10. arXiv:2405.16082  [pdf

    cs.CV cs.AI

    Uncertainty Measurement of Deep Learning System based on the Convex Hull of Training Sets

    Authors: Hyekyoung Hwang, Jitae Shin

    Abstract: Deep Learning (DL) has made remarkable achievements in computer vision and adopted in safety critical domains such as medical imaging or autonomous drive. Thus, it is necessary to understand the uncertainty of the model to effectively reduce accidents and losses due to misjudgment of the Deep Neural Networks (DNN). This can start by efficiently selecting data that could potentially malfunction to… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 11 pages

  11. arXiv:2405.14750   

    astro-ph.SR cs.AI cs.LG

    Extreme Solar Flare Prediction Using Residual Networks with HMI Magnetograms and Intensitygrams

    Authors: Juyoung Yun, Jungmin Shin

    Abstract: Solar flares, especially C, M, and X class, pose significant risks to satellite operations, communication systems, and power grids. We present a novel approach for predicting extreme solar flares using HMI intensitygrams and magnetograms. By detecting sunspots from intensitygrams and extracting magnetic field patches from magnetograms, we train a Residual Network (ResNet) to classify extreme class… ▽ More

    Submitted 19 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: The dataset has some noise, so we need to make new data to train more robust model

  12. arXiv:2405.11828  [pdf, other

    cs.LG

    Federated Learning with Incomplete Sensing Modalities

    Authors: Adiba Orzikulova, Jaehyun Kwak, Jaemin Shin, Sung-Ju Lee

    Abstract: Many mobile sensing applications utilize data from various modalities, including motion and physiological sensors in mobile and wearable devices. Federated Learning (FL) is particularly suitable for these applications thanks to its privacy-preserving feature. However, challenges such as limited battery life, poor network conditions, and sensor malfunctions can restrict the use of all available mod… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  13. arXiv:2405.09802  [pdf, other

    astro-ph.SR astro-ph.EP cs.AI cs.LG

    Analysis and Predictive Modeling of Solar Coronal Holes Using Computer Vision and LSTM Networks

    Authors: Juyoung Yun, Jungmin Shin

    Abstract: In the era of space exploration, coronal holes on the sun play a significant role due to their impact on satellites and aircraft through their open magnetic fields and increased solar wind emissions. This study employs computer vision techniques to detect coronal hole regions and estimate their sizes using imagery from the Solar Dynamics Observatory (SDO). Additionally, we utilize deep learning me… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: submitted to SPAICE Conference 2024

  14. arXiv:2405.09550  [pdf, other

    cs.CV cs.AI cs.CR

    Mask-based Invisible Backdoor Attacks on Object Detection

    Authors: Jeong** Shin

    Abstract: Deep learning models have achieved unprecedented performance in the domain of object detection, resulting in breakthroughs in areas such as autonomous driving and security. However, deep learning models are vulnerable to backdoor attacks. These attacks prompt models to behave similarly to standard models without a trigger; however, they act maliciously upon detecting a predefined trigger. Despite… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 March, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures

    ACM Class: I.4.8

  15. arXiv:2405.02845  [pdf, other

    cs.LG q-bio.MN

    Data-Efficient Molecular Generation with Hierarchical Textual Inversion

    Authors: Seo** Kim, Jaehyun Nam, Sihyun Yu, Younghoon Shin, **woo Shin

    Abstract: Develo** an effective molecular generation framework even with a limited number of molecules is often important for its practical deployment, e.g., drug discovery, since acquiring task-related molecular data requires expensive and time-consuming experimental costs. To tackle this issue, we introduce Hierarchical textual Inversion for Molecular generation (HI-Mol), a novel data-efficient molecula… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  16. arXiv:2405.01535  [pdf, other

    cs.CL

    Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

    Authors: Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo

    Abstract: Proprietary LMs such as GPT-4 are often employed to assess the quality of responses from various LMs. However, concerns including transparency, controllability, and affordability strongly motivate the development of open-source LMs specialized in evaluations. On the other hand, existing open evaluator LMs exhibit critical shortcomings: 1) they issue scores that significantly diverge from those ass… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Work in Progress

  17. arXiv:2404.13949  [pdf, other

    cs.CV cs.RO

    PeLiCal: Targetless Extrinsic Calibration via Penetrating Lines for RGB-D Cameras with Limited Co-visibility

    Authors: Jaeho Shin, Seungsang Yun, Ayoung Kim

    Abstract: RGB-D cameras are crucial in robotic perception, given their ability to produce images augmented with depth data. However, their limited FOV often requires multiple cameras to cover a broader area. In multi-camera RGB-D setups, the goal is typically to reduce camera overlap, optimizing spatial coverage with as few cameras as possible. The extrinsic calibration of these systems introduces additiona… ▽ More

    Submitted 23 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  18. arXiv:2404.13081  [pdf, other

    cs.CL cs.AI cs.LG

    SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs

    Authors: Jaehyung Kim, Jaehyun Nam, Sangwoo Mo, Jong** Park, Sang-Woo Lee, Minjoon Seo, Jung-Woo Ha, **woo Shin

    Abstract: Large language models (LLMs) have made significant advancements in various natural language processing tasks, including question answering (QA) tasks. While incorporating new information with the retrieval of relevant passages is a promising way to improve QA with LLMs, the existing methods often require additional fine-tuning which becomes infeasible with recent LLMs. Augmenting retrieved passage… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted at ICLR 2024

  19. arXiv:2404.12168  [pdf, other

    cs.CV cs.AI

    Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization

    Authors: Insoo Kim, Jae Seok Choi, Geonseok Seo, Kinam Kwon, **woo Shin, Hyong-Euk Lee

    Abstract: As recent advances in mobile camera technology have enabled the capability to capture high-resolution images, such as 4K images, the demand for an efficient deblurring model handling large motion has increased. In this paper, we discover that the image residual errors, i.e., blur-sharp pixel differences, can be grouped into some categories according to their motion blur type and how complex their… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPR2024 Camera-Ready

  20. arXiv:2404.10308  [pdf, other

    cs.LG cs.AI

    Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

    Authors: Woomin Song, Seunghyuk Oh, Sangwoo Mo, Jaehyung Kim, Sukmin Yun, Jung-Woo Ha, **woo Shin

    Abstract: Large language models (LLMs) have shown remarkable performance in various natural language processing tasks. However, a primary constraint they face is the context limit, i.e., the maximum number of tokens they can process. Previous works have explored architectural changes and modifications in positional encoding to relax the constraint, but they often require expensive training or do not address… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted to ICLR 2024. The first two authors contributed equally

  21. arXiv:2404.02135  [pdf

    cs.CV eess.IV

    Enhancing Ship Classification in Optical Satellite Imagery: Integrating Convolutional Block Attention Module with ResNet for Improved Performance

    Authors: Ryan Donghan Kwon, Gangjoo Robin Nam, Jisoo Tak, Junseob Shin, Hyerin Cha, Yeom Hyeok, Seung Won Lee

    Abstract: This study presents an advanced Convolutional Neural Network (CNN) architecture for ship classification from optical satellite imagery, significantly enhancing performance through the integration of the Convolutional Block Attention Module (CBAM) and additional architectural innovations. Building upon the foundational ResNet50 model, we first incorporated a standard CBAM to direct the model's focu… ▽ More

    Submitted 8 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  22. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  23. arXiv:2404.01863  [pdf, other

    cs.LG cs.AI

    Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

    Authors: Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh, Krishnamurthy Dvijotham, **woo Shin, Kimin Lee

    Abstract: Fine-tuning text-to-image models with reward functions trained on human feedback data has proven effective for aligning model behavior with human intent. However, excessive optimization with such reward models, which serve as mere proxy objectives, can compromise the performance of fine-tuned models, a phenomenon known as reward overoptimization. To investigate this issue in depth, we introduce th… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: ICLR 2024

  24. arXiv:2404.00972  [pdf, other

    cs.IR

    Cross-channel Recommendation for Multi-channel Retail

    Authors: Yi** Choi, Jongkyung Shin, Chiehyeon Lim

    Abstract: An increasing number of brick-and-mortar retailers are expanding their channels to the online domain, transforming them into multi-channel retailers. This transition emphasizes the need for cross-channel recommender systems, aiming to enhance revenue across both offline and online channels. Given that each retail channel represents a separate domain with a unique context, this can be regarded as a… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 5 pages, 2 figures, 3 tables

  25. arXiv:2403.17287  [pdf, other

    cs.LG cs.DC

    Not All Federated Learning Algorithms Are Created Equal: A Performance Evaluation Study

    Authors: Gustav A. Baumgart, Jaemin Shin, Ali Payani, Myung** Lee, Ramana Rao Kompella

    Abstract: Federated Learning (FL) emerged as a practical approach to training a model from decentralized data. The proliferation of FL led to the development of numerous FL algorithms and mechanisms. Many prior efforts have given their primary focus on accuracy of those approaches, but there exists little understanding of other aspects such as computational overheads, performance and training stability, etc… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  26. arXiv:2403.15594  [pdf, other

    cs.CY cs.LG

    Analyzing Male Domestic Violence through Exploratory Data Analysis and Explainable Machine Learning Insights

    Authors: Md Abrar Jahin, Saleh Akram Naife, Fatema Tuj Johora Lima, M. F. Mridha, Jungpil Shin

    Abstract: Domestic violence, which is often perceived as a gendered issue among female victims, has gained increasing attention in recent years. Despite this focus, male victims of domestic abuse remain primarily overlooked, particularly in Bangladesh. Our study represents a pioneering exploration of the underexplored realm of male domestic violence (MDV) within the Bangladeshi context, shedding light on it… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  27. arXiv:2403.14966  [pdf, other

    cs.CV

    DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow

    Authors: Kyungmin Lee, Kihyuk Sohn, **woo Shin

    Abstract: Recent progress in text-to-3D generation has been achieved through the utilization of score distillation methods: they make use of the pre-trained text-to-image (T2I) diffusion models by distilling via the diffusion model training objective. However, such an approach inevitably results in the use of random timesteps at each update, which increases the variance of the gradient and ultimately prolon… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  28. arXiv:2403.14148  [pdf, other

    cs.CV cs.LG

    Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

    Authors: Sihyun Yu, Weili Nie, De-An Huang, Boyi Li, **woo Shin, Anima Anandkumar

    Abstract: Video diffusion models have recently made great progress in generation quality, but are still limited by the high memory and computational requirements. This is because current video diffusion models often attempt to process high-dimensional videos directly. To tackle this issue, we propose content-motion latent diffusion model (CMD), a novel efficient extension of pretrained image diffusion model… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: ICLR 2024. Project page: https://sihyun.me/CMD

  29. arXiv:2403.14111  [pdf, other

    cs.CR cs.LG

    HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption

    Authors: Seewoo Lee, Garam Lee, Jung Woo Kim, Junbum Shin, Mun-Kyu Lee

    Abstract: Transfer learning is a de facto standard method for efficiently training machine learning models for data-scarce problems by adding and fine-tuning new classification layers to a model pre-trained on large datasets. Although numerous previous studies proposed to use homomorphic encryption to resolve the data privacy issue in transfer learning in the machine learning as a service setting, most of t… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: ICML 2023, Appendix D includes some updates after official publication

    Journal ref: PMLR 202:19010-19035, 2023

  30. arXiv:2403.10834  [pdf, other

    cs.CV cs.AI cs.LG

    SF(DA)$^2$: Source-free Domain Adaptation Through the Lens of Data Augmentation

    Authors: Uiwon Hwang, Jonghyun Lee, Juhyeon Shin, Sungroh Yoon

    Abstract: In the face of the deep learning model's vulnerability to domain shift, source-free domain adaptation (SFDA) methods have been proposed to adapt models to new, unseen target domains without requiring access to source domain data. Although the potential benefits of applying data augmentation to SFDA are attractive, several challenges arise such as the dependence on prior knowledge of class-preservi… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: ICLR 2024. Code: https://github.com/shinyflight/SFDA2

  31. arXiv:2403.07366  [pdf, other

    cs.CV cs.LG

    Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors

    Authors: Jonghyun Lee, Dahuin Jung, Saehyung Lee, Junsung Park, Juhyeon Shin, Uiwon Hwang, Sungroh Yoon

    Abstract: Test-time adaptation (TTA) fine-tunes pre-trained deep neural networks for unseen test data. The primary challenge of TTA is limited access to the entire test dataset during online updates, causing error accumulation. To mitigate it, TTA methods have utilized the model output's entropy as a confidence metric that aims to determine which samples have a lower likelihood of causing error. Through exp… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 Spotlight; 26 pages, 9 figures, 20 tables;

  32. arXiv:2403.07355  [pdf, ps, other

    eess.SP cs.AI cs.CV

    Vector Quantization for Deep-Learning-Based CSI Feedback in Massive MIMO Systems

    Authors: Junyong Shin, Yu** Kang, Yo-Seb Jeon

    Abstract: This paper presents a finite-rate deep-learning (DL)-based channel state information (CSI) feedback method for massive multiple-input multiple-output (MIMO) systems. The presented method provides a finite-bit representation of the latent vector based on a vector-quantized variational autoencoder (VQ-VAE) framework while reducing its computational complexity based on shape-gain vector quantization.… ▽ More

    Submitted 12 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  33. arXiv:2403.05139  [pdf, other

    cs.CV

    Improving Diffusion Models for Virtual Try-on

    Authors: Yisol Choi, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, **woo Shin

    Abstract: This paper considers image-based virtual try-on, which renders an image of a person wearing a curated garment, given a pair of images depicting the person and the garment, respectively. Previous works adapt existing exemplar-based inpainting diffusion models for virtual try-on to improve the naturalness of the generated visuals compared to other methods (e.g., GAN-based), but they fail to preserve… ▽ More

    Submitted 19 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  34. arXiv:2403.04583  [pdf, other

    cs.CV

    Unbiased Estimator for Distorted Conics in Camera Calibration

    Authors: Chaehyeon Song, Jaeho Shin, Myung-Hwan Jeon, Jongwoo Lim, Ayoung Kim

    Abstract: In the literature, points and conics have been major features for camera geometric calibration. Although conics are more informative features than points, the loss of the conic property under distortion has critically limited the utility of conic features in camera calibration. Many existing approaches addressed conic-based calibration by ignoring distortion or introducing 3D spherical targets to… ▽ More

    Submitted 9 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  35. arXiv:2403.04317  [pdf, other

    cs.LG cs.CL

    Online Adaptation of Language Models with a Memory of Amortized Contexts

    Authors: Jihoon Tack, Jaehyung Kim, Eric Mitchell, **woo Shin, Yee Whye Teh, Jonathan Richard Schwarz

    Abstract: Due to the rapid generation and dissemination of information, large language models (LLMs) quickly run out of date despite enormous development costs. Due to this crucial need to keep models updated, online learning has emerged as a critical necessity when utilizing LLMs for real-world applications. However, given the ever-expanding corpus of unseen documents and the large parameter space of moder… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 pages

  36. arXiv:2402.12004  [pdf, other

    cs.CV

    Direct Consistency Optimization for Compositional Text-to-Image Personalization

    Authors: Kyungmin Lee, Sangkyung Kwak, Kihyuk Sohn, **woo Shin

    Abstract: Text-to-image (T2I) diffusion models, when fine-tuned on a few personal images, are able to generate visuals with a high degree of consistency. However, they still lack in synthesizing images of different scenarios or styles that are possible in the original pretrained models. To address this, we propose to fine-tune the T2I model by maximizing consistency to reference images, while penalizing the… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Preprint. See our project page (https://dco-t2i.github.io/) for more examples and codes

  37. arXiv:2402.09004  [pdf, other

    cs.CV cs.LG

    Gradient Alignment with Prototype Feature for Fully Test-time Adaptation

    Authors: Juhyeon Shin, Jonghyun Lee, Saehyung Lee, Minjun Park, Dongjun Lee, Uiwon Hwang, Sungroh Yoon

    Abstract: In context of Test-time Adaptation(TTA), we propose a regularizer, dubbed Gradient Alignment with Prototype feature (GAP), which alleviates the inappropriate guidance from entropy minimization loss from misclassified pseudo label. We developed a gradient alignment loss to precisely manage the adaptation process, ensuring that changes made for some data don't negatively impact the model's performan… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  38. arXiv:2402.06264  [pdf

    cs.AI cs.CL cs.SI

    LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education

    Authors: Unggi Lee, Minji Jeon, Yunseo Lee, Gyuri Byun, Yoorim Son, Jaeyoon Shin, Hongkyu Ko, Hyeoncheol Kim

    Abstract: Art appreciation is vital in nurturing critical thinking and emotional intelligence among learners. However, traditional art appreciation education has often been hindered by limited access to art resources, especially for disadvantaged students, and an imbalanced emphasis on STEM subjects in mainstream education. In response to these challenges, recent technological advancements have paved the wa… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 37 pages, 4 figures, 10 tables

  39. arXiv:2402.02834  [pdf, other

    cs.LG cs.CL

    Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods

    Authors: Bo-Kyeong Kim, Geonmin Kim, Tae-Ho Kim, Thibault Castells, Shinkook Choi, Junho Shin, Hyoung-Kyu Song

    Abstract: Structured pruning of modern large language models (LLMs) has emerged as a way of decreasing their high computational needs. Width pruning reduces the size of projection weight matrices (e.g., by removing attention heads) while maintaining the number of layers. Depth pruning, in contrast, removes entire layers or blocks, while kee** the size of the remaining weights unchanged. Most current resea… ▽ More

    Submitted 23 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Update (arXiv-v2): continued pretraining for severe pruning ratios, compatibility with quantization, and enhanced baselines. Preliminary work (arXiv-v1) accepted at ICLR 2024 Workshop on ME-FoMo: https://openreview.net/forum?id=18VGxuOdpu

  40. arXiv:2401.09787  [pdf, other

    cs.LG cs.AI stat.ML

    Querying Easily Flip-flopped Samples for Deep Active Learning

    Authors: Seong ** Cho, Gwangsu Kim, Junghyun Lee, **woo Shin, Chang D. Yoo

    Abstract: Active learning is a machine learning paradigm that aims to improve the performance of a model by strategically selecting and querying unlabeled data. One effective selection strategy is to base it on the model's predictive uncertainty, which can be interpreted as a measure of how informative a sample is. The sample's distance to the decision boundary is a natural measure of predictive uncertainty… ▽ More

    Submitted 16 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 34 pages, 17 figures, 5 tables. Accepted to the 12th International Conference on Learning Representations (ICLR 2024) (ver2: fixed some typos and improved some parts of the writing)

  41. arXiv:2401.06243  [pdf, other

    cs.RO

    Modularis: Modular Underwater Robot for Rapid Development and Validation of Autonomous Systems

    Authors: Baker Herrin, Victoria Close, Nathan Berner, Joshua Herbert, Ethan Reussow, Ryan James, Cale Woodward, Jared Mindlin, Sebastian Paez, Nilson Bretas, Jane Shin

    Abstract: Autonomous underwater robots typically require higher cost and time for demonstrations compared to other domains due to the complexity of the environment. Due to the limited capacity and payload flexibility, it is challenging to find off-the-shelf underwater robots that are affordable, customizable, and subject to environmental variability. Custom-built underwater robots may be necessary for speci… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 7 pages, 13 figures, presented at OCEANS 2023

  42. arXiv:2401.03123  [pdf, ps, other

    stat.ME cs.LG

    A least distance estimator for a multivariate regression model using deep neural networks

    Authors: Jungmin Shin, Seung Jun Shin, Sungwan Bang

    Abstract: We propose a deep neural network (DNN) based least distance (LD) estimator (DNN-LD) for a multivariate regression problem, addressing the limitations of the conventional methods. Due to the flexibility of a DNN structure, both linear and nonlinear conditional mean functions can be easily modeled, and a multivariate regression model can be realized by simply adding extra nodes at the output layer.… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Submitted to 'Journal of Statistical Computation and Simulation'

  43. arXiv:2312.16397  [pdf, other

    cs.CG cs.DS

    Approximate Distance and Shortest-Path Oracles for Fault-Tolerant Geometric Spanners

    Authors: Kyung** Cho, Jihun Shin, Eun** Oh

    Abstract: In this paper, we present approximate distance and shortest-path oracles for fault-tolerant Euclidean spanners motivated by the routing problem in real-world road networks. An $f$-fault-tolerant Euclidean $t$-spanner for a set $V$ of $n$ points in $\mathbb{R}^d$ is a graph $G=(V,E)$ where, for any two points $p$ and $q$ in $V$ and a set $F$ of $f$ vertices of $V$, the distance between $p$ and $q$… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  44. arXiv:2312.13521  [pdf

    q-bio.GN cs.AI

    Preparing to Integrate Generative Pretrained Transformer Series 4 models into Genetic Variant Assessment Workflows: Assessing Performance, Drift, and Nondeterminism Characteristics Relative to Classifying Functional Evidence in Literature

    Authors: Samuel J. Aronson, Kalotina Machini, Jiyeon Shin, Pranav Sriraman, Sean Hamill, Emma R. Henricks, Charlotte Mailly, Angie J. Nottage, Sami S. Amr, Michael Oates, Matthew S. Lebo

    Abstract: Background. Large Language Models (LLMs) hold promise for improving genetic variant literature review in clinical testing. We assessed Generative Pretrained Transformer 4's (GPT-4) performance, nondeterminism, and drift to inform its suitability for use in complex clinical processes. Methods. A 2-prompt process for classification of functional evidence was optimized using a development set of 45 a… ▽ More

    Submitted 16 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 5 pages, 1 table, 4 figures, 2 supplementary tables, 1 supplementary figure. These authors contributed equally: Samuel J. Aronson, Kalotina Machini, and Jiyeon Shin Corresponding author: Samuel J. Aronson

  45. arXiv:2312.05465  [pdf, other

    cs.LG eess.SY

    On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR

    Authors: Jaeuk Shin, Giho Kim, Howon Lee, Joonho Han, Insoon Yang

    Abstract: Designing a competent meta-reinforcement learning (meta-RL) algorithm in terms of data usage remains a central challenge to be tackled for its successful real-world applications. In this paper, we propose a sample-efficient meta-RL algorithm that learns a model of the system or environment at hand in a task-directed manner. As opposed to the standard model-based approaches to meta-RL, our method e… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  46. arXiv:2311.10306  [pdf, other

    eess.IV cs.CV cs.LG

    MPSeg : Multi-Phase strategy for coronary artery Segmentation

    Authors: Jonghoe Ku, Yong-Hee Lee, Junsup Shin, In Kyu Lee, Hyun-Woo Kim

    Abstract: Accurate segmentation of coronary arteries is a pivotal process in assessing cardiovascular diseases. However, the intricate structure of the cardiovascular system presents significant challenges for automatic segmentation, especially when utilizing methodologies like the SYNTAX Score, which relies extensively on detailed structural information for precise risk stratification. To address these dif… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: MICCAI 2023 Conference ARCADE Challenge

  47. arXiv:2311.10281  [pdf, other

    cs.CV

    SSASS: Semi-Supervised Approach for Stenosis Segmentation

    Authors: In Kyu Lee, Junsup Shin, Yong-Hee Lee, Jonghoe Ku, Hyun-Woo Kim

    Abstract: Coronary artery stenosis is a critical health risk, and its precise identification in Coronary Angiography (CAG) can significantly aid medical practitioners in accurately evaluating the severity of a patient's condition. The complexity of coronary artery structures combined with the inherent noise in X-ray images poses a considerable challenge to this task. To tackle these obstacles, we introduce… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: MICCAI 2023 Conference ARCADE Challenge

  48. arXiv:2310.16779  [pdf, other

    cs.LG cs.AI stat.ML

    Multi-scale Diffusion Denoised Smoothing

    Authors: Jongheon Jeong, **woo Shin

    Abstract: Along with recent diffusion models, randomized smoothing has become one of a few tangible approaches that offers adversarial robustness to models at scale, e.g., those of large pre-trained models. Specifically, one can perform randomized smoothing on any classifier via a simple "denoise-and-classify" pipeline, so-called denoised smoothing, given that an accurate denoiser is available - such as dif… ▽ More

    Submitted 27 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at NeurIPS 2023; Code is available at https://github.com/jh-jeong/smoothing-multiscale

  49. arXiv:2310.16538  [pdf, other

    cs.CL cs.AI cs.LG

    FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning

    Authors: Jaemin Shin, Hyungjun Yoon, Seungjoo Lee, Sungjoon Park, Yunxin Liu, **ho D. Choi, Sung-Ju Lee

    Abstract: Psychiatrists diagnose mental disorders via the linguistic use of patients. Still, due to data privacy, existing passive mental health monitoring systems use alternative features such as activity, app usage, and location via mobile devices. We propose FedTherapist, a mobile mental health monitoring system that utilizes continuous speech and keyboard input in a privacy-preserving way via federated… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  50. arXiv:2310.16360  [pdf

    cs.AI cs.RO

    A Comprehensive Review of AI-enabled Unmanned Aerial Vehicle: Trends, Vision , and Challenges

    Authors: Osim Kumar Pal, Md Sakib Hossain Shovon, M. F. Mridha, Jungpil Shin

    Abstract: In recent years, the combination of artificial intelligence (AI) and unmanned aerial vehicles (UAVs) has brought about advancements in various areas. This comprehensive analysis explores the changing landscape of AI-powered UAVs and friendly computing in their applications. It covers emerging trends, futuristic visions, and the inherent challenges that come with this relationship. The study examin… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.