Skip to main content

Showing 1–7 of 7 results for author: Choi, H K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02501  [pdf, other

    cs.CL cs.AI

    PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning

    Authors: Hyeong Kyu Choi, Yixuan Li

    Abstract: Large Language Models (LLMs) are trained on massive text corpora, which are encoded with diverse personality traits. This triggers an interesting goal of eliciting a desired personality trait from the LLM, and probing its behavioral preferences. Accordingly, we formalize the persona elicitation task, aiming to customize LLM behaviors to align with a target persona. We present Persona In-Context Le… ▽ More

    Submitted 14 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  2. arXiv:2310.15484  [pdf, other

    cs.CL cs.AI

    NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA

    Authors: Hyeong Kyu Choi, Seunghun Lee, Jaewon Chu, Hyunwoo J. Kim

    Abstract: Multi-hop Knowledge Graph Question Answering (KGQA) is a task that involves retrieving nodes from a knowledge graph (KG) to answer natural language questions. Recent GNN-based approaches formulate this task as a KG path searching problem, where messages are sequentially propagated from the seed node towards the answer nodes. However, these messages are past-oriented, and they do not consider the f… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Neural Information Processing Systems (NeurIPS) 2023

  3. arXiv:2303.13009  [pdf, other

    cs.CV

    MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models

    Authors: Dohwan Ko, Joonmyung Choi, Hyeong Kyu Choi, Kyoung-Woon On, Byungseok Roh, Hyunwoo J. Kim

    Abstract: Foundation models have shown outstanding performance and generalization capabilities across domains. Since most studies on foundation models mainly focus on the pretraining phase, a naive strategy to minimize a single task-specific loss is adopted for fine-tuning. However, such fine-tuning methods do not fully leverage other losses that are potentially beneficial for the target task. Therefore, we… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted paper at CVPR 2023

  4. arXiv:2212.00975  [pdf, other

    cs.CL cs.AI

    Relation-Aware Language-Graph Transformer for Question Answering

    Authors: **young Park, Hyeong Kyu Choi, Juyeon Ko, Hyeon** Park, Ji-Hoon Kim, Jisu Jeong, Kyungmin Kim, Hyunwoo J. Kim

    Abstract: Question Answering (QA) is a task that entails reasoning over natural language contexts, and many relevant works augment language models (LMs) with graph neural networks (GNNs) to encode the Knowledge Graph (KG) information. However, most existing GNN-based modules for QA do not take advantage of rich relational information of KGs and depend on limited information interaction between the LM and th… ▽ More

    Submitted 25 April, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: AAAI2023 (accepted)

  5. arXiv:2210.07562  [pdf, other

    cs.CV

    TokenMixup: Efficient Attention-guided Token-level Data Augmentation for Transformers

    Authors: Hyeong Kyu Choi, Joonmyung Choi, Hyunwoo J. Kim

    Abstract: Mixup is a commonly adopted data augmentation technique for image classification. Recent advances in mixup methods primarily focus on mixing based on saliency. However, many saliency detectors require intense computation and are especially burdensome for parameter-heavy transformer models. To this end, we propose TokenMixup, an efficient attention-guided token-level data augmentation method that a… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted paper at NeurIPS 2022

  6. arXiv:2204.04836  [pdf, other

    cs.CV

    Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection

    Authors: Jihwan Park, SeungJun Lee, Hwan Heo, Hyeong Kyu Choi, Hyunwoo J. Kim

    Abstract: Human-Object Interaction detection is a holistic visual recognition task that entails object detection as well as interaction classification. Previous works of HOI detection has been addressed by the various compositions of subset predictions, e.g., Image -> HO -> I, Image -> HI -> O. Recently, transformer based architecture for HOI has emerged, which directly predicts the HOI triplets in an end-t… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: CVPR2022 accepted

  7. arXiv:1808.01560  [pdf, other

    cs.CE cs.LG q-fin.PM

    Stock Price Correlation Coefficient Prediction with ARIMA-LSTM Hybrid Model

    Authors: Hyeong Kyu Choi

    Abstract: Predicting the price correlation of two assets for future time periods is important in portfolio optimization. We apply LSTM recurrent neural networks (RNN) in predicting the stock price correlation coefficient of two individual stocks. RNNs are competent in understanding temporal dependencies. The use of LSTM cells further enhances its long term predictive properties. To encompass both linearity… ▽ More

    Submitted 1 October, 2018; v1 submitted 5 August, 2018; originally announced August 2018.

    Comments: I'd appreciate any kind of comments on my work. Feel free to email me!