Skip to main content

Showing 1–5 of 5 results for author: Kwon, H J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17990  [pdf, other

    cs.CL cs.AI cs.LG

    Explicit Diversity Conditions for Effective Question Answer Generation with Large Language Models

    Authors: Vikas Yadav, Hyuk Joon Kwon, Vijay Srinivasan, Hongxia **

    Abstract: Question Answer Generation (QAG) is an effective data augmentation technique to improve the accuracy of question answering systems, especially in low-resource domains. While recent pretrained and large language model-based QAG methods have made substantial progress, they face the critical issue of redundant QA pair generation, affecting downstream QA systems. Implicit diversity techniques such as… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Published at COLING 2024

  2. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  3. arXiv:2402.10462  [pdf, other

    cs.LG cs.CL

    QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning

    Authors: Hossein Rajabzadeh, Mojtaba Valipour, Tianshu Zhu, Marzieh Tahaei, Hyock Ju Kwon, Ali Ghodsi, Boxing Chen, Mehdi Rezagholizadeh

    Abstract: Finetuning large language models requires huge GPU memory, restricting the choice to acquire Larger models. While the quantized version of the Low-Rank Adaptation technique, named QLoRA, significantly alleviates this issue, finding the efficient LoRA rank is still challenging. Moreover, QLoRA is trained on a pre-defined rank and, therefore, cannot be reconfigured for its lower ranks without requir… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Best Paper Award AAAI EIW Workshop

  4. arXiv:2309.08922  [pdf, other

    cs.CL

    Multimodal Multi-Hop Question Answering Through a Conversation Between Tools and Efficiently Finetuned Large Language Models

    Authors: Hossein Rajabzadeh, Suyuchen Wang, Hyock Ju Kwon, Bang Liu

    Abstract: We employ a tool-interacting divide-and-conquer strategy enabling large language models (LLMs) to answer complex multimodal multi-hop questions. In particular, we harness the power of large language models to divide a given multimodal multi-hop question into unimodal single-hop sub-questions to be answered by the appropriate tool from a predefined set of tools. After all corresponding tools provid… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  5. arXiv:2201.12572  [pdf, ps, other

    cs.LO

    Logical Pseudocode: Connecting Algorithms with Proofs

    Authors: Keehang Kwon, Hyung Joon Kwon

    Abstract: Proofs (sequent calculus, natural deduction) and imperative algorithms (pseudocodes) are two well-known coexisting concepts. Then what is their relationship? Our answer is that \[ imperative\ algorithms\ =\ proofs\ with\ cuts \] This observation leads to a generalization to pseudocodes which we call {\it logical pseudocodes}. It is similar to natural deduction proof of computability logic\cite… ▽ More

    Submitted 14 February, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Comments: 4 pages. Induction is missing in version 1 but added in version 2. arXiv admin note: text overlap with arXiv:2108.10728