Skip to main content

Showing 1–11 of 11 results for author: Son, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14272  [pdf, other

    cs.CV cs.GR

    MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset

    Authors: Kim Sung-Bin, Lee Chae-Yeon, Gihun Son, Oh Hyun-Bin, Janghoon Ju, Suekyeong Nam, Tae-Hyun Oh

    Abstract: Recent studies in speech-driven 3D talking head generation have achieved convincing results in verbal articulations. However, generating accurate lip-syncs degrades when applied to input speech in other languages, possibly due to the lack of datasets covering a broad spectrum of facial movements across languages. In this work, we introduce a novel task to generate 3D talking heads from speeches of… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  2. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  3. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  4. arXiv:2403.15040  [pdf, other

    cs.CL

    ESG Classification by Implicit Rule Learning via GPT-4

    Authors: Hyo Jeong Yun, Chanyoung Kim, Moonjeong Hahm, Kyuri Kim, Gui** Son

    Abstract: Environmental, social, and governance (ESG) factors are widely adopted as higher investment return indicators. Accordingly, ongoing efforts are being made to automate ESG evaluation with language models to extract signals from massive web text easily. However, recent approaches suffer from a lack of training data, as rating agencies keep their evaluation metrics confidential. This paper investigat… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted as Shared Track Paper at 7th FinNLP Workshop @ LREC-COLING 2024

  5. arXiv:2402.11597  [pdf, other

    cs.CL

    Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

    Authors: Gui** Son, Sangwon Baek, Sangdae Nam, Ilgyun Jeong, Seungone Kim

    Abstract: Large language models (LLMs) are typically prompted to follow a single instruction per inference call. In this work, we analyze whether LLMs also hold the capability to handle multiple instructions simultaneously, denoted as Multi-Task Inference. For this purpose, we introduce the MTI Bench(Multi-Task Inference Benchmark), a comprehensive evaluation benchmark encompassing 5,000 instances across 25… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: acl 2024 (main)

  6. arXiv:2402.11548  [pdf, other

    cs.CL

    KMMLU: Measuring Massive Multitask Language Understanding in Korean

    Authors: Gui** Son, Hanwool Lee, Sungdong Kim, Seungone Kim, Niklas Muennighoff, Taekyoon Choi, Cheonbok Park, Kang Min Yoo, Stella Biderman

    Abstract: We propose KMMLU, a new Korean benchmark with 35,030 expert-level multiple-choice questions across 45 subjects ranging from humanities to STEM. While prior Korean benchmarks are translated from existing English benchmarks, KMMLU is collected from original Korean exams, capturing linguistic and cultural aspects of the Korean language. We test 27 public and proprietary LLMs and observe the best publ… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Under Review

  7. arXiv:2309.02706  [pdf, other

    cs.CL

    HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models

    Authors: Gui** Son, Hanwool Lee, Suwan Kim, Huiseo Kim, Jaecheol Lee, Je Won Yeom, Jihyu Jung, Jung Woo Kim, Songseong Kim

    Abstract: Large language models (LLMs) trained on massive corpora demonstrate impressive capabilities in a wide range of tasks. While there are ongoing efforts to adapt these models to languages beyond English, the attention given to their evaluation methodologies remains limited. Current multilingual benchmarks often rely on back translations or re-implementations of English tests, limiting their capacity… ▽ More

    Submitted 20 March, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at LREC-COLING 2024

  8. arXiv:2305.01505  [pdf, other

    cs.CL cs.AI cs.CY

    Beyond Classification: Financial Reasoning in State-of-the-Art Language Models

    Authors: Gui** Son, Hanearl Jung, Moonjeong Hahm, Keonju Na, Sol **

    Abstract: Large Language Models (LLMs), consisting of 100 billion or more parameters, have demonstrated remarkable ability in complex multi-step reasoning tasks. However, the application of such generic advancements has been limited to a few fields, such as clinical or legal, with the field of financial reasoning remaining largely unexplored. To the best of our knowledge, the ability of LLMs to solve financ… ▽ More

    Submitted 25 June, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: Accepted by FinNLP (Financial Technology and Natural Language Processing) @ IJCAI2023 as long paper

  9. arXiv:2301.03136  [pdf, other

    cs.CL cs.LG q-fin.GN

    Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance

    Authors: Gui** Son, Hanwool Lee, Nahyeon Kang, Moonjeong Hahm

    Abstract: Extraction of sentiment signals from news text, stock message boards, and business reports, for stock movement prediction, has been a rising field of interest in finance. Building upon past literature, the most recent works attempt to better capture sentiment from sentences with complex syntactic structures by introducing aspect-level sentiment classification (ASC). Despite the growing interest, h… ▽ More

    Submitted 24 January, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

    Comments: Published at The AAAI-2023 Workshop On Multimodal AI For Financial Forecasting (muffin@AAAI2023)

  10. arXiv:2202.07252  [pdf, other

    physics.soc-ph cs.DL

    Quantifying team chemistry in scientific collaboration

    Authors: Gangmin Son, **hyuk Yun, Hawoong Jeong

    Abstract: Team chemistry is the holy grail of understanding collaborative human behavior, yet its quantitative understanding remains inconclusive. To reveal the presence and mechanisms of team chemistry in scientific collaboration, we reconstruct the publication histories of 560,689 individual scientists and 1,026,196 duos of scientists. We identify ability discrepancies between teams and their members, ena… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  11. arXiv:1511.02435  [pdf

    cs.CL

    A Chinese POS Decision Method Using Korean Translation Information

    Authors: Son-Il Kwak, O-Chol Kown, Chang-Sin Kim, Yong-Il Pak, Gum-Chol Son, Chol-Jun Hwang, Hyon-Chol Kim, Hyok-Chol Sin, Gyong-Il Hyon, Sok-Min Han

    Abstract: In this paper we propose a method that imitates a translation expert using the Korean translation information and analyse the performance. Korean is good at tagging than Chinese, so we can use this property in Chinese POS tagging.

    Submitted 7 November, 2015; originally announced November 2015.

    Comments: 6 pages, 0 figures