Skip to main content

Showing 1–9 of 9 results for author: Choi, S J

.
  1. arXiv:2406.08796  [pdf, other

    cs.CL

    Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning

    Authors: Janghoon Han, Changho Lee, Joongbo Shin, Stanley Jungkyu Choi, Honglak Lee, Kynghoon Bae

    Abstract: Instruction tuning has emerged as a powerful technique, significantly boosting zero-shot performance on unseen tasks. While recent work has explored cross-lingual generalization by applying instruction tuning to multilingual models, previous studies have primarily focused on English, with a limited exploration of non-English tasks. For an in-depth exploration of cross-lingual generalization in ins… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024 (Camera-ready), by Janghoon Han and Changho Lee, with equal contribution

  2. arXiv:2404.16418  [pdf, other

    cs.CL

    Instruction Matters, a Simple yet Effective Task Selection Approach in Instruction Tuning for Specific Tasks

    Authors: Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae

    Abstract: Instruction tuning has shown its ability to not only enhance zero-shot generalization across various tasks but also its effectiveness in improving the performance of specific tasks. A crucial aspect in instruction tuning for a particular task is a strategic selection of related tasks that offer meaningful supervision, thereby enhancing efficiency and preventing performance degradation from irrelev… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 21 pages, 6 figures, 16 tables

  3. arXiv:2402.13380  [pdf, ps, other

    cs.AI cs.LG math.CO math.OC stat.ML

    Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers

    Authors: Joshua F. Cooper, Seung ** Choi, I. Esra Buyuktahtakin

    Abstract: In this study, we introduce an innovative deep learning framework that employs a transformer model to address the challenges of mixed-integer programs, specifically focusing on the Capacitated Lot Sizing Problem (CLSP). Our approach, to our knowledge, is the first to utilize transformers to predict the binary variables of a mixed-integer programming (MIP) problem. Specifically, our approach harnes… ▽ More

    Submitted 24 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2303.07592  [pdf, other

    eess.AS cs.SD

    Lightweight feature encoder for wake-up word detection based on self-supervised speech representation

    Authors: Hyungjun Lim, Younggwan Kim, Kiho Yeom, Eunjoo Seo, Hoodong Lee, Stanley Jungkyu Choi, Honglak Lee

    Abstract: Self-supervised learning method that provides generalized speech representations has recently received increasing attention. Wav2vec 2.0 is the most famous example, showing remarkable performance in numerous downstream speech processing tasks. Despite its success, it is challenging to use it directly for wake-up word detection on mobile devices due to its expensive computational cost. In this work… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  5. arXiv:2209.02251  [pdf, other

    cs.CL

    External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems

    Authors: Janghoon Han, Joongbo Shin, Hosung Song, Hyunjik Jo, Gyeonghun Kim, Yireun Kim, Stanley Jungkyu Choi

    Abstract: Constructing a robust dialogue system on spoken conversations bring more challenge than written conversation. In this respect, DSTC10-Track2-Task2 is proposed, which aims to build a task-oriented dialogue (TOD) system incorporating unstructured external knowledge on a spoken conversation, extending DSTC9-Track1. This paper introduces our system containing four advanced methods: data construction,… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: 7page, DSTC10-Track2-task2

  6. arXiv:2110.03215  [pdf, other

    cs.CL cs.LG

    Towards Continual Knowledge Learning of Language Models

    Authors: Joel Jang, Seonghyeon Ye, Sohee Yang, Joongbo Shin, Janghoon Han, Gyeonghun Kim, Stanley Jungkyu Choi, Minjoon Seo

    Abstract: Large Language Models (LMs) are known to encode world knowledge in their parameters as they pretrain on a vast amount of web corpus, which is often utilized for performing knowledge-dependent downstream tasks such as question answering, fact-checking, and open dialogue. In real-world scenarios, the world knowledge stored in the LMs can quickly become outdated as the world changes, but it is non-tr… ▽ More

    Submitted 24 May, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: published at ICLR 2022

  7. arXiv:1909.04020  [pdf

    cond-mat.mtrl-sci

    Strong near-field light-matter interaction in plasmon-resonant tip-enhanced Raman scattering in indium nitride

    Authors: Emanuele Poliani, Daniel Seidlitz, Maximilian Ries, Soo J. Choi, Jim S. Speck, Axel Hoffmann, Markus R. Wagner

    Abstract: We report a detailed study of the strong near-field Raman scattering enhancement which takes place in tip-enhanced Raman scattering (TERS) in indium nitride. In addition to the well-known first-order optical phonons of indium nitride, near-field Raman modes, not detectable in the far-field, appear when approaching the plasmonic probe. The frequencies of these modes coincide with calculated energie… ▽ More

    Submitted 30 October, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

  8. arXiv:1904.02058  [pdf, other

    econ.GN stat.AP

    Do Hospital Data Breaches Reduce Patient Care Quality?

    Authors: Sung J. Choi, M. Eric Johnson

    Abstract: Objective: To estimate the relationship between a hospital data breach and hospital quality outcome Materials and Methods: Hospital data breaches reported to the U.S. Department of Health and Human Services breach portal and the Privacy Rights Clearinghouse database were merged with the Medicare Hospital Compare data to assemble a panel of non-federal acutecare inpatient hospitals for years 2011… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

    Comments: 32 pages, 6 figures, 4 tables, presented at the Workshop on the Economics of Information Security 2017

  9. arXiv:1804.07946  [pdf, other

    cs.CL cs.AI

    Extrofitting: Enriching Word Representation and its Vector Space with Semantic Lexicons

    Authors: Hwiyeol Jo, Stanley Jungkyu Choi

    Abstract: We propose post-processing method for enriching not only word representation but also its vector space using semantic lexicons, which we call extrofitting. The method consists of 3 steps as follows: (i) Expanding 1 or more dimension(s) on all the word vectors, filling with their representative value. (ii) Transferring semantic knowledge by averaging each representative values of synonyms and filli… ▽ More

    Submitted 3 June, 2018; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: In Proceedings of the 3rd ACL Workshop on Representation Learning for NLP (RepL4NLP)