Skip to main content

Showing 1–50 of 73 results for author: Yoo, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15481  [pdf, other

    cs.AI cs.CL

    CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset

    Authors: Haneul Yoo, Yong** Yang, Hwaran Lee

    Abstract: Recent studies in large language models (LLMs) shed light on their multilingual ability and safety, beyond conventional tasks in language modeling. Still, current benchmarks reveal their inability to comprehensively evaluate them and are excessively dependent on manual annotations. In this paper, we introduce code-switching red-teaming (CSRT), a simple yet effective red-teaming technique that simu… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2405.19691  [pdf, other

    cs.HC

    Designing Prompt Analytics Dashboards to Analyze Student-ChatGPT Interactions in EFL Writing

    Authors: Minsun Kim, SeonGyeom Kim, Suyoun Lee, Yoosang Yoon, Junho Myung, Haneul Yoo, Hyungseung Lim, Jieun Han, Yoonsu Kim, So-Yeon Ahn, Juho Kim, Alice Oh, Hwajung Hong, Tak Yeon Lee

    Abstract: While ChatGPT has significantly impacted education by offering personalized resources for students, its integration into educational settings poses unprecedented risks, such as inaccuracies and biases in AI-generated content, plagiarism and over-reliance on AI, and privacy and security issues. To help teachers address such risks, we conducted a two-phase iterative design process that comprises sur… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.00670  [pdf, other

    cs.CV eess.IV

    Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays

    Authors: Andrei Chubarau, Hyun** Yoo, Tara Akhavan, James Clark

    Abstract: Conventional image quality metrics (IQMs), such as PSNR and SSIM, are designed for perceptually uniform gamma-encoded pixel values and cannot be directly applied to perceptually non-uniform linear high-dynamic-range (HDR) colors. Similarly, most of the available datasets consist of standard-dynamic-range (SDR) images collected in standard and possibly uncontrolled viewing conditions. Popular pre-t… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, 3 tables. Submitted to Human Vision and Electronic Imaging 2024 (HVEI)

  4. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  5. arXiv:2403.11399  [pdf, other

    cs.CL

    X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment

    Authors: Dongjae Shin, Hyeonseok Lim, Inho Won, Changsu Choi, Minjun Kim, Seungwoo Song, Hangyeol Yoo, Sangmin Kim, Kyungtae Lim

    Abstract: The impressive development of large language models (LLMs) is expanding into the realm of large multimodal models (LMMs), which incorporate multiple types of data beyond text. However, the nature of multimodal models leads to significant expenses in the creation of training data. Furthermore, constructing multilingual data for LMMs presents its own set of challenges due to language diversity and c… ▽ More

    Submitted 1 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  6. arXiv:2403.09490  [pdf, other

    cs.CL

    Hyper-CL: Conditioning Sentence Representations with Hypernetworks

    Authors: Young Hyun Yoo, Jii Cha, Changhyeon Kim, Taeuk Kim

    Abstract: While the introduction of contrastive learning frameworks in sentence representation learning has significantly contributed to advancements in the field, it still remains unclear whether state-of-the-art sentence embeddings can capture the fine-grained semantics of sentences, particularly when conditioned on specific perspectives. In this paper, we introduce Hyper-CL, an efficient methodology that… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: ACL 2024

  7. arXiv:2403.08272  [pdf, other

    cs.CL

    RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education

    Authors: Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh

    Abstract: The integration of generative AI in education is expanding, yet empirical analyses of large-scale and real-world interactions between students and AI systems still remain limited. Addressing this gap, we present RECIPE4U (RECIPE for University), a dataset sourced from a semester-long experiment with 212 college students in English as Foreign Language (EFL) writing courses. During the study, studen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2309.13243

  8. arXiv:2403.06880  [pdf, other

    cs.LG cs.AI

    Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning

    Authors: Junseok Park, Yoonsung Kim, Hee Bin Yoo, Min Whoo Lee, Kibeom Kim, Won-Seok Choi, Minsu Lee, Byoung-Tak Zhang

    Abstract: Toddlers evolve from free exploration with sparse feedback to exploiting prior experiences for goal-directed learning with denser rewards. Drawing inspiration from this Toddler-Inspired Reward Transition, we set out to explore the implications of varying reward transitions when incorporated into Reinforcement Learning (RL) tasks. Central to our inquiry is the transition from sparse to potential-ba… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted as a full paper at AAAI 2024 (Oral presentation): 7 pages (main paper), 2 pages (references), 17 pages (appendix) each

  9. arXiv:2403.06412  [pdf, other

    cs.CL

    CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

    Authors: Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne, Alice Oh

    Abstract: Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge. Because many existing Korean benchmark datasets are derived from the English counterparts through translation, they often overlook the different cultural contexts. For the few benchmark datasets… ▽ More

    Submitted 15 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  10. arXiv:2403.04982  [pdf, other

    cs.AR

    A 28.6 mJ/iter Stable Diffusion Processor for Text-to-Image Generation with Patch Similarity-based Sparsity Augmentation and Text-based Mixed-Precision

    Authors: Jiwon Choi, Wooyoung Jo, Seongyon Hong, Beomseok Kwon, Wonhoon Park, Hoi-Jun Yoo

    Abstract: This paper presents an energy-efficient stable diffusion processor for text-to-image generation. While stable diffusion attained attention for high-quality image synthesis results, its inherent characteristics hinder its deployment on mobile platforms. The proposed processor achieves high throughput and energy efficiency with three key features as solutions: 1) Patch similarity-based sparsity augm… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

  11. arXiv:2402.16733  [pdf, other

    cs.CL cs.AI

    DREsS: Dataset for Rubric-based Essay Scoring on EFL Writing

    Authors: Haneul Yoo, Jieun Han, So-Yeon Ahn, Alice Oh

    Abstract: Automated essay scoring (AES) is a useful tool in English as a Foreign Language (EFL) writing education, offering real-time essay scores for students and instructors. However, previous AES models were trained on essays and scores irrelevant to the practical scenarios of EFL writing education and usually provided a single holistic score due to the lack of appropriate datasets. In this paper, we rel… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.05191

  12. arXiv:2402.05402  [pdf, other

    cs.NI eess.SP eess.SY

    A State-of-the-art Survey on Full-duplex Network Design

    Authors: Yonghwi Kim, Hyung-Joo Moon, Hanju Yoo, Byoungnam, Kim, Kai-Kit Wong, Chan-Byoung Chae

    Abstract: Full-duplex (FD) technology is gaining popularity for integration into a wide range of wireless networks due to its demonstrated potential in recent studies. In contrast to half-duplex (HD) technology, the implementation of FD in networks necessitates considering inter-node interference (INI) from various network perspectives. When deploying FD technology in networks, several critical factors must… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 23 pages, 10 figures, To appear in Proceedings of the IEEE

  13. A 0.5V, 6.2$μ$W, 0.059mm$^{2}$ Sinusoidal Current Generator IC with 0.088% THD for Bio-Impedance Sensing

    Authors: Kwantae Kim, Changhyeon Kim, Sungpill Choi, Hoi-Jun Yoo

    Abstract: This paper presents the first sub-10$μ$W, sub-0.1% total harmonic distortion (THD) sinusoidal current generator (CG) integrated circuit (IC) that is capable of 20kHz output for the bio-impedance (Bio-Z) sensing applications. To benefit from the ultra-low-power nature of near-threshold operation, a 9b pseudo-sine lookup table (LUT) is 3b $ΔΣ$ modulated in the digital domain, thus linearity burden o… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 5 pages, 8 figures, 1 table, 2020 IEEE Symposium on VLSI Circuits

  14. arXiv:2401.00642  [pdf, other

    cs.CL

    Predicting Anti-microbial Resistance using Large Language Models

    Authors: Hyunwoo Yoo, Bahrad Sokhansanj, James R. Brown, Gail Rosen

    Abstract: During times of increasing antibiotic resistance and the spread of infectious diseases like COVID-19, it is important to classify genes related to antibiotic resistance. As natural language processing has advanced with transformer-based language models, many language models that learn characteristics of nucleotide sequences have also emerged. These models show good performance in classifying vario… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  15. arXiv:2310.07078  [pdf, other

    cs.LG cs.AI cs.CL

    Auditing and Robustifying COVID-19 Misinformation Datasets via Anticontent Sampling

    Authors: Clay H. Yoo, Ashiqur R. KhudaBukhsh

    Abstract: This paper makes two key contributions. First, it argues that highly specialized rare content classifiers trained on small data typically have limited exposure to the richness and topical diversity of the negative class (dubbed anticontent) as observed in the wild. As a result, these classifiers' strong performance observed on the test set may not translate into real-world settings. In the context… ▽ More

    Submitted 5 August, 2023; originally announced October 2023.

    Comments: This paper has been accepted at AAAI 2023 (Robust and Safe AI track)

  16. arXiv:2310.05191  [pdf, other

    cs.CL

    FABRIC: Automated Scoring and Feedback Generation for Essays

    Authors: Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Hyunseung Lim, Yoonsu Kim, Tak Yeon Lee, Hwajung Hong, Juho Kim, So-Yeon Ahn, Alice Oh

    Abstract: Automated essay scoring (AES) provides a useful tool for students and instructors in writing classes by generating essay scores in real-time. However, previous AES models do not provide more specific rubric-based scores nor feedback on how to improve the essays, which can be even more important than the overall scores for learning. We present FABRIC, a pipeline to help students and instructors in… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  17. arXiv:2310.04152  [pdf, other

    cs.CV

    Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation

    Authors: Hye Bin Yoo, Hyun Min Han, Sung Soo Hwang, Il Yong Chun

    Abstract: Neural radiance field (NeRF) is an emerging view synthesis method that samples points in a three-dimensional (3D) space and estimates their existence and color probabilities. The disadvantage of NeRF is that it requires a long training time since it samples many 3D points. In addition, if one samples points from occluded regions or in the space where an object is unlikely to exist, the rendering q… ▽ More

    Submitted 17 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 14 figures, 3 tables

  18. arXiv:2309.13243   

    cs.CL

    ChEDDAR: Student-ChatGPT Dialogue in EFL Writing Education

    Authors: Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh

    Abstract: The integration of generative AI in education is expanding, yet empirical analyses of large-scale, real-world interactions between students and AI systems still remain limited. In this study, we present ChEDDAR, ChatGPT & EFL Learner's Dialogue Dataset As Revising an essay, which is collected from a semester-long longitudinal experiment involving 212 college students enrolled in English as Foreign… ▽ More

    Submitted 20 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: The new version of this paper is on arXiv as arXiv:2403.08272

  19. arXiv:2309.00349  [pdf

    physics.chem-ph cs.LG

    Bespoke Nanoparticle Synthesis and Chemical Knowledge Discovery Via Autonomous Experimentations

    Authors: Hyuk Jun Yoo, Nayeon Kim, Heeseung Lee, Daeho Kim, Leslie Tiong Ching Ow, Hyobin Nam, Chansoo Kim, Seung Yong Lee, Kwan-Young Lee, Donghun Kim, Sang Soo Han

    Abstract: The optimization of nanomaterial synthesis using numerous synthetic variables is considered to be extremely laborious task because the conventional combinatorial explorations are prohibitively expensive. In this work, we report an autonomous experimentation platform developed for the bespoke design of nanoparticles (NPs) with targeted optical properties. This platform operates in a closed-loop man… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  20. arXiv:2308.15651  [pdf, other

    cs.IR cs.CY cs.LG

    Ensuring User-side Fairness in Dynamic Recommender Systems

    Authors: Hyunsik Yoo, Zhichen Zeng, Jian Kang, Ruizhong Qiu, David Zhou, Zhining Liu, Fei Wang, Charlie Xu, Eunice Chan, Hanghang Tong

    Abstract: User-side group fairness is crucial for modern recommender systems, aiming to alleviate performance disparities among user groups defined by sensitive attributes like gender, race, or age. In the ever-evolving landscape of user-item interactions, continual adaptation to newly collected data is crucial for recommender systems to stay aligned with the latest user preferences. However, we observe tha… ▽ More

    Submitted 31 March, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: 19 pages, 20 figures, 2 tables, ACM Web Conference 2024

  21. arXiv:2308.14181  [pdf, other

    cs.LG cs.AI

    Class-Imbalanced Graph Learning without Class Rebalancing

    Authors: Zhining Liu, Ruizhong Qiu, Zhichen Zeng, Hyunsik Yoo, David Zhou, Zhe Xu, Yada Zhu, Kommy Weldemariam, **grui He, Hanghang Tong

    Abstract: Class imbalance is prevalent in real-world node classification tasks and poses great challenges for graph learning models. Most existing studies are rooted in a class-rebalancing (CR) perspective and address class imbalance with class-wise reweighting or resampling. In this work, we approach the root cause of class-imbalance bias from an topological paradigm. Specifically, we theoretically reveal… ▽ More

    Submitted 19 May, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: In ICML 2024; 26 pages, 9 figures, 12 tables

  22. arXiv:2307.16778  [pdf, other

    cs.CL cs.AI

    KoBBQ: Korean Bias Benchmark for Question Answering

    Authors: Jiho **, Jiseon Kim, Nayeon Lee, Haneul Yoo, Alice Oh, Hwaran Lee

    Abstract: The Bias Benchmark for Question Answering (BBQ) is designed to evaluate social biases of language models (LMs), but it is not simple to adapt this benchmark to cultural contexts other than the US because social biases depend heavily on the cultural context. In this paper, we present KoBBQ, a Korean bias benchmark dataset, and we propose a general framework that addresses considerations for cultura… ▽ More

    Submitted 25 January, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: TACL 2024 (pre-MIT Press publication version)

  23. RECIPE: How to Integrate ChatGPT into EFL Writing Education

    Authors: Jieun Han, Haneul Yoo, Yoonsu Kim, Junho Myung, Minsun Kim, Hyunseung Lim, Juho Kim, Tak Yeon Lee, Hwajung Hong, So-Yeon Ahn, Alice Oh

    Abstract: The integration of generative AI in the field of education is actively being explored. In particular, ChatGPT has garnered significant interest, offering an opportunity to examine its effectiveness in English as a foreign language (EFL) education. To address this need, we present a novel learning platform called RECIPE (Revising an Essay with ChatGPT on an Interactive Platform for EFL learners). O… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  24. arXiv:2303.16050  [pdf, other

    cs.CV cs.LG

    Information-Theoretic GAN Compression with Variational Energy-based Model

    Authors: Minsoo Kang, Hyewon Yoo, Eunhee Kang, Sehwan Ki, Hyong-Euk Lee, Bohyung Han

    Abstract: We propose an information-theoretic knowledge distillation approach for the compression of generative adversarial networks, which aims to maximize the mutual information between teacher and student networks via a variational optimization based on an energy-based model. Because the direct computation of the mutual information in continuous domains is intractable, our approach alternatively optimize… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted at Neurips2022

  25. arXiv:2302.09461  [pdf, other

    cs.CV cs.AI

    Liveness score-based regression neural networks for face anti-spoofing

    Authors: Youngjun Kwak, Minyoung Jung, Hunjae Yoo, **Ho Shin, Changick Kim

    Abstract: Previous anti-spoofing methods have used either pseudo maps or user-defined labels, and the performance of each approach depends on the accuracy of the third party networks generating pseudo maps and the way in which the users define the labels. In this paper, we propose a liveness score-based regression network for overcoming the dependency on third party networks and users. First, we introduce a… ▽ More

    Submitted 20 March, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Submission to ICASSP 2023

  26. arXiv:2302.00319  [pdf, other

    cs.LG cs.AI q-bio.QM

    Development of deep biological ages aware of morbidity and mortality based on unsupervised and semi-supervised deep learning approaches

    Authors: Seong-Eun Moon, Ji Won Yoon, Shinyoung Joo, Yoohyung Kim, Jae Hyun Bae, Seokho Yoon, Haanju Yoo, Young Min Cho

    Abstract: Background: While deep learning technology, which has the capability of obtaining latent representations based on large-scale data, can be a potential solution for the discovery of a novel aging biomarker, existing deep learning methods for biological age estimation usually depend on chronological ages and lack of consideration of mortality and morbidity that are the most significant outcomes of a… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  27. arXiv:2301.00891  [pdf, other

    cs.CL

    Understanding Political Polarisation using Language Models: A dataset and method

    Authors: Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo

    Abstract: Our paper aims to analyze political polarization in US political system using Language Models, and thereby help candidates make an informed decision. The availability of this information will help voters understand their candidates views on the economy, healthcare, education and other social issues. Our main contributions are a dataset extracted from Wikipedia that spans the past 120 years and a L… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  28. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  29. arXiv:2212.04734  [pdf, other

    cs.LG cs.AI cs.CL

    MED-SE: Medical Entity Definition-based Sentence Embedding

    Authors: Hyeonbin Hwang, Haanju Yoo, Yera Choi

    Abstract: We propose Medical Entity Definition-based Sentence Embedding (MED-SE), a novel unsupervised contrastive learning framework designed for clinical texts, which exploits the definitions of medical entities. To this end, we conduct an extensive analysis of multiple sentence embedding techniques in clinical semantic textual similarity (STS) settings. In the entity-centric setting that we have designed… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: 8 pages, 2 figures, 9 tables

  30. arXiv:2211.08429  [pdf, other

    cs.LG cs.AI cs.IR

    An Automatic ICD Coding Network Using Partition-Based Label Attention

    Authors: Daeseong Kim, Haanju Yoo, Sewon Kim

    Abstract: International Classification of Diseases (ICD) is a global medical classification system which provides unique codes for diagnoses and procedures appropriate to a patient's clinical record. However, manual coding by human coders is expensive and error-prone. Automatic ICD coding has the potential to solve this problem. With the advancement of deep learning technologies, many deep learning-based me… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 9 pages, 3 figures, 5 tables

  31. arXiv:2210.06828  [pdf, other

    cs.CL

    Rethinking Annotation: Can Language Learners Contribute?

    Authors: Haneul Yoo, Rifki Afina Putri, Changyoon Lee, Youngin Lee, So-Yeon Ahn, Dongyeop Kang, Alice Oh

    Abstract: Researchers have traditionally recruited native speakers to provide annotations for widely used benchmark datasets. However, there are languages for which recruiting native speakers can be difficult, and it would help to find learners of those languages to annotate the data. In this paper, we investigate whether language learners can contribute annotations to benchmark datasets. In a carefully con… ▽ More

    Submitted 29 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: ACL 2023

  32. HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea

    Authors: Haneul Yoo, Jiho **, Juhee Son, **Yeong Bak, Kyunghyun Cho, Alice Oh

    Abstract: Historical records in Korea before the 20th century were primarily written in Hanja, an extinct language based on Chinese characters and not understood by modern Korean or Chinese speakers. Historians with expertise in this time period have been analyzing the documents, but that process is very difficult and time-consuming, and language models would significantly speed up the process. Toward build… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: Findings of NAACL 2022

  33. arXiv:2209.08274  [pdf, other

    cs.RO

    Topological Semantic Graph Memory for Image-Goal Navigation

    Authors: Nuri Kim, Obin Kwon, Hwiyeon Yoo, Yunho Choi, Jeongho Park, Songhwai Oh

    Abstract: A novel framework is proposed to incrementally collect landmark-based graph memory and use the collected memory for image goal navigation. Given a target image to search, an embodied robot utilizes semantic memory to find the target in an unknown environment. % The semantic graph memory is collected from a panoramic observation of an RGB-D camera without knowing the robot's pose. In this paper, we… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

  34. arXiv:2208.00693  [pdf, other

    cs.AR cs.SD eess.AS

    A 23 $μ$W Keyword Spotting IC with Ring-Oscillator-Based Time-Domain Feature Extraction

    Authors: Kwantae Kim, Chang Gao, Rui Graça, Ilya Kiselev, Hoi-Jun Yoo, Tobi Delbruck, Shih-Chii Liu

    Abstract: This article presents the first keyword spotting (KWS) IC which uses a ring-oscillator-based time-domain processing technique for its analog feature extractor (FEx). Its extensive usage of time-encoding schemes allows the analog audio signal to be processed in a fully time-domain manner except for the voltage-to-time conversion stage of the analog front-end. Benefiting from fundamental building bl… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 14 pages, 21 figures, 2 tables

  35. arXiv:2206.07272  [pdf

    cs.CV cs.AI

    Machine vision for vial positioning detection toward the safe automation of material synthesis

    Authors: Leslie Ching Ow Tiong, Hyuk Jun Yoo, Na Yeon Kim, Kwan-Young Lee, Sang Soo Han, Donghun Kim

    Abstract: Although robot-based automation in chemistry laboratories can accelerate the material development process, surveillance-free environments may lead to dangerous accidents primarily due to machine control errors. Object detection techniques can play vital roles in addressing these safety issues; however, state-of-the-art detectors, including single-shot detector (SSD) models, suffer from insufficien… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  36. arXiv:2206.04663  [pdf, other

    quant-ph cs.LG stat.ML

    Provably efficient variational generative modeling of quantum many-body systems via quantum-probabilistic information geometry

    Authors: Faris M. Sbahi, Antonio J. Martinez, Sahil Patel, Dmitri Saberi, Jae Hyeon Yoo, Geoffrey Roeder, Guillaume Verdon

    Abstract: The dual tasks of quantum Hamiltonian learning and quantum Gibbs sampling are relevant to many important problems in physics and chemistry. In the low temperature regime, algorithms for these tasks often suffer from intractabilities, for example from poor sample- or time-complexity. With the aim of addressing such intractabilities, we introduce a generalization of quantum natural gradient descent… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 24 + 49 pages, 5 + 4 figures

  37. arXiv:2205.11224  [pdf

    cs.CV eess.IV

    Around View Monitoring System for Hydraulic Excavators

    Authors: Dong Jun Yeom, Yu Na Hong, Yoojun Kim, Hyun Seok Yoo, Youngsuk Kim

    Abstract: This paper describes the Around View Monitoring (AVM) system for hydraulic excavators that prevents the safety accidents caused by blind spots and increases the operational efficiency. To verify the developed system, experiments were conducted with its prototype. The experimental results demonstrate its applicability in the field with the following values: 7m of a visual range, 15fps of image refr… ▽ More

    Submitted 4 April, 2022; originally announced May 2022.

    Comments: 9 pages, 11 figures

    Journal ref: The 7th International Conference on Construction Engineering and Project Management (ICCEPM 2017), Oct. 27-30, 2017, Chengdu, China

  38. arXiv:2205.10405  [pdf, other

    cs.NI eess.SP

    Demo: A Transparent Antenna System for In-Building Networks

    Authors: Sang-Hyun Park, Soo-Min Kim, Seonghoon Kim, HongIl Yoo, Byoungnam Kim, Chan-Byoung Chae

    Abstract: For in-building networks, the potential of transparent antennas, which are used as windows of a building, is presented in this paper. In this scenario, a transparent window antenna communicates with outdoor devices or base stations, and the indoor repeaters act as relay stations of the transparent window antenna for indoor devices. At indoor, back lobe waves of the transparent window antenna are d… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 2 pages, 3 figures

  39. arXiv:2205.10019  [pdf, other

    cs.CL cs.AI cs.LG

    Translating Hanja Historical Documents to Contemporary Korean and English

    Authors: Juhee Son, Jiho **, Haneul Yoo, **Yeong Bak, Kyunghyun Cho, Alice Oh

    Abstract: The Annals of Joseon Dynasty (AJD) contain the daily records of the Kings of Joseon, the 500-year kingdom preceding the modern nation of Korea. The Annals were originally written in an archaic Korean writing system, `Hanja', and were translated into Korean from 1968 to 1993. The resulting translation was however too literal and contained many archaic Korean words; thus, a new expert translation ef… ▽ More

    Submitted 29 December, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: EMNLP Findings 2022

  40. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  41. arXiv:2205.03886  [pdf, other

    eess.SP cs.AI

    Demo: Real-Time Semantic Communications with a Vision Transformer

    Authors: Hanju Yoo, Taehun Jung, Linglong Dai, Songkuk Kim, Chan-Byoung Chae

    Abstract: Semantic communications are expected to enable the more effective delivery of meaning rather than a precise transfer of symbols. In this paper, we propose an end-to-end deep neural network-based architecture for image transmission and demonstrate its feasibility in a real-time wireless channel by implementing a prototype based on a field-programmable gate array (FPGA). We demonstrate that this sys… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

  42. arXiv:2203.12147  [pdf, other

    eess.SP cs.LG

    3D-EDM: Early Detection Model for 3D-Printer Faults

    Authors: Harim Jeong, Joo Hun Yoo

    Abstract: With the advent of 3D printers in different price ranges and sizes, they are no longer just for professionals. However, it is still challenging to use a 3D printer perfectly. Especially, in the case of the Fused Deposition Method, it is very difficult to perform with accurate calibration. Previous studies have suggested that these problems can be detected using sensor data and image data with mach… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted by KSII The 13th International Conference on Internet(ICONI)2021. Copyright 2021 KSII

  43. arXiv:2203.07679  [pdf, other

    cs.AR cs.LG

    Energy-efficient Dense DNN Acceleration with Signed Bit-slice Architecture

    Authors: Dongseok Im, Gwangtae Park, Zhiyong Li, Junha Ryu, Hoi-Jun Yoo

    Abstract: As the number of deep neural networks (DNNs) to be executed on a mobile system-on-chip (SoC) increases, the mobile SoC suffers from the real-time DNN acceleration within its limited hardware resources and power budget. Although the previous mobile neural processing units (NPUs) take advantage of low-bit computing and exploitation of the sparsity, it is incapable of accelerating high-precision and… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  44. arXiv:2202.03601  [pdf

    cs.AR

    Two-Step Spike Encoding Scheme and Architecture for Highly Sparse Spiking-Neural-Network

    Authors: Sangyeob Kim, Sang** Kim, Soyeon Um, Soyeon Kim, Hoi-Jun Yoo

    Abstract: This paper proposes a two-step spike encoding scheme, which consists of the source encoding and the process encoding for a high energy-efficient spiking-neural-network (SNN) acceleration. The eigen-train generation and its superposition generate spike trains which show high accuracy with low spike ratio. Sparsity boosting (SB) and spike generation skip** (SGS) reduce the amount of operations for… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: 5 pages, 10 figures

  45. arXiv:2110.07566  [pdf, other

    cs.CL cs.AI cs.LG

    Practical Benefits of Feature Feedback Under Distribution Shift

    Authors: Anurag Katakkar, Clay H. Yoo, Weiqin Wang, Zachary C. Lipton, Divyansh Kaushik

    Abstract: In attempts to develop sample-efficient and interpretable algorithms, researcher have explored myriad mechanisms for collecting and exploiting feature feedback (or rationales) auxiliary annotations provided for training (but not test) instances that highlight salient evidence. Examples include bounding boxes around objects and salient spans in text. Despite its intuitive appeal, feature feedback h… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  46. arXiv:2109.09057  [pdf, other

    cs.CL

    Knowledge-Enhanced Evidence Retrieval for Counterargument Generation

    Authors: Yohan Jo, Haneul Yoo, **Yeong Bak, Alice Oh, Chris Reed, Eduard Hovy

    Abstract: Finding counterevidence to statements is key to many tasks, including counterargument generation. We build a system that, given a statement, retrieves counterevidence from diverse sources on the Web. At the core of this system is a natural language inference (NLI) model that determines whether a candidate sentence is valid counterevidence or not. Most NLI models to date, however, lack proper reaso… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: To appear in Findings of EMNLP 2021

  47. arXiv:2109.00202  [pdf, other

    cs.LG cs.AI

    Federated Learning: Issues in Medical Application

    Authors: Joo Hun Yoo, Hyejun Jeong, Jaehyeok Lee, Tai-Myoung Chung

    Abstract: Since the federated learning, which makes AI learning possible without moving local data around, was introduced by google in 2017 it has been actively studied particularly in the field of medicine. In fact, the idea of machine learning in AI without collecting data from local clients is very attractive because data remain in local sites. However, federated learning techniques still have various op… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 20 pages, 3 figures, 1 table, submitted to FDSE2021

  48. arXiv:2108.01903  [pdf, other

    cs.LG cs.AI

    Personalized Federated Learning with Clustering: Non-IID Heart Rate Variability Data Application

    Authors: Joo Hun Yoo, Ha Min Son, Hyejun Jeong, Eun-Hye Jang, Ah Young Kim, Han Young Yu, Hong ** Jeon, Tai-Myoung Chung

    Abstract: While machine learning techniques are being applied to various fields for their exceptional ability to find complex relations in large datasets, the strengthening of regulations on data ownership and privacy is causing increasing difficulty in its application to medical data. In light of this, Federated Learning has recently been proposed as a solution to train on private data without breach of co… ▽ More

    Submitted 10 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 6 pages with two columns, 4 figures, 3 tables

  49. arXiv:2106.12044  [pdf, other

    cs.SI cs.CY

    Empathy and Hope: Resource Transfer to Model Inter-country Social Media Dynamics

    Authors: Clay H. Yoo, Shriphani Palakodety, Rupak Sarkar, Ashiqur R. KhudaBukhsh

    Abstract: The ongoing COVID-19 pandemic resulted in significant ramifications for international relations ranging from travel restrictions, global ceasefires, and international vaccine production and sharing agreements. Amidst a wave of infections in India that resulted in a systemic breakdown of healthcare infrastructure, a social welfare organization based in Pakistan offered to procure medical-grade oxyg… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  50. arXiv:2106.07036  [pdf, other

    q-bio.BM cs.LG

    Protein-Ligand Docking Surrogate Models: A SARS-CoV-2 Benchmark for Deep Learning Accelerated Virtual Screening

    Authors: Austin Clyde, Thomas Brettin, Alexander Partin, Hyunseung Yoo, Yadu Babuji, Ben Blaiszik, Andre Merzky, Matteo Turilli, Shantenu Jha, Arvind Ramanathan, Rick Stevens

    Abstract: We propose a benchmark to study surrogate model accuracy for protein-ligand docking. We share a dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. Our work shows surrogate docking models have six orders of magnitude more throughput than standa… ▽ More

    Submitted 30 June, 2021; v1 submitted 13 June, 2021; originally announced June 2021.