Skip to main content

Showing 101–150 of 3,305 results for author: Lee, Y

.
  1. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  2. arXiv:2404.01288  [pdf, other

    cs.CL

    Large Language Models are Capable of Offering Cognitive Reappraisal, if Guided

    Authors: Hongli Zhan, Allen Zheng, Yoon Kyung Lee, **a Suh, Junyi Jessy Li, Desmond C. Ong

    Abstract: Large language models (LLMs) have offered new opportunities for emotional support, and recent work has shown that they can produce empathic responses to people in distress. However, long-term mental well-being requires emotional self-regulation, where a one-time empathic response falls short. This work takes a first step by engaging with cognitive reappraisals, a strategy from psychology practitio… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2404.01039  [pdf, other

    cs.LG

    A Survey on Hypergraph Neural Networks: An In-Depth and Step-By-Step Guide

    Authors: Sunwoo Kim, Soo Yong Lee, Yue Gao, Alessia Antelmi, Mirko Polato, Kijung Shin

    Abstract: Higher-order interactions (HOIs) are ubiquitous in real-world complex systems and applications, and thus investigation of deep learning for HOIs has become a valuable agenda for the data mining and machine learning communities. As networks of HOIs are expressed mathematically as hypergraphs, hypergraph neural networks (HNNs) have emerged as a powerful tool for representation learning on hypergraph… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  4. arXiv:2404.00638  [pdf, other

    cs.LG

    HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

    Authors: Sunwoo Kim, Shinhwan Kang, Fanchen Bu, Soo Yong Lee, Jaemin Yoo, Kijung Shin

    Abstract: Hypergraphs are marked by complex topology, expressing higher-order interactions among multiple nodes with hyperedges, and better capturing the topology is essential for effective representation learning. Recent advances in generative self-supervised learning (SSL) suggest that hypergraph neural networks learned from generative self supervision have the potential to effectively encode the complex… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Published as a conference paper at ICLR 2024

  5. arXiv:2404.00300  [pdf, other

    cs.HC

    Enhancing Empathy in Virtual Reality: An Embodied Approach to Mindset Modulation

    Authors: Seoyeon Bae, Yoon Kyung Lee, Jungcheol Lee, Jaeheon Kim, Haeseong Jeon, Seung-Hwan Lim, Byung-Cheol Kim, Sowon Hahn

    Abstract: A growth mindset has shown promising outcomes for increasing empathy ability. However, stimulating a growth mindset in VR-based empathy interventions is under-explored. In the present study, we implemented prosocial VR content, Our Neighbor Hero, focusing on embodying a virtual character to modulate players' mindsets. The virtual body served as a step** stone, enabling players to identify with t… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 9 pages, 2 figures, 1 table

  6. arXiv:2404.00154  [pdf, other

    math.NA

    Sampling error mitigation through spectrum smoothing in ensemble data assimilation

    Authors: Bosu Choi, Yoonsang Lee

    Abstract: In data assimilation, an ensemble provides a nonintrusive way to evolve a probability density described by a nonlinear prediction model. Although a large ensemble size is required for statistical accuracy, the ensemble size is typically limited to a small number due to the computational cost of running the prediction model, which leads to a sampling error. Several methods, such as localization, ex… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  7. arXiv:2404.00060  [pdf, other

    q-fin.ST cs.AI cs.LG

    Temporal Graph Networks for Graph Anomaly Detection in Financial Networks

    Authors: Ye** Kim, Youngbin Lee, Minyoung Choe, Sungju Oh, Yongjae Lee

    Abstract: This paper explores the utilization of Temporal Graph Networks (TGN) for financial anomaly detection, a pressing need in the era of fintech and digitized financial transactions. We present a comprehensive framework that leverages TGN, capable of capturing dynamic changes in edges within financial networks, for fraud detection. Our study compares TGN's performance against static Graph Neural Networ… ▽ More

    Submitted 27 March, 2024; originally announced April 2024.

    Comments: Presented at the AAAI 2024 Workshop on AI in Finance for Social Impact (https://sites.google.com/view/aifin-aaai2024)

  8. arXiv:2403.19146  [pdf, ps, other

    cs.DS cs.DC math.OC

    Improving the Bit Complexity of Communication for Distributed Convex Optimization

    Authors: Mehrdad Ghadiri, Yin Tat Lee, Swati Padmanabhan, William Swartworth, David Woodruff, Guanghao Ye

    Abstract: We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank app… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: To appear in STOC '24. Abstract shortened to meet the arXiv limits. Comments welcome!

  9. arXiv:2403.18881  [pdf

    q-bio.QM physics.optics

    Transmission IR Microscopy for the Quantitation of Biomolecular Mass In Live Cells

    Authors: Yow-Ren Chang, Seong-Min Kim, Young Jong Lee

    Abstract: Absolute quantity imaging of biomolecules on a single cell level is critical for measurement assurance in biosciences and bioindustries. While infrared (IR) transmission microscopy is a powerful label-free imaging modality capable of chemical quantification, its applicability to hydrated biological samples remains challenging due to the strong water absorption. We overcome this challenge by applyi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Body: 19 pages, 5 figures. Supplemental: 11 pages, 6 figures

  10. arXiv:2403.18771  [pdf, other

    cs.CL

    CheckEval: Robust Evaluation Framework using Large Language Model via Checklist

    Authors: Yukyung Lee, Joonghoon Kim, Jaehee Kim, Hyowon Cho, Pilsung Kang

    Abstract: We introduce CheckEval, a novel evaluation framework using Large Language Models, addressing the challenges of ambiguity and inconsistency in current evaluation methods. CheckEval addresses these challenges by dividing evaluation criteria into detailed sub-aspects and constructing a checklist of Boolean questions for each, simplifying the evaluation. This approach not only renders the process more… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: HEAL at CHI 2024

  11. arXiv:2403.18305  [pdf, other

    cs.IR cs.AI

    A Recommender System for NFT Collectibles with Item Feature

    Authors: Minjoo Choi, Seonmi Kim, Ye** Kim, Youngbin Lee, Joohwan Hong, Yongjae Lee

    Abstract: Recommender systems have been actively studied and applied in various domains to deal with information overload. Although there are numerous studies on recommender systems for movies, music, and e-commerce, comparatively less attention has been paid to the recommender system for NFTs despite the continuous growth of the NFT market. This paper presents a recommender system for NFTs that utilizes a… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Presented at the AAAI 2023 Bridge on AI for Financial Services (https://sites.google.com/view/aaai-ai-fin/home)

  12. arXiv:2403.18177  [pdf, ps, other

    q-fin.MF q-fin.PR q-fin.TR

    Growth rate of liquidity provider's wealth in G3Ms

    Authors: Cheuk Yin Lee, Shen-Ning Tung, Tai-Ho Wang

    Abstract: Geometric mean market makers (G3Ms), such as Uniswap and Balancer, represent a widely used class of automated market makers (AMMs). These G3Ms are characterized by the following rule: the reserves of the AMM must maintain the same (weighted) geometric mean before and after each trade. This paper investigates the effects of trading fees on liquidity providers' (LP) profitability in a G3M, as well a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 27 pages

    MSC Class: 91G15

  13. arXiv:2403.18148  [pdf, other

    cs.CL cs.AI

    Large Language Models Produce Responses Perceived to be Empathic

    Authors: Yoon Kyung Lee, **a Suh, Hongli Zhan, Junyi Jessy Li, Desmond C. Ong

    Abstract: Large Language Models (LLMs) have demonstrated surprising performance on many tasks, including writing supportive messages that display empathy. Here, we had these models generate empathic messages in response to posts describing common life experiences, such as workplace situations, parenting, relationships, and other anxiety- and anger-eliciting situations. Across two studies (N=192, 202), we sh… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  14. arXiv:2403.17938  [pdf, other

    cs.NE eess.SY

    Circuit-centric Genetic Algorithm (CGA) for Analog and Radio-Frequency Circuit Optimization

    Authors: Mingi Kwon, Yeonjun Lee, Ickhyun Song

    Abstract: This paper presents an automated method for optimizing parameters in analog/high-frequency circuits, aiming to maximize performance parameters of a radio-frequency (RF) receiver. The design target includes a reduction of power consumption and noise figure and an increase in conversion gain. This study investigates the use of an artificial algorithm for the optimization of a receiver, illustrating… ▽ More

    Submitted 18 November, 2023; originally announced March 2024.

    Comments: 15 pages, 6 figures, submission to Circuits, Systems and Signal Processing

  15. arXiv:2403.17069  [pdf, other

    cond-mat.str-el quant-ph

    Tensor network formulation of symmetry protected topological phases in mixed states

    Authors: Hanyu Xue, Jong Yeon Lee, Yimu Bao

    Abstract: We define and classify symmetry-protected topological (SPT) phases in mixed states based on the tensor network formulation of the density matrix. In one dimension, we introduce strong injective matrix product density operators (MPDO), which describe a broad class of short-range correlated mixed states, including the locally decohered SPT states. We map strong injective MPDO to a pure state in the… ▽ More

    Submitted 15 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Appendix D is fixed

  16. Impact of noise transients on gravitational-wave burst detection efficiency of the BayesWave pipeline with multi-detector networks

    Authors: Yi Shuen C. Lee, Margaret Millhouse, Andrew Melatos

    Abstract: Detection confidence of the source-agnostic gravitational-wave burst search pipeline BayesWave is quantified by the log signal-versus-glitch Bayes factor, $\ln\mathcal{B}_{\mathcal{S},\mathcal{G}}$. A recent study shows that $\ln\mathcal{B}_{\mathcal{S},\mathcal{G}}$ increases with the number of detectors. However, the increasing frequency of non-Gaussian noise transients (glitches) in expanded de… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 16 pages, 8 figures

  17. arXiv:2403.16066  [pdf, other

    cs.AI

    A Temporal Graph Network Framework for Dynamic Recommendation

    Authors: Ye** Kim, Youngbin Lee, Vincent Yuan, Annika Lee, Yongjae Lee

    Abstract: Recommender systems, crucial for user engagement on platforms like e-commerce and streaming services, often lag behind users' evolving preferences due to static data reliance. After Temporal Graph Networks (TGNs) were proposed, various studies have shown that TGN can significantly improve situations where the features of nodes and edges dynamically change over time. However, despite its promising… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Presented at the AAAI 2024 Workshop on Recommendation Ecosystems: Modeling, Optimization and Incentive Design

  18. arXiv:2403.15902  [pdf, other

    cs.GR

    Utilizing Motion Matching with Deep Reinforcement Learning for Target Location Tasks

    Authors: Jeongmin Lee, Taesoo Kwon, Hyunju Shin, Yoonsang Lee

    Abstract: We present an approach using deep reinforcement learning (DRL) to directly generate motion matching queries for long-term tasks, particularly targeting the reaching of specific locations. By integrating motion matching and DRL, our method demonstrates the rapid learning of policies for target location tasks within minutes on a standard desktop, employing a simple reward design. Additionally, we pr… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Eurographics 2024 Short Papers

  19. arXiv:2403.15388  [pdf, other

    cs.CV cs.AI cs.CL

    LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

    Authors: Yuzhang Shang, Mu Cai, Bingxin Xu, Yong Jae Lee, Yan Yan

    Abstract: Large Multimodal Models (LMMs) have shown significant visual reasoning capabilities by connecting a visual encoder and a large language model. LMMs typically take in a fixed and large amount of visual tokens, such as the penultimate layer features in the CLIP visual encoder, as the prefix content. Recent LMMs incorporate more complex visual inputs, such as high-resolution images and videos, which… ▽ More

    Submitted 22 May, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: Project page: https://llava-prumerge.github.io/

  20. arXiv:2403.14963  [pdf, other

    cs.CR

    Enabling Physical Localization of Uncooperative Cellular Devices

    Authors: Taekkyung Oh, Sangwook Bae, Junho Ahn, Yonghwa Lee, Dinh-Tuan Hoang, Min Suk Kang, Nils Ole Tippenhauer, Yongdae Kim

    Abstract: In cellular networks, it can become necessary for authorities to physically locate user devices for tracking criminals or illegal devices. While cellular operators can provide authorities with cell information the device is cam** on, fine-grained localization is still required. Therefore, the authorized agents trace the device by monitoring its uplink signals. However, tracking the uplink signal… ▽ More

    Submitted 25 March, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  21. arXiv:2403.14353  [pdf, other

    cs.AR cs.LG cs.RO

    DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

    Authors: Yoonsung Kim, Changhun Oh, **woo Hwang, Wonung Kim, Seongryong Oh, Yubin Lee, Hardik Sharma, Amir Yazdanbakhsh, Jongse Park

    Abstract: Deep neural network (DNN) video analytics is crucial for autonomous systems such as self-driving vehicles, unmanned aerial vehicles (UAVs), and security robots. However, real-world deployment faces challenges due to their limited computational resources and battery power. To tackle these challenges, continuous learning exploits a lightweight "student" model at deployment (inference), leverages a l… ▽ More

    Submitted 28 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  22. arXiv:2403.13589  [pdf, other

    cs.CV

    ReGround: Improving Textual and Spatial Grounding at No Cost

    Authors: Yuseung Lee, Minhyuk Sung

    Abstract: When an image generation process is guided by both a text prompt and spatial cues, such as a set of bounding boxes, do these elements work in harmony, or does one dominate the other? Our analysis of a pretrained image diffusion model that integrates gated self-attention into the U-Net reveals that spatial grounding often outweighs textual grounding due to the sequential flow from gated self-attent… ▽ More

    Submitted 30 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Project page: https://re-ground.github.io/

  23. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  24. arXiv:2403.11385  [pdf, other

    math.NA cs.CE cs.LG math.PR

    Stochastic approach for elliptic problems in perforated domains

    Authors: Jihun Han, Yoonsang Lee

    Abstract: A wide range of applications in science and engineering involve a PDE model in a domain with perforations, such as perforated metals or air filters. Solving such perforated domain problems suffers from computational challenges related to resolving the scale imposed by the geometries of perforations. We propose a neural network-based mesh-free approach for perforated domain problems. The method is… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 18 pages, 6 figures

    MSC Class: 65N99; 65C05; 68T07

  25. arXiv:2403.10882  [pdf, other

    cs.CL cs.AI

    Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean

    Authors: ChangSu Choi, Yongbin Jeong, Seoyoon Park, InHo Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, Hye** Lee, Younggyun Hahm, Hansaem Kim, KyungTae Lim

    Abstract: Large language models (LLMs) use pretraining to predict the subsequent word; however, their expansion requires significant computing resources. Numerous big tech companies and research institutes have developed multilingual LLMs (MLLMs) to meet current demands, overlooking less-resourced languages (LRLs). This study proposed three strategies to enhance the performance of LRLs based on the publicly… ▽ More

    Submitted 21 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  26. arXiv:2403.10576  [pdf, other

    cs.CR cs.CL cs.LG

    Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain

    Authors: Eugene Jang, Jian Cui, Dayeon Yim, Young** **, **-Woo Chung, Seungwon Shin, Yongjae Lee

    Abstract: Cybersecurity information is often technically complex and relayed through unstructured text, making automation of cyber threat intelligence highly challenging. For such text domains that involve high levels of expertise, pretraining on in-domain corpora has been a popular method for language models to obtain domain expertise. However, cybersecurity texts often contain non-linguistic elements (suc… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: To appear in NAACL Findings 2024

    ACM Class: I.2.7

  27. arXiv:2403.10506  [pdf, other

    cs.RO cs.AI cs.LG

    HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation

    Authors: Carmelo Sferrazza, Dun-Ming Huang, Xingyu Lin, Youngwoon Lee, Pieter Abbeel

    Abstract: Humanoid robots hold great promise in assisting humans in diverse environments and tasks, due to their flexibility and adaptability leveraging human-like morphology. However, research in humanoid robots is often bottlenecked by the costly and fragile hardware setups. To accelerate algorithmic research in humanoid robots, we present a high-dimensional, simulated robot learning benchmark, HumanoidBe… ▽ More

    Submitted 18 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  28. arXiv:2403.09168  [pdf, other

    cs.HC

    VIVID: Human-AI Collaborative Authoring of Vicarious Dialogues from Lecture Videos

    Authors: Seulgi Choi, Hyewon Lee, Yoonjoo Lee, Juho Kim

    Abstract: The lengthy monologue-style online lectures cause learners to lose engagement easily. Designing lectures in a "vicarious dialogue" format can foster learners' cognitive activities more than monologue-style. However, designing online lectures in a dialogue style catered to the diverse needs of learners is laborious for instructors. We conducted a design workshop with eight educational experts and s… ▽ More

    Submitted 10 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  29. arXiv:2403.08500  [pdf

    cond-mat.mtrl-sci physics.optics

    Highly confined epsilon-near-zero- and surface-phonon polaritons in SrTiO3 membranes

    Authors: Ruijuan Xu, Iris Crassee, Hans A. Bechtel, Yixi Zhou, Adrien Bercher, Lukas Korosec, Carl Willem Rischau, Jérémie Teyssier, Kevin J. Crust, Yonghun Lee, Stephanie N. Gilbert Corder, Jiarui Li, Jennifer A. Dionne, Harold Y. Hwang, Alexey B. Kuzmenko, Yin Liu

    Abstract: Recent theoretical studies have suggested that transition metal perovskite oxide membranes can enable surface phonon polaritons in the infrared range with low loss and much stronger subwavelength confinement than bulk crystals. Such modes, however, have not been experimentally observed so far. Here, using a combination of far-field Fourier-transform infrared (FTIR) spectroscopy and near-field sync… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  30. arXiv:2403.08272  [pdf, other

    cs.CL

    RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education

    Authors: Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh

    Abstract: The integration of generative AI in education is expanding, yet empirical analyses of large-scale and real-world interactions between students and AI systems still remain limited. Addressing this gap, we present RECIPE4U (RECIPE for University), a dataset sourced from a semester-long experiment with 212 college students in English as Foreign Language (EFL) writing courses. During the study, studen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2309.13243

  31. arXiv:2403.08058  [pdf, other

    cs.LG cs.CL

    CHAI: Clustered Head Attention for Efficient LLM Inference

    Authors: Saurabh Agarwal, Bilge Acun, Basil Hosmer, Mostafa Elhoushi, Ye** Lee, Shivaram Venkataraman, Dimitris Papailiopoulos, Carole-Jean Wu

    Abstract: Large Language Models (LLMs) with hundreds of billions of parameters have transformed the field of machine learning. However, serving these models at inference time is both compute and memory intensive, where a single request can require multiple GPUs and tens of Gigabytes of memory. Multi-Head Attention is one of the key components of LLMs, which can account for over 50% of LLMs memory and comput… ▽ More

    Submitted 27 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  32. arXiv:2403.07598  [pdf, other

    cs.CV

    Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference

    Authors: Changmin Jeon, Seonjun Kim, Juheon Yi, Youngki Lee

    Abstract: In this paper, we present Mondrian, an edge system that enables high-performance object detection on high-resolution video streams. Many lightweight models and system optimization techniques have been proposed for resource-constrained devices, but they do not fully utilize the potential of the accelerators over dynamic, high-resolution videos. To enable such capability, we devise a novel Compressi… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  33. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  34. arXiv:2403.04613  [pdf, other

    stat.ME

    Simultaneous Conformal Prediction of Missing Outcomes with Propensity Score $ε$-Discretization

    Authors: Yonghoon Lee, Edgar Dobriban, Eric Tchetgen Tchetgen

    Abstract: We study the problem of simultaneous predictive inference on multiple outcomes missing at random. We consider a suite of possible simultaneous coverage properties, conditionally on the missingness pattern and on the -- possibly discretized/binned -- feature values. For data with discrete feature distributions, we develop a procedure which attains feature- and missingness-conditional coverage; and… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  35. Probing the mixing between sterile and tau neutrinos in the SHiP experiment

    Authors: Ki-Young Choi, Sung Hyun Kim, Yeong Gyun Kim, Kang Young Lee, Kyong Sei Lee, Byung Do Park, Jong Yoon Sohn, Seong Moon Yoo, Chun Sil Yoon

    Abstract: We study the expected sensitivity to the mixing between sterile and tau neutrinos directly from the tau neutrino disappearance in the high-energy fixed target experiment. Here, the beam energy is large enough to produce tau neutrinos at the target with large luminosity. During their propagation to the detector, tau neutrinos may oscillate into sterile neutrinos. By examining the energy spectrum of… ▽ More

    Submitted 26 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

    Journal ref: J. High Energ. Phys. 2024, 166 (2024)

  36. arXiv:2403.03742  [pdf, other

    cs.HC

    Mitigating Ageism through Virtual Reality: Intergenerational Collaborative Escape Room Design

    Authors: Ruotong Zou, Shuyu Yin, Tianqi Song, Peinuan Qin, Yi-Chieh Lee

    Abstract: As virtual reality (VR) becomes more popular for intergenerational collaboration, there is still a significant gap in research regarding understanding the potential for reducing ageism. Our study aims to address this gap by analyzing ageism levels before and after VR escape room collaborative experiences. We recruited 28 participants to collaborate with an older player in a challenging VR escape r… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  37. arXiv:2403.03392  [pdf, other

    physics.ins-det astro-ph.IM hep-ex

    Pulse shape discrimination in an organic scintillation phoswich detector using machine learning techniques

    Authors: Yu** Lee, **young Kim, Byoung-cheol Koh, Young Soo Yoon, Chang Hyon Ha

    Abstract: We developed machine learning algorithms for distinguishing scintillation signals from a plastic-liquid coupled detector known as a phoswich. The challenge lies in discriminating signals from organic scintillators with similar shapes and short decay times. Using a single-readout phoswich detector, we successfully identified $γ$ radiation signals from two scintillating components. Our Boosted Decis… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 11pages, 7 figures

  38. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  39. arXiv:2403.02939  [pdf, other

    cs.DL cs.AI cs.CL cs.HC

    PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers

    Authors: Yoonjoo Lee, Hyeonsu B. Kang, Matt Latzke, Juho Kim, Jonathan Bragg, Joseph Chee Chang, Pao Siangliulue

    Abstract: With the rapid growth of scholarly archives, researchers subscribe to "paper alert" systems that periodically provide them with recommendations of recently published papers that are similar to previously collected papers. However, researchers sometimes struggle to make sense of nuanced connections between recommended papers and their own research context, as existing systems only present paper tit… ▽ More

    Submitted 9 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to CHI 2024

  40. arXiv:2403.02892  [pdf, other

    cs.CV cs.AI

    Enhancing Long-Term Person Re-Identification Using Global, Local Body Part, and Head Streams

    Authors: Duy Tran Thanh, Yee** Lee, Byeongkeun Kang

    Abstract: This work addresses the task of long-term person re-identification. Typically, person re-identification assumes that people do not change their clothes, which limits its applications to short-term scenarios. To overcome this limitation, we investigate long-term person re-identification, which considers both clothes-changing and clothes-consistent scenarios. In this paper, we propose a novel framew… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 16 pages

    Journal ref: Neurocomputing, 2024

  41. arXiv:2403.02870  [pdf, other

    cs.AI cs.CR cs.LG

    Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices

    Authors: Younghan Lee, Sohee Jun, Yungi Cho, Woorim Han, Hyungon Moon, Yunheung Paek

    Abstract: With growing popularity, deep learning (DL) models are becoming larger-scale, and only the companies with vast training datasets and immense computing power can manage their business serving such large models. Most of those DL models are proprietary to the companies who thus strive to keep their private models safe from the model extraction attack (MEA), whose aim is to steal the model by training… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by 27th European Symposium on Research in Computer Security (ESORICS 2022)

  42. arXiv:2403.02846  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

    Authors: Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

    Abstract: Federated Learning (FL) thrives in training a global model with numerous clients by only sharing the parameters of their local models trained with their private training datasets. Therefore, without revealing the private dataset, the clients can obtain a deep learning (DL) model with high performance. However, recent research proposed poisoning attacks that cause a catastrophic loss in the accurac… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by 28th European Symposium on Research in Computer Security (ESORICS 2023)

  43. arXiv:2403.02752  [pdf, other

    cs.HC

    HINTs: Sensemaking on large collections of documents with Hypergraph visualization and INTelligent agents

    Authors: Sam Yu-Te Lee, Kwan-Liu Ma

    Abstract: Sensemaking on a large collection of documents (corpus) is a challenging task often found in fields such as market research, legal studies, intelligence analysis, political science, computational linguistics, etc. Previous works approach this problem either from a topic- or entity-based perspective, but they lack interpretability and trust due to poor model alignment. In this paper, we present HIN… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  44. arXiv:2403.02734  [pdf, ps, other

    cond-mat.str-el cond-mat.mtrl-sci

    Strain tunable electronic ground states in two-dimensional iridate thin films

    Authors: Donghan Kim, Byungmin Sohn, Yeonjae Lee, Jeongkeun Song, Mi Kyung Kim, Minjae Kim, Tae Won Noh, Changyoung Kim

    Abstract: Quantum phases of matter such as superconducting, ferromagnetic and Wigner crystal states are often driven by the two-dimensionality (2D) of correlated systems. Meanwhile, spin-orbit coupling (SOC) is a fundamental element leading to nontrivial topology which gives rise to quantum phenomena such as the large anomalous Hall effect and nontrivial superconductivity. However, the search for controllab… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures

  45. arXiv:2403.02638  [pdf, other

    hep-ex astro-ph.IM

    Real-time portable muography with Hankuk Atmospheric-muon Wide Landsca** : HAWL

    Authors: J. Seo, N. Carlin, D. F. F. S. Cavalcante, J. S. Chung, L. E. Franca, C. Ha, J. Kim, J. Y. Kim, H. Kimku, B. C. Koh, Y. J. Lee, B. B. Manzato, S. W. Oh, R. L. C. Pitta, S. J. Won

    Abstract: Cosmic ray muons prove valuable across various fields, from particle physics experiments to non-invasive tomography, thanks to their high flux and exceptional penetrating capability. Utilizing a scintillator detector, one can effectively study the topography of mountains situated above tunnels and underground spaces. The Hankuk Atmospheric-muon Wide Landsca** (HAWL) project successfully charts t… ▽ More

    Submitted 28 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 10pages, 12 figures

  46. arXiv:2403.01858  [pdf, other

    cs.CL

    An Improved Traditional Chinese Evaluation Suite for Foundation Model

    Authors: Zhi-Rui Tam, Ya-Ting Pai, Yen-Wei Lee, Sega Cheng, Hong-Han Shuai

    Abstract: We present TMMLU+, a comprehensive dataset designed for the Traditional Chinese massive multitask language understanding dataset. TMMLU+ is a multiple-choice question-answering dataset with 66 subjects from elementary to professional level. Compared to its predecessor, TMMLU, TMMLU+ is six times larger and boasts a more balanced subject distribution. We included benchmark results in TMMLU+ from cl… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  47. arXiv:2403.01749  [pdf, other

    cs.CL

    Differentially Private Synthetic Data via Foundation Model APIs 2: Text

    Authors: Chulin Xie, Zinan Lin, Arturs Backurs, Sivakanth Gopi, Da Yu, Huseyin A Inan, Harsha Nori, Haotian Jiang, Huishuai Zhang, Yin Tat Lee, Bo Li, Sergey Yekhanin

    Abstract: Text data has become extremely valuable due to the emergence of machine learning algorithms that learn from it. A lot of high-quality text data generated in the real world is private and therefore cannot be shared or used freely due to privacy concerns. Generating synthetic replicas of private text data with a formal privacy guarantee, i.e., differential privacy (DP), offers a promising and scalab… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  48. arXiv:2403.01479  [pdf, other

    cs.CL cs.AI

    Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation

    Authors: Heegon **, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh, Yeonsoo Lee

    Abstract: The advent of scalable deep models and large datasets has improved the performance of Neural Machine Translation. Knowledge Distillation (KD) enhances efficiency by transferring knowledge from a teacher model to a more compact student model. However, KD approaches to Transformer architecture often rely on heuristics, particularly when deciding which teacher layers to distill from. In this paper, w… ▽ More

    Submitted 25 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

    MSC Class: 68T50 ACM Class: I.2.7

  49. arXiv:2403.00827  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Refinement of Language Models from External Proxy Metrics Feedback

    Authors: Keshav Ramji, Young-Suk Lee, Ramón Fernandez Astudillo, Md Arafat Sultan, Tahira Naseem, Asim Munawar, Radu Florian, Salim Roukos

    Abstract: It is often desirable for Large Language Models (LLMs) to capture multiple objectives when providing a response. In document-grounded response generation, for example, agent responses are expected to be relevant to a user's query while also being grounded in a given document. In this paper, we introduce Proxy Metric-based Self-Refinement (ProMiSe), which enables an LLM to refine its own initial re… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  50. arXiv:2403.00334  [pdf, other

    cs.HC

    NOVA: A visual interface for assessing polarizing media coverage

    Authors: Keshav Dasu, Sam Yu-Te Lee, Ying-Cheng Chen, Kwan-Liu Ma

    Abstract: Within the United States, the majority of the populace receives their news online. U.S mainstream media outlets both generate and influence the news consumed by U.S citizens. Many of these citizens have their personal beliefs about these outlets and question the fairness of their reporting. We offer an interactive visualization system for the public to assess their perception of the mainstream med… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.