Skip to main content

Showing 1–50 of 263 results for author: Lee, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00878  [pdf, other

    cs.DC cs.LG

    A Robust Power Model Training Framework for Cloud Native Runtime Energy Metric Exporter

    Authors: Sunyanan Choochotkaew, Chen Wang, Huamin Chen, Tatsuhiro Chiba, Marcelo Amaral, Eun Kyung Lee, Tamar Eilam

    Abstract: Estimating power consumption in modern Cloud environments is essential for carbon quantification toward green computing. Specifically, it is important to properly account for the power consumed by each of the running applications, which are packaged as containers. This paper examines multiple challenges associated with this goal. The first challenge is that multiple customers are sharing the same… ▽ More

    Submitted 9 April, 2024; originally announced July 2024.

    Comments: This is a full-version (8-page) paper of our previous publication in IEEE MASCOTS 2023, which has been accepted as a 4-page short paper (https://ieeexplore.ieee.org/document/10387542)

  2. arXiv:2407.00176  [pdf, other

    cs.LG cs.CV

    The impact of model size on catastrophic forgetting in Online Continual Learning

    Authors: Eunhae Lee

    Abstract: This study investigates the impact of model size on Online Continual Learning performance, with a focus on catastrophic forgetting. Employing ResNet architectures of varying sizes, the research examines how network depth and width affect model performance in class-incremental learning using the SplitCIFAR-10 dataset. Key findings reveal that larger models do not guarantee better Continual Learning… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  3. arXiv:2406.19971  [pdf, other

    cs.RO

    Perception Stitching: Zero-Shot Perception Encoder Transfer for Visuomotor Robot Policies

    Authors: **cheng Jian, Easop Lee, Zachary Bell, Michael M. Zavlanos, Boyuan Chen

    Abstract: Vision-based imitation learning has shown promising capabilities of endowing robots with various motion skills given visual observation. However, current visuomotor policies fail to adapt to drastic changes in their visual observations. We present Perception Stitching that enables strong zero-shot adaptation to large visual changes by directly stitching novel combinations of visual encoders. Our k… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  4. arXiv:2406.09334  [pdf, other

    cs.CL

    ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models

    Authors: David Anugraha, Genta Indra Winata, Chenyue Li, Patrick Amadeus Irawan, En-Shiun Annie Lee

    Abstract: Performance prediction is a method to estimate the performance of Language Models (LMs) on various Natural Language Processing (NLP) tasks, mitigating computational costs associated with model capacity and data for fine-tuning. Our paper introduces ProxyLM, a scalable framework for predicting LM performance using proxy models in multilingual tasks. These proxy models act as surrogates, approximati… ▽ More

    Submitted 14 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Preprint

  5. arXiv:2406.06253  [pdf, other

    eess.SY cs.PL

    PretVM: Predictable, Efficient Virtual Machine for Real-Time Concurrency

    Authors: Shaokai Lin, Erling Jellum, Mirco Theile, Tassilo Tanneberger, Binqi Sun, Chadlia Jerad, Ruomu Xu, Guangyu Feng, Christian Menard, Marten Lohstroh, Jeronimo Castrillon, Sanjit Seshia, Edward Lee

    Abstract: This paper introduces the Precision-Timed Virtual Machine (PretVM), an intermediate platform facilitating the execution of quasi-static schedules compiled from a subset of programs written in the Lingua Franca (LF) coordination language. The subset consists of those programs that in principle should have statically verifiable and predictable timing behavior. The PretVM provides a schedule with wel… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  6. arXiv:2406.03368  [pdf, other

    cs.CL cs.AI

    IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

    Authors: David Ifeoluwa Adelani, Jessica Ojo, Israel Abebe Azime, Jian Yun Zhuang, Jesujoba O. Alabi, Xuanli He, Millicent Ochieng, Sara Hooker, Andiswa Bukula, En-Shiun Annie Lee, Chiamaka Chukwuneke, Happy Buzaaba, Blessing Sibanda, Godson Kalipe, Jonathan Mukiibi, Salomon Kabongo, Foutse Yuehgoh, Mmasibidi Setaka, Lolwethu Ndolela, Nkiruka Odu, Rooweither Mabuya, Shamsuddeen Hassan Muhammad, Salomey Osei, Sokhar Samb, Tadesse Kebede Guge , et al. (1 additional authors not shown)

    Abstract: Despite the widespread adoption of Large language models (LLMs), their remarkable capabilities remain limited to a few high-resource languages. Additionally, many low-resource languages (e.g. African languages) are often evaluated only on basic text classification tasks due to the lack of appropriate or comprehensive benchmarks outside of high-resource languages. In this paper, we introduce IrokoB… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Under review

  7. arXiv:2405.20867  [pdf, other

    cs.CV cs.AI cs.CC

    Automatic Channel Pruning for Multi-Head Attention

    Authors: Eunho Lee, Youngbae Hwang

    Abstract: Despite the strong performance of Transformers, their quadratic computation complexity presents challenges in applying them to vision tasks. Automatic pruning is one of effective methods for reducing computation complexity without heuristic approaches. However, directly applying it to multi-head attention is not straightforward due to channel misalignment. In this paper, we propose an automatic ch… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  8. arXiv:2405.18148  [pdf, other

    cs.CV cs.AI

    Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation

    Authors: JuneHyoung Kwon, Eunju Lee, Yunsung Cho, YoungBin Kim

    Abstract: Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels. However, classifiers trained on biased datasets tend to exploit shortcut features and make predictions based on spurious correlations between certain backgrounds and objects, leading to a poor generalization performance. In this p… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to WACV 2024

  9. arXiv:2405.16021  [pdf, other

    cs.RO

    VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

    Authors: Michael Ahn, Montserrat Gonzalez Arenas, Matthew Bennice, Noah Brown, Christine Chan, Byron David, Anthony Francis, Gavin Gonzalez, Rainer Hessmer, Tomas Jackson, Nikhil J Joshi, Daniel Lam, Tsang-Wei Edward Lee, Alex Luong, Sharath Maddineni, Harsh Patel, Jodilyn Peralta, Jornell Quiambao, Diego Reyes, Rosario M Jauregui Ruano, Dorsa Sadigh, Pannag Sanketi, Leila Takayama, Pavel Vodenski, Fei Xia

    Abstract: Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon ta… ▽ More

    Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  10. arXiv:2405.12117  [pdf, other

    cs.DC

    Strongly-Consistent Distributed Discrete-event Systems

    Authors: Peter Donovan, Erling Jellum, Byeonggil Jun, Hokeun Kim, Edward A. Lee, Shaokai Lin, Marten Lohstroh, Anirudh Rengarajan

    Abstract: Discrete-event (DE) systems are concurrent programs where components communicate via tagged events, where tags are drawn from a totally ordered set. Reactors are an emerging model of computation based on DE and realized in the open-source coordination language Lingua Franca. Distributed DE (DDE) systems are DE systems where the components (reactors) communicate over networks. The prior art has req… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  11. arXiv:2405.11125  [pdf, other

    cs.CL

    A Reproducibility Study on Quantifying Language Similarity: The Impact of Missing Values in the URIEL Knowledge Base

    Authors: Hasti Toossi, Guo Qing Huai, **yu Liu, Eric Khiu, A. Seza Doğruöz, En-Shiun Annie Lee

    Abstract: In the pursuit of supporting more languages around the world, tools that characterize properties of languages play a key role in expanding the existing multilingual NLP research. In this study, we focus on a widely used typological knowledge base, URIEL, which aggregates linguistic information into numeric vectors. Specifically, we delve into the soundness and reproducibility of the approach taken… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 SRW

  12. arXiv:2405.00738  [pdf, other

    cs.AR cs.AI cs.LG

    HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis

    Authors: Andy He, Darren Key, Mason Bulling, Andrew Chang, Skyler Shapiro, Everett Lee

    Abstract: Graphics Processing Units (GPUs) have become the leading hardware accelerator for deep learning applications and are used widely in training and inference of transformers; transformers have achieved state-of-the-art performance in many areas of machine learning and are especially used in most modern Large Language Models (LLMs). However, GPUs require large amounts of energy, which poses environmen… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

    Comments: 7 pages, 2 figures

  13. Understanding the Cluster LP for Correlation Clustering

    Authors: Nairen Cao, Vincent Cohen-Addad, Euiwoong Lee, Shi Li, Alantha Newman, Lukas Vogl

    Abstract: In the classic Correlation Clustering problem introduced by Bansal, Blum, and Chawla~(FOCS 2002), the input is a complete graph where edges are labeled either $+$ or $-$, and the goal is to find a partition of the vertices that minimizes the sum of the +edges across parts plus the sum of the -edges within parts. In recent years, Chawla, Makarychev, Schramm and Yaroslavtsev~(STOC 2015) gave a 2.06-… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  14. arXiv:2404.13852  [pdf, other

    cs.RO

    Toward Robust LiDAR based 3D Object Detection via Density-Aware Adaptive Thresholding

    Authors: Eunho Lee, Minwoo Jung, Ayoung Kim

    Abstract: Robust 3D object detection is a core challenge for autonomous mobile systems in field robotics. To tackle this issue, many researchers have demonstrated improvements in 3D object detection performance in datasets. However, real-world urban scenarios with unstructured and dynamic situations can still lead to numerous false positives, posing a challenge for robust 3D object detection models. This pa… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 5 pages, 4 figures, Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  15. arXiv:2404.08672  [pdf, other

    cs.IR cs.AI cs.CL cs.CY cs.LG

    Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

    Authors: Hwiyeol Jo, Taiwoo Park, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Hyunwoo Lee, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

    Abstract: Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in develo** and operating generative AI models within a national-scale search engine, with a specific focus on… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  16. arXiv:2404.04212  [pdf, other

    cs.CL

    Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation

    Authors: Tong Su, Xin Peng, Sarubi Thillainathan, David Guzmán, Surangika Ranathunga, En-Shiun Annie Lee

    Abstract: Parameter-efficient fine-tuning (PEFT) methods are increasingly vital in adapting large-scale pre-trained language models for diverse tasks, offering a balance between adaptability and computational efficiency. They are important in Low-Resource Language (LRL) Neural Machine Translation (NMT) to enhance translation accuracy with minimal resources. However, their practical effectiveness varies sign… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted to the Findings of NAACL 2024

  17. arXiv:2404.03745  [pdf, other

    cs.HC cs.AI cs.CL

    Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations

    Authors: Mahjabin Nahar, Haeseung Seo, Eun-Ju Lee, Dongwon Lee

    Abstract: The widespread adoption and transformative effects of large language models (LLMs) have sparked concerns regarding their capacity to produce inaccurate and fictitious content, referred to as `hallucinations'. Given the potential risks associated with hallucinations, humans should be able to identify them. This research aims to understand the human perception of LLM hallucinations by systematically… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  18. Towards Pareto Optimal Throughput in Small Language Model Serving

    Authors: Pol G. Recasens, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Alaa Youssef, Jordi Torres, Josep Ll. Berral

    Abstract: Large language models (LLMs) have revolutionized the state-of-the-art of many different natural language processing tasks. Although serving LLMs is computationally and memory demanding, the rise of Small Language Models (SLMs) offers new opportunities for resource-constrained users, who now are able to serve small models with cutting-edge performance. In this paper, we present a set of experiments… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: It is going to be published at EuroMLSys'24

  19. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  20. arXiv:2403.19497  [pdf, other

    cs.CV

    Surface-based parcellation and vertex-wise analysis of ultra high-resolution ex vivo 7 tesla MRI in Alzheimer's disease and related dementias

    Authors: Pulkit Khandelwal, Michael Tran Duong, Lisa Levorse, Constanza Fuentes, Amanda Denning, Winifred Trotman, Ranjit Ittyerah, Alejandra Bahena, Theresa Schuck, Marianna Gabrielyan, Karthik Prabhakaran, Daniel Ohm, Gabor Mizsei, John Robinson, Monica Munoz, John Detre, Edward Lee, David Irwin, Corey McMillan, M. Dylan Tisdall, Sandhitsu Das, David Wolk, Paul A. Yushkevich

    Abstract: Magnetic resonance imaging (MRI) is the standard modality to understand human brain structure and function in vivo (antemortem). Decades of research in human neuroimaging has led to the widespread development of methods and tools to provide automated volume-based segmentations and surface-based parcellations which help localize brain functions to specialized anatomical regions. Recently ex vivo (p… ▽ More

    Submitted 2 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  21. arXiv:2403.12024  [pdf, other

    cs.CL

    Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems

    Authors: Bo-Han Lu, Yi-Hsuan Lin, En-Shiun Annie Lee, Richard Tzong-Han Tsai

    Abstract: Machine translation focuses mainly on high-resource languages (HRLs), while low-resource languages (LRLs) like Taiwanese Hokkien are relatively under-explored. The study aims to address this gap by develo** a dual translation model between Taiwanese Hokkien and both Traditional Mandarin Chinese and English. We employ a pre-trained LLaMA 2-7B model specialized in Traditional Mandarin Chinese to l… ▽ More

    Submitted 14 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024 as a long oral paper

  22. arXiv:2403.08983  [pdf, ps, other

    cs.DS

    Approximating Small Sparse Cuts

    Authors: Aditya Anand, Euiwoong Lee, Jason Li, Thatchaphol Saranurak

    Abstract: We study polynomial-time approximation algorithms for (edge/vertex) Sparsest Cut and Small Set Expansion in terms of $k$, the number of edges or vertices cut in the optimal solution. Our main results are $\mathcal{O}(\text{polylog}\, k)$-approximation algorithms for various versions in this setting. Our techniques involve an extension of the notion of sample sets (Feige and Mahdian STOC'06), ori… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 49 Pages, to appear at STOC 2024

  23. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  24. arXiv:2402.18263  [pdf, ps, other

    cs.DS cs.CC

    Max-Cut with $ε$-Accurate Predictions

    Authors: Vincent Cohen-Addad, Tommaso d'Orsi, Anupam Gupta, Euiwoong Lee, Debmalya Panigrahi

    Abstract: We study the approximability of the MaxCut problem in the presence of predictions. Specifically, we consider two models: in the noisy predictions model, for each vertex we are given its correct label in $\{-1,+1\}$ with some unknown probability $1/2 + ε$, and the other (incorrect) label otherwise. In the more-informative partial predictions model, for each vertex we are given its correct label wit… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 18 pages

    ACM Class: F.0

  25. arXiv:2402.18144  [pdf

    cs.AI cs.CY

    Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information

    Authors: Seungjong Sun, Eungu Lee, Dongyan Nan, Xiangying Zhao, Wonbyung Lee, Bernard J. Jansen, Jang Hyun Kim

    Abstract: Large language models exhibit societal biases associated with demographic information, including race, gender, and others. Endowing such language models with personalities based on demographic data can enable generating opinions that align with those of humans. Building on this idea, we propose "random silicon sampling," a method to emulate the opinions of the human population sub-group. Our study… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 25 pages, 4 figures, 19 Tables

    ACM Class: I.2.7

  26. arXiv:2402.11450  [pdf, other

    cs.RO

    Learning to Learn Faster from Human Feedback with Language Model Predictive Control

    Authors: Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed, Chuyuan Kelly Fu, Nimrod Gileadi, Marissa Giustina, Keerthana Gopalakrishnan, Leonard Hasenclever, Jan Humplik, Jasmine Hsu, Nikhil Joshi, Ben Jyenis, Chase Kew, Sean Kirmani, Tsang-Wei Edward Lee, Kuang-Huei Lee, Assaf Hurwitz Michaely, Joss Moore , et al. (25 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new tasks. However, these capabilities (driven by in-context learning) are limited to short-term interactions, where users' feedback remains relevant for o… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  27. arXiv:2402.07872  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

    Authors: Soroush Nasiriany, Fei Xia, Wenhao Yu, Ted Xiao, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee, Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter

    Abstract: Vision language models (VLMs) have shown impressive capabilities across a variety of tasks, from logical reasoning to visual understanding. This opens the door to richer interaction with the world, for example robotic control. However, VLMs produce only textual outputs, while robotic control and other spatial tasks require outputting continuous coordinates, actions, or trajectories. How can we ena… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  28. arXiv:2402.05512  [pdf, other

    cs.CL cs.AI

    GPTs Are Multilingual Annotators for Sequence Generation Tasks

    Authors: Juhwan Choi, Eunju Lee, Kyohoon **, YoungBin Kim

    Abstract: Data annotation is an essential step for constructing new datasets. However, the conventional approach of data annotation through crowdsourcing is both time-consuming and expensive. In addition, the complexity of this process increases when dealing with low-resource languages owing to the difference in the language pool of crowdworkers. To address these issues, this study proposes an autonomous an… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: EACL 2024 Findings: Camera-ready version

  29. arXiv:2402.02633  [pdf, other

    cs.CL cs.LG

    Predicting Machine Translation Performance on Low-Resource Languages: The Role of Domain Similarity

    Authors: Eric Khiu, Hasti Toossi, David Anugraha, **yu Liu, Jiaxu Li, Juan Armando Parra Flores, Leandro Acros Roman, A. Seza Doğruöz, En-Shiun Annie Lee

    Abstract: Fine-tuning and testing a multilingual large language model is expensive and challenging for low-resource languages (LRLs). While previous studies have predicted the performance of natural language processing (NLP) tasks using machine learning methods, they primarily focus on high-resource languages, overlooking LRLs and shifts across domains. Focusing on LRLs, we investigate three factors: the si… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures, accepted to EACL 2024, findings

  30. arXiv:2402.00219  [pdf, other

    cs.DC cs.AI cs.LG

    FedCore: Straggler-Free Federated Learning with Distributed Coresets

    Authors: Hongpeng Guo, Haotian Gu, Xiaoyang Wang, Bo Chen, Eun Kyung Lee, Tamar Eilam, Deming Chen, Klara Nahrstedt

    Abstract: Federated learning (FL) is a machine learning paradigm that allows multiple clients to collaboratively train a shared model while kee** their data on-premise. However, the straggler issue, due to slow clients, often hinders the efficiency and scalability of FL. This paper presents FedCore, an algorithm that innovatively tackles the straggler problem via the decentralized selection of coresets, r… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  31. arXiv:2401.15140  [pdf, other

    math.DS cs.SI

    Link Prediction Accuracy on Real-World Networks Under Non-Uniform Missing Edge Patterns

    Authors: Xie He, Amir Ghasemian, Eun Lee, Alice Schwarze, Aaron Clauset, Peter J. Mucha

    Abstract: Real-world network datasets are typically obtained in ways that fail to capture all edges. The patterns of missing data are often non-uniform as they reflect biases and other shortcomings of different data collection methods. Nevertheless, uniform missing data is a common assumption made when no additional information is available about the underlying missing-edge pattern, and link prediction meth… ▽ More

    Submitted 30 April, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Submitted to PLOS ONE

  32. arXiv:2401.13819  [pdf, ps, other

    cs.DS

    Separating $k$-Median from the Supplier Version

    Authors: Aditya Anand, Euiwoong Lee

    Abstract: Given a metric space $(V, d)$ along with an integer $k$, the $k$-Median problem asks to open $k$ centers $C \subseteq V$ to minimize $\sum_{v \in V} d(v, C)$, where $d(v, C) := \min_{c \in C} d(v, c)$. While the best-known approximation ratio of $2.613$ holds for the more general supplier version where an additional set $F \subseteq V$ is given with the restriction $C \subseteq F$, the best known… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 20 pages; To appear at IPCO 2024

  33. arXiv:2401.12963  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

    Authors: Michael Ahn, Debidatta Dwibedi, Chelsea Finn, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Karol Hausman, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Sean Kirmani, Isabel Leal, Edward Lee, Sergey Levine, Yao Lu, Isabel Leal, Sharath Maddineni, Kanishka Rao, Dorsa Sadigh, Pannag Sanketi, Pierre Sermanet, Quan Vuong, Stefan Welker, Fei Xia, Ted Xiao , et al. (3 additional authors not shown)

    Abstract: Foundation models that incorporate language, vision, and more recently actions have revolutionized the ability to harness internet scale data to reason about useful tasks. However, one of the key challenges of training embodied foundation models is the lack of data grounded in the physical world. In this paper, we propose AutoRT, a system that leverages existing foundation models to scale up the d… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 26 pages, 9 figures, ICRA 2024 VLMNM Workshop

  34. arXiv:2401.09376  [pdf, other

    cs.LG math.ST stat.ML

    Unlocking Unlabeled Data: Ensemble Learning with the Hui- Walter Paradigm for Performance Estimation in Online and Static Settings

    Authors: Kevin Slote, Elaine Lee

    Abstract: In the realm of machine learning and statistical modeling, practitioners often work under the assumption of accessible, static, labeled data for evaluation and training. However, this assumption often deviates from reality where data may be private, encrypted, difficult- to-measure, or unlabeled. In this paper, we bridge this gap by adapting the Hui-Walter paradigm, a method traditionally applied… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  35. arXiv:2401.09185  [pdf, other

    cs.PL

    Behavior Trees with Dataflow: Coordinating Reactive Tasks in Lingua Franca

    Authors: Alexander Schulz-Rosengarten, Akash Ahmad, Malte Clement, Reinhard von Hanxleden, Benjamin Asch, Marten Lohstroh, Edward A. Lee, Gustavo Quiros Araya, Ankit Shukla

    Abstract: Behavior Trees (BTs) provide a lean set of control flow elements that are easily composable in a modular tree structure. They are well established for modeling the high-level behavior of non-player characters in computer games and recently gained popularity in other areas such as industrial automation. While BTs nicely express control, data handling aspects so far must be provided separately, e. g… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  36. arXiv:2401.05730  [pdf, other

    cs.CV cs.AI

    Enhancing Contrastive Learning with Efficient Combinatorial Positive Pairing

    Authors: Jaeill Kim, Duhun Hwang, Eunjung Lee, Jangwon Suh, Jimyeong Kim, Wonjong Rhee

    Abstract: In the past few years, contrastive learning has played a central role for the success of visual unsupervised representation learning. Around the same time, high-performance non-contrastive learning methods have been developed as well. While most of the works utilize only two views, we carefully review the existing multi-view methods and propose a general multi-view strategy that can improve learni… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  37. arXiv:2401.00496  [pdf, other

    cs.CV cs.AI cs.LG

    SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

    Authors: Dimitrios Psychogyios, Emanuele Colleoni, Beatrice Van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi , et al. (25 additional authors not shown)

    Abstract: Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme… ▽ More

    Submitted 23 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  38. arXiv:2312.09588  [pdf, other

    cs.RO cs.AI

    NeuroFlow: Development of lightweight and efficient model integration scheduling strategy for autonomous driving system

    Authors: Eunbin Seo, Gwanjun Shin, Eunho Lee

    Abstract: This paper proposes a specialized autonomous driving system that takes into account the unique constraints and characteristics of automotive systems, aiming for innovative advancements in autonomous driving technology. The proposed system systematically analyzes the intricate data flow in autonomous driving and provides functionality to dynamically adjust various factors that influence deep learni… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 9 pages

  39. arXiv:2312.04704  [pdf, other

    cs.DC cs.LG

    Efficient Parallel Reinforcement Learning Framework using the Reactor Model

    Authors: Jacky Kwok, Marten Lohstroh, Edward A. Lee

    Abstract: Parallel Reinforcement Learning (RL) frameworks are essential for map** RL workloads to multiple computational resources, allowing for faster generation of samples, estimation of values, and policy improvement. These computational paradigms require a seamless integration of training, serving, and simulation workloads. Existing frameworks, such as Ray, are not managing this orchestration efficien… ▽ More

    Submitted 2 February, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 10 pages, 11 figures

  40. arXiv:2312.01638  [pdf, other

    eess.IV cs.CV

    J-Net: Improved U-Net for Terahertz Image Super-Resolution

    Authors: Woon-Ha Yeo, Seung-Hwan Jung, Seung Jae Oh, Inhee Maeng, Eui Su Lee, Han-Cheol Ryu

    Abstract: Terahertz (THz) waves are electromagnetic waves in the 0.1 to 10 THz frequency range, and THz imaging is utilized in a range of applications, including security inspections, biomedical fields, and the non-destructive examination of materials. However, THz images have low resolution due to the long wavelength of THz waves. Therefore, improving the resolution of THz images is one of the current hot… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  41. arXiv:2311.10922  [pdf, other

    cs.AI cs.CL cs.DB cs.IR

    Explainable Product Classification for Customs

    Authors: Eunji Lee, Sihyeon Kim, Sundong Kim, Soyeon Jung, Heeja Kim, Meeyoung Cha

    Abstract: The task of assigning internationally accepted commodity codes (aka HS codes) to traded goods is a critical function of customs offices. Like court decisions made by judges, this task follows the doctrine of precedent and can be nontrivial even for experienced officers. Together with the Korea Customs Service (KCS), we propose a first-ever explainable decision supporting model that suggests the mo… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 24 pages, Accepted to ACM Transactions on Intelligent Systems and Technology

  42. Qualifying System F-sub

    Authors: Edward Lee, Yaoyu Zhao, James You, Kavin Satheeskumar, Ondřej Lhoták, Jonathan Brachthäuser

    Abstract: Type qualifiers offer a lightweight mechanism for enriching existing type systems to enforce additional, desirable, program invariants. They do so by offering a restricted but effective form of subty**. While the theory of type qualifiers is well understood and present in many programming languages today, polymorphism over type qualifiers is an area that is less examined. We explore how such a p… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 24 pages

    Journal ref: Proc. ACM Program. Lang. 8, OOPSLA1, Article 115 (April 2024), 30 pages

  43. Visual-information-driven model for crowd simulation using temporal convolutional network

    Authors: Xuanwen Liang, Eric Wai Ming Lee

    Abstract: Crowd simulations play a pivotal role in building design, influencing both user experience and public safety. While traditional knowledge-driven models have their merits, data-driven crowd simulation models promise to bring a new dimension of realism to these simulations. However, most of the existing data-driven models are designed for specific geometries, leading to poor adaptability and applica… ▽ More

    Submitted 9 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems

  44. arXiv:2311.01712  [pdf, other

    cs.CL

    A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers

    Authors: Beomjune Kim, Eunsun Lee, Dongbin Na

    Abstract: Many users reading online articles in various magazines may suffer considerable difficulty in distinguishing the implicit intents in texts. In this work, we focus on automatically recognizing the political intents of a given online newspaper by understanding the context of the text. To solve this task, we present a novel Korean text classification dataset that contains various articles. We also pr… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 11 pages

  45. arXiv:2311.00928  [pdf, other

    cs.RO

    Quatro++: Robust Global Registration Exploiting Ground Segmentation for Loop Closing in LiDAR SLAM

    Authors: Hyungtae Lim, Beomsoo Kim, Daebeom Kim, Eungchang Mason Lee, Hyun Myung

    Abstract: Global registration is a fundamental task that estimates the relative pose between two viewpoints of 3D point clouds. However, there are two issues that degrade the performance of global registration in LiDAR SLAM: one is the sparsity issue and the other is degeneracy. The sparsity issue is caused by the sparse characteristics of the 3D point cloud measurements in a mechanically spinning LiDAR sen… ▽ More

    Submitted 21 January, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 26 pages, 23 figures

  46. arXiv:2311.00892  [pdf, other

    cs.DS

    A PTAS for $\ell_0$-Low Rank Approximation: Solving Dense CSPs over Reals

    Authors: Vincent Cohen-Addad, Chenglin Fan, Suprovat Ghoshal, Euiwoong Lee, Arnaud de Mesmay, Alantha Newman, Tony Chang Wang

    Abstract: We consider the Low Rank Approximation problem, where the input consists of a matrix $A \in \mathbb{R}^{n_R \times n_C}$ and an integer $k$, and the goal is to find a matrix $B$ of rank at most $k$ that minimizes $\| A - B \|_0$, which is the number of entries where $A$ and $B$ differ. For any constant $k$ and $\varepsilon > 0$, we present a polynomial time $(1 + \varepsilon)$-approximation time f… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: To appear in SODA 24

  47. A Collaborative Filtering-Based Two Stage Model with Item Dependency for Course Recommendation

    Authors: Eric L. Lee, Tsung-Ting Kuo, Shou-De Lin

    Abstract: Recommender systems have been studied for decades with numerous promising models been proposed. Among them, Collaborative Filtering (CF) models are arguably the most successful one due to its high accuracy in recommendation and elimination of privacy-concerned personal meta-data from training. This paper extends the usage of CF-based model to the task of course recommendation. We point out several… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

    Journal ref: In 2017 IEEE DSAA, pp. 496-503. IEEE, 2017

  48. arXiv:2310.10349  [pdf, other

    cs.CR cs.AI

    Optimized Layerwise Approximation for Efficient Private Inference on Fully Homomorphic Encryption

    Authors: Junghyun Lee, Eunsang Lee, Young-Sik Kim, Yongwoo Lee, Joon-Woo Lee, Yongjune Kim, Jong-Seon No

    Abstract: Recent studies have explored the deployment of privacy-preserving deep neural networks utilizing homomorphic encryption (HE), especially for private inference (PI). Many works have attempted the approximation-aware training (AAT) approach in PI, changing the activation functions of a model to low-degree polynomials that are easier to compute on HE by allowing model retraining. However, due to cons… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  49. arXiv:2310.04313  [pdf, other

    cs.CL

    KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

    Authors: Dasol Choi, Jooyoung Song, Eunsun Lee, **woo Seo, Heejune Park, Dongbin Na

    Abstract: With the growth of online services, the need for advanced text classification algorithms, such as sentiment analysis and biased text detection, has become increasingly evident. The anonymous nature of online services often leads to the presence of biased and harmful language, posing challenges to maintaining the health of online communities. This phenomenon is especially relevant in South Korea, w… ▽ More

    Submitted 12 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted to the NeurIPS 2023 Workshop on Socially Responsible Language Modelling Research (SoLaR)

  50. A Versatile Data Fabric for Advanced IoT-Based Remote Health Monitoring

    Authors: Italo Buleje, Vince S. Siu, Kuan Yu Hsieh, Nigel Hinds, Bing Dang, Erhan Bilal, Thanhnha Nguyen, Ellen E. Lee, Colin A. Depp, Jeffrey L. Rogers

    Abstract: This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. M… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Journal ref: 2023 IEEE International Conference on Digital Health (ICDH), Chicago, IL, USA, 2023, pp. 88-90