Skip to main content

Showing 1–16 of 16 results for author: Kim, T S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05581  [pdf, other

    cs.HC cs.AI cs.CL

    One vs. Many: Comprehending Accurate Information from Multiple Erroneous and Inconsistent AI Generations

    Authors: Yoonjoo Lee, Kihoon Son, Tae Soo Kim, Jisu Kim, John Joon Young Chung, Eytan Adar, Juho Kim

    Abstract: As Large Language Models (LLMs) are nondeterministic, the same input can generate different outputs, some of which may be incorrect or hallucinated. If run again, the LLM may correct itself and produce the correct answer. Unfortunately, most LLM-powered systems resort to single results which, correct or not, users accept. Having the LLM produce multiple outputs may help identify disagreements or a… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted to FAccT 2024

  2. arXiv:2405.04497  [pdf, other

    cs.HC

    Unveiling Disparities in Web Task Handling Between Human and Web Agent

    Authors: Kihoon Son, **hyeon Kwon, DaEun Choi, Tae Soo Kim, Young-Ho Kim, Sangdoo Yun, Juho Kim

    Abstract: With the advancement of Large-Language Models (LLMs) and Large Vision-Language Models (LVMs), agents have shown significant capabilities in various tasks, such as data analysis, gaming, or code generation. Recently, there has been a surge in research on web agents, capable of performing tasks within the web environment. However, the web poses unforeseeable scenarios, challenging the generalizabili… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  3. arXiv:2403.06252  [pdf, other

    cs.HC

    Demystifying Tacit Knowledge in Graphic Design: Characteristics, Instances, Approaches, and Guidelines

    Authors: Kihoon Son, DaEun Choi, Tae Soo Kim, Juho Kim

    Abstract: Despite the growing demand for professional graphic design knowledge, the tacit nature of design inhibits knowledge sharing. However, there is a limited understanding on the characteristics and instances of tacit knowledge in graphic design. In this work, we build a comprehensive set of tacit knowledge characteristics through a literature review. Through interviews with 10 professional graphic des… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  4. arXiv:2310.01287  [pdf, other

    cs.HC

    GenQuery: Supporting Expressive Visual Search with Generative Models

    Authors: Kihoon Son, DaEun Choi, Tae Soo Kim, Young-Ho Kim, Juho Kim

    Abstract: Designers rely on visual search to explore and develop ideas in early design stages. However, designers can struggle to identify suitable text queries to initiate a search or to discover images for similarity-based search that can adequately express their intent. We propose GenQuery, a novel system that integrates generative models into the visual search process. GenQuery can automatically elabora… ▽ More

    Submitted 4 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 18 pages and 12 figures

  5. arXiv:2309.13633  [pdf, other

    cs.HC cs.AI cs.CL

    EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria

    Authors: Tae Soo Kim, Yoonjoo Lee, Jamin Shin, Young-Ho Kim, Juho Kim

    Abstract: By simply composing prompts, developers can prototype novel generative applications with Large Language Models (LLMs). To refine prototypes into products, however, developers must iteratively revise prompts by evaluating outputs to diagnose weaknesses. Formative interviews (N=8) revealed that developers invest significant effort in manually evaluating outputs as they assess context-specific and su… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted to CHI 2024

  6. Papeos: Augmenting Research Papers with Talk Videos

    Authors: Tae Soo Kim, Matt Latzke, Jonathan Bragg, Amy X. Zhang, Joseph Chee Chang

    Abstract: Research consumption has been traditionally limited to the reading of academic papers-a static, dense, and formally written format. Alternatively, pre-recorded conference presentation videos, which are more dynamic, concise, and colloquial, have recently become more widely available but potentially under-utilized. In this work, we explore the design space and benefits for combining academic papers… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Accepted to UIST 2023

  7. arXiv:2304.05303  [pdf, other

    cs.CV cs.CL

    ELVIS: Empowering Locality of Vision Language Pre-training with Intra-modal Similarity

    Authors: Sumin Seo, JaeWoong Shin, Jaewoo Kang, Tae Soo Kim, Thijs Kooi

    Abstract: Deep learning has shown great potential in assisting radiologists in reading chest X-ray (CXR) images, but its need for expensive annotations for improving performance prevents widespread clinical application. Visual language pre-training (VLP) can alleviate the burden and cost of annotation by leveraging routinely generated reports for radiographs, which exist in large quantities as well as in pa… ▽ More

    Submitted 23 July, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Under review

  8. arXiv:2303.15125  [pdf, other

    cs.HC cs.CL

    LMCanvas: Object-Oriented Interaction to Personalize Large Language Model-Powered Writing Environments

    Authors: Tae Soo Kim, Arghya Sarkar, Yoonjoo Lee, Minsuk Chang, Juho Kim

    Abstract: Large language models (LLMs) can enhance writing by automating or supporting specific tasks in writers' workflows (e.g., paraphrasing, creating analogies). Leveraging this capability, a collection of interfaces have been developed that provide LLM-powered tools for specific writing tasks. However, these interfaces provide limited support for writers to create personal tools for their own unique ta… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted to CHI 2023 Workshop on Generative AI and HCI

  9. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  10. arXiv:2209.15314  [pdf, other

    cs.CV

    Did You Get What You Paid For? Rethinking Annotation Cost of Deep Learning Based Computer Aided Detection in Chest Radiographs

    Authors: Tae Soo Kim, Geonwoon Jang, Sanghyup Lee, Thijs Kooi

    Abstract: As deep networks require large amounts of accurately labeled training data, a strategy to collect sufficiently large and accurate annotations is as important as innovations in recognition methods. This is especially true for building Computer Aided Detection (CAD) systems for chest X-rays where domain expertise of radiologists is required to annotate the presence and location of abnormalities on X… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: MICCAI 2022, Contains Supplemental Material

  11. arXiv:2205.06416  [pdf, other

    cs.CV

    Video-based assessment of intraoperative surgical skill

    Authors: Sanchit Hira, Digvijay Singh, Tae Soo Kim, Shobhit Gupta, Gregory Hager, Shameema Sikder, S. Swaroop Vedula

    Abstract: Purpose: The objective of this investigation is to provide a comprehensive analysis of state-of-the-art methods for video-based assessment of surgical skill in the operating room. Methods: Using a data set of 99 videos of capsulorhexis, a critical step in cataract surgery, we evaluate feature based methods previously developed for surgical skill assessment mostly under benchtop settings. In additi… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  12. arXiv:2104.00646  [pdf, other

    cs.CV

    Motion Guided Attention Fusion to Recognize Interactions from Videos

    Authors: Tae Soo Kim, Jonathan Jones, Gregory D. Hager

    Abstract: We present a dual-pathway approach for recognizing fine-grained interactions from videos. We build on the success of prior dual-stream approaches, but make a distinction between the static and dynamic representations of objects and their interactions explicit by introducing separate motion and object detection pathways. Then, using our new Motion-Guided Attention Fusion module, we fuse the bottom-… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  13. arXiv:2012.02109  [pdf, other

    cs.CV

    SAFCAR: Structured Attention Fusion for Compositional Action Recognition

    Authors: Tae Soo Kim, Gregory D. Hager

    Abstract: We present a general framework for compositional action recognition -- i.e. action recognition where the labels are composed out of simpler components such as subjects, atomic-actions and objects. The main challenge in compositional action recognition is that there is a combinatorially large set of possible actions that can be composed using basic components. However, compositionality also provide… ▽ More

    Submitted 17 December, 2020; v1 submitted 3 December, 2020; originally announced December 2020.

  14. arXiv:1912.03613  [pdf, other

    cs.CV

    DASZL: Dynamic Action Signatures for Zero-shot Learning

    Authors: Tae Soo Kim, Jonathan D. Jones, Michael Peven, Zihao Xiao, ** Bai, Yi Zhang, Weichao Qiu, Alan Yuille, Gregory D. Hager

    Abstract: There are many realistic applications of activity recognition where the set of potential activity descriptions is combinatorially large. This makes end-to-end supervised training of a recognition system impractical as no training set is practically able to encompass the entire label set. In this paper, we present an approach to fine-grained recognition that models activities as compositions of dyn… ▽ More

    Submitted 17 November, 2020; v1 submitted 7 December, 2019; originally announced December 2019.

    Comments: 10 pages, 4 figures, 3 tables, AAAI2021 submission

  15. arXiv:1711.08502  [pdf, other

    cs.CV

    Train, Diagnose and Fix: Interpretable Approach for Fine-grained Action Recognition

    Authors: **gxuan Hou, Tae Soo Kim, Austin Reiter

    Abstract: Despite the growing discriminative capabilities of modern deep learning methods for recognition tasks, the inner workings of the state-of-art models still remain mostly black-boxes. In this paper, we propose a systematic interpretation of model parameters and hidden representations of Residual Temporal Convolutional Networks (Res-TCN) for action recognition in time-series data. We also propose a F… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

    Comments: 8 pages, 8 figures, CVPR18 submission

  16. arXiv:1704.04516  [pdf, other

    cs.CV

    Interpretable 3D Human Action Analysis with Temporal Convolutional Networks

    Authors: Tae Soo Kim, Austin Reiter

    Abstract: The discriminative power of modern deep learning models for 3D human action recognition is growing ever so potent. In conjunction with the recent resurgence of 3D human action representation with 3D skeletons, the quality and the pace of recent progress have been significant. However, the inner workings of state-of-the-art learning based methods in 3D human action recognition still remain mostly b… ▽ More

    Submitted 14 April, 2017; originally announced April 2017.

    Comments: 8 pages, 5 figures, BNMW CVPR 2017 Submission

    MSC Class: 68T45; 68T10 (Primary) ACM Class: I.2.10; I.5.4