Skip to main content

Showing 1–24 of 24 results for author: Kahng, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07498  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Interactive Prompt Debugging with Sequence Salience

    Authors: Ian Tenney, Ryan Mullins, Bin Du, Shree Pandya, Minsuk Kahng, Lucas Dixon

    Abstract: We present Sequence Salience, a visual tool for interactive prompt debugging with input salience methods. Sequence Salience builds on widely used salience methods for text classification and single-token prediction, and extends this to a system tailored for debugging complex LLM prompts. Our system is well-suited for long texts, and expands on previous work by 1) providing controllable aggregation… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2404.01361  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    LLM Attributor: Interactive Visual Attribution for LLM Generation

    Authors: Seongmin Lee, Zijie J. Wang, Aishwarya Chakravarthy, Alec Helbling, ShengYun Peng, Mansi Phute, Duen Horng Chau, Minsuk Kahng

    Abstract: While large language models (LLMs) have shown remarkable capability to generate convincing text across diverse domains, concerns around its potential risks have highlighted the importance of understanding the rationale behind text generation. We present LLM Attributor, a Python library that provides interactive visualizations for training data attribution of an LLM's text generation. Our library o… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, For a video demo, see https://youtu.be/mIG2MDQKQxM

  3. arXiv:2403.12075  [pdf, other

    cs.CY cs.AI cs.CR cs.CV cs.LG

    Adversarial Nibbler: An Open Red-Teaming Method for Identifying Diverse Harms in Text-to-Image Generation

    Authors: Jessica Quaye, Alicia Parrish, Oana Inel, Charvi Rastogi, Hannah Rose Kirk, Minsuk Kahng, Erin van Liemt, Max Bartolo, Jess Tsang, Justin White, Nathan Clement, Rafael Mosquera, Juan Ciro, Vijay Janapa Reddi, Lora Aroyo

    Abstract: With the rise of text-to-image (T2I) generative AI models reaching wide audiences, it is critical to evaluate model robustness against non-obvious attacks to mitigate the generation of offensive images. By focusing on ``implicitly adversarial'' prompts (those that trigger T2I models to generate unsafe images for non-obvious reasons), we isolate a set of difficult safety issues that human creativit… ▽ More

    Submitted 13 May, 2024; v1 submitted 14 February, 2024; originally announced March 2024.

    Comments: 10 pages, 6 figures

  4. arXiv:2402.16611  [pdf, other

    cs.CL cs.AI cs.HC

    Understanding the Dataset Practitioners Behind Large Language Model Development

    Authors: Crystal Qian, Emily Reif, Minsuk Kahng

    Abstract: As large language models (LLMs) become more advanced and impactful, it is increasingly important to scrutinize the data that they rely upon and produce. What is it to be a dataset practitioner doing this work? We approach this in two parts: first, we define the role of "dataset practitioners" by performing a retrospective analysis on the responsibilities of teams contributing to LLM development at… ▽ More

    Submitted 1 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 7 pages, 2 figures. To be published in In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA '24). Revised to reflect updates from CHI LBW reviewer feedback

  5. arXiv:2402.14880  [pdf, other

    cs.CL cs.AI cs.HC

    Automatic Histograms: Leveraging Language Models for Text Dataset Exploration

    Authors: Emily Reif, Crystal Qian, James Wexler, Minsuk Kahng

    Abstract: Making sense of unstructured text datasets is perennially difficult, yet increasingly relevant with Large Language Models. Data workers often rely on dataset summaries, especially distributions of various derived features. Some features, like toxicity or topics, are relevant to many datasets, but many interesting features are domain specific: instruments and genres for a music dataset, or diseases… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  6. arXiv:2402.10524  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

    Authors: Minsuk Kahng, Ian Tenney, Mahima Pushkarna, Michael Xieyang Liu, James Wexler, Emily Reif, Krystal Kallarackal, Minsuk Chang, Michael Terry, Lucas Dixon

    Abstract: Automatic side-by-side evaluation has emerged as a promising approach to evaluating the quality of responses from large language models (LLMs). However, analyzing the results from this evaluation approach raises scalability and interpretability challenges. In this paper, we present LLM Comparator, a novel visual analytics tool for interactively analyzing results from automatic side-by-side evaluat… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  7. arXiv:2309.06703  [pdf, other

    cs.CV cs.CL cs.HC cs.LG

    VLSlice: Interactive Vision-and-Language Slice Discovery

    Authors: Eric Slyman, Minsuk Kahng, Stefan Lee

    Abstract: Recent work in vision-and-language demonstrates that large-scale pretraining can learn generalizable models that are efficiently transferable to downstream tasks. While this may improve dataset-scale aggregate metrics, analyzing performance around hand-crafted subgroups targeting specific bias dimensions reveals systemic undesirable behaviors. However, this subgroup analysis is frequently stalled… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Conference paper at ICCV 2023. 17 pages, 11 figures. https://ericslyman.com/vlslice/

    ACM Class: I.4.10; I.2.7; J.4

  8. arXiv:2305.11364  [pdf, other

    cs.CL cs.AI

    Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models

    Authors: Emily Reif, Minsuk Kahng, Savvas Petridis

    Abstract: Large language models (LLMs) can be used to generate smaller, more refined datasets via few-shot prompting for benchmarking, fine-tuning or other use cases. However, understanding and evaluating these datasets is difficult, and the failure modes of LLM-generated data are still not well understood. Specifically, the data can be repetitive in surprising ways, not only semantically but also syntactic… ▽ More

    Submitted 27 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  9. arXiv:2206.02039  [pdf, other

    cs.AI cs.LG

    Beyond Value: CHECKLIST for Testing Inferences in Planning-Based RL

    Authors: Kin-Ho Lam, Delyar Tabatabai, Jed Irvine, Donald Bertucci, Anita Ruangrotsakun, Minsuk Kahng, Alan Fern

    Abstract: Reinforcement learning (RL) agents are commonly evaluated via their expected value over a distribution of test scenarios. Unfortunately, this evaluation approach provides limited evidence for post-deployment generalization beyond the test distribution. In this paper, we address this limitation by extending the recent CheckList testing methodology from natural language processing to planning-based… ▽ More

    Submitted 7 June, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: This work will appear in the Proceedings of the 32nd International Conference on Automated Planning and Scheduling (ICAPS2022) https://icaps22.icaps-conference.org/papers

  10. arXiv:2205.06935  [pdf, other

    cs.HC cs.AI cs.LG

    DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps

    Authors: Donald Bertucci, Md Montaser Hamid, Yashwanthi Anand, Anita Ruangrotsakun, Delyar Tabatabai, Melissa Perez, Minsuk Kahng

    Abstract: In this paper, we present DendroMap, a novel approach to interactively exploring large-scale image datasets for machine learning (ML). ML practitioners often explore image datasets by generating a grid of images or projecting high-dimensional representations of images into 2-D using dimensionality reduction techniques (e.g., t-SNE). However, neither approach effectively scales to large datasets be… ▽ More

    Submitted 15 August, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: This paper has been accepted for the IEEE VIS 2022 Conference and will be published in the IEEE Transactions on Visualization and Computer Graphics

  11. arXiv:2109.13978  [pdf, other

    cs.AI

    Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

    Authors: Kin-Ho Lam, Zhengxian Lin, Jed Irvine, Jonathan Dodge, Zeyad T Shureih, Roli Khanna, Minsuk Kahng, Alan Fern

    Abstract: Enabling humans to identify potential flaws in an agent's decision making is an important Explainable AI application. We consider identifying such flaws in a planning-based deep reinforcement learning (RL) agent for a complex real-time strategy game. In particular, the agent makes decisions via tree search using a learned model and evaluation function over interpretable states and actions. This gi… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  12. arXiv:2109.06365  [pdf, other

    cs.CV cs.LG

    From Heatmaps to Structural Explanations of Image Classifiers

    Authors: Li Fuxin, Zhongang Qi, Saeed Khorram, Vivswan Shitole, Prasad Tadepalli, Minsuk Kahng, Alan Fern

    Abstract: This paper summarizes our endeavors in the past few years in terms of explaining image classifiers, with the aim of including negative results and insights we have gained. The paper starts with describing the explainable neural network (XNN), which attempts to extract and visualize several high-level concepts purely from the deep network, without relying on human linguistic concepts. This helps us… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: Submitted to Applied AI Letters

    Journal ref: Applied AI Letters.2021;2:e46

  13. arXiv:2108.08000  [pdf, other

    cs.LG cs.AI cs.HC

    Contrastive Identification of Covariate Shift in Image Data

    Authors: Matthew L. Olson, Thuy-Vy Nguyen, Gaurav Dixit, Neale Ratzlaff, Weng-Keen Wong, Minsuk Kahng

    Abstract: Identifying covariate shift is crucial for making machine learning systems robust in the real world and for detecting training data biases that are not reflected in test data. However, detecting covariate shift is challenging, especially when the data consists of high-dimensional images, and when multiple types of localized covariate shift affect different subspaces of the data. Although automated… ▽ More

    Submitted 19 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: IEEE VIS 2021

  14. arXiv:2011.06733  [pdf, other

    cs.CV cs.LG

    One Explanation is Not Enough: Structured Attention Graphs for Image Classification

    Authors: Vivswan Shitole, Li Fuxin, Minsuk Kahng, Prasad Tadepalli, Alan Fern

    Abstract: Attention maps are a popular way of explaining the decisions of convolutional networks for image classification. Typically, for each image of interest, a single attention map is produced, which assigns weights to pixels based on their importance to the classification. A single attention map, however, provides an incomplete understanding since there are often many other maps that explain a classifi… ▽ More

    Submitted 7 November, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 26 pages, 25 figures

    Journal ref: NeuRIPS 2021

  15. arXiv:2004.15004  [pdf, other

    cs.HC cs.AI cs.CV cs.LG

    CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization

    Authors: Zijie J. Wang, Robert Turko, Omar Shaikh, Haekyu Park, Nilaksh Das, Fred Hohman, Minsuk Kahng, Duen Horng Chau

    Abstract: Deep learning's great success motivates many practitioners and students to learn about this exciting technology. However, it is often challenging for beginners to take their first step due to the complexity of understanding and applying deep learning. We present CNN Explainer, an interactive visualization tool designed for non-experts to learn and examine convolutional neural networks (CNNs), a fo… ▽ More

    Submitted 28 August, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: 11 pages, 14 figures, to be presented at IEEE VIS 2020. For a demo video, see https://youtu.be/HnWIHWFbuUQ . For a live demo, visit https://poloclub.github.io/cnn-explainer/

  16. arXiv:2001.02004  [pdf, other

    cs.HC cs.AI cs.LG

    CNN 101: Interactive Visual Learning for Convolutional Neural Networks

    Authors: Zijie J. Wang, Robert Turko, Omar Shaikh, Haekyu Park, Nilaksh Das, Fred Hohman, Minsuk Kahng, Duen Horng Chau

    Abstract: The success of deep learning solving previously-thought hard problems has inspired many non-experts to learn and understand this exciting technology. However, it is often challenging for learners to take the first steps due to the complexity of deep learning models. We present our ongoing work, CNN 101, an interactive visualization system for explaining and teaching convolutional neural networks.… ▽ More

    Submitted 27 February, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: CHI'20 Late-Breaking Work (April 25-30, 2020), 7 pages, 3 figures

  17. FairVis: Visual Analytics for Discovering Intersectional Bias in Machine Learning

    Authors: Ángel Alexander Cabrera, Will Epperson, Fred Hohman, Minsuk Kahng, Jamie Morgenstern, Duen Horng Chau

    Abstract: The growing capability and accessibility of machine learning has led to its application to many real-world domains and data about people. Despite the benefits algorithmic systems may bring, models can reflect, inject, or exacerbate implicit and explicit societal biases into their outputs, disadvantaging certain demographic subgroups. Discovering which biases a machine learning model has introduced… ▽ More

    Submitted 1 September, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted as a VAST conference paper to IEEE VIS'19

  18. arXiv:1809.01587  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation

    Authors: Minsuk Kahng, Nikhil Thorat, Duen Horng Chau, Fernanda Viégas, Martin Wattenberg

    Abstract: Recent success in deep learning has generated immense interest among practitioners and students, inspiring many to learn about this new technology. While visual and interactive approaches have been successfully developed to help people more easily learn deep learning, most existing tools focus on simpler models. In this work, we present GAN Lab, the first interactive visualization tool designed fo… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

    Comments: This paper will be published in the IEEE Transactions on Visualization and Computer Graphics, 25(1), January 2019, and presented at IEEE VAST 2018

  19. arXiv:1801.06889  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers

    Authors: Fred Hohman, Minsuk Kahng, Robert Pienta, Duen Horng Chau

    Abstract: Deep learning has recently seen rapid development and received significant attention due to its state-of-the-art performance on previously-thought hard problems. However, because of the internal complexity and nonlinear structure of deep neural networks, the underlying decision making processes for why these models are achieving such performance are challenging and sometimes mystifying to interpre… ▽ More

    Submitted 14 May, 2018; v1 submitted 21 January, 2018; originally announced January 2018.

    Comments: Under review for IEEE Transactions on Visualization and Computer Graphics (TVCG)

    ACM Class: H.5.2; I.5.1.d; I.6.9.c; I.6.9.f; I.2.6.g

  20. arXiv:1704.01942  [pdf, other

    cs.HC stat.ML

    ActiVis: Visual Exploration of Industry-Scale Deep Neural Network Models

    Authors: Minsuk Kahng, Pierre Y. Andrews, Aditya Kalro, Duen Horng Chau

    Abstract: While deep learning models have achieved state-of-the-art accuracies for many prediction tasks, understanding these models remains a challenge. Despite the recent interest in develo** visual tools to help users interpret deep learning models, the complexity and wide variety of models deployed in industry, and the large-scale datasets that they used, pose unique design challenges that are inadequ… ▽ More

    Submitted 8 August, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: Will be presented at IEEE VAST 2017 and published in IEEE Transactions on Visualization and Computer Graphics, 24(1)

  21. arXiv:1609.08535  [pdf, other

    cs.HC

    Chronodes: Interactive Multi-focus Exploration of Event Sequences

    Authors: Peter J Polack Jr, Shang-Tse Chen, Minsuk Kahng, Kaya de Barbaro, Moushumi Sharmin, Rahul Basole, Duen Horng Chau

    Abstract: The advent of mobile health technologies presents new challenges that existing visualizations, interactive tools, and algorithms are not yet designed to support. In dealing with uncertainty in sensor data and high-dimensional physiological records, we must seek to improve current tools that make sense of health data from traditional perspectives in event-based trend discovery. With Chronodes, a sy… ▽ More

    Submitted 27 September, 2016; originally announced September 2016.

  22. Interactive Browsing and Navigation in Relational Databases

    Authors: Minsuk Kahng, Shamkant B. Navathe, John T. Stasko, Duen Horng Chau

    Abstract: Although researchers have devoted considerable attention to hel** database users formulate queries, many users still find it challenging to specify queries that involve joining tables. To help users construct join queries for exploring relational databases, we propose ETable, a novel presentation data model that provides users with a presentation-level interactive view. This view compactly prese… ▽ More

    Submitted 20 August, 2016; v1 submitted 7 March, 2016; originally announced March 2016.

    Comments: VLDB 2016

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 9, No. 12, pp. 1017-1028, 2016

  23. M-Flash: Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model

    Authors: Hugo Gualdron, Robson Cordeiro, Jose Rodrigues-Jr, Duen Chau, Minsuk Kahng, U Kang

    Abstract: Recent graph computation approaches have demonstrated that a single PC can perform efficiently on billion-scale graphs. While these approaches achieve scalability by optimizing I/O operations, they do not fully exploit the capabilities of modern hard drives and processors. To overcome their performance, in this work, we introduce the Bimodal Block Processing (BBP), an innovation that is able to bo… ▽ More

    Submitted 14 September, 2016; v1 submitted 3 June, 2015; originally announced June 2015.

    Comments: Hugo Gualdron, Robson Cordeiro, Jose Rodrigues-Jr, Duen Chau, Minsuk Kahng, U Kang (2016) M-Flash: Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model, In: ECML-PKDD16, pages 623-640, LNCS, Springer

  24. arXiv:1505.06792  [pdf, other

    cs.IR

    Seeing the Forest through the Trees: Adaptive Local Exploration of Large Graphs

    Authors: Robert Pienta, Zhiyuan Lin, Minsuk Kahng, Jilles Vreeken, Partha P. Talukdar, James Abello, Ganesh Parameswaran, Duen Horng Chau

    Abstract: Visualization is a powerful paradigm for exploratory data analysis. Visualizing large graphs, however, often results in a meaningless hairball. In this paper, we propose a different approach that helps the user adaptively explore large million-node graphs from a local perspective. For nodes that the user investigates, we propose to only show the neighbors with the most subjectively interesting nei… ▽ More

    Submitted 21 July, 2016; v1 submitted 25 May, 2015; originally announced May 2015.