Skip to main content

Showing 1–26 of 26 results for author: Heer, J

.
  1. arXiv:2406.13853  [pdf, other

    cs.HC

    AltGeoViz: Facilitating Accessible Geovisualization

    Authors: Chu Li, Rock Yuren Pang, Ather Sharif, Arnavi Chheda-Kothary, Jeffrey Heer, Jon E. Froehlich

    Abstract: Geovisualizations are powerful tools for exploratory spatial analysis, enabling sighted users to discern patterns, trends, and relationships within geographic data. However, these visual tools have remained largely inaccessible to screen-reader users. We present AltGeoViz, a new system we designed to facilitate geovisualization exploration for these users. AltGeoViz dynamically generates alt-text… ▽ More

    Submitted 21 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  2. Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM

    Authors: Michelle S. Lam, Janice Teoh, James Landay, Jeffrey Heer, Michael S. Bernstein

    Abstract: Data analysts have long sought to turn unstructured text data into meaningful concepts. Though common, topic modeling and clustering focus on lower-level keywords and require significant interpretative work. We introduce concept induction, a computational process that instead produces high-level concepts, defined by explicit inclusion criteria, from unstructured text. For a dataset of toxic online… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: To appear at CHI 2024

  3. arXiv:2404.11602  [pdf, other

    cs.HC

    Interaction Techniques for Exploratory Data Visualization on Mobile Devices

    Authors: Luke S. Snyder, Ryan A. Rossi, Eunyee Koh, Jeffrey Heer, Jane Hoffswell

    Abstract: The ubiquity and on-the-go availability of mobile devices makes them central to many tasks such as interpersonal communication and media consumption. However, despite the potential of mobile devices for on-demand exploratory data visualization, existing mobile interactions are difficult, often using highly custom interactions, complex gestures, or multi-modal input. We synthesize limitations from… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 4 pages, 1 figure, 1 table, EuroVis 2024 Short Papers

  4. arXiv:2312.11681  [pdf, other

    cs.HC cs.AI cs.CL

    Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows

    Authors: Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer

    Abstract: LLM chains enable complex tasks by decomposing work into a sequence of subtasks. Similarly, the more established techniques of crowdsourcing workflows decompose complex tasks into smaller tasks for human crowdworkers. Chains address LLM errors analogously to the way crowdsourcing workflows address human error. To characterize opportunities for LLM chaining, we survey 107 papers across the crowdsou… ▽ More

    Submitted 6 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  5. arXiv:2310.17814  [pdf, other

    cs.HC

    DIVI: Dynamically Interactive Visualization

    Authors: Luke S. Snyder, Jeffrey Heer

    Abstract: Dynamically Interactive Visualization (DIVI) is a novel approach for orchestrating interactions within and across static visualizations. DIVI deconstructs Scalable Vector Graphics charts at runtime to infer content and coordinate user input, decoupling interaction from specification logic. This decoupling allows interactions to extend and compose freely across different tools, chart types, and ana… ▽ More

    Submitted 4 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 9 pages, 2 pages supplementary material, 10 figures, IEEE TVCG 2024 (Proc. VIS 2023)

  6. arXiv:2310.16262  [pdf, other

    cs.HC cs.AI cs.PL stat.CO

    rTisane: Externalizing conceptual models for data analysis increases engagement with domain knowledge and improves statistical model quality

    Authors: Eunice Jun, Edward Misback, Jeffrey Heer, René Just

    Abstract: Statistical models should accurately reflect analysts' domain knowledge about variables and their relationships. While recent tools let analysts express these assumptions and use them to produce a resulting statistical model, it remains unclear what analysts want to express and how externalization impacts statistical model quality. This paper addresses these gaps. We first conduct an exploratory s… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    ACM Class: H.5.2; D.2.2; H.1.2; D.3.2

  7. arXiv:2309.10108  [pdf, other

    cs.HC

    How Do Data Analysts Respond to AI Assistance? A Wizard-of-Oz Study

    Authors: Ken Gu, Madeleine Grunde-McLaughlin, Andrew M. McNutt, Jeffrey Heer, Tim Althoff

    Abstract: Data analysis is challenging as analysts must navigate nuanced decisions that may yield divergent conclusions. AI assistants have the potential to support analysts in planning their analyses, enabling more robust decision making. Though AI-based assistants that target code execution (e.g., Github Copilot) have received significant attention, limited research addresses assistance for both analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted to CHI 2024

  8. arXiv:2308.14241  [pdf, other

    cs.HC

    Too Many Cooks: Exploring How Graphical Perception Studies Influence Visualization Recommendations in Draco

    Authors: Zehua Zeng, Junran Yang, Dominik Moritz, Jeffrey Heer, Leilani Battle

    Abstract: Findings from graphical perception can guide visualization recommendation algorithms in identifying effective visualization designs. However, existing algorithms use knowledge from, at best, a few studies, limiting our understanding of how complementary (or contradictory) graphical perception results influence generated recommendations. In this paper, we present a pipeline of applying a large body… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  9. arXiv:2308.13024  [pdf, other

    cs.HC

    EVM: Incorporating Model Checking into Exploratory Visual Analysis

    Authors: Alex Kale, Ziyang Guo, Xiao Li Qiao, Jeffrey Heer, Jessica Hullman

    Abstract: Visual analytics (VA) tools support data exploration by hel** analysts quickly and iteratively generate views of data which reveal interesting patterns. However, these tools seldom enable explicit checks of the resulting interpretations of data -- e.g., whether patterns can be accounted for by a model that implies a particular structure in the relationships between variables. We present EVM, a d… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  10. arXiv:2305.08323  [pdf, other

    cs.HC

    Approximation and Progressive Display of Multiverse Analyses

    Authors: Yang Liu, Tim Althoff, Jeffrey Heer

    Abstract: A multiverse analysis evaluates all combinations of "reasonable" analytic decisions to promote robustness and transparency, but can lead to a combinatorial explosion of analyses to compute. Long delays before assessing results prevent users from diagnosing errors and iterating early. We contribute (1) approximation algorithms for estimating multiverse sensitivity and (2) monitoring visualizations… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  11. arXiv:2303.06777  [pdf

    cs.PL cs.HC cs.SE

    Live, Rich, and Composable: Qualities for Programming Beyond Static Text

    Authors: Joshua Horowitz, Jeffrey Heer

    Abstract: Efforts to push programming beyond static textual code have sought to imbue programming with multiple distinct qualities. One long-acknowledged quality is liveness: providing programmers with in-depth feedback about a program's dynamic behavior as the program is edited. A second quality, long-explored but lacking a shared term of art, is richness: allowing programmers to edit programs though domai… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: To appear in the proceedings of PLATEAU 2023

  12. ScatterShot: Interactive In-context Example Curation for Text Transformation

    Authors: Tongshuang Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro

    Abstract: The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when "enough" examples have been included even for kno… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: IUI 2023: 28th International Conference on Intelligent User Interfaces

  13. arXiv:2301.03109  [pdf, other

    cs.HC

    Cinematic Techniques in Narrative Visualization

    Authors: Matthew Conlen, Jeffrey Heer, Hillary Mushkin, Scott Davidoff

    Abstract: The many genres of narrative visualization (e.g. data comics, data videos) each offer a unique set of affordances and constraints. To better understand a genre that we call cinematic visualizations-3D visualizations that make highly deliberate use of a camera to convey a narrative-we gathered 50 examples and analyzed their traditional cinematic aspects to identify the benefits and limitations of t… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: 10 pages, 7 figures

  14. Turán numbers $T(n,5,3)$ and graphs without induced $5$-cycles

    Authors: Iliya Bluskov, Jan de Heer, Alexander Sidorenko

    Abstract: Turán number $T(n,5,3)$ is the minimum size of a system of triples out of a base set $X$ of $n$ elements such that every quintuple in $X$ contains a triple from the system. The exact values of $T(n,5,3)$ are known for $n \leq 17$. Turán conjectured that $T(2m,5,3) = 2\binom{m}{3}$, and no counterexamples have been found so far. If this conjecture is true, then… ▽ More

    Submitted 17 January, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    MSC Class: 05B07; 05C15; 05C65

    Journal ref: Journal of Graph Theory, vol. 104, no. 3, 2023, pp. 451-460

  15. arXiv:2205.09858  [pdf, other

    cs.HC

    Fidyll: A Compiler for Cross-Format Data Stories & Explorable Explanations

    Authors: Matthew Conlen, Jeffrey Heer

    Abstract: Narrative visualization is a powerful communicative tool that can take on various formats such as interactive articles, slideshows, and data videos. These formats each have their strengths and weaknesses, but existing authoring tools only support one output target. We conducted a series of formative interviews with seven domain experts to understand needs and practices around cross-format data sto… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 10 pages, 6 figures, for associated examples see https://idyll-lang.github.io/fidyll-examples/

  16. arXiv:2201.02705  [pdf, other

    cs.AI cs.HC cs.PL stat.CO stat.OT

    Tisane: Authoring Statistical Models via Formal Reasoning from Conceptual and Data Relationships

    Authors: Eunice Jun, Audrey Seo, Jeffrey Heer, René Just

    Abstract: Proper statistical modeling incorporates domain theory about how concepts relate and details of how data were measured. However, data analysts currently lack tool support for recording and reasoning about domain assumptions, data collection, and modeling choices in an integrated manner, leading to mistakes that can compromise scientific validity. For instance, generalized linear mixed-effects mode… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  17. arXiv:2108.04385  [pdf, other

    cs.HC

    Gemini2: Generating Keyframe-Oriented Animated Transitions Between Statistical Graphics

    Authors: Younghoon Kim, Jeffrey Heer

    Abstract: Complex animated transitions may be easier to understand when divided into separate, consecutive stages. However, effective staging requires careful attention to both animation semantics and timing parameters. We present Gemini2, a system for creating staged animations from a sequence of chart keyframes. Given only a start state and an end state, Gemini2 can automatically recommend intermediate ke… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  18. arXiv:2104.02712  [pdf, other

    cs.OH cs.HC cs.SE

    Hypothesis Formalization: Empirical Findings, Software Limitations, and Design Implications

    Authors: Eunice Jun, Melissa Birchfield, Nicole de Moura, Jeffrey Heer, Rene Just

    Abstract: Data analysis requires translating higher level questions and hypotheses into computable statistical models. We present a mixed-methods study aimed at identifying the steps, considerations, and challenges involved in operationalizing hypotheses into statistical models, a process we refer to as hypothesis formalization. In a formative content analysis of research papers, we find that researchers hi… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  19. arXiv:2101.00288  [pdf, other

    cs.CL

    Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

    Authors: Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer, Daniel S. Weld

    Abstract: While counterfactual examples are useful for analysis and training of NLP models, current generation methods either rely on manual labor to create very few counterfactuals, or only instantiate limited types of perturbations such as paraphrases or word substitutions. We present Polyjuice, a general-purpose counterfactual generator that allows for control over perturbation types and locations, train… ▽ More

    Submitted 1 June, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: ACL 2021, main conference, long paper

  20. arXiv:2009.01429  [pdf, other

    cs.HC

    Gemini: A Grammar and Recommender System for AnimatedTransitions in Statistical Graphics

    Authors: Younghoon Kim, Jeffrey Heer

    Abstract: Animated transitions help viewers follow changes between related visualizations. Specifying effective animations demands significant effort: authors must select the elements and properties to animate, provide transition parameters, and coordinate the timing of stages. To facilitate this process, we present Gemini, a declarative grammar and recommendation system for animated transitions between sin… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

  21. arXiv:2008.12828  [pdf, other

    cs.LG cs.DL stat.ML

    CORAL: COde RepresentAtion Learning with Weakly-Supervised Transformers for Analyzing Data Analysis

    Authors: Ge Zhang, Mike A. Merrill, Yang Liu, Jeffrey Heer, Tim Althoff

    Abstract: Large scale analysis of source code, and in particular scientific source code, holds the promise of better understanding the data science process, identifying analytical best practices, and providing insights to the builders of scientific toolkits. However, large corpora have remained unanalyzed in depth, as descriptive labels are absent and require expert domain knowledge to generate. We propose… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

  22. Boba: Authoring and Visualizing Multiverse Analyses

    Authors: Yang Liu, Alex Kale, Tim Althoff, Jeffrey Heer

    Abstract: Multiverse analysis is an approach to data analysis in which all "reasonable" analytic decisions are evaluated in parallel and interpreted collectively, in order to foster robustness and transparency. However, specifying a multiverse is demanding because analysts must manage myriad variants from a cross-product of analytic decisions, and the results require nuanced interpretation. We contribute Bo… ▽ More

    Submitted 30 July, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: submitted to IEEE Transactions on Visualization and Computer Graphics (Proc. VAST)

  23. arXiv:1911.00568  [pdf, other

    cs.HC

    Goals, Process, and Challenges of Exploratory Data Analysis: An Interview Study

    Authors: Kanit Wongsuphasawat, Yang Liu, Jeffrey Heer

    Abstract: How do analysis goals and context affect exploratory data analysis (EDA)? To investigate this question, we conducted semi-structured interviews with 18 data analysts. We characterize common exploration goals: profiling (assessing data quality) and discovery (gaining new insights). Though the EDA literature primarily emphasizes discovery, we observe that discovery only reliably occurs in the contex… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  24. Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis

    Authors: Yang Liu, Tim Althoff, Jeffrey Heer

    Abstract: Drawing reliable inferences from data involves many, sometimes arbitrary, decisions across phases of data collection, wrangling, and modeling. As different choices can lead to diverging conclusions, understanding how researchers make analytic decisions is important for supporting robust and replicable analysis. In this study, we pore over nine published research studies and conduct semi-structured… ▽ More

    Submitted 8 January, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  25. Critical Reflections on Visualization Authoring Systems

    Authors: Arvind Satyanarayan, Bongshin Lee, Donghao Ren, Jeffrey Heer, John Stasko, John Thompson, Matthew Brehmer, Zhicheng Liu

    Abstract: An emerging generation of visualization authoring systems support expressive information visualization without textual programming. As they vary in their visualization models, system architectures, and user interfaces, it is challenging to directly compare these systems using traditional evaluative methods. Recognizing the value of contextualizing our decisions in the broader design space, we pres… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

  26. arXiv:1807.06641  [pdf, other

    cs.HC

    Beyond Heuristics: Learning Visualization Design

    Authors: Bahador Saket, Dominik Moritz, Halden Lin, Victor Dibia, Cagatay Demiralp, Jeffrey Heer

    Abstract: In this paper, we describe a research agenda for deriving design principles directly from data. We argue that it is time to go beyond manually curated and applied visualization design guidelines. We propose learning models of visualization design from data collected using graphical perception studies and build tools powered by the learned models. To achieve this vision, we need to 1) develop scala… ▽ More

    Submitted 15 August, 2018; v1 submitted 17 July, 2018; originally announced July 2018.