Skip to main content

Showing 1–50 of 55 results for author: Bigham, J

.
  1. arXiv:2406.09264  [pdf, other

    cs.HC cs.AI cs.CL

    Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

    Authors: Hua Shen, Tiffany Knearem, Reshmi Ghosh, Kenan Alkiek, Kundan Krishna, Yachuan Liu, Ziqiao Ma, Savvas Petridis, Yi-Hao Peng, Li Qiwei, Sushrita Rakshit, Chenglei Si, Yutong Xie, Jeffrey P. Bigham, Frank Bentley, Joyce Chai, Zachary Lipton, Qiaozhu Mei, Rada Mihalcea, Michael Terry, Diyi Yang, Meredith Ringel Morris, Paul Resnick, David Jurgens

    Abstract: Recent advancements in general-purpose AI have highlighted the importance of guiding AI systems towards the intended goals, ethical principles, and values of individuals and groups, a concept broadly recognized as alignment. However, the lack of clarified definitions and scopes of human-AI alignment poses a significant obstacle, hampering collaborative efforts across research domains to achieve th… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 56 pages

  2. arXiv:2406.07739  [pdf, other

    cs.CL cs.HC cs.SE

    UICoder: Finetuning Large Language Models to Generate User Interface Code through Automated Feedback

    Authors: Jason Wu, Eldon Schoop, Alan Leung, Titus Barik, Jeffrey P. Bigham, Jeffrey Nichols

    Abstract: Large language models (LLMs) struggle to consistently generate UI code that compiles and produces visually relevant designs. Existing approaches to improve generation rely on expensive human feedback or distilling a proprietary model. In this paper, we explore the use of automated feedback (compilers and multi-modal models) to guide LLMs to generate high-quality UI code. Our method starts with an… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to NAACL 2024

  3. "This really lets us see the entire world:" Designing a conversational telepresence robot for homebound older adults

    Authors: Yaxin Hu, Laura Stegner, Yasmine Kotturi, Caroline Zhang, Yi-Hao Peng, Faria Huq, Yuhang Zhao, Jeffrey P. Bigham, Bilge Mutlu

    Abstract: In this paper, we explore the design and use of conversational telepresence robots to help homebound older adults interact with the external world. An initial needfinding study (N=8) using video vignettes revealed older adults' experiential needs for robot-mediated remote experiences such as exploration, reminiscence and social participation. We then designed a prototype system to support these go… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: In proceedings of ACM Designing Interactive Systems (DIS) 2024

    MSC Class: 68-06

  4. arXiv:2404.12500  [pdf, other

    cs.HC cs.CL cs.CV

    UIClip: A Data-driven Model for Assessing User Interface Design

    Authors: Jason Wu, Yi-Hao Peng, Amanda Li, Amanda Swearngin, Jeffrey P. Bigham, Jeffrey Nichols

    Abstract: User interface (UI) design is a difficult yet important task for ensuring the usability, accessibility, and aesthetic qualities of applications. In our paper, we develop a machine-learned model, UIClip, for assessing the design quality and visual relevance of a UI given its screenshot and natural language description. To train UIClip, we used a combination of automated crawling, synthetic augmenta… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  5. arXiv:2404.03085  [pdf, other

    cs.HC cs.AI cs.LG

    Talaria: Interactively Optimizing Machine Learning Models for Efficient Inference

    Authors: Fred Hohman, Chaoqun Wang, **mook Lee, Jochen Görtler, Dominik Moritz, Jeffrey P Bigham, Zhile Ren, Cecile Foret, Qi Shan, Xiaoyi Zhang

    Abstract: On-device machine learning (ML) moves computation from the cloud to personal devices, protecting user privacy and enabling intelligent user experiences. However, fitting models on devices with limited resources presents a major technical challenge: practitioners need to optimize models and balance hardware metrics such as model size, latency, and power. To help practitioners create efficient ML mo… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Proceedings of the 2024 ACM CHI Conference on Human Factors in Computing Systems

  6. Deconstructing the Veneer of Simplicity: Co-Designing Introductory Generative AI Workshops with Local Entrepreneurs

    Authors: Yasmine Kotturi, Angel Anderson, Glenn Ford, Michael Skirpan, Jeffrey P. Bigham

    Abstract: Generative AI platforms and features are permeating many aspects of work. Entrepreneurs from lean economies in particular are well positioned to outsource tasks to generative AI given limited resources. In this paper, we work to address a growing disparity in use of these technologies by building on a four-year partnership with a local entrepreneurial hub dedicated to equity in tech and entreprene… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  7. arXiv:2402.12566  [pdf, other

    cs.CL cs.LG

    GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence

    Authors: Kundan Krishna, Sanjana Ramprasad, Prakhar Gupta, Byron C. Wallace, Zachary C. Lipton, Jeffrey P. Bigham

    Abstract: LLMs can generate factually incorrect statements even when provided access to reference documents. Such errors can be dangerous in high-stakes applications (e.g., document-grounded QA for healthcare or finance). We present GenAudit -- a tool intended to assist fact-checking LLM responses for document-grounded tasks. GenAudit suggests edits to the LLM response by revising or removing claims that ar… ▽ More

    Submitted 16 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Code and models available at https://genaudit.org

  8. arXiv:2312.06147  [pdf, other

    cs.CL cs.IR

    "What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces

    Authors: Faria Huq, Jeffrey P. Bigham, Nikolas Martelaro

    Abstract: Large language models (LLMs) that have been trained on a corpus that includes large amount of code exhibit a remarkable ability to understand HTML code. As web interfaces are primarily constructed using HTML, we design an in-depth study to see how LLMs can be used to retrieve and locate important elements for a user given query (i.e. task description) in a web interface. In contrast with prior wor… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to NeurIPS 2023 R0-FoMo Workshop

  9. arXiv:2310.00091  [pdf, other

    cs.HC cs.SE

    Towards Automated Accessibility Report Generation for Mobile Apps

    Authors: Amanda Swearngin, Jason Wu, Xiaoyi Zhang, Esteban Gomez, Jen Coughenour, Rachel Stukenborg, Bhavya Garg, Greg Hughes, Adriana Hilliard, Jeffrey P. Bigham, Jeffrey Nichols

    Abstract: Many apps have basic accessibility issues, like missing labels or low contrast. Automated tools can help app developers catch basic issues, but can be laborious or require writing dedicated tests. We propose a system, motivated by a collaborative process with accessibility stakeholders at a large technology company, to generate whole app accessibility reports by combining varied data collection me… ▽ More

    Submitted 16 October, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 24 pages, 8 figures

  10. arXiv:2308.08726  [pdf, other

    cs.HC

    Never-ending Learning of User Interfaces

    Authors: Jason Wu, Rebecca Krosnick, Eldon Schoop, Amanda Swearngin, Jeffrey P. Bigham, Jeffrey Nichols

    Abstract: Machine learning models have been trained to predict semantic information about user interfaces (UIs) to make apps more accessible, easier to test, and to automate. Currently, most models rely on datasets that are collected and labeled by human crowd-workers, a process that is costly and surprisingly error-prone for certain tasks. For example, it is possible to guess if a UI element is "tappable"… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  11. arXiv:2306.05446  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Latent Phrase Matching for Dysarthric Speech

    Authors: Colin Lea, Dianna Yee, Jaya Narain, Zifang Huang, Lauren Tooley, Jeffrey P. Bigham, Leah Findlater

    Abstract: Many consumer speech recognition systems are not tuned for people with speech disabilities, resulting in poor recognition and user experience, especially for severe speech differences. Recent studies have emphasized interest in personalized speech models from people with atypical speech patterns. We propose a query-by-example-based personalized phrase recognition system that is trained using small… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  12. arXiv:2305.14296  [pdf, other

    cs.CL cs.LG

    USB: A Unified Summarization Benchmark Across Tasks and Domains

    Authors: Kundan Krishna, Prakhar Gupta, Sanjana Ramprasad, Byron C. Wallace, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: While the NLP community has produced numerous summarization benchmarks, none provide the rich annotations required to simultaneously address many important problems related to control and reliability. We introduce a Wikipedia-derived benchmark, complemented by a rich set of crowd-sourced annotations, that supports $8$ interrelated tasks: (i) extractive summarization; (ii) abstractive summarization… ▽ More

    Submitted 4 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP Findings 2023 Camera Ready

  13. arXiv:2302.09044  [pdf, other

    cs.HC

    From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition

    Authors: Colin Lea, Zifang Huang, Lauren Tooley, Jaya Narain, Dianna Yee, Panayiotis Georgiou, Tien Dung Tran, Jeffrey P. Bigham, Leah Findlater

    Abstract: Consumer speech recognition systems do not work as well for many people with speech diferences, such as stuttering, relative to the rest of the general population. However, what is not clear is the degree to which these systems do not work, how they can be improved, or how much people want to use them. In this paper, we frst address these questions using results from a 61-person survey from people… ▽ More

    Submitted 27 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: CHI 2023

  14. arXiv:2301.13280  [pdf, other

    cs.HC

    WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics

    Authors: Jason Wu, Siyan Wang, Siman Shen, Yi-Hao Peng, Jeffrey Nichols, Jeffrey P. Bigham

    Abstract: Modeling user interfaces (UIs) from visual information allows systems to make inferences about the functionality and semantics needed to support use cases in accessibility, app automation, and testing. Current datasets for training machine learning models are limited in size due to the costly and time-consuming process of manually collecting and annotating UIs. We crawled the web to construct WebU… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to CHI 2023. Dataset, code, and models release coming soon

  15. arXiv:2301.08372  [pdf, other

    cs.HC

    Screen Correspondence: Map** Interchangeable Elements between UIs

    Authors: Jason Wu, Amanda Swearngin, Xiaoyi Zhang, Jeffrey Nichols, Jeffrey P. Bigham

    Abstract: Understanding user interface (UI) functionality is a useful yet challenging task for both machines and people. In this paper, we investigate a machine learning approach for screen correspondence, which allows reasoning about UIs by map** their elements onto previously encountered examples with known functionality and properties. We describe and implement a model that incorporates element semanti… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  16. arXiv:2209.14389  [pdf, other

    cs.CL cs.LG

    Downstream Datasets Make Surprisingly Good Pretraining Corpora

    Authors: Kundan Krishna, Saurabh Garg, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: For most natural language processing tasks, the dominant practice is to finetune large pretrained transformer models (e.g., BERT) using smaller downstream datasets. Despite the success of this approach, it remains unclear to what extent these gains are attributable to the massive background corpora employed for pretraining versus to the pretraining objectives themselves. This paper introduces a la… ▽ More

    Submitted 26 May, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: ACL2023 Camera Ready

  17. arXiv:2207.12551  [pdf, other

    cs.CL

    DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit

    Authors: Jessica Huynh, Ting-Rui Chiang, Jeffrey Bigham, Maxine Eskenazi

    Abstract: Dialog system developers need high-quality data to train, fine-tune and assess their systems. They often use crowdsourcing for this since it provides large quantities of data from many workers. However, the data may not be of sufficiently good quality. This can be due to the way that the requester presents a task and how they interact with the workers. This paper introduces DialCrowd 2.0 to help r… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Published at LREC 2022

  18. arXiv:2207.07712  [pdf, other

    cs.HC

    Reflow: Automatically Improving Touch Interactions in Mobile Applications through Pixel-based Refinements

    Authors: Jason Wu, Titus Barik, Xiaoyi Zhang, Colin Lea, Jeffrey Nichols, Jeffrey P. Bigham

    Abstract: Touch is the primary way that users interact with smartphones. However, building mobile user interfaces where touch interactions work well for all users is a difficult problem, because users have different abilities and preferences. We propose a system, Reflow, which automatically applies small, personalized UI adaptations, called refinements -- to mobile app screens to improve touch efficiency. R… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  19. arXiv:2205.12673  [pdf, other

    cs.CL

    InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning

    Authors: Prakhar Gupta, Cathy Jiao, Yi-Ting Yeh, Shikib Mehri, Maxine Eskenazi, Jeffrey P. Bigham

    Abstract: Instruction tuning is an emergent paradigm in NLP wherein natural language instructions are leveraged with language models to induce zero-shot performance on unseen tasks. Instructions have been shown to enable good performance on unseen tasks and datasets in both large and small language models. Dialogue is an especially interesting area to explore instruction tuning because dialogue systems perf… ▽ More

    Submitted 26 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022

  20. arXiv:2205.09314  [pdf, other

    cs.CL

    Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation

    Authors: Prakhar Gupta, Harsh Jhamtani, Jeffrey P. Bigham

    Abstract: Target-guided response generation enables dialogue systems to smoothly transition a conversation from a dialogue context toward a target sentence. Such control is useful for designing dialogue systems that direct a conversation toward specific goals, such as creating non-obtrusive recommendations or introducing new topics in the conversation. In this paper, we introduce a new technique for target-… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted at NAACL 2022 (Findings)

  21. arXiv:2202.07750  [pdf, other

    eess.AS cs.CL cs.SD

    Nonverbal Sound Detection for Disordered Speech

    Authors: Colin Lea, Zifang Huang, Dhruv Jain, Lauren Tooley, Zeinab Liaghat, Shrinath Thelapurath, Leah Findlater, Jeffrey P. Bigham

    Abstract: Voice assistants have become an essential tool for people with various disabilities because they enable complex phone- or tablet-based interactions without the need for fine-grained motor control, such as with touchscreens. However, these systems are not tuned for the unique characteristics of individuals with speech disorders, including many of those who have a motor-speech disorder, are deaf or… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: Accepted at ICASSP 2022

  22. arXiv:2112.09544  [pdf

    cs.CY

    It's Time to Do Something: Mitigating the Negative Impacts of Computing Through a Change to the Peer Review Process

    Authors: Brent Hecht, Lauren Wilcox, Jeffrey P. Bigham, Johannes Schöning, Ehsan Hoque, Jason Ernst, Yonatan Bisk, Luigi De Russis, Lana Yarosh, Bushra Anjum, Danish Contractor, Cathy Wu

    Abstract: The computing research community needs to work much harder to address the downsides of our innovations. Between the erosion of privacy, threats to democracy, and automation's effect on employment (among many other issues), we can no longer simply assume that our research will have a net positive impact on the world. While bending the arc of computing innovation towards societal benefit may at firs… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: First published on the ACM Future of Computing Academy blog on March 29, 2018. This is the archival version

  23. arXiv:2111.05241  [pdf, other

    cs.CL

    A Survey of NLP-Related Crowdsourcing HITs: what works and what does not

    Authors: Jessica Huynh, Jeffrey Bigham, Maxine Eskenazi

    Abstract: Crowdsourcing requesters on Amazon Mechanical Turk (AMT) have raised questions about the reliability of the workers. The AMT workforce is very diverse and it is not possible to make blanket assumptions about them as a group. Some requesters now reject work en mass when they do not get the results they expect. This has the effect of giving each worker (good or bad) a lower Human Intelligence Task (… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  24. Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots

    Authors: Jason Wu, Xiaoyi Zhang, Jeff Nichols, Jeffrey P. Bigham

    Abstract: Automated understanding of user interfaces (UIs) from their pixels can improve accessibility, enable task automation, and facilitate interface design without relying on developers to comprehensively provide metadata. A first step is to infer what UI elements exist on a screen, but current approaches are limited in how they infer how those elements are semantically grouped into structured interface… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

  25. arXiv:2109.04953  [pdf, other

    cs.CL cs.LG

    Does Pretraining for Summarization Require Knowledge Transfer?

    Authors: Kundan Krishna, Jeffrey Bigham, Zachary C. Lipton

    Abstract: Pretraining techniques leveraging enormous datasets have driven recent advances in text summarization. While folk explanations suggest that knowledge transfer accounts for pretraining's benefits, little is known about why it works or what makes a pretraining task or dataset suitable. In this paper, we challenge the knowledge transfer story, showing that pretraining on documents consisting of chara… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: Camera-ready for Findings of EMNLP 2021

  26. arXiv:2106.11759  [pdf, other

    eess.AS cs.AI cs.CL cs.CV cs.LG cs.SD

    Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

    Authors: Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

    Abstract: Dysfluencies and variations in speech pronunciation can severely degrade speech recognition performance, and for many individuals with moderate-to-severe speech disorders, voice operated systems do not work. Current speech recognition systems are trained primarily with data from fluent speakers and as a consequence do not generalize well to speech with dysfluencies such as sound or word repetition… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 5 pages, 1 page reference, 2 figures

  27. arXiv:2106.05894  [pdf, other

    cs.CL

    Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation

    Authors: Prakhar Gupta, Yulia Tsvetkov, Jeffrey P. Bigham

    Abstract: Open-domain neural dialogue models have achieved high performance in response ranking and evaluation tasks. These tasks are formulated as a binary classification of responses given in a dialogue context, and models generally learn to make predictions based on context-response content similarity. However, over-reliance on content similarity makes the models less sensitive to the presence of inconsi… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted to Findings of ACL 2021

  28. When Can Accessibility Help?: An Exploration of Accessibility Feature Recommendation on Mobile Devices

    Authors: Jason Wu, Gabriel Reyes, Sam C. White, Xiaoyi Zhang, Jeffrey P. Bigham

    Abstract: Numerous accessibility features have been developed and included in consumer operating systems to provide people with a variety of disabilities additional ways to access computing devices. Unfortunately, many users, especially older adults who are more likely to experience ability changes, are not aware of these features or do not know which combination to use. In this paper, we first quantify thi… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: Accepted to Web4All 2021 (W4A '21)

  29. arXiv:2103.14491  [pdf

    cs.HC

    Say It All: Feedback for Improving Non-Visual Presentation Accessibility

    Authors: Yi-Hao Peng, JiWoong Jang, Jeffrey P. Bigham, Amy Pavel

    Abstract: Presenters commonly use slides as visual aids for informative talks. When presenters fail to verbally describe the content on their slides, blind and visually impaired audience members lose access to necessary content, making the presentation difficult to follow. Our analysis of 90 presentation videos revealed that 72% of 610 visual elements (e.g., images, text) were insufficiently described. To h… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

  30. arXiv:2102.12394  [pdf, other

    eess.AS cs.SD

    SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

    Authors: Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham

    Abstract: The ability to automatically detect stuttering events in speech could help speech pathologists track an individual's fluency over time or help improve speech recognition systems for people with atypical speech patterns. Despite increasing interest in this area, existing public datasets are too small to build generalizable dysfluency detection systems and lack sufficient annotations. In this work,… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021

  31. arXiv:2101.04893  [pdf, other

    cs.HC

    Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

    Authors: Xiaoyi Zhang, Lilian de Greef, Amanda Swearngin, Samuel White, Kyle Murray, Lisa Yu, Qi Shan, Jeffrey Nichols, Jason Wu, Chris Fleizach, Aaron Everitt, Jeffrey P. Bigham

    Abstract: Many accessibility features available on mobile platforms require applications (apps) to provide complete and accurate metadata describing user interface (UI) components. Unfortunately, many apps do not provide sufficient metadata for accessibility features to work as expected. In this paper, we explore inferring accessibility metadata for mobile apps from their pixels, as the visual interfaces of… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  32. arXiv:2012.15211  [pdf, other

    cs.HC

    The Challenges of Crowd Workers in Rural and Urban America

    Authors: Claudia Flores-Saviaga, Yuwen Li, Benjamin V. Hanrahan, Jeffrey Bigham, Saiph Savage

    Abstract: Crowd work has the potential of hel** the financial recovery of regions traditionally plagued by a lack of economic opportunities, e.g., rural areas. However, we currently have limited information about the challenges facing crowd work-ers from rural and super rural areas as they struggle to make a living through crowd work sites. This paper examines the challenges and advantages of rural and su… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Journal ref: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing 2020

  33. arXiv:2012.02748  [pdf, other

    cs.LG cs.CY cs.HC

    Challenging common interpretability assumptions in feature attribution explanations

    Authors: Jonathan Dinu, Jeffrey Bigham, J. Zico Kolter

    Abstract: As machine learning and algorithmic decision making systems are increasingly being leveraged in high-stakes human-in-the-loop settings, there is a pressing need to understand the rationale of their predictions. Researchers have responded to this need with explainable AI (XAI), but often proclaim interpretability axiomatically without evaluation. When these systems are evaluated, they are often tes… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: Presented at the NeurIPS 2020 ML-Retrospectives, Surveys & Meta-Analyses Workshop

    ACM Class: J.4; I.5.1; K.4

  34. Making Mobile Augmented Reality Applications Accessible

    Authors: Jaylin Herskovitz, Jason Wu, Samuel White, Amy Pavel, Gabriel Reyes, Anhong Guo, Jeffrey P. Bigham

    Abstract: Augmented Reality (AR) technology creates new immersive experiences in entertainment, games, education, retail, and social media. AR content is often primarily visual and it is challenging to enable access to it non-visually due to the mix of virtual and real-world content. In this paper, we identify common constituent tasks in AR by analyzing existing mobile AR applications for iOS, and character… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: 14 pages. 6 figures. Published in The 22nd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '20)

  35. arXiv:2010.03667  [pdf

    cs.HC

    Rescribe: Authoring and Automatically Editing Audio Descriptions

    Authors: Amy Pavel, Gabriel Reyes, Jeffrey P. Bigham

    Abstract: Audio descriptions make videos accessible to those who cannot see them by describing visual content in audio. Producing audio descriptions is challenging due to the synchronous nature of the audio description that must fit into gaps of other video content. An experienced audio description author will produce content that fits narration necessary to understand, enjoy, or experience the video conten… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  36. arXiv:2008.09075  [pdf

    cs.CL cs.AI

    Controlling Dialogue Generation with Semantic Exemplars

    Authors: Prakhar Gupta, Jeffrey P. Bigham, Yulia Tsvetkov, Amy Pavel

    Abstract: Dialogue systems pretrained with large language models generate locally coherent responses, but lack the fine-grained control over responses necessary to achieve specific goals. A promising method to control response generation is exemplar-based generation, in which models edit exemplar responses that are retrieved from training data, or hand-written to strategically address discourse-level goals,… ▽ More

    Submitted 25 March, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Accepted at NAACL 2021

  37. arXiv:2007.07151  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Extracting Structured Data from Physician-Patient Conversations By Predicting Noteworthy Utterances

    Authors: Kundan Krishna, Amy Pavel, Benjamin Schloss, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: Despite diverse efforts to mine various modalities of medical data, the conversations between physicians and patients at the time of care remain an untapped source of insights. In this paper, we leverage this data to extract structured information that might assist physicians with post-visit documentation in electronic health records, potentially lightening the clerical burden. In this exploratory… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  38. Becoming the Super Turker: Increasing Wages via a Strategy from High Earning Workers

    Authors: Saiph Savage, Chun-Wei Chiang, Susumu Saito, Carlos Toxtli, Jeffrey Bigham

    Abstract: Crowd markets have traditionally limited workers by not providing transparency information concerning which tasks pay fairly or which requesters are unreliable. Researchers believe that a key reason why crowd workers earn low wages is due to this lack of transparency. As a result, tools have been developed to provide more transparency within crowd markets to help workers. However, while most worke… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: 12.pages, 8 figures, The Web Conference 202, ACM WWW 2020

    ACM Class: I.2.7

  39. arXiv:2005.01795  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques

    Authors: Kundan Krishna, Sopan Khosla, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: Following each patient visit, physicians draft long semi-structured clinical summaries called SOAP notes. While invaluable to clinicians and researchers, creating digital SOAP notes is burdensome, contributing to physician burnout. In this paper, we introduce the first complete pipelines to leverage deep summarization models to generate these notes based on transcripts of conversations between phy… ▽ More

    Submitted 2 June, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Published at ACL 2021 Main Conference

  40. InstructableCrowd: Creating IF-THEN Rules for Smartphones via Conversations with the Crowd

    Authors: Ting-Hao 'Kenneth' Huang, Amos Azaria, Oscar J. Romero, Jeffrey P. Bigham

    Abstract: Natural language interfaces have become a common part of modern digital life. Chatbots utilize text-based conversations to communicate with users; personal assistants on smartphones such as Google Assistant take direct speech commands from their users; and speech-controlled devices such as Amazon Echo use voice as their only input mode. In this paper, we introduce InstructableCrowd, a crowd-powere… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: Published at Human Computation (2019) 6:1:113-146

    Journal ref: Human Computation (2019) 6:1:113-146

  41. StateLens: A Reverse Engineering Solution for Making Existing Dynamic Touchscreens Accessible

    Authors: Anhong Guo, Junhan Kong, Michael Rivera, Frank F. Xu, Jeffrey P. Bigham

    Abstract: Blind people frequently encounter inaccessible dynamic touchscreens in their everyday lives that are difficult, frustrating, and often impossible to use independently. Touchscreens are often the only way to control everything from coffee machines and payment terminals, to subway ticket machines and in-flight entertainment systems. Interacting with dynamic touchscreens is difficult non-visually bec… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: ACM UIST 2019

  42. arXiv:1907.10568  [pdf, other

    cs.CL

    Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References

    Authors: Prakhar Gupta, Shikib Mehri, Tiancheng Zhao, Amy Pavel, Maxine Eskenazi, Jeffrey P. Bigham

    Abstract: The aim of this paper is to mitigate the shortcomings of automatic evaluation of open-domain dialog systems through multi-reference evaluation. Existing metrics have been shown to correlate poorly with human judgement, particularly in open-domain dialog. One alternative is to collect human annotations for evaluation, which can be expensive and time consuming. To demonstrate the effectiveness of mu… ▽ More

    Submitted 8 September, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: SIGDIAL 2019

  43. Predicting risk of dyslexia with an online gamified test

    Authors: Luz Rello, Ricardo Baeza-Yates, Abdullah Ali, Jeffrey P. Bigham, Miquel Serra

    Abstract: Dyslexia is a specific learning disorder related to school failure. Detection is both crucial and challenging, especially in languages with transparent orthographies, such as Spanish. To make detecting dyslexia easier, we designed an online gamified test and a predictive machine learning model. In a study with more than 3,600 participants, our model correctly detected over 80% of the participants… ▽ More

    Submitted 9 December, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  44. TurkScanner: Predicting the Hourly Wage of Microtasks

    Authors: Susumu Saito, Chun-Wei Chiang, Saiph Savage, Teppei Nakano, Tetsunori Kobayashi, Jeffrey Bigham

    Abstract: Workers in crowd markets struggle to earn a living. One reason for this is that it is difficult for workers to accurately gauge the hourly wages of microtasks, and they consequently end up performing labor with little pay. In general, workers are provided with little information about tasks, and are left to rely on noisy signals, such as textual description of the task or rating of the requester.… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

    Comments: Proceedings of the 28th International Conference on World Wide Web (WWW '19), San Francisco, CA, USA, May 13-17, 2019

  45. arXiv:1802.08218  [pdf, other

    cs.CV cs.CL cs.HC

    VizWiz Grand Challenge: Answering Visual Questions from Blind People

    Authors: Danna Gurari, Qing Li, Abigale J. Stangl, Anhong Guo, Chi Lin, Kristen Grauman, Jiebo Luo, Jeffrey P. Bigham

    Abstract: The study of algorithms to automatically answer visual questions currently is motivated by visual question answering (VQA) datasets constructed in artificial VQA settings. We propose VizWiz, the first goal-oriented VQA dataset arising from a natural VQA setting. VizWiz consists of over 31,000 visual questions originating from blind people who each took a picture using a mobile phone and recorded a… ▽ More

    Submitted 9 May, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

  46. arXiv:1801.02668  [pdf, other

    cs.HC cs.AI cs.CL

    Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time

    Authors: Ting-Hao 'Kenneth' Huang, Joseph Chee Chang, Jeffrey P. Bigham

    Abstract: Crowd-powered conversational assistants have been shown to be more robust than automated systems, but do so at the cost of higher response latency and monetary costs. A promising direction is to combine the two approaches for high quality, low latency, and low cost solutions. In this paper, we introduce Evorus, a crowd-powered conversational assistant built to automate itself over time by (i) allo… ▽ More

    Submitted 9 January, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

    Comments: 10 pages. To appear in the Proceedings of the Conference on Human Factors in Computing Systems 2018 (CHI'18)

    ACM Class: H.5.m

  47. arXiv:1712.05796  [pdf

    cs.CY cs.HC

    A Data-Driven Analysis of Workers' Earnings on Amazon Mechanical Turk

    Authors: Kotaro Hara, Abi Adams, Kristy Milland, Saiph Savage, Chris Callison-Burch, Jeffrey Bigham

    Abstract: A growing number of people are working as part of on-line crowd work, which has been characterized by its low wages; yet, we know little about wage distribution and causes of low/high earnings. We recorded 2,676 workers performing 3.8 million tasks on Amazon Mechanical Turk. Our task-level analysis revealed that workers earned a median hourly wage of only ~\$2/h, and only 4% earned more than \$7.2… ▽ More

    Submitted 28 December, 2017; v1 submitted 14 December, 2017; originally announced December 2017.

    Comments: Conditionally accepted for inclusion in the 2018 ACM Conference on Human Factors in Computing Systems (CHI'18) Papers program

  48. arXiv:1708.03044  [pdf, other

    cs.HC cs.AI cs.CL

    "Is there anything else I can help you with?": Challenges in Deploying an On-Demand Crowd-Powered Conversational Agent

    Authors: Ting-Hao Kenneth Huang, Walter S. Lasecki, Amos Azaria, Jeffrey P. Bigham

    Abstract: Intelligent conversational assistants, such as Apple's Siri, Microsoft's Cortana, and Amazon's Echo, have quickly become a part of our digital life. However, these assistants have major limitations, which prevents users from conversing with them as they would with human dialog partners. This limits our ability to observe how users really want to interact with the underlying system. To address this… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

    Comments: 10 pages. In Proceedings of Conference on Human Computation & Crowdsourcing (HCOMP 2016), 2016, Austin, TX, USA

  49. arXiv:1704.03627  [pdf, other

    cs.HC cs.AI cs.CL

    Real-time On-Demand Crowd-powered Entity Extraction

    Authors: Ting-Hao 'Kenneth' Huang, Yun-Nung Chen, Jeffrey P. Bigham

    Abstract: Output-agreement mechanisms such as ESP Game have been widely used in human computation to obtain reliable human-generated labels. In this paper, we argue that a "time-limited" output-agreement mechanism can be used to create a fast and robust crowd-powered component in interactive systems, particularly dialogue systems, to extract key information from user utterances on the fly. Our experiments o… ▽ More

    Submitted 6 December, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: Accepted by the 5th Edition Of The Collective Intelligence Conference (CI 2017) as an oral presentation. Interface code and data are available at: https://github.com/windx0303/dialogue-esp-game

  50. arXiv:1508.02982  [pdf

    cs.HC

    WearWrite: Orchestrating the Crowd to Complete Complex Tasks from Wearables (We Wrote This Paper on a Watch)

    Authors: Michael Nebeling, Anhong Guo, Kyle Murray, Annika Tostengard, Angelos Giannopoulos, Martin Mihajlov, Steven Dow, Jaime Teevan, Jeffrey P. Bigham

    Abstract: In this paper we introduce a paradigm for completing complex tasks from wearable devices by leveraging crowdsourcing, and demonstrate its validity for academic writing. We explore this paradigm using a collaborative authoring system, called WearWrite, which is designed to enable authors and crowd workers to work together using an Android smartwatch and Google Docs to produce academic papers, inclu… ▽ More

    Submitted 25 July, 2015; originally announced August 2015.