Skip to main content

Showing 1–50 of 52 results for author: Ting-Hao

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12787  [pdf, other

    cs.CL cs.HC

    Generating Educational Materials with Different Levels of Readability using LLMs

    Authors: Chieh-Yang Huang, **g Wei, Ting-Hao 'Kenneth' Huang

    Abstract: This study introduces the leveled-text generation task, aiming to rewrite educational materials to specific readability levels while preserving meaning. We assess the capability of GPT-3.5, LLaMA-2 70B, and Mixtral 8x7B, to generate content at various readability levels through zero-shot and few-shot prompting. Evaluating 100 processed educational materials reveals that few-shot prompting signific… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: In2Writing 2024

  2. arXiv:2404.17025  [pdf, other

    cs.HC

    How Does Conversation Length Impact User's Satisfaction? A Case Study of Length-Controlled Conversations with LLM-Powered Chatbots

    Authors: Shih-Hong Huang, Ya-Fang Lin, Zeyu He, Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang

    Abstract: Users can discuss a wide range of topics with large language models (LLMs), but they do not always prefer solving problems or getting information through lengthy conversations. This raises an intriguing HCI question: How does instructing LLMs to engage in longer or shorter conversations affect conversation quality? In this paper, we developed two Slack chatbots using GPT-4 with the ability to vary… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  3. SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Crafting effective captions for figures is important. Readers heavily depend on these captions to grasp the figure's message. However, despite a well-developed set of AI technologies for figures and captions, these have rarely been tested for usefulness in aiding caption writing. This paper introduces SciCapenter, an interactive system that puts together cutting-edge AI technologies for scientific… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CHI EA '24: Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems

  4. arXiv:2402.16795  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    If in a Crowdsourced Data Annotation Pipeline, a GPT-4

    Authors: Zeyu He, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Shaurya Rohatgi, Ting-Hao 'Kenneth' Huang

    Abstract: Recent studies indicated GPT-4 outperforms online crowd workers in data labeling accuracy, notably workers from Amazon Mechanical Turk (MTurk). However, these studies were criticized for deviating from standard crowdsourcing practices and emphasizing individual workers' performances over the whole data-annotation process. This paper compared GPT-4 and an ethical and well-executed MTurk pipeline, w… ▽ More

    Submitted 28 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted By CHI 2024

  5. arXiv:2311.16521  [pdf, other

    cs.HC

    Inspo: Writing Stories with a Flock of AIs and Humans

    Authors: Chieh-Yang Huang, Sanjana Gautam, Shannon McClellan Brooks, Ya-Fang Lin, Ting-Hao 'Kenneth' Huang

    Abstract: Large Language Models (LLMs) have advanced automated writing assistance, enabling complex tasks like co-writing novels and poems. However, real-world writing typically requires various support and collaboration across stages and scenarios. Existing research mainly examines how writers utilize single text generators, neglecting this broader context. This paper introduces Inspo, a web-based editor t… ▽ More

    Submitted 12 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  6. arXiv:2310.15405  [pdf, other

    cs.CL

    GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao K. Huang

    Abstract: There is growing interest in systems that generate captions for scientific figures. However, assessing these systems output poses a significant challenge. Human evaluation requires academic expertise and is costly, while automatic evaluation depends on often low-quality author-written captions. This paper investigates using large language models (LLMs) as a cost-effective, reference-free method fo… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To Appear in EMNLP 2023 Findings

  7. arXiv:2310.15129  [pdf, other

    cs.CL cs.LG

    Location-Aware Visual Question Generation with Lightweight Models

    Authors: Nicholas Collin Suwono, Justin Chih-Yao Chen, Tun Min Hung, Ting-Hao Kenneth Huang, I-Bin Liao, Yung-Hui Li, Lun-Wei Ku, Shao-Hua Sun

    Abstract: This work introduces a novel task, location-aware visual question generation (LocaVQG), which aims to generate engaging questions from data relevant to a particular geographical location. Specifically, we represent such location-aware information with surrounding images and a GPS coordinate. To tackle this task, we present a dataset generation pipeline that leverages GPT-4 to produce diverse and s… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  8. arXiv:2310.07649  [pdf, other

    cs.RO eess.SY

    Automated Layout Design and Control of Robust Cooperative Grasped-Load Aerial Transportation Systems

    Authors: Carlo Bosio, Jerry Tang, Ting-Hao Wang, Mark W. Mueller

    Abstract: We present a novel approach to cooperative aerial transportation through a team of drones, using optimal control theory and a hierarchical control strategy. We assume the drones are connected to the payload through rigid attachments, essentially transforming the whole system into a larger flying object with "thrust modules" at the attachment locations of the drones. We investigate the optimal arra… ▽ More

    Submitted 28 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 7 pages, 7 figures, conference paper

  9. arXiv:2308.04346  [pdf, other

    cs.CL cs.CY

    Unmasking Nationality Bias: A Study of Human Perception of Nationalities in AI-Generated Articles

    Authors: Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao `Kenneth' Huang, Shomir Wilson

    Abstract: We investigate the potential for nationality biases in natural language processing (NLP) models using human evaluation methods. Biased NLP models can perpetuate stereotypes and lead to algorithmic discrimination, posing a significant challenge to the fairness and justice of AI systems. Our study employs a two-step mixed-methods approach that includes both quantitative and qualitative analysis to i… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  10. arXiv:2306.04820  [pdf, other

    cs.CL

    Good Data, Large Data, or No Data? Comparing Three Approaches in Develo** Research Aspect Classifiers for Biomedical Papers

    Authors: Shreya Chandrasekhar, Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang

    Abstract: The rapid growth of scientific publications, particularly during the COVID-19 pandemic, emphasizes the need for tools to help researchers efficiently comprehend the latest advancements. One essential part of understanding scientific literature is research aspect classification, which categorizes sentences in abstracts to Background, Purpose, Method, and Finding. In this study, we investigate the i… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: BioNLP workshop 2023

  11. arXiv:2305.09770  [pdf, other

    cs.HC cs.AI cs.CL

    ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing

    Authors: Hua Shen, Chieh-Yang Huang, Tongshuang Wu, Ting-Hao 'Kenneth' Huang

    Abstract: Despite a surge collection of XAI methods, users still struggle to obtain required AI explanations. Previous research suggests chatbots as dynamic solutions, but the effective design of conversational XAI agents for practical human needs remains under-explored. This paper focuses on Conversational XAI for AI-assisted scientific writing tasks. Drawing from human linguistic theories and formative st… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: CSCW 2023 Demo. ConvXAI system code: https://github.com/huashen218/convxai.git

  12. arXiv:2304.01002  [pdf, other

    cs.CL cs.AI cs.HC

    Does Human Collaboration Enhance the Accuracy of Identifying LLM-Generated Deepfake Texts?

    Authors: Adaku Uchendu, Jooyoung Lee, Hua Shen, Thai Le, Ting-Hao 'Kenneth' Huang, Dongwon Lee

    Abstract: Advances in Large Language Models (e.g., GPT-4, LLaMA) have improved the generation of coherent sentences resembling human writing on a large scale, resulting in the creation of so-called deepfake texts. However, this progress poses security and privacy concerns, necessitating effective solutions for distinguishing deepfake texts from human-written ones. Although prior works studied humans' abilit… ▽ More

    Submitted 9 October, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted at The 11th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2023)

  13. arXiv:2303.17710  [pdf, other

    cs.HC cs.CL

    What Types of Questions Require Conversation to Answer? A Case Study of AskReddit Questions

    Authors: Shih-Hong Huang, Chieh-Yang Huang, Ya-Fang Lin, Ting-Hao 'Kenneth' Huang

    Abstract: The proliferation of automated conversational systems such as chatbots, spoken-dialogue systems, and smart speakers, has significantly impacted modern digital life. However, these systems are primarily designed to provide answers to well-defined questions rather than to support users in exploring complex, ill-defined questions. In this paper, we aim to push the boundaries of conversational systems… ▽ More

    Submitted 3 April, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: To appear in CHI 2023 Late-Breaking Work

  14. arXiv:2302.12324  [pdf, other

    cs.CL

    Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization

    Authors: Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid paper writers by providing good starting captions that can be refined for better quality. Prior work often treated figure caption generation as a vision-to-language task. In this paper, we show that it can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by INLG-2023

  15. arXiv:2302.09122  [pdf, other

    cs.CL cs.HC

    Conveying the Predicted Future to Users: A Case Study of Story Plot Prediction

    Authors: Chieh-Yang Huang, Saniya Naphade, Kavya Laalasa Karanam, Ting-Hao 'Kenneth' Huang

    Abstract: Creative writing is hard: Novelists struggle with writer's block daily. While automatic story generation has advanced recently, it is treated as a "toy task" for advancing artificial intelligence rather than hel** people. In this paper, we create a system that produces a short description that narrates a predicted plot using existing story generation approaches. Our goal is to assist writers in… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: To appear in the AAAI 2023 Workshop- Creative AI Across Modalities

  16. arXiv:2302.02463  [pdf, other

    cs.CL cs.AI

    Nationality Bias in Text Generation

    Authors: Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao 'Kenneth' Huang, Shomir Wilson

    Abstract: Little attention is placed on analyzing nationality bias in language models, especially when nationality is highly used as a factor in increasing the performance of social NLP models. This paper examines how a text generation model, GPT-2, accentuates pre-existing societal biases about country-based demonyms. We generate stories using GPT-2 for various nationalities and use sensitivity analysis to… ▽ More

    Submitted 14 February, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Paper accepted in the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL2023)

  17. arXiv:2212.03969  [pdf, other

    cs.HC

    Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

    Authors: Shih-Hong Huang, Chieh-Yang Huang, Yuxin Deng, Hua Shen, Szu-Chi Kuan, Ting-Hao 'Kenneth' Huang

    Abstract: Real-time crowd-powered systems, such as Chorus/Evorus, VizWiz, and Apparition, have shown how incorporating humans into automated systems could supplement where the automatic solutions fall short. However, one unspoken bottleneck of applying such architectures to more scenarios is the longer latency of including humans in the loop of automated systems. For the applications that have hard constrai… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: This document is the extended technical report of the "Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers" paper by the authors. The paper was accepted by the Works-in-Progress and Demonstration track of the 10th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2022 WiP/Demo) https://youtu.be/iMDsX52VWGY

  18. arXiv:2211.07441  [pdf, other

    cs.CL cs.CV cs.LG

    Multi-VQG: Generating Engaging Questions for Multiple Images

    Authors: Min-Hsuan Yeh, Vicent Chen, Ting-Hao 'Kenneth' Haung, Lun-Wei Ku

    Abstract: Generating engaging content has drawn much recent attention in the NLP community. Asking questions is a natural way to respond to photos and promote awareness. However, most answers to questions in traditional question-answering (QA) datasets are factoids, which reduce individuals' willingness to answer. Furthermore, traditional visual question generation (VQG) confines the source data for questio… ▽ More

    Submitted 17 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

  19. arXiv:2205.09327  [pdf, other

    cs.AI cs.CL cs.CV

    Let's Talk! Striking Up Conversations via Conversational Visual Question Generation

    Authors: Shih-Han Chan, Tsai-Lun Yang, Yun-Wei Chu, Chi-Yang Hsu, Ting-Hao Huang, Yu-Shian Chiu, Lun-Wei Ku

    Abstract: An engaging and provocative question can open up a great conversation. In this work, we explore a novel scenario: a conversation agent views a set of the user's photos (for example, from social media platforms) and asks an engaging question to initiate a conversation with the user. The existing vision-to-question models mostly generate tedious and obvious questions, which might not be ideals conve… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted as a full talk paper on AAAI-DEEPDIAL'21

  20. arXiv:2204.06382  [pdf, ps, other

    cs.HC

    Empathy-Centric Design At Scale

    Authors: Andrea Mauri, Yen-Chia Hsu, Marco Brambilla, Aisling Ann O'Kane, Ting-Hao 'Kenneth' Huang, Himanshu Verma

    Abstract: EmpathiCH aims at bringing together and blend different expertise to develop new research agenda in the context of "Empathy-Centric Design at Scale". The main research question is to investigate how new technologies can contribute to the elicitation of empathy across and within multiple stakeholders at scale; and how empathy can be used to design solutions to societal problems that are not only ef… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: accepted at Workshops at the 2022 CHI Conference on Human Factors in Computing Systems (CHI 2022)

  21. arXiv:2203.08788  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Are Shortest Rationales the Best Explanations for Human Understanding?

    Authors: Hua Shen, Tongshuang Wu, Wenbo Guo, Ting-Hao 'Kenneth' Huang

    Abstract: Existing self-explaining models typically favor extracting the shortest possible rationales - snippets of an input text "responsible for" corresponding output - to explain the model prediction, with the assumption that shorter rationales are more intuitive to humans. However, this assumption has yet to be validated. Is the shortest rationale indeed the most human-understandable? To answer this que… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: To appear in ACL 2022 main conference

  22. arXiv:2110.11624  [pdf, other

    cs.CL cs.AI cs.CV

    SciCap: Generating Captions for Scientific Figures

    Authors: Ting-Yao Hsu, C. Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Researchers use figures to communicate rich, complex information in scientific papers. The captions of these figures are critical to conveying effective messages. However, low-quality figure captions commonly occur in scientific articles and may decrease understanding. In this paper, we propose an end-to-end neural framework to automatically generate informative, high-quality captions for scientif… ▽ More

    Submitted 25 October, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To Appear in EMNLP 2021 Findings. The dataset is available at: https://github.com/tingyaohsu/SciCap

  23. Empowering Local Communities Using Artificial Intelligence

    Authors: Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang, Himanshu Verma, Andrea Mauri, Illah Nourbakhsh, Alessandro Bozzon

    Abstract: Artificial Intelligence (AI) is increasingly used to analyze large amounts of data in various practices, such as object recognition. We are specifically interested in using AI-powered systems to engage local communities in develo** plans or solutions for pressing societal and environmental concerns. Such local contexts often involve multiple stakeholders with different and even contradictory age… ▽ More

    Submitted 26 April, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: This manuscript is peer-reviewed and accepted by the Patterns journal

  24. arXiv:2109.00122  [pdf, other

    cs.CL

    FinQA: A Dataset of Numerical Reasoning over Financial Data

    Authors: Zhiyu Chen, Wenhu Chen, Charese Smiley, Sameena Shah, Iana Borova, Dylan Langdon, Reema Moussa, Matt Beane, Ting-Hao Huang, Bryan Routledge, William Yang Wang

    Abstract: The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance… ▽ More

    Submitted 7 May, 2022; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  25. arXiv:2106.12027  [pdf, other

    cs.CL cs.AI cs.LG

    ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences

    Authors: Yanjun Gao, Ting-hao Huang, Rebecca J. Passonneau

    Abstract: Atomic clauses are fundamental text units for understanding complex sentences. Identifying the atomic sentences within complex sentences is important for applications such as summarization, argument mining, discourse analysis, discourse parsing, and question answering. Previous work mainly relies on rule-based methods dependent on parsing. We propose a new task to decompose each complex sentence i… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: To appear in the proceeding of 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021) Main Conference

  26. arXiv:2105.06950  [pdf, other

    cs.CL cs.AI

    Plot and Rework: Modeling Storylines for Visual Storytelling

    Authors: Chi-Yang Hsu, Yun-Wei Chu, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: Writing a coherent and engaging story is not easy. Creative writers use their knowledge and worldview to put disjointed elements together to form a coherent storyline, and work and rework iteratively toward perfection. Automated visual storytelling (VIST) models, however, make poor use of external knowledge and iterative generation when attempting to create stories. This paper introduces PR-VIST,… ▽ More

    Submitted 7 July, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: 9 pages, ACL-IJCNLP 2021 Findings

  27. arXiv:2104.05604  [pdf, other

    cs.CL

    Semantic Frame Forecast

    Authors: Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang

    Abstract: This paper introduces semantic frame forecast, a task that predicts the semantic frames that will occur in the next 10, 100, or even 1,000 sentences in a running story. Prior work focused on predicting the immediate future of a story, such as one to a few sentences ahead. However, when novelists write long stories, generating a few sentences is not enough to help them gain high-level insight to de… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages, NAACL 2021

  28. arXiv:2103.14973  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Explaining the Road Not Taken

    Authors: Hua Shen, Ting-Hao 'Kenneth' Huang

    Abstract: It is unclear if existing interpretations of deep neural network models respond effectively to the needs of users. This paper summarizes the common forms of explanations (such as feature attribution, decision rules, or probes) used in over 200 recent papers about natural language processing (NLP), and compares them against user questions collected in the XAI Question Bank. We found that although u… ▽ More

    Submitted 30 March, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

    Comments: Accepted by The 2021 ACM CHI Workshop on Operationalizing Human-Centered Perspectives in Explainable AI (CHI 2021 HCXAI Workshop). For associated website, see https://human-centered-exnlp.github.io

  29. arXiv:2010.02179  [pdf, other

    cs.CL

    Assessing the Helpfulness of Learning Materials with Inference-Based Learner-Like Agent

    Authors: Yun-Hsuan Jen, Chieh-Yang Huang, Mei-Hua Chen, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: Many English-as-a-second language learners have trouble using near-synonym words (e.g., small vs.little; briefly vs.shortly) correctly, and often look for example sentences to learn how two nearly synonymous terms differ. Prior work uses hand-crafted scores to recommend sentences but has difficulty in adopting such scores to all the near-synonyms as near-synonyms differ in various ways. We notice… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, to appear in EMNLP 2020 as a long paper

  30. arXiv:2008.11721  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    How Useful Are the Machine-Generated Interpretations to General Users? A Human Evaluation on Guessing the Incorrectly Predicted Labels

    Authors: Hua Shen, Ting-Hao Kenneth Huang

    Abstract: Explaining to users why automated systems make certain mistakes is important and challenging. Researchers have proposed ways to automatically produce interpretations for deep neural network models. However, it is unclear how useful these interpretations are in hel** users figure out why they are getting an error. If an interpretation effectively explains to users how the underlying deep neural n… ▽ More

    Submitted 27 August, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: Accepted by The 8th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2020) https://github.com/huashen218/GuessWrongLabel

  31. arXiv:2005.06111  [pdf, other

    cs.CV

    Project RISE: Recognizing Industrial Smoke Emissions

    Authors: Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang, Ting-Yao Hu, Paul Dille, Sean Prendi, Ryan Hoffman, Anastasia Tsuhlares, Jessica Pachuta, Randy Sargent, Illah Nourbakhsh

    Abstract: Industrial smoke emissions pose a significant concern to human health. Prior works have shown that using Computer Vision (CV) techniques to identify smoke as visual evidence can influence the attitude of regulators and empower citizens to pursue environmental justice. However, existing datasets are not of sufficient quality nor quantity to train the robust CV models needed to support air quality a… ▽ More

    Submitted 29 April, 2024; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted by AAAI 2021

  32. arXiv:2005.02367  [pdf, other

    cs.CL cs.HC

    CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Research Dataset

    Authors: Ting-Hao 'Kenneth' Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Yen-Chia Hsu, C. Lee Giles

    Abstract: This paper introduces CODA-19, a human-annotated dataset that codes the Background, Purpose, Method, Finding/Contribution, and Other sections of 10,966 English abstracts in the COVID-19 Open Research Dataset. CODA-19 was created by 248 crowd workers from Amazon Mechanical Turk within 10 days, and achieved labeling quality comparable to that of experts. Each abstract was annotated by nine different… ▽ More

    Submitted 17 September, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: Accepted by the NLP COVID-19 Workshop at ACL 2020. (The data, code, and model are available at: https://github.com/windx0303/CODA-19)

  33. Heteroglossia: In-Situ Story Ideation with the Crowd

    Authors: Chieh-Yang Huang, Shih-Hong Huang, Ting-Hao 'Kenneth' Huang

    Abstract: Ideation is essential for creative writing. Many authors struggle to come up with ideas throughout the writing process, yet modern writing tools fail to provide on-the-spot assistance for writers when they get stuck. This paper introduces Heteroglossia, an add-on for Google Docs that allows writers to elicit story ideas from the online crowd using their text editors. Writers can share snippets of… ▽ More

    Submitted 15 January, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: Accepted by CHI 2020. Video Promotion: https://www.youtube.com/watch?v=i0G-tq3d8c0

    ACM Class: H.5; H.4; I.7

  34. arXiv:1912.11936  [pdf, other

    cs.HC cs.AI cs.SI

    Smell Pittsburgh: Engaging Community Citizen Science for Air Quality

    Authors: Yen-Chia Hsu, Jennifer Cross, Paul Dille, Michael Tasota, Beatrice Dias, Randy Sargent, Ting-Hao 'Kenneth' Huang, Illah Nourbakhsh

    Abstract: Urban air pollution has been linked to various human health concerns, including cardiopulmonary diseases. Communities who suffer from poor air quality often rely on experts to identify pollution sources due to the lack of accessible tools. Taking this into account, we developed Smell Pittsburgh, a system that enables community members to report odors and track where these odors are frequently conc… ▽ More

    Submitted 20 November, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: Accepted by ACM Transactions on Interactive Intelligent Systems on 2020. This is an extended version of the arXiv:1810.11143, which was accepted by the ACM IUI 2019 conference. arXiv admin note: substantial text overlap with arXiv:1810.11143

  35. arXiv:1912.01496  [pdf, other

    cs.CL

    Knowledge-Enriched Visual Storytelling

    Authors: Chao-Chun Hsu, Zi-Yuan Chen, Chi-Yang Hsu, Chih-Chia Li, Tzu-Yuan Lin, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: Stories are diverse and highly personalized, resulting in a large possible output space for story generation. Existing end-to-end approaches produce monotonous stories because they are limited to the vocabulary and knowledge in a single training dataset. This paper introduces KG-Story, a three-stage framework that allows the story generation model to take advantage of external Knowledge Graphs to… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: AAAI 2020

  36. arXiv:1910.09621  [pdf, other

    cs.HC cs.CL

    On Automating Conversations

    Authors: Ting-Hao 'Kenneth' Huang

    Abstract: From 2016 to 2018, we developed and deployed Chorus, a system that blends real-time human computation with artificial intelligence (AI) and has real-world, open conversations with users. We took a top-down approach that started with a working crowd-powered system, Chorus, and then created a framework, Evorus, that enables Chorus to automate itself over time. Over our two-year deployment, more than… ▽ More

    Submitted 24 October, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: An invited position paper at the "Artificial Intelligence and Work: AAAI 2019 Fall Symposium" (AAAI-FSS 2019), Washington, DC, November 7-9, 2019

  37. arXiv:1910.08814  [pdf, ps, other

    cs.HC

    On Using Chatbots to Promote Smoking Cessation Among Adolescents of Low Socioeconomic Status

    Authors: Patricia Simon, Suchitra Krishnan-Sarin, Ting-Hao 'Kenneth' Huang

    Abstract: Reducing youth tobacco use is critical for improving child health since tobacco use is associated with respiratory problems, and nicotine may interfere with healthy brain development. While tobacco regulation has contributed to declines in cigarette use among youth, these declines have occurred more quickly for youth of high socioeconomic status (SES) compared to youth of low SES. A major barrier… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: Selected for round-table discussion in Artificial Intelligence and Work: AAAI 2019 Fall Symposium (AAAI FSS 2019)

  38. InstructableCrowd: Creating IF-THEN Rules for Smartphones via Conversations with the Crowd

    Authors: Ting-Hao 'Kenneth' Huang, Amos Azaria, Oscar J. Romero, Jeffrey P. Bigham

    Abstract: Natural language interfaces have become a common part of modern digital life. Chatbots utilize text-based conversations to communicate with users; personal assistants on smartphones such as Google Assistant take direct speech commands from their users; and speech-controlled devices such as Amazon Echo use voice as their only input mode. In this paper, we introduce InstructableCrowd, a crowd-powere… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: Published at Human Computation (2019) 6:1:113-146

    Journal ref: Human Computation (2019) 6:1:113-146

  39. arXiv:1906.01764  [pdf, other

    cs.CL cs.AI cs.HC

    Visual Story Post-Editing

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang

    Abstract: We introduce the first dataset for human edits of machine-generated visual stories and explore how these collected edits may be used for the visual story post-editing task. The dataset, VIST-Edit, includes 14,905 human edited versions of 2,981 machine-generated visual stories. The stories were generated by two state-of-the-art visual storytelling models, each aligned to 5 human-edited versions. We… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted by ACL 2019

  40. Dixit: Interactive Visual Storytelling via Term Manipulation

    Authors: Chao-Chun Hsu, Yu-Hua Chen, Zi-Yuan Chen, Hsin-Yu Lin, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: In this paper, we introduce Dixit, an interactive visual storytelling system that the user interacts with iteratively to compose a short story for a photo sequence. The user initiates the process by uploading a sequence of photos. Dixit first extracts text terms from each photo which describe the objects (e.g., boy, bike) or actions (e.g., sleep) in the photo, and then allows the user to add new t… ▽ More

    Submitted 31 May, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: WWW'19 Demo, demo video: https://www.youtube.com/watch?v=CUu1MOwnveI

  41. arXiv:1902.08327  [pdf, other

    cs.HC cs.CL

    On How Users Edit Computer-Generated Visual Stories

    Authors: Ting-Yao Hsu, Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang

    Abstract: A significant body of research in Artificial Intelligence (AI) has focused on generating stories automatically, either based on prior story plots or input images. However, literature has little to say about how users would receive and use these stories. Given the quality of stories generated by modern AI algorithms, users will nearly inevitably have to edit these stories before putting them to rea… ▽ More

    Submitted 8 March, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: To appear in CHI'19 Late-Breaking Work on Human Factors in Computing Systems (CHI LBW 2019), 2019

  42. arXiv:1810.11143  [pdf, other

    cs.HC

    Smell Pittsburgh: Community-Empowered Mobile Smell Reporting System

    Authors: Yen-Chia Hsu, Jennifer Cross, Paul Dille, Michael Tasota, Beatrice Dias, Randy Sargent, Ting-Hao 'Kenneth' Huang, Illah Nourbakhsh

    Abstract: Urban air pollution has been linked to various human health considerations, including cardiopulmonary diseases. Communities who suffer from poor air quality often rely on experts to identify pollution sources due to the lack of accessible tools. Taking this into account, we developed Smell Pittsburgh, a system that enables community members to report odors and track where these odors are frequentl… ▽ More

    Submitted 1 July, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: Accepted by ACM IUI 2019 conference, with error corrections

  43. arXiv:1802.08379  [pdf, other

    cs.CL

    EmotionLines: An Emotion Corpus of Multi-Party Conversations

    Authors: Sheng-Yeh Chen, Chao-Chun Hsu, Chuan-Chun Kuo, Ting-Hao, Huang, Lun-Wei Ku

    Abstract: Feeling emotion is a critical characteristic to distinguish people from machines. Among all the multi-modal resources for emotion detection, textual datasets are those containing the least additional information in addition to semantics, and hence are adopted widely for testing the developed systems. However, most of the textual emotional datasets consist of emotion labels of only individual words… ▽ More

    Submitted 30 May, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

    Comments: LREC2018

  44. arXiv:1801.02668  [pdf, other

    cs.HC cs.AI cs.CL

    Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time

    Authors: Ting-Hao 'Kenneth' Huang, Joseph Chee Chang, Jeffrey P. Bigham

    Abstract: Crowd-powered conversational assistants have been shown to be more robust than automated systems, but do so at the cost of higher response latency and monetary costs. A promising direction is to combine the two approaches for high quality, low latency, and low cost solutions. In this paper, we introduce Evorus, a crowd-powered conversational assistant built to automate itself over time by (i) allo… ▽ More

    Submitted 9 January, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

    Comments: 10 pages. To appear in the Proceedings of the Conference on Human Factors in Computing Systems 2018 (CHI'18)

    ACM Class: H.5.m

  45. arXiv:1708.03044  [pdf, other

    cs.HC cs.AI cs.CL

    "Is there anything else I can help you with?": Challenges in Deploying an On-Demand Crowd-Powered Conversational Agent

    Authors: Ting-Hao Kenneth Huang, Walter S. Lasecki, Amos Azaria, Jeffrey P. Bigham

    Abstract: Intelligent conversational assistants, such as Apple's Siri, Microsoft's Cortana, and Amazon's Echo, have quickly become a part of our digital life. However, these assistants have major limitations, which prevents users from conversing with them as they would with human dialog partners. This limits our ability to observe how users really want to interact with the underlying system. To address this… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

    Comments: 10 pages. In Proceedings of Conference on Human Computation & Crowdsourcing (HCOMP 2016), 2016, Austin, TX, USA

  46. arXiv:1707.07191  [pdf, other

    cs.CL cs.HC

    MoodSwipe: A Soft Keyboard that Suggests Messages Based on User-Specified Emotions

    Authors: Chieh-Yang Huang, Tristan Labetoulle, Ting-Hao Kenneth Huang, Yi-Pei Chen, Hung-Chen Chen, Vallari Srivastava, Lun-Wei Ku

    Abstract: We present MoodSwipe, a soft keyboard that suggests text messages given the user-specified emotions utilizing the real dialog data. The aim of MoodSwipe is to create a convenient user interface to enjoy the technology of emotion classification and text suggestion, and at the same time to collect labeled data automatically for develo** more advanced technologies. While users select the MoodSwipe… ▽ More

    Submitted 22 July, 2017; originally announced July 2017.

    Comments: 6 pages (including references), EMNLP 2017 Demo paper

    ACM Class: H.5.2; H.5.3; I.2.7

  47. arXiv:1704.03627  [pdf, other

    cs.HC cs.AI cs.CL

    Real-time On-Demand Crowd-powered Entity Extraction

    Authors: Ting-Hao 'Kenneth' Huang, Yun-Nung Chen, Jeffrey P. Bigham

    Abstract: Output-agreement mechanisms such as ESP Game have been widely used in human computation to obtain reliable human-generated labels. In this paper, we argue that a "time-limited" output-agreement mechanism can be used to create a fast and robust crowd-powered component in interactive systems, particularly dialogue systems, to extract key information from user utterances on the fly. Our experiments o… ▽ More

    Submitted 6 December, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: Accepted by the 5th Edition Of The Collective Intelligence Conference (CI 2017) as an oral presentation. Interface code and data are available at: https://github.com/windx0303/dialogue-esp-game

  48. arXiv:1702.02736  [pdf, other

    cs.CL cs.HC

    Challenges in Providing Automatic Affective Feedback in Instant Messaging Applications

    Authors: Chieh-Yang Huang, Ting-Hao, Huang, Lun-Wei Ku

    Abstract: Instant messaging is one of the major channels of computer mediated communication. However, humans are known to be very limited in understanding others' emotions via text-based communication. Aiming on introducing emotion sensing technologies to instant messaging, we developed EmotionPush, a system that automatically detects the emotions of the messages end-users received on Facebook Messenger and… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Comments: 7 pages, 2017 AAAI Spring Symposia

    ACM Class: H.5.2; H.5.3; I.2.7

  49. arXiv:1610.04758  [pdf, other

    cs.HC

    Sensing Emotions in Text Messages: An Application and Deployment Study of EmotionPush

    Authors: Shih-Ming Wang, Chun-Hui Li, Yu-Chun Lo, Ting-Hao K. Huang, Lun-Wei Ku

    Abstract: Instant messaging and push notifications play important roles in modern digital life. To enable robust sense-making and rich context awareness in computer mediated communications, we introduce EmotionPush, a system that automatically conveys the emotion of received text with a colored push notification on mobile devices. EmotionPush is powered by state-of-the-art emotion classifiers and is deploye… ▽ More

    Submitted 15 October, 2016; originally announced October 2016.

    Comments: 4 pages. COLING 2016 Demo paper

    ACM Class: H.5.2; H.5.3

  50. arXiv:1604.03968  [pdf, other

    cs.CL cs.AI cs.CV

    Visual Storytelling

    Authors: Ting-Hao, Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell

    Abstract: We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The first release of this dataset, SIND v.1, includes 81,743 unique photos in 20,211 sequences, aligned to both descriptive (caption) and story language. We establish several strong baselines for the storytelling task, and motivate an automatic metric to benc… ▽ More

    Submitted 13 April, 2016; originally announced April 2016.

    Comments: to appear in NAACL 2016