Skip to main content

Showing 1–3 of 3 results for author: Peña, J D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02632  [pdf, other

    cs.HC cs.FL

    STL: Still Tricky Logic (for System Validation, Even When Showing Your Work)

    Authors: Isabelle Hurley, Rohan Paleja, Ashley Suh, Jaime D. Peña, Ho Chit Siu

    Abstract: As learned control policies become increasingly common in autonomous systems, there is increasing need to ensure that they are interpretable and can be checked by human stakeholders. Formal specifications have been proposed as ways to produce human-interpretable policies for autonomous systems that can still be learned from examples. Previous work showed that despite claims of interpretability, hu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.02018  [pdf, other

    cs.CL cs.AI cs.HC

    Why Would You Suggest That? Human Trust in Language Model Responses

    Authors: Manasi Sharma, Ho Chit Siu, Rohan Paleja, Jaime D. Peña

    Abstract: The emergence of Large Language Models (LLMs) has revealed a growing need for human-AI collaboration, especially in creative decision-making scenarios where trust and reliance are paramount. Through human studies and model evaluations on the open-ended News Headline Generation task from the LaMP benchmark, we analyze how the framing and presence of explanations affect user trust and model performa… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2107.07630  [pdf, other

    cs.AI cs.HC

    Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

    Authors: Ho Chit Siu, Jaime D. Pena, Edenna Chen, Yutai Zhou, Victor J. Lopez, Kyle Palko, Kimberlee C. Chang, Ross E. Allen

    Abstract: Deep reinforcement learning has generated superhuman AI in competitive games such as Go and StarCraft. Can similar learning techniques create a superior AI teammate for human-machine collaborative games? Will humans prefer AI teammates that improve objective team performance or those that improve subjective metrics of trust? In this study, we perform a single-blind evaluation of teams of humans an… ▽ More

    Submitted 21 October, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at NeurIPS 2021