Skip to main content

Showing 1–2 of 2 results for author: Ma, W A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10215  [pdf, other

    cs.CL cs.LG

    DevBench: A multimodal developmental benchmark for language learning

    Authors: Alvin Wei Ming Tan, Sunny Yu, Bria Long, Wan**g Anya Ma, Tonya Murray, Rebecca D. Silverman, Jason D. Yeatman, Michael C. Frank

    Abstract: How (dis)similar are the learning trajectories of vision-language models and children? Recent modeling work has attempted to understand the gap between models' and humans' data efficiency by constructing models trained on less data, especially multimodal naturalistic data. However, such models are often evaluated on adult-level benchmarks, with limited breadth in language abilities tested, and wit… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2310.06837  [pdf, other

    cs.CL cs.LG

    Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency

    Authors: Eric Zelikman, Wan**g Anya Ma, Jasmine E. Tran, Diyi Yang, Jason D. Yeatman, Nick Haber

    Abstract: Develo** an educational test can be expensive and time-consuming, as each item must be written by experts and then evaluated by collecting hundreds of student responses. Moreover, many tests require multiple distinct sets of questions administered throughout the school year to closely monitor students' progress, known as parallel tests. In this study, we focus on tests of silent sentence reading… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Main)