Skip to main content

Showing 1–1 of 1 results for author: Mulvehill, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.12184  [pdf, other

    cs.CL

    A Theoretically Grounded Benchmark for Evaluating Machine Commonsense

    Authors: Henrique Santos, Ke Shen, Alice M. Mulvehill, Yasaman Razeghi, Deborah L. McGuinness, Mayank Kejriwal

    Abstract: Programming machines with commonsense reasoning (CSR) abilities is a longstanding challenge in the Artificial Intelligence community. Current CSR benchmarks use multiple-choice (and in relatively fewer cases, generative) question-answering instances to evaluate machine commonsense. Recent progress in transformer-based language representation models suggest that considerable progress has been made… ▽ More

    Submitted 14 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.