Skip to main content

Showing 1–1 of 1 results for author: Mareedu, L

.
  1. arXiv:2302.13814  [pdf, other

    cs.CL cs.AI cs.LG

    An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP)

    Authors: Paulo Shakarian, Abhinav Koyyalamudi, Noel Ngu, Lakshmivihari Mareedu

    Abstract: We study the performance of a commercially available large language model (LLM) known as ChatGPT on math word problems (MWPs) from the dataset DRAW-1K. To our knowledge, this is the first independent evaluation of ChatGPT. We found that ChatGPT's performance changes dramatically based on the requirement to show its work, failing 20% of the time when it provides work compared with 84% when it does… ▽ More

    Submitted 27 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Journal ref: AAAI Spring Symposium 2023 (MAKE)