Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving

Wang, Karen D.; Burkholder, Eric; Wieman, Carl; Salehi, Shima; Haber, Nick

Computer Science > Artificial Intelligence

arXiv:2310.08773 (cs)

[Submitted on 12 Oct 2023 (v1), last revised 28 Oct 2023 (this version, v2)]

Title:Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving

Authors:Karen D. Wang, Eric Burkholder, Carl Wieman, Shima Salehi, Nick Haber

View PDF

Abstract:The study explores the capabilities of OpenAI's ChatGPT in solving different types of physics problems. ChatGPT (with GPT-4) was queried to solve a total of 40 problems from a college-level engineering physics course. These problems ranged from well-specified problems, where all data required for solving the problem was provided, to under-specified, real-world problems where not all necessary data were given. Our findings show that ChatGPT could successfully solve 62.5% of the well-specified problems, but its accuracy drops to 8.3% for under-specified problems. Analysis of the model's incorrect solutions revealed three distinct failure modes: 1) failure to construct accurate models of the physical world, 2) failure to make reasonable assumptions about missing data, and 3) calculation errors. The study offers implications for how to leverage LLM-augmented instructional materials to enhance STEM education. The insights also contribute to the broader discourse on AI's strengths and limitations, serving both educators aiming to leverage the technology and researchers investigating human-AI collaboration frameworks for problem-solving and decision-making.

Comments:	12 pages, 2 figures
Subjects:	Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
Cite as:	arXiv:2310.08773 [cs.AI]
	(or arXiv:2310.08773v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2310.08773

Submission history

From: Karen D. Wang [view email]
[v1] Thu, 12 Oct 2023 23:39:28 UTC (55 KB)
[v2] Sat, 28 Oct 2023 00:24:57 UTC (55 KB)

Computer Science > Artificial Intelligence

Title:Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators