-
Experiences from Integrating Large Language Model Chatbots into the Classroom
Authors:
Arto Hellas,
Juho Leinonen,
Leo Leppänen
Abstract:
In the present study, we provided students an unfiltered access to a state-of-the-art large language model (LLM) chatbot. The chatbot was intentionally designed to mimic proprietary commercial chatbots such as ChatGPT where the chatbot has not been tailored for the educational context; the underlying engine was OpenAI GPT-4. The chatbot was integrated into online learning materials of three course…
▽ More
In the present study, we provided students an unfiltered access to a state-of-the-art large language model (LLM) chatbot. The chatbot was intentionally designed to mimic proprietary commercial chatbots such as ChatGPT where the chatbot has not been tailored for the educational context; the underlying engine was OpenAI GPT-4. The chatbot was integrated into online learning materials of three courses. One of the courses focused on software engineering with LLMs, while the two other courses were not directly related to LLMs. Our results suggest that only a minority of students engage with the chatbot in the courses that do not relate to LLMs. At the same time, unsurprisingly, nearly all students in the LLM-focused course leveraged the chatbot. In all courses, the majority of the LLM usage came from a few superusers, whereas the majority of the students did not heavily use the chatbot even though it was readily available and effectively provided a free access to the OpenAI GPT-4 model. We also observe that in addition to students using the chatbot for course-specific purposes, many use the chatbot for their own purposes. These results suggest that the worst fears of educators -- all students overrelying on LLMs -- did not materialize even when the chatbot access was unfiltered. We finally discuss potential reasons for the low usage, suggesting the need for more tailored and scaffolded LLM experiences targeted for specific types of student use cases.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Underreporting of errors in NLG output, and what to do about it
Authors:
Emiel van Miltenburg,
Miruna-Adriana Clinciu,
Ondřej Dušek,
Dimitra Gkatzia,
Stephanie Inglis,
Leo Leppänen,
Saad Mahamood,
Emma Manning,
Stephanie Schoch,
Craig Thomson,
Luou Wen
Abstract:
We observe a severe under-reporting of the different kinds of errors that Natural Language Generation systems make. This is a problem, because mistakes are an important indicator of where systems should still be improved. If authors only report overall performance metrics, the research community is left in the dark about the specific weaknesses that are exhibited by `state-of-the-art' research. Ne…
▽ More
We observe a severe under-reporting of the different kinds of errors that Natural Language Generation systems make. This is a problem, because mistakes are an important indicator of where systems should still be improved. If authors only report overall performance metrics, the research community is left in the dark about the specific weaknesses that are exhibited by `state-of-the-art' research. Next to quantifying the extent of error under-reporting, this position paper provides recommendations for error identification, analysis and reporting.
△ Less
Submitted 8 August, 2021; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Let Us Dance Just a Little Bit More --- On the Information Capacity of the Human Motor System
Authors:
Teemu Roos,
Antti Oulasvirta,
Laura Leppänen,
Arttu Modig
Abstract:
Fitts' law is a fundamental tool in measuring the capacity of the human motor system. However, it is, by definition, limited to aimed movements toward spatially expanded targets. We revisit its information-theoretic basis with the goal of generalizing it into unconstrained trained movement such as dance and sports. The proposed new measure is based on a subject's ability to accurately reproduce a…
▽ More
Fitts' law is a fundamental tool in measuring the capacity of the human motor system. However, it is, by definition, limited to aimed movements toward spatially expanded targets. We revisit its information-theoretic basis with the goal of generalizing it into unconstrained trained movement such as dance and sports. The proposed new measure is based on a subject's ability to accurately reproduce a complex movement pattern. We demonstrate our framework using motion-capture data from professional dance performances.
△ Less
Submitted 13 February, 2012; v1 submitted 25 February, 2011;
originally announced February 2011.