Skip to main content

Showing 1–23 of 23 results for author: Hellas, A

.
  1. arXiv:2407.04873  [pdf, ps, other

    cs.AI cs.CY

    Evaluating Language Models for Generating and Judging Programming Feedback

    Authors: Charles Koutcheme, Nicola Dainese, Arto Hellas, Sami Sarsa, Juho Leinonen, Syed Ashraf, Paul Denny

    Abstract: The emergence of large language models (LLMs) has transformed research and practice in a wide range of domains. Within the computing education research (CER) domain, LLMs have received plenty of attention especially in the context of learning programming. Much of the work on LLMs in CER has however focused on applying and evaluating proprietary models. In this article, we evaluate the efficiency o… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2406.04817  [pdf, other

    cs.CY cs.HC

    Experiences from Integrating Large Language Model Chatbots into the Classroom

    Authors: Arto Hellas, Juho Leinonen, Leo Leppänen

    Abstract: In the present study, we provided students an unfiltered access to a state-of-the-art large language model (LLM) chatbot. The chatbot was intentionally designed to mimic proprietary commercial chatbots such as ChatGPT where the chatbot has not been tailored for the educational context; the underlying engine was OpenAI GPT-4. The chatbot was integrated into online learning materials of three course… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 7 pages, 1 figure, 5 tables

  3. arXiv:2405.05347  [pdf, other

    cs.SE cs.AI cs.CL cs.CY

    Benchmarking Educational Program Repair

    Authors: Charles Koutcheme, Nicola Dainese, Sami Sarsa, Juho Leinonen, Arto Hellas, Paul Denny

    Abstract: The emergence of large language models (LLMs) has sparked enormous interest due to their potential application across a range of educational tasks. For example, recent work in programming education has used LLMs to generate learning resources, improve error messages, and provide feedback on code. However, one factor that limits progress within the field is that much of the research uses bespoke da… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 15 pages, 2 figures, 3 tables. Non-archival report presented at the NeurIPS'23 Workshop on Generative AI for Education (GAIED)

  4. arXiv:2405.05253  [pdf, other

    cs.CL cs.AI cs.CY

    Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge

    Authors: Charles Koutcheme, Nicola Dainese, Sami Sarsa, Arto Hellas, Juho Leinonen, Paul Denny

    Abstract: Large language models (LLMs) have shown great potential for the automatic generation of feedback in a wide range of computing contexts. However, concerns have been voiced around the privacy and ethical implications of sending student work to proprietary models. This has sparked considerable interest in the use of open source LLMs in education, but the quality of the feedback that such open models… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures, 2 tables. Accepted for publication at the 29th annual ACM conference on Innovation and Technology in Computer Science Education (ITiCSE 2024)

  5. Let's Ask AI About Their Programs: Exploring ChatGPT's Answers To Program Comprehension Questions

    Authors: Teemu Lehtinen, Charles Koutcheme, Arto Hellas

    Abstract: Recent research has explored the creation of questions from code submitted by students. These Questions about Learners' Code (QLCs) are created through program analysis, exploring execution paths, and then creating code comprehension questions from these paths and the broader code structure. Responding to the questions requires reading and tracing the code, which is known to support students' lear… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  6. arXiv:2403.09409  [pdf, ps, other

    cs.HC cs.AI cs.CL

    "Like a Nesting Doll": Analyzing Recursion Analogies Generated by CS Students using Large Language Models

    Authors: Seth Bernstein, Paul Denny, Juho Leinonen, Lauren Kan, Arto Hellas, Matt Littlefield, Sami Sarsa, Stephen MacNeil

    Abstract: Gras** complex computing concepts often poses a challenge for students who struggle to anchor these new ideas to familiar experiences and understandings. To help with this, a good analogy can bridge the gap between unfamiliar concepts and familiar ones, providing an engaging way to aid understanding. However, creating effective educational analogies is difficult even for experienced instructors.… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 7 pages, 2 figures, ITiCSE 2024 preprint

  7. arXiv:2311.16017  [pdf, other

    cs.HC cs.AI

    Decoding Logic Errors: A Comparative Study on Bug Detection by Students and Large Language Models

    Authors: Stephen MacNeil, Paul Denny, Andrew Tran, Juho Leinonen, Seth Bernstein, Arto Hellas, Sami Sarsa, Joanne Kim

    Abstract: Identifying and resolving logic errors can be one of the most frustrating challenges for novices programmers. Unlike syntax errors, for which a compiler or interpreter can issue a message, logic errors can be subtle. In certain conditions, buggy code may even exhibit correct behavior -- in other cases, the issue might be about how a problem statement has been interpreted. Such errors can be hard t… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  8. arXiv:2309.05669  [pdf, other

    cs.HC cs.DC cs.PF cs.SE

    Implications of Edge Computing for Static Site Generation

    Authors: Juho Vepsäläinen, Arto Hellas, Petri Vuorimaa

    Abstract: Static site generation (SSG) is a common technique in the web development space to create performant websites that are easy to host. Numerous SSG tools exist, and the approach has been complemented by newer approaches, such as Jamstack, that extend its usability. Edge computing represents a new option to extend the usefulness of SSG further by allowing the creation of dynamic sites on top of a sta… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 14 pages, 3 figures, 1 table, approved for WEBIST 2023

    ACM Class: D.2.11

  9. arXiv:2309.04188  [pdf, other

    cs.SE

    The State of Disappearing Frameworks in 2023

    Authors: Juho Vepsäläinen, Arto Hellas, Petri Vuorimaa

    Abstract: Disappearing frameworks represent a new type of thinking for web development. In the current mainstream JavaScript frameworks, the focus has been on developer experience at the cost of user experience. Disappearing frameworks shift the focus by aiming to deliver as little, even zero, JavaScript to the client. In this paper, we look at the options available in the ecosystem in mid-2023 and characte… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 15 pages, 1 figure, 2 tables, approved for WEBIST 2023

    ACM Class: D.2.11

  10. arXiv:2306.10509  [pdf, other

    cs.HC cs.AI

    Can We Trust AI-Generated Educational Content? Comparative Analysis of Human and AI-Generated Learning Resources

    Authors: Paul Denny, Hassan Khosravi, Arto Hellas, Juho Leinonen, Sami Sarsa

    Abstract: As an increasing number of students move to online learning platforms that deliver personalized learning experiences, there is a great need for the production of high-quality educational content. Large language models (LLMs) appear to offer a promising solution to the rapid creation of learning materials at scale, reducing the burden on instructors. In this study, we investigated the potential for… ▽ More

    Submitted 3 July, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

  11. arXiv:2306.05715  [pdf, ps, other

    cs.CY cs.AI cs.CL cs.HC cs.SE

    Exploring the Responses of Large Language Models to Beginner Programmers' Help Requests

    Authors: Arto Hellas, Juho Leinonen, Sami Sarsa, Charles Koutcheme, Lilja Kujanpää, Juha Sorva

    Abstract: Background and Context: Over the past year, large language models (LLMs) have taken the world by storm. In computing education, like in other walks of life, many opportunities and threats have emerged as a consequence. Objectives: In this article, we explore such opportunities and threats in a specific area: responding to student programmers' help requests. More specifically, we assess how good… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 13 pages, 1 figure. To be published in Proceedings of the 2023 ACM Conference on International Computing Education Research V.1 (ICER '23 V1)

  12. arXiv:2306.02608  [pdf, other

    cs.CY cs.AI cs.HC

    Computing Education in the Era of Generative AI

    Authors: Paul Denny, James Prather, Brett A. Becker, James Finnie-Ansley, Arto Hellas, Juho Leinonen, Andrew Luxton-Reilly, Brent N. Reeves, Eddie Antonio Santos, Sami Sarsa

    Abstract: The computing education community has a rich history of pedagogical innovation designed to support students in introductory courses, and to support teachers in facilitating student learning. Very recent advances in artificial intelligence have resulted in code generation models that can produce source code from natural language problem descriptions -- with impressive accuracy in many cases. The wi… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted for publication as a Contributed Article in Communications of the ACM (CACM)

  13. arXiv:2304.03938  [pdf, other

    cs.CY cs.AI cs.CL cs.HC cs.SE

    Comparing Code Explanations Created by Students and Large Language Models

    Authors: Juho Leinonen, Paul Denny, Stephen MacNeil, Sami Sarsa, Seth Bernstein, Joanne Kim, Andrew Tran, Arto Hellas

    Abstract: Reasoning about code and explaining its purpose are fundamental skills for computer scientists. There has been extensive research in the field of computing education on the relationship between a student's ability to explain code and other skills such as writing and tracing code. In particular, the ability to describe at a high-level of abstraction how code will behave over all possible inputs cor… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: 8 pages, 3 figures. To be published in Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1

  14. The Rise of Disappearing Frameworks in Web Development

    Authors: Juho Vepsäläinen, Arto Hellas, Petri Vuorimaa

    Abstract: The evolution of the web can be characterized as an emergence of frameworks paving the way from static websites to dynamic web applications. As the scope of web applications has grown, new technical challenges have emerged, leading to the need for new solutions. The latest of these developments is the rise of so-called disappearing web frameworks that question the axioms of earlier generations of… ▽ More

    Submitted 15 June, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 9 pages, 1 figure, ICWE 2023

    ACM Class: D.2.11

    Journal ref: ICWE 2023. Lecture Notes in Computer Science, vol 13893. Springer, Cham

  15. arXiv:2301.07509  [pdf, other

    cs.HC cs.CY

    Coverage of Course Topics in Learnersourced SQL Exercises

    Authors: Nea Pirttinen, Arto Hellas, Juho Leinonen

    Abstract: Learnersourcing is a common task in modern computing classrooms, where it is used, for example, for the creation of educational resources such as multiple-choice questions and programming exercises. One less studied type of learnersourced artefact is SQL exercises. In this work, we explore how well different SQL topics are covered by learnersourced SQL exercises. Covering most course topics would… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  16. arXiv:2212.07763  [pdf, ps, other

    cs.SE cs.HC

    Synthesizing Research on Programmers' Mental Models of Programs, Tasks and Concepts -- a Systematic Literature Review

    Authors: Ava Heinonen, Bettina Lehtelä, Arto Hellas, Fabian Fagerholm

    Abstract: Programmers' mental models represent their knowledge and understanding of programs, programming concepts, and programming in general. They guide programmers' work and influence their task performance. Understanding mental models is important for designing work systems and practices that support programmers. Although the importance of programmers' mental models is widely acknowledged, research on m… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Submitted to Information and Software Technology

    ACM Class: F.3.2; F.3.3

  17. Automatically Generating CS Learning Materials with Large Language Models

    Authors: Stephen MacNeil, Andrew Tran, Juho Leinonen, Paul Denny, Joanne Kim, Arto Hellas, Seth Bernstein, Sami Sarsa

    Abstract: Recent breakthroughs in Large Language Models (LLMs), such as GPT-3 and Codex, now enable software developers to generate code based on a natural language prompt. Within computer science education, researchers are exploring the potential for LLMs to generate code explanations and programming assignments using carefully crafted prompts. These advances may enable students to interact with code in ne… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: In Proceedings of the 54th ACM Technical Symposium on Computing Science Education

  18. arXiv:2211.04715  [pdf, other

    cs.HC

    Robosourcing Educational Resources -- Leveraging Large Language Models for Learnersourcing

    Authors: Paul Denny, Sami Sarsa, Arto Hellas, Juho Leinonen

    Abstract: In this article, we introduce and evaluate the concept of robosourcing for creating educational content. Robosourcing lies in the intersection of crowdsourcing and large language models, where instead of a crowd of humans, requests to large language models replace some of the work traditionally performed by the crowd. Robosourcing includes a human-in-the-loop to provide priming (input) as well as… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  19. arXiv:2211.02265  [pdf, other

    cs.SE cs.HC

    Experiences from Using Code Explanations Generated by Large Language Models in a Web Software Development E-Book

    Authors: Stephen MacNeil, Andrew Tran, Arto Hellas, Joanne Kim, Sami Sarsa, Paul Denny, Seth Bernstein, Juho Leinonen

    Abstract: Advances in natural language processing have resulted in large language models (LLMs) that are capable of generating understandable and sensible written text. Recent versions of these models, such as OpenAI Codex and GPT-3, can generate code and code explanations. However, it is unclear whether and how students might engage with such explanations. In this paper, we report on our experiences genera… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  20. arXiv:2210.11630  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.SE

    Using Large Language Models to Enhance Programming Error Messages

    Authors: Juho Leinonen, Arto Hellas, Sami Sarsa, Brent Reeves, Paul Denny, James Prather, Brett A. Becker

    Abstract: A key part of learning to program is learning to understand programming error messages. They can be hard to interpret and identifying the cause of errors can be time-consuming. One factor in this challenge is that the messages are typically intended for an audience that already knows how to program, or even for programming environments that then use the information to highlight areas in code. Rese… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 7 pages, accepted for publication at SIGCSE TS 2023

  21. arXiv:2206.11861  [pdf, other

    cs.SE cs.AI cs.CL

    Automatic Generation of Programming Exercises and Code Explanations using Large Language Models

    Authors: Sami Sarsa, Paul Denny, Arto Hellas, Juho Leinonen

    Abstract: This article explores the natural language generation capabilities of large language models with application to the production of two types of learning resources common in programming courses. Using OpenAI Codex as the large language model, we create programming exercises (including sample solutions and test cases) and code explanations, assessing these qualitatively and quantitatively. Our result… ▽ More

    Submitted 26 June, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: 18 pages, 1 figure, accepted in ICER

  22. arXiv:2112.15072  [pdf, other

    cs.LG

    Empirical Evaluation of Deep Learning Models for Knowledge Tracing: Of Hyperparameters and Metrics on Performance and Replicability

    Authors: Sami Sarsa, Juho Leinonen, Arto Hellas

    Abstract: We review and evaluate a body of deep learning knowledge tracing (DLKT) models with openly available and widely-used data sets, and with a novel data set of students learning to program. The evaluated knowledge tracing models include Vanilla-DKT, two Long Short-Term Memory Deep Knowledge Tracing (LSTM-DKT) variants, two Dynamic Key-Value Memory Network (DKVMN) variants, and Self-Attentive Knowledg… ▽ More

    Submitted 5 April, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Comments: 70 pages, 8 figures, submitted to JEDM, added acknowledgments, modified after first round of review

    ACM Class: K.3; I.2

  23. arXiv:2103.01752  [pdf, other

    cs.CY

    Morning or Evening? An Examination of Circadian Rhythms of CS1 Students

    Authors: Albina Zavgorodniaia, Raj Shrestha, Juho Leinonen, Arto Hellas, John Edwards

    Abstract: Circadian rhythms are the cycles of our internal clock that play a key role in governing when we sleep and when we are active. A related concept is chronotype, which is a person's natural tendency toward activity at certain times of day and typically governs when the individual is most alert and productive. In this work we investigate chronotypes in the setting of an Introductory Computer Programm… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.