Skip to main content

Showing 1–23 of 23 results for author: Käser, T

.
  1. arXiv:2406.07420  [pdf, other

    cs.IR

    Graph Reasoning for Explainable Cold Start Recommendation

    Authors: Jibril Frej, Marta Knezevic, Tanja Kaser

    Abstract: The cold start problem, where new users or items have no interaction history, remains a critical challenge in recommender systems (RS). A common solution involves using Knowledge Graphs (KG) to train entity embeddings or Graph Neural Networks (GNNs). Since KGs incorporate auxiliary data and not just user/item interactions, these methods can make relevant recommendations for cold users or items. Gr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    ACM Class: H.3.3

  2. arXiv:2405.20079  [pdf, other

    cs.CL cs.CY cs.LG

    Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

    Authors: Elena Grazia Gado, Tommaso Martorella, Luca Zunino, Paola Mejia-Domenzain, Vinitra Swamy, Jibril Frej, Tanja Käser

    Abstract: Intelligent Tutoring Systems (ITS) enhance personalized learning by predicting student answers to provide immediate and customized instruction. However, recent research has primarily focused on the correctness of the answer rather than the student's performance on specific answer choices, limiting insights into students' thought processes and potential misconceptions. To address this gap, we prese… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA

  3. arXiv:2404.18978  [pdf, other

    cs.LG cs.AI cs.CY

    Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs

    Authors: Bahar Radmehr, Adish Singla, Tanja Käser

    Abstract: There has been a growing interest in develo** learner models to enhance learning and teaching experiences in educational environments. However, existing works have primarily focused on structured environments relying on meticulously crafted representations of tasks, thereby limiting the agent's ability to generalize skills across tasks. In this paper, we aim to enhance the generalization capabil… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted as a full paper at EDM 2024: The 17th International Conference on Educational Data Mining, 14-17 of July 2024, Atlanta

  4. Course Recommender Systems Need to Consider the Job Market

    Authors: Jibril Frej, Anna Dai, Syrielle Montariol, Antoine Bosselut, Tanja Käser

    Abstract: Current course recommender systems primarily leverage learner-course interactions, course content, learner preferences, and supplementary course details like instructor, institution, ratings, and reviews, to make their recommendation. However, these systems often overlook a critical aspect: the evolving skill demand of the job market. This paper focuses on the perspective of academic researchers,… ▽ More

    Submitted 1 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: accepted at SIGIR 2024 as a perspective paper. Camera Ready will come soon

    ACM Class: H.3.3

  5. arXiv:2403.14661  [pdf, other

    cs.CY cs.CL cs.LG

    Towards Modeling Learner Performance with Large Language Models

    Authors: Seyed Parsa Neshaei, Richard Lee Davis, Adam Hazimeh, Bojan Lazarevski, Pierre Dillenbourg, Tanja Käser

    Abstract: Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including time-series prediction and robot control. This paper investigates whether the pattern recognition and sequence modeling capabilities of LLMs can be extended to the dom… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 12 pages, 4 figures

  6. arXiv:2402.02933  [pdf, other

    cs.LG cs.CY cs.HC

    InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

    Authors: Vinitra Swamy, Syrielle Montariol, Julian Blackwell, Jibril Frej, Martin Jaggi, Tanja Käser

    Abstract: Interpretability for neural networks is a trade-off between three key requirements: 1) faithfulness of the explanation (i.e., how perfectly it explains the prediction), 2) understandability of the explanation by humans, and 3) model performance. Most existing methods compromise one or more of these requirements; e.g., post-hoc approaches provide limited faithfulness, automatically identified featu… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2402.01580  [pdf, other

    cs.CY cs.AI

    Generative AI for Education (GAIED): Advances, Opportunities, and Challenges

    Authors: Paul Denny, Sumit Gulwani, Neil T. Heffernan, Tanja Käser, Steven Moore, Anna N. Rafferty, Adish Singla

    Abstract: This survey article has grown out of the GAIED (pronounced "guide") workshop organized by the authors at the NeurIPS 2023 conference. We organized the GAIED workshop as part of a community-building effort to bring together researchers, educators, and practitioners to explore the potential of generative AI for enhancing education. This article aims to provide an overview of the workshop activities… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  8. Finding Paths for Explainable MOOC Recommendation: A Learner Perspective

    Authors: Jibril Frej, Neel Shah, Marta Knežević, Tanya Nazaretsky, Tanja Käser

    Abstract: The increasing availability of Massive Open Online Courses (MOOCs) has created a necessity for personalized course recommendation systems. These systems often combine neural networks with Knowledge Graphs (KGs) to achieve richer representations of learners and courses. While these enriched representations allow more accurate and personalized recommendations, explainability remains a significant ch… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  9. arXiv:2311.03311  [pdf, other

    cs.CL cs.CY

    Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance

    Authors: Thiemo Wambsganss, Xiaotian Su, Vinitra Swamy, Seyed Parsa Neshaei, Roman Rietsche, Tanja Käser

    Abstract: Large Language Models (LLMs) are increasingly utilized in educational tasks such as providing writing suggestions to students. Despite their potential, LLMs are known to harbor inherent biases which may negatively impact learners. Previous studies have investigated bias in models and data representations separately, neglecting the potential impact of LLM bias on human writing. In this paper, we in… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted as a full paper at EMNLP Findings 2023

  10. arXiv:2309.14118  [pdf, other

    cs.LG

    MultiModN- Multimodal, Multi-Task, Interpretable Modular Networks

    Authors: Vinitra Swamy, Malika Satayeva, Jibril Frej, Thierry Bossy, Thijs Vogels, Martin Jaggi, Tanja Käser, Mary-Anne Hartley

    Abstract: Predicting multiple real-world tasks in a single model often requires a particularly diverse feature space. Multimodal (MM) models aim to extract the synergistic predictive potential of multiple data types to create a shared feature space with aligned semantic meaning across inputs of drastically varying sizes (i.e. images, text, sound). Most current MM architectures fuse these representations in… ▽ More

    Submitted 6 November, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted as a full paper at NeurIPS 2023 in New Orleans, USA

  11. arXiv:2307.00364  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

    Authors: Vinitra Swamy, Jibril Frej, Tanja Käser

    Abstract: Explainable Artificial Intelligence (XAI) plays a crucial role in enabling human understanding and trust in deep learning systems. As models get larger, more ubiquitous, and pervasive in aspects of daily life, explainability is necessary to minimize adverse effects of model mistakes. Unfortunately, current approaches in human-centric XAI (e.g. predictive tasks in healthcare, education, or personal… ▽ More

    Submitted 28 May, 2024; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: Viewpoint paper, under review at JAIR

  12. arXiv:2307.00279  [pdf, other

    cs.CL

    Let Me Teach You: Pedagogical Foundations of Feedback for Language Models

    Authors: Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut

    Abstract: Natural Language Feedback (NLF) is an increasingly popular mechanism for aligning Large Language Models (LLMs) to human preferences. Despite the diversity of the information it can convey, NLF methods are often hand-designed and arbitrary, with little systematic grounding. At the same time, research in learning sciences has long established several effective feedback models. In this opinion piece,… ▽ More

    Submitted 18 June, 2024; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: 8 pages, 2 figures

  13. Understanding Revision Behavior in Adaptive Writing Support Systems for Education

    Authors: Luca Mouchel, Thiemo Wambsganss, Paola Mejia-Domenzain, Tanja Käser

    Abstract: Revision behavior in adaptive writing support systems is an important and relatively new area of research that can improve the design and effectiveness of these tools, and promote students' self-regulated learning (SRL). Understanding how these tools are used is key to improving them to better support learners in their writing and learning processes. In this paper, we present a novel pipeline with… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: 8 pages, Conference Paper

  14. arXiv:2305.16851  [pdf, other

    cs.HC cs.CY

    Visualizing Self-Regulated Learner Profiles in Dashboards: Design Insights from Teachers

    Authors: Paola Mejia-Domenzain, Eva Laini, Seyed Parsa Neshaei, Thiemo Wambsganss, Tanja Käser

    Abstract: Flipped Classrooms (FC) are a promising teaching strategy, where students engage with the learning material before attending face-to-face sessions. While pre-class activities are critical for course success, many students struggle to engage effectively in them due to inadequate of self-regulated learning (SRL) skills. Thus, tools enabling teachers to monitor students' SRL and provide personalized… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted as a poster paper at AIED 2023: The 24th International Conference on Artificial Intelligence in Education, 3-7 of July 2023, Tokyo

  15. Protected Attributes Tell Us Who, Behavior Tells Us How: A Comparison of Demographic and Behavioral Oversampling for Fair Student Success Modeling

    Authors: Jade Maï Cock, Muhammad Bilal, Richard Davis, Mirko Marras, Tanja Käser

    Abstract: Algorithms deployed in education can shape the learning experience and success of a student. It is therefore important to understand whether and how such algorithms might create inequalities or amplify existing biases. In this paper, we analyze the fairness of models which use behavioral data to identify at-risk students and suggest two novel pre-processing approaches for bias mitigation. Based on… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper at LAK 2023: The 13th International Learning Analytics and Knowledge Conference, 13-17 of March 2023, Arlington

  16. arXiv:2212.08955  [pdf, other

    cs.CY cs.HC cs.LG

    Trusting the Explainers: Teacher Validation of Explainable Artificial Intelligence for Course Design

    Authors: Vinitra Swamy, Sijia Du, Mirko Marras, Tanja Käser

    Abstract: Deep learning models for learning analytics have become increasingly popular over the last few years; however, these approaches are still not widely adopted in real-world settings, likely due to a lack of trust and transparency. In this paper, we tackle this issue by implementing explainable AI methods for black-box neural networks. This work focuses on the context of online and blended learning a… ▽ More

    Submitted 6 March, 2023; v1 submitted 17 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper (Best Paper nominee) at LAK 2023: The 13th International Learning Analytics and Knowledge Conference, March 13-17, 2023, Arlington, Texas, USA

  17. Do Not Trust a Model Because It is Confident: Uncovering and Characterizing Unknown Unknowns to Student Success Predictors in Online-Based Learning

    Authors: Roberta Galici, Tanja Käser, Gianni Fenu, Mirko Marras

    Abstract: Student success models might be prone to develop weak spots, i.e., examples hard to accurately classify due to insufficient representation during model creation. This weakness is one of the main factors undermining users' trust, since model predictions could for instance lead an instructor to not intervene on a student in need. In this paper, we unveil the need of detecting and characterizing unkn… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper at the International Conference on Learning Analytics & Knowledge (LAK23)

  18. arXiv:2212.01133  [pdf, other

    cs.LG cs.CY

    RIPPLE: Concept-Based Interpretation for Raw Time Series Models in Education

    Authors: Mohammad Asadi, Vinitra Swamy, Jibril Frej, Julien Vignoud, Mirko Marras, Tanja Käser

    Abstract: Time series is the most prevalent form of input data for educational prediction tasks. The vast majority of research using time series data focuses on hand-crafted features, designed by experts for predictive performance and interpretability. However, extracting these features is labor-intensive for humans and computers. In this paper, we propose an approach that utilizes irregular multivariate ti… ▽ More

    Submitted 28 February, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper at AAAI 2023: 37th AAAI Conference on Artificial Intelligence (EAAI: AI for Education Special Track), 7-14 of February 2023, Washington DC, USA

  19. arXiv:2209.10335  [pdf, other

    cs.CL cs.CY

    Bias at a Second Glance: A Deep Dive into Bias for German Educational Peer-Review Data Modeling

    Authors: Thiemo Wambsganss, Vinitra Swamy, Roman Rietsche, Tanja Käser

    Abstract: Natural Language Processing (NLP) has become increasingly utilized to provide adaptivity in educational applications. However, recent research has highlighted a variety of biases in pre-trained language models. While existing studies investigate bias in different domains, they are limited in addressing fine-grained analysis on educational and multilingual corpora. In this work, we analyze bias acr… ▽ More

    Submitted 22 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted as a full paper at COLING 2022: The 29th International Conference on Computational Linguistics, 12-17 of October 2022, Gyeongju, Republic of Korea

  20. arXiv:2207.01457  [pdf, other

    cs.CY cs.LG

    Generalisable Methods for Early Prediction in Interactive Simulations for Education

    Authors: Jade Maï Cock, Mirko Marras, Christian Giang, Tanja Käser

    Abstract: Interactive simulations allow students to discover the underlying principles of a scientific phenomenon through their own exploration. Unfortunately, students often struggle to learn effectively in these environments. Classifying students' interaction data in the simulations based on their expected performance has the potential to enable adaptive guidance and consequently improve students' learnin… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted as a full paper at EDM 2022: The 15th International Conference on Educational Data Mining, 24 -27 of July 2022, Durham

  21. arXiv:2207.00551  [pdf, other

    cs.LG cs.CY

    Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs

    Authors: Vinitra Swamy, Bahar Radmehr, Natasa Krco, Mirko Marras, Tanja Käser

    Abstract: Neural networks are ubiquitous in applied machine learning for education. Their pervasive success in predictive performance comes alongside a severe weakness, the lack of explainability of their decisions, especially relevant in human-centric fields. We implement five state-of-the-art methodologies for explaining black-box machine learning models (LIME, PermutationSHAP, KernelSHAP, DiCE, CEM) and… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted as a full paper at EDM 2022: The 15th International Conference on Educational Data Mining, 24-27 of July 2022, Durham

  22. arXiv:2205.01064  [pdf, other

    cs.CY cs.LG

    Meta Transfer Learning for Early Success Prediction in MOOCs

    Authors: Vinitra Swamy, Mirko Marras, Tanja Käser

    Abstract: Despite the increasing popularity of massive open online courses (MOOCs), many suffer from high dropout and low success rates. Early prediction of student success for targeted intervention is therefore essential to ensure no student is left behind in a course. There exists a large body of research in success prediction for MOOCs, focusing mainly on training models from scratch for individual cours… ▽ More

    Submitted 25 April, 2022; originally announced May 2022.

    Comments: Accepted at the 2022 ACM Conference on Learning at Scale (L@S 2022)

  23. arXiv:1806.03257  [pdf, other

    cs.CY cs.HC

    Ten Years of Research on Intelligent Educational Games for Learning Spelling and Mathematics

    Authors: Barbara Solenthaler, Severin Klingler, Tanja Käser, Markus Gross

    Abstract: In this article, we present our findings from ten years of research on intelligent educational games. We discuss the architecture of our training environments for learning spelling and mathematics, and specifically focus on the representation of the content and the controller that enables personalized trainings. We first show the multi-modal representation that reroutes information through multipl… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.