Skip to main content

Showing 1–31 of 31 results for author: Piech, C

.
  1. arXiv:2406.03030  [pdf, other

    cs.CL cs.LG

    From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation

    Authors: Ali Malik, Stephen Mayhew, Chris Piech, Klinton Bicknell

    Abstract: We study the problem of controlling the difficulty level of text generated by Large Language Models (LLMs) for contexts where end-users are not fully proficient, such as language learners. Using a novel framework, we evaluate the effectiveness of several key approaches for this task, including few-shot prompting, supervised finetuning, and reinforcement learning (RL), utilising both GPT-4 and open… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Journal ref: In Findings of the Association for Computational Linguistics (ACL 2024)

  2. TeachNow: Enabling Teachers to Provide Spontaneous, Realtime 1:1 Help in Massive Online Courses

    Authors: Ali Malik, Juliette Woodrow, Chao Wang, Chris Piech

    Abstract: One-on-one help from a teacher is highly impactful for students, yet extremely challenging to support in massive online courses (MOOCs). In this work, we present TeachNow: a novel system that lets volunteer teachers from anywhere in the world instantly provide 1:1 help sessions to students in MOOCs, without any scheduling or coordination overhead. TeachNow works by quickly finding an online studen… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Journal ref: In Proceedings of the 2024 Innovation and Technology in Computer Science Education (ITiCSE 2024)

  3. AI Teaches the Art of Elegant Coding: Timely, Fair, and Helpful Style Feedback in a Global Course

    Authors: Juliette Woodrow, Ali Malik, Chris Piech

    Abstract: Teaching students how to write code that is elegant, reusable, and comprehensible is a fundamental part of CS1 education. However, providing this "style feedback" in a timely manner has proven difficult to scale. In this paper, we present our experience deploying a novel, real-time style feedback tool in Code in Place, a large-scale online CS1 course. Our tool is based on the latest breakthroughs… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Journal ref: Proceedings of the 55th ACM Technical Symposium on Computer Science Education (SIGCSE); March 2024 (1442-1448)

  4. Learners Teaching Novices: An Uplifting Alternative Assessment

    Authors: Ali Malik, Juliette Woodrow, Chris Piech

    Abstract: We propose and carry-out a novel method of formative assessment called Assessment via Teaching (AVT), in which learners demonstrate their understanding of CS1 topics by tutoring more novice students. AVT has powerful benefits over traditional forms of assessment: it is centered around service to others and is highly rewarding for the learners who teach. Moreover, teaching greatly improves the lear… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Journal ref: Proceedings of the 55th ACM Technical Symposium on Computer Science Education (SIGCSE); March 2024 (785-791)

  5. arXiv:2403.14637  [pdf, other

    cs.CY

    SimGrade: Using Code Similarity Measures for More Accurate Human Grading

    Authors: Sonja Johnson-Yu, Nicholas Bowman, Mehran Sahami, Chris Piech

    Abstract: While the use of programming problems on exams is a common form of summative assessment in CS courses, grading such exam problems can be a difficult and inconsistent process. Through an analysis of historical grading patterns we show that inaccurate and inconsistent grading of free-response programming problems is widespread in CS1 courses. These inconsistencies necessitate the development of meth… ▽ More

    Submitted 19 February, 2024; originally announced March 2024.

    Comments: Educational Data Mining 2021

  6. arXiv:2311.08594  [pdf, other

    cs.LG stat.ML

    Variational Temporal IRT: Fast, Accurate, and Explainable Inference of Dynamic Learner Proficiency

    Authors: Yunsung Kim, Sreechan Sankaranarayanan, Chris Piech, Candace Thille

    Abstract: Dynamic Item Response Models extend the standard Item Response Theory (IRT) to capture temporal dynamics in learner ability. While these models have the potential to allow instructional systems to actively monitor the evolution of learner proficiency in real time, existing dynamic item response models rely on expensive inference algorithms that scale poorly to massive datasets. In this work, we pr… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 9 pages, 16th International Conference on Educational Data Mining (EDM'23)

  7. arXiv:2310.19677  [pdf, other

    cs.CL

    MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

    Authors: Allen Nie, Yuhui Zhang, Atharva Amdekar, Chris Piech, Tatsunori Hashimoto, Tobias Gerstenberg

    Abstract: Human commonsense understanding of the physical and social world is organized around intuitive theories. These theories support making causal and moral judgments. When something bad happens, we naturally ask: who did what, and why? A rich literature in cognitive science has studied people's causal and moral intuitions. This work has revealed a number of factors that systematically influence people… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 34 pages, 7 figures. NeurIPS 2023

  8. arXiv:2310.18844  [pdf, other

    cs.LG cs.AI

    BanditPAM++: Faster $k$-medoids Clustering

    Authors: Mo Tiwari, Ryan Kang, Donghyun Lee, Sebastian Thrun, Chris Piech, Ilan Shomorony, Martin **ye Zhang

    Abstract: Clustering is a fundamental task in data science with wide-ranging applications. In $k$-medoids clustering, cluster centers must be actual datapoints and arbitrary distance metrics may be used; these features allow for greater interpretability of the cluster centers and the clustering of exotic objects in $k$-medoids clustering, respectively. $k$-medoids clustering has recently grown in popularity… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

    MSC Class: 68 ACM Class: I.m; I.2.0; I.2.6; K.3.2; I.2.m

  9. arXiv:2310.15612  [pdf, other

    cs.CL cs.CY cs.HC cs.LG

    Machine Translation for Nko: Tools, Corpora and Baseline Results

    Authors: Moussa Koulako Bala Doumbouya, Baba Mamadi Diané, Solo Farabado Cissé, Djibrila Diané, Abdoulaye Sow, Séré Moussa Doumbouya, Daouda Bangoura, Fodé Moriba Bayo, Ibrahima Sory 2. Condé, Kalo Mory Diané, Chris Piech, Christopher Manning

    Abstract: Currently, there is no usable machine translation system for Nko, a language spoken by tens of millions of people across multiple West African countries, which holds significant cultural and educational value. To address this issue, we present a set of tools, resources, and baseline results aimed towards the development of usable machine translation systems for Nko and other languages that do no… ▽ More

    Submitted 15 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    ACM Class: I.2.6; I.2.7

  10. arXiv:2306.06941  [pdf, other

    cs.CL

    The BEA 2023 Shared Task on Generating AI Teacher Responses in Educational Dialogues

    Authors: Anaïs Tack, Ekaterina Kochmar, Zheng Yuan, Serge Bibauw, Chris Piech

    Abstract: This paper describes the results of the first shared task on the generation of teacher responses in educational dialogues. The goal of the task was to benchmark the ability of generative language models to act as AI teachers, replying to a student in a teacher-student dialogue. Eight teams participated in the competition hosted on CodaLab. They experimented with a wide variety of state-of-the-art… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: to appear in the Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications, ACL 2023, Toronto, Canada

    ACM Class: I.2.7; K.3

  11. arXiv:2302.07407  [pdf, ps, other

    cs.LG cs.AI

    Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

    Authors: Colin Sullivan, Mo Tiwari, Sebastian Thrun, Chris Piech

    Abstract: Decision Trees are some of the most popular machine learning models today due to their out-of-the-box performance and interpretability. Often, Decision Trees models are constructed greedily in a top-down fashion via heuristic search criteria, such as Gini impurity or entropy. However, trees constructed in this manner are sensitive to minor fluctuations in training data and are prone to overfitting… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 10 pages, 1 figure

    ACM Class: I.2.m; I.2.6; I.2.0

  12. arXiv:2212.07551  [pdf, ps, other

    cs.LG cs.AI

    Faster Maximum Inner Product Search in High Dimensions

    Authors: Mo Tiwari, Ryan Kang, Je-Yong Lee, Donghyun Lee, Chris Piech, Sebastian Thrun, Ilan Shomorony, Martin **ye Zhang

    Abstract: Maximum Inner Product Search (MIPS) is a ubiquitous task in machine learning applications such as recommendation systems. Given a query vector and $n$ atom vectors in $d$-dimensional space, the goal of MIPS is to find the atom that has the highest inner product with the query vector. Existing MIPS algorithms scale at least as $O(\sqrt{d})$, which becomes computationally prohibitive in high-dimensi… ▽ More

    Submitted 26 June, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: 24 pages

  13. arXiv:2212.07473  [pdf, ps, other

    cs.LG cs.DS

    MABSplit: Faster Forest Training Using Multi-Armed Bandits

    Authors: Mo Tiwari, Ryan Kang, Je-Yong Lee, Sebastian Thrun, Chris Piech, Ilan Shomorony, Martin **ye Zhang

    Abstract: Random forests are some of the most widely used machine learning models today, especially in domains that necessitate interpretability. We present an algorithm that accelerates the training of random forests and other popular tree-based learning methods. At the core of our algorithm is a novel node-splitting subroutine, dubbed MABSplit, used to efficiently find split points when constructing decis… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Published at NeurIPS 2022, 30 pages

    ACM Class: I.2.8

  14. arXiv:2211.08802  [pdf, other

    cs.LG cs.AI stat.ML

    Giving Feedback on Interactive Student Programs with Meta-Exploration

    Authors: Evan Zheran Liu, Moritz Stephan, Allen Nie, Chris Piech, Emma Brunskill, Chelsea Finn

    Abstract: Develo** interactive software, such as websites or games, is a particularly engaging way to learn computer science. However, teaching and giving feedback on such software is time-consuming -- standard approaches require instructors to manually grade student-implemented interactive programs. As a result, online platforms that serve millions, like Code.org, are unable to provide any feedback on as… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2022). Selected as Oral

  15. arXiv:2205.07540  [pdf, other

    cs.CL cs.AI

    The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

    Authors: Anaïs Tack, Chris Piech

    Abstract: How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports on a first attempt at an AI teacher test. We built a… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: to be published in the Proceedings of the 15th International Conference on Educational Data Mining; 8 pages, 5 figures, 3 tables

    ACM Class: I.2.7; K.3

  16. arXiv:2110.14615  [pdf, other

    cs.AI cs.CY cs.LG

    Play to Grade: Testing Coding Games as Classifying Markov Decision Process

    Authors: Allen Nie, Emma Brunskill, Chris Piech

    Abstract: Contemporary coding education often presents students with the task of develo** programs that have user interaction and complex dynamic systems, such as mouse based games. While pedagogically compelling, there are no contemporary autonomous methods for providing feedback. Notably, interactive programs are impossible to grade by traditional unit tests. In this paper we formalize the challenge of… ▽ More

    Submitted 14 December, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021, 16 pages, 7 figures

  17. arXiv:2108.11579  [pdf, other

    cs.LG stat.ML

    Modeling Item Response Theory with Stochastic Variational Inference

    Authors: Mike Wu, Richard L. Davis, Benjamin W. Domingue, Chris Piech, Noah Goodman

    Abstract: Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving psychometric modeling leading to improved scientific understanding and public policy. However, while larger datasets allow for more flexible approaches, many… ▽ More

    Submitted 28 July, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: version two includes added experiments; 33 pages of content; 6 pages appendix; figures at the bottom. arXiv admin note: text overlap with arXiv:2002.00276

  18. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  19. arXiv:2107.14035  [pdf, other

    cs.CY cs.LG

    ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback

    Authors: Mike Wu, Noah Goodman, Chris Piech, Chelsea Finn

    Abstract: High-quality computer science education is limited by the difficulty of providing instructor feedback to students at scale. While this feedback could in principle be automated, supervised approaches to predicting the correct feedback are bottlenecked by the intractability of annotating large quantities of student code. In this paper, we instead frame the problem of providing feedback as few-shot c… ▽ More

    Submitted 4 October, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: 9 pages content; 6 pages supplement

  20. arXiv:2104.13083  [pdf, other

    cs.LG cs.AI

    Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate Users

    Authors: Moussa Doumbouya, Lisa Einstein, Chris Piech

    Abstract: For many of the 700 million illiterate people around the world, speech recognition technology could provide a bridge to valuable information and services. Yet, those most in need of this technology are often the most underserved by it. In many countries, illiterate people tend to speak only low-resource languages, for which the datasets necessary for speech technology development are scarce. In th… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  21. arXiv:2006.06856  [pdf, other

    cs.LG cs.AI stat.ML

    BanditPAM: Almost Linear Time $k$-Medoids Clustering via Multi-Armed Bandits

    Authors: Mo Tiwari, Martin **ye Zhang, James Mayclin, Sebastian Thrun, Chris Piech, Ilan Shomorony

    Abstract: Clustering is a ubiquitous task in data science. Compared to the commonly used $k$-means clustering, $k$-medoids clustering requires the cluster centers to be actual data points and support arbitrary distance metrics, which permits greater interpretability and the clustering of structured objects. Current state-of-the-art $k$-medoids clustering algorithms, such as Partitioning Around Medoids (PAM)… ▽ More

    Submitted 6 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 21 pages, NeurIPS 2020

  22. arXiv:2002.00276  [pdf, other

    cs.LG stat.ML

    Variational Item Response Theory: Fast, Accurate, and Expressive

    Authors: Mike Wu, Richard L. Davis, Benjamin W. Domingue, Chris Piech, Noah Goodman

    Abstract: Item Response Theory (IRT) is a ubiquitous model for understanding humans based on their responses to questions, used in fields as diverse as education, medicine and psychology. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving test scoring and better informing public policy. Yet larger datasets pose a difficult speed / accuracy challenge to… ▽ More

    Submitted 16 March, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: 10 pages of content

  23. arXiv:1909.04556  [pdf, other

    cs.CL

    Human Languages in Source Code: Auto-Translation for Localized Instruction

    Authors: Chris Piech, Sami Abu-El-Haija

    Abstract: Computer science education has promised open access around the world, but access is largely determined by what human language you speak. As younger students learn computer science it is less appropriate to assume that they should learn English beforehand. To that end we present CodeInternational, the first tool to translate code between human languages. To develop a theory of non-English code, and… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  24. arXiv:1906.01811  [pdf, other

    cs.AI stat.AP

    The Stanford Acuity Test: A Precise Vision Test Using Bayesian Techniques and a Discovery in Human Visual Response

    Authors: Chris Piech, Ali Malik, Laura M Scott, Robert T Chang, Charles Lin

    Abstract: Chart-based visual acuity measurements are used by billions of people to diagnose and guide treatment of vision impairment. However, the ubiquitous eye exam has no mechanism for reasoning about uncertainty and as such, suffers from a well-documented reproducibility problem. In this paper we make two core contributions. First, we uncover a new parametric probabilistic model of visual acuity respons… ▽ More

    Submitted 21 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 34th AAAI Conference on Artificial Intelligence, New York, USA. 2020

  25. arXiv:1905.13383  [pdf, other

    cs.LG stat.ML

    Using Latent Variable Models to Observe Academic Pathways

    Authors: Nate Gruver, Ali Malik, Brahm Capoor, Chris Piech, Mitchell L. Stevens, Andreas Paepcke

    Abstract: Understanding large-scale patterns in student course enrollment is a problem of great interest to university administrators and educational researchers. Yet important decisions are often made without a good quantitative framework of the process underlying student choices. We propose a probabilistic approach to modelling course enrollment decisions, drawing inspiration from multilabel classificatio… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: Twelfth International Conference on Educational Data Mining

  26. arXiv:1905.09916  [pdf, other

    cs.LG cs.CY stat.ML

    Generative Grading: Near Human-level Accuracy for Automated Feedback on Richly Structured Problems

    Authors: Ali Malik, Mike Wu, Vrinda Vasavada, **peng Song, Madison Coots, John Mitchell, Noah Goodman, Chris Piech

    Abstract: Access to high-quality education at scale is limited by the difficulty of providing student feedback on open-ended assignments in structured domains like computer programming, graphics, and short response questions. This problem has proven to be exceptionally difficult: for humans, it requires large amounts of manual work, and for computers, until recently, achieving anything near human-level accu… ▽ More

    Submitted 23 March, 2021; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: 10 pages of content

  27. arXiv:1809.01357  [pdf, other

    cs.LG cs.CY stat.ML

    Zero Shot Learning for Code Education: Rubric Sampling with Deep Learning Inference

    Authors: Mike Wu, Milan Mosse, Noah Goodman, Chris Piech

    Abstract: In modern computer science education, massive open online courses (MOOCs) log thousands of hours of data about how students solve coding challenges. Being so rich in data, these platforms have garnered the interest of the machine learning community, with many new algorithms attempting to autonomously provide feedback to help future students learn. But what about those first hundred thousand studen… ▽ More

    Submitted 16 December, 2018; v1 submitted 5 September, 2018; originally announced September 2018.

    Comments: To appear at AAAI 2019; 9 pages

  28. arXiv:1807.00199  [pdf, other

    cs.LG stat.ML

    Achieving Fairness through Adversarial Learning: an Application to Recidivism Prediction

    Authors: Christina Wadsworth, Francesca Vera, Chris Piech

    Abstract: Recidivism prediction scores are used across the USA to determine sentencing and supervision for hundreds of thousands of inmates. One such generator of recidivism prediction scores is Northpointe's Correctional Offender Management Profiling for Alternative Sanctions (COMPAS) score, used in states like California and Florida, which past research has shown to be biased against black inmates accordi… ▽ More

    Submitted 30 June, 2018; originally announced July 2018.

    Comments: To be published in FAT/ML, 2018, Stockholm, Sweden

  29. arXiv:1506.05908  [pdf, other

    cs.AI cs.CY cs.LG

    Deep Knowledge Tracing

    Authors: Chris Piech, Jonathan Spencer, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas Guibas, Jascha Sohl-Dickstein

    Abstract: Knowledge tracing---where a machine models the knowledge of a student as they interact with coursework---is a well established problem in computer supported education. Though effectively modeling student knowledge would have high educational impact, the task has many inherent challenges. In this paper we explore the utility of using Recurrent Neural Networks (RNNs) to model student learning. The R… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.

    ACM Class: K.3.1

  30. arXiv:1505.05969  [pdf, other

    cs.LG cs.NE cs.SE

    Learning Program Embeddings to Propagate Feedback on Student Code

    Authors: Chris Piech, Jonathan Huang, Andy Nguyen, Mike Phulsuksombati, Mehran Sahami, Leonidas Guibas

    Abstract: Providing feedback, both assessing final work and giving hints to stuck students, is difficult for open-ended assignments in massive online classes which can range from thousands to millions of students. We introduce a neural network method to encode programs as a linear map** from an embedded precondition space to an embedded postcondition space and propose an algorithm for feedback at scale us… ▽ More

    Submitted 22 May, 2015; originally announced May 2015.

    Comments: Accepted to International Conference on Machine Learning (ICML 2015)

  31. arXiv:1307.2579  [pdf, other

    cs.LG cs.AI cs.HC stat.AP stat.ML

    Tuned Models of Peer Assessment in MOOCs

    Authors: Chris Piech, Jonathan Huang, Zhenghao Chen, Chuong Do, Andrew Ng, Daphne Koller

    Abstract: In massive open online courses (MOOCs), peer grading serves as a critical tool for scaling the grading of complex, open-ended assignments to courses with tens or hundreds of thousands of students. But despite promising initial trials, it does not always deliver accurate results compared to human experts. In this paper, we develop algorithms for estimating and correcting for grader biases and relia… ▽ More

    Submitted 9 July, 2013; originally announced July 2013.

    Comments: Proceedings of The 6th International Conference on Educational Data Mining (EDM 2013)