Skip to main content

Showing 1–22 of 22 results for author: Serban, I V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.14003  [pdf, other

    cs.CL cs.AI cs.CY cs.HC cs.LG

    Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits

    Authors: Robert Belfer, Ekaterina Kochmar, Iulian Vlad Serban

    Abstract: We present an adaptive learning Intelligent Tutoring System, which uses model-based reinforcement learning in the form of contextual bandits to assign learning activities to students. The model is trained on the trajectories of thousands of students in order to maximize their exercise completion rates and continues to learn online, automatically adjusting itself to new activities. A randomized con… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: 6 pages, 1 figure, To appear in the Proceedings of the 23rd International Conference on Artificial Intelligence in Education (AIED 2022)

    ACM Class: I.2.6; I.2.7; K.3.1; K.3.2

  2. arXiv:2206.04187  [pdf, other

    cs.CL

    Few-shot Question Generation for Personalized Feedback in Intelligent Tutoring Systems

    Authors: Devang Kulshreshtha, Muhammad Shayan, Robert Belfer, Siva Reddy, Iulian Vlad Serban, Ekaterina Kochmar

    Abstract: Existing work on generating hints in Intelligent Tutoring Systems (ITS) focuses mostly on manual and non-personalized feedback. In this work, we explore automatically generated questions as personalized feedback in an ITS. Our personalized feedback can pinpoint correct and incorrect or missing phrases in student answers as well as guide them towards correct answer by asking a question in natural l… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: PAIS 2022

  3. arXiv:2203.03724  [pdf, other

    cs.CY cs.AI cs.HC cs.LG

    A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions

    Authors: Francois St-Hilaire, Dung Do Vu, Antoine Frau, Nathan Burns, Farid Faraji, Joseph Potochny, Stephane Robert, Arnaud Roussel, Selene Zheng, Taylor Glazier, Junfel Vincent Romano, Robert Belfer, Muhammad Shayan, Ariella Smofsky, Tommy Delarosbil, Seulmin Ahn, Simon Eden-Walker, Kritika Sony, Ansona Onyi Ching, Sabina Elkins, Anush Stepanyan, Adela Matajova, Victor Chen, Hossein Sahraei, Robert Larson , et al. (6 additional authors not shown)

    Abstract: Despite artificial intelligence (AI) having transformed major aspects of our society, less than a fraction of its potential has been explored, let alone deployed, for education. AI-powered learning can provide millions of learners with a highly personalized, active and practical learning experience, which is key to successful learning. This is especially relevant in the context of online learning… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: 9 pages, 6 figures

    ACM Class: I.2.0; K.3.1; K.4.0

  4. arXiv:2104.08801  [pdf, other

    cs.CL cs.AI cs.LG

    Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage Retrieval

    Authors: Devang Kulshreshtha, Robert Belfer, Iulian Vlad Serban, Siva Reddy

    Abstract: In this work, we introduce back-training, an alternative to self-training for unsupervised domain adaptation (UDA) from source to target domain. While self-training generates synthetic training data where natural inputs are aligned with noisy outputs, back-training results in natural outputs aligned with noisy inputs. This significantly reduces the gap between the target domain and synthetic data… ▽ More

    Submitted 8 September, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021

  5. arXiv:2104.07763  [pdf, other

    cs.CY cs.AI cs.CL cs.HC

    Comparative Study of Learning Outcomes for Online Learning Platforms

    Authors: Francois St-Hilaire, Nathan Burns, Robert Belfer, Muhammad Shayan, Ariella Smofsky, Dung Do Vu, Antoine Frau, Joseph Potochny, Farid Faraji, Vincent Pavero, Neroli Ko, Ansona Onyi Ching, Sabina Elkins, Anush Stepanyan, Adela Matajova, Laurent Charlin, Yoshua Bengio, Iulian Vlad Serban, Ekaterina Kochmar

    Abstract: Personalization and active learning are key aspects to successful learning. These aspects are important to address in intelligent educational applications, as they help systems to adapt and close the gap between students with varying abilities, which becomes increasingly important in the context of online and distance learning. We run a comparative head-to-head study of learning outcomes for two p… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: 14 pages, 3 figures, 2 tables, accepted at AIED 2021 (2021 Conference on Artificial Intelligence in Education)

    ACM Class: I.2.0; I.2.1; I.2.7; K.3.1; G.4

  6. arXiv:2103.07785  [pdf, other

    cs.CL

    Deep Discourse Analysis for Generating Personalized Feedback in Intelligent Tutor Systems

    Authors: Matt Grenander, Robert Belfer, Ekaterina Kochmar, Iulian V. Serban, François St-Hilaire, Jackie C. K. Cheung

    Abstract: We explore creating automated, personalized feedback in an intelligent tutoring system (ITS). Our goal is to pinpoint correct and incorrect concepts in student answers in order to achieve better student learning gains. Although automatic methods for providing personalized feedback exist, they do not explicitly inform students about which concepts in their answers are correct or incorrect. Our appr… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

    Comments: Accepted at EAAI 2021

  7. arXiv:2005.06616  [pdf, other

    cs.CY cs.AI cs.CL cs.HC cs.LG

    A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM

    Authors: Iulian Vlad Serban, Varun Gupta, Ekaterina Kochmar, Dung D. Vu, Robert Belfer, Joelle Pineau, Aaron Courville, Laurent Charlin, Yoshua Bengio

    Abstract: We present Korbit, a large-scale, open-domain, mixed-interface, dialogue-based intelligent tutoring system (ITS). Korbit uses machine learning, natural language processing and reinforcement learning to provide interactive, personalized learning online. Korbit has been designed to easily scale to thousands of subjects, by automating, standardizing and simplifying the content creation process. Unlik… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 6 pages, 1 figure, 1 table, accepted for publication in the 21st International Conference on Artificial Intelligence in Education (AIED 2020)

    ACM Class: I.2.0; I.2.1; I.2.7; K.3.1; G.4

  8. arXiv:2005.02431  [pdf, other

    cs.CL cs.AI

    Automated Personalized Feedback Improves Learning Gains in an Intelligent Tutoring System

    Authors: Ekaterina Kochmar, Dung Do Vu, Robert Belfer, Varun Gupta, Iulian Vlad Serban, Joelle Pineau

    Abstract: We investigate how automated, data-driven, personalized feedback in a large-scale intelligent tutoring system (ITS) improves student learning outcomes. We propose a machine learning approach to generate personalized feedback, which takes individual needs of students into account. We utilize state-of-the-art machine learning and natural language processing techniques to provide the students with pe… ▽ More

    Submitted 7 May, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: To be published in Proceedings of the the 21st International Conference on Artificial Intelligence in Education (AIED 2020)

  9. arXiv:1807.04723  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.NE stat.ML

    The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach

    Authors: Iulian Vlad Serban, Chinnadhurai Sankar, Michael Pieper, Joelle Pineau, Yoshua Bengio

    Abstract: Deep reinforcement learning has recently shown many impressive successes. However, one major obstacle towards applying such methods to real-world problems is their lack of data-efficiency. To this end, we propose the Bottleneck Simulator: a model-based reinforcement learning method which combines a learned, factorized transition model of the environment with rollout simulations to learn an effecti… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 26 pages, 2 figures, 4 tables

    ACM Class: I.5.1; I.2.7

  10. arXiv:1801.06700  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot (Short Version)

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

    Comments: 9 pages, 1 figure, 2 tables; presented at NIPS 2017, Conversational AI: "Today's Practice and Tomorrow's Potential" Workshop

    ACM Class: I.5.1; I.2.7

  11. arXiv:1709.02349  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeshwar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-wor… ▽ More

    Submitted 5 November, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: 40 pages, 9 figures, 11 tables

    ACM Class: I.5.1; I.2.7

  12. arXiv:1708.07149  [pdf, other

    cs.CL cs.AI cs.LG

    Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses

    Authors: Ryan Lowe, Michael Noseworthy, Iulian V. Serban, Nicolas Angelard-Gontier, Yoshua Bengio, Joelle Pineau

    Abstract: Automatically evaluating the quality of dialogue responses for unstructured domains is a challenging problem. Unfortunately, existing automatic evaluation metrics are biased and correlate very poorly with human judgements of response quality. Yet having an accurate automatic evaluation procedure is crucial for dialogue research, as it allows rapid prototy** and testing of new models with fewer e… ▽ More

    Submitted 16 January, 2018; v1 submitted 23 August, 2017; originally announced August 2017.

    Comments: ACL 2017

    Journal ref: Proceedings of the 55th annual meeting on Association for Computational Linguistics (2017), pp. 1116-1126

  13. arXiv:1612.00377  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Piecewise Latent Variables for Neural Variational Text Processing

    Authors: Iulian V. Serban, Alexander G. Ororbia II, Joelle Pineau, Aaron Courville

    Abstract: Advances in neural variational inference have facilitated the learning of powerful directed graphical models with continuous latent variables, such as variational autoencoders. The hope is that such models will learn to represent rich, multi-modal latent factors in real-world data, such as natural language text. However, current models often assume simplistic priors on the latent variables - such… ▽ More

    Submitted 23 September, 2017; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: 19 pages, 2 figures, 8 tables; EMNLP 2017

    ACM Class: I.5.1; I.2.7

  14. arXiv:1611.06216  [pdf, other

    cs.CL cs.AI cs.NE

    Generative Deep Neural Networks for Dialogue: A Short Review

    Authors: Iulian Vlad Serban, Ryan Lowe, Laurent Charlin, Joelle Pineau

    Abstract: Researchers have recently started investigating deep neural networks for dialogue applications. In particular, generative sequence-to-sequence (Seq2Seq) models have shown promising results for unstructured tasks, such as word-level dialogue response generation. The hope is that such models will be able to leverage massive amounts of data to learn meaningful natural language representations and res… ▽ More

    Submitted 18 November, 2016; originally announced November 2016.

    Comments: 6 pages, 1 figure, 3 tables; NIPS 2016 workshop on Learning Methods for Dialogue

    ACM Class: I.5.1; I.2.7

  15. arXiv:1606.00776  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

    Authors: Iulian Vlad Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula, Bowen Zhou, Yoshua Bengio, Aaron Courville

    Abstract: We introduce the multiresolution recurrent neural network, which extends the sequence-to-sequence framework to model natural language generation as two parallel discrete stochastic processes: a sequence of high-level coarse tokens, and a sequence of natural language tokens. There are many ways to estimate or learn the high-level coarse tokens, but we argue that a simple extraction procedure is suf… ▽ More

    Submitted 13 June, 2016; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: 21 pages, 2 figures, 10 tables

    ACM Class: I.5.1; I.2.7

  16. arXiv:1605.06069  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

    Authors: Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron Courville, Yoshua Bengio

    Abstract: Sequential data often possesses a hierarchical structure with complex dependencies between subsequences, such as found between the utterances in a dialogue. In an effort to model this kind of generative process, we propose a neural network-based generative architecture, with latent stochastic variables that span a variable number of time steps. We apply the proposed model to the task of dialogue r… ▽ More

    Submitted 13 June, 2016; v1 submitted 19 May, 2016; originally announced May 2016.

    Comments: 15 pages, 5 tables, 4 figures

    ACM Class: I.5.1; I.2.7

  17. arXiv:1605.05414  [pdf, other

    cs.CL cs.LG

    On the Evaluation of Dialogue Systems with Next Utterance Classification

    Authors: Ryan Lowe, Iulian V. Serban, Mike Noseworthy, Laurent Charlin, Joelle Pineau

    Abstract: An open challenge in constructing dialogue systems is develo** methods for automatically learning dialogue strategies from large amounts of unlabelled data. Recent work has proposed Next-Utterance-Classification (NUC) as a surrogate task for building dialogue systems from text data. In this paper we investigate the performance of humans on this task to validate the relevance of NUC as a method o… ▽ More

    Submitted 22 July, 2016; v1 submitted 17 May, 2016; originally announced May 2016.

    Comments: Accepted to SIGDIAL 2016 (short paper). 5 pages

  18. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  19. arXiv:1603.08023  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation

    Authors: Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, Joelle Pineau

    Abstract: We investigate evaluation metrics for dialogue response generation systems where supervised labels, such as task completion, are not available. Recent works in response generation have adopted metrics from machine translation to compare a model's generated response to a single target response. We show that these metrics correlate very weakly with human judgements in the non-technical Twitter domai… ▽ More

    Submitted 3 January, 2017; v1 submitted 25 March, 2016; originally announced March 2016.

    Comments: First 4 authors had equal contribution. 13 pages, 5 tables, 6 figures. EMNLP 2016

  20. arXiv:1603.06807  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus

    Authors: Iulian Vlad Serban, Alberto García-Durán, Caglar Gulcehre, Sung** Ahn, Sarath Chandar, Aaron Courville, Yoshua Bengio

    Abstract: Over the past decade, large-scale supervised learning corpora have enabled machine learning researchers to make substantial advances. However, to this date, there are no large-scale question-answer corpora available. In this paper we present the 30M Factoid Question-Answer Corpus, an enormous question answer pair corpus produced by applying a novel neural network architecture on the knowledge base… ▽ More

    Submitted 29 May, 2016; v1 submitted 22 March, 2016; originally announced March 2016.

    Comments: 13 pages, 1 figure, 7 tables

    ACM Class: H.3.4; I.5.1; I.2.6; I.2.7

  21. arXiv:1512.05742  [pdf, other

    cs.CL cs.AI cs.HC cs.LG stat.ML

    A Survey of Available Corpora for Building Data-Driven Dialogue Systems

    Authors: Iulian Vlad Serban, Ryan Lowe, Peter Henderson, Laurent Charlin, Joelle Pineau

    Abstract: During the past decade, several areas of speech and language understanding have witnessed substantial breakthroughs from the use of data-driven models. In the area of dialogue systems, the trend is less obvious, and most practical systems are still built through significant engineering and expert knowledge. Nevertheless, several recent results suggest that data-driven approaches are feasible and q… ▽ More

    Submitted 20 March, 2017; v1 submitted 17 December, 2015; originally announced December 2015.

    Comments: 56 pages including references and appendix, 5 tables and 1 figure; Under review for the Dialogue & Discourse journal. Update: paper has been rewritten and now includes several new datasets

    MSC Class: 68T01; 68T05; 68T35; 68T50 ACM Class: I.2.6; I.2.7; I.2.1

  22. arXiv:1507.04808  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models

    Authors: Iulian V. Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, Joelle Pineau

    Abstract: We investigate the task of building open domain, conversational dialogue systems based on large dialogue corpora using generative models. Generative models produce system responses that are autonomously generated word-by-word, opening up the possibility for realistic, flexible interactions. In support of this goal, we extend the recently proposed hierarchical recurrent encoder-decoder neural netwo… ▽ More

    Submitted 6 April, 2016; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: 8 pages with references; Published in AAAI 2016 (Special Track on Cognitive Systems)

    ACM Class: I.5.1; I.2.7