Skip to main content

Showing 1–4 of 4 results for author: Kaden, Z

.
  1. arXiv:1910.06393  [pdf, other

    cs.CL cs.LG

    In-training Matrix Factorization for Parameter-frugal Neural Machine Translation

    Authors: Zachary Kaden, Teven Le Scao, Raphael Olivier

    Abstract: In this paper, we propose the use of in-training matrix factorization to reduce the model size for neural machine translation. Using in-training matrix factorization, parameter matrices may be decomposed into the products of smaller matrices, which can compress large machine translation architectures by vastly reducing the number of learnable parameters. We apply in-training matrix factorization t… ▽ More

    Submitted 23 March, 2020; v1 submitted 27 September, 2019; originally announced October 2019.

  2. arXiv:1812.01260  [pdf, other

    cs.CL cs.AI

    Tartan: A retrieval-based socialbot powered by a dynamic finite-state machine architecture

    Authors: George Larionov, Zachary Kaden, Hima Varsha Dureddy, Gabriel Bayomi T. Kalejaiye, Mihir Kale, Srividya Pranavi Potharaju, Ankit Parag Shah, Alexander I Rudnicky

    Abstract: This paper describes the Tartan conversational agent built for the 2018 Alexa Prize Competition. Tartan is a non-goal-oriented socialbot focused around providing users with an engaging and fluent casual conversation. Tartan's key features include an emphasis on structured conversation based on flexible finite-state models and an approach focused on understanding and using conversational acts. To p… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  3. arXiv:1811.00260  [pdf, other

    cs.LG cs.AI stat.ML

    Horizon: Facebook's Open Source Applied Reinforcement Learning Platform

    Authors: Jason Gauci, Edoardo Conti, Yitao Liang, Kittipat Virochsiri, Yuchen He, Zachary Kaden, Vivek Narayanan, Xiaohui Ye, Zhengxing Chen, Scott Fujimoto

    Abstract: In this paper we present Horizon, Facebook's open source applied reinforcement learning (RL) platform. Horizon is an end-to-end platform designed to solve industry applied RL problems where datasets are large (millions to billions of observations), the feedback loop is slow (vs. a simulator), and experiments must be done with care because they don't run in a simulator. Unlike other RL platforms, w… ▽ More

    Submitted 4 September, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: 10 pages

  4. arXiv:1806.06192  [pdf, other

    cs.IR cs.AI cs.LG

    Handling Cold-Start Collaborative Filtering with Reinforcement Learning

    Authors: Hima Varsha Dureddy, Zachary Kaden

    Abstract: A major challenge in recommender systems is handling new users, whom are also called $\textit{cold-start}$ users. In this paper, we propose a novel approach for learning an optimal series of questions with which to interview cold-start users for movie recommender systems. We propose learning interview questions using Deep Q Networks to create user profiles to make better recommendations to cold-st… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.