Skip to main content

Showing 1–13 of 13 results for author: Józefowicz, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:1912.06680  [pdf, other

    cs.LG stat.ML

    Dota 2 with Large Scale Deep Reinforcement Learning

    Authors: OpenAI, :, Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique P. d. O. Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang , et al. (2 additional authors not shown)

    Abstract: On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learnin… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  2. arXiv:1808.00177  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Learning Dexterous In-Hand Manipulation

    Authors: OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba

    Abstract: We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies which can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The training is performed in a simulated environment in which we randomize many of the physical properties of the system like friction coefficients and an object's appearance. Our policies transfer to the physical robot despite… ▽ More

    Submitted 18 January, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: Making OpenAI the first author. We wish this paper to be cited as "Learning Dexterous In-Hand Manipulation" by OpenAI et al. We are replicating the approach from the physics community: arXiv:1812.06489

  3. arXiv:1704.01444  [pdf, other

    cs.LG cs.CL cs.NE

    Learning to Generate Reviews and Discovering Sentiment

    Authors: Alec Radford, Rafal Jozefowicz, Ilya Sutskever

    Abstract: We explore the properties of byte-level recurrent language models. When given sufficient amounts of capacity, training data, and compute time, the representations learned by these models include disentangled features corresponding to high-level concepts. Specifically, we find a single unit which performs sentiment analysis. These representations, learned in an unsupervised manner, achieve state of… ▽ More

    Submitted 6 April, 2017; v1 submitted 5 April, 2017; originally announced April 2017.

  4. arXiv:1702.05800   

    cs.DC cs.AI cs.LG

    Revisiting Distributed Synchronous SGD

    Authors: Xinghao Pan, Jianmin Chen, Rajat Monga, Samy Bengio, Rafal Jozefowicz

    Abstract: Distributed training of deep learning models on large-scale training data is typically conducted with asynchronous stochastic optimization to maximize the rate of updates, at the cost of additional noise introduced from asynchrony. In contrast, the synchronous approach is often thought to be impractical due to idle time wasted on waiting for straggling workers. We revisit these conventional belief… ▽ More

    Submitted 18 March, 2017; v1 submitted 19 February, 2017; originally announced February 2017.

    Comments: This article will be superseded by arXiv:1604.00981

  5. arXiv:1608.06315  [pdf, other

    cs.LG q-bio.NC stat.ML

    LFADS - Latent Factor Analysis via Dynamical Systems

    Authors: David Sussillo, Rafal Jozefowicz, L. F. Abbott, Chethan Pandarinath

    Abstract: Neuroscience is experiencing a data revolution in which many hundreds or thousands of neurons are recorded simultaneously. Currently, there is little consensus on how such data should be analyzed. Here we introduce LFADS (Latent Factor Analysis via Dynamical Systems), a method to infer latent dynamics from simultaneously recorded, single-trial, high-dimensional neural spiking data. LFADS is a sequ… ▽ More

    Submitted 22 August, 2016; originally announced August 2016.

    Comments: 16 pages, 11 figures

  6. arXiv:1606.04934  [pdf, other

    cs.LG stat.ML

    Improving Variational Inference with Inverse Autoregressive Flow

    Authors: Diederik P. Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, Max Welling

    Abstract: The framework of normalizing flows provides a general strategy for flexible variational inference of posteriors over latent variables. We propose a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces. The proposed flow consists of a chain of invertible transformations, where each transformation… ▽ More

    Submitted 30 January, 2017; v1 submitted 15 June, 2016; originally announced June 2016.

  7. arXiv:1604.00981  [pdf, other

    cs.LG cs.DC cs.NE

    Revisiting Distributed Synchronous SGD

    Authors: Jianmin Chen, Xinghao Pan, Rajat Monga, Samy Bengio, Rafal Jozefowicz

    Abstract: Distributed training of deep learning models on large-scale training data is typically conducted with asynchronous stochastic optimization to maximize the rate of updates, at the cost of additional noise introduced from asynchrony. In contrast, the synchronous approach is often thought to be impractical due to idle time wasted on waiting for straggling workers. We revisit these conventional belief… ▽ More

    Submitted 21 March, 2017; v1 submitted 4 April, 2016; originally announced April 2016.

    Comments: 10 pages

  8. arXiv:1603.04467  [pdf, other

    cs.DC cs.LG

    TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

    Authors: Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mane, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah , et al. (15 additional authors not shown)

    Abstract: TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational de… ▽ More

    Submitted 16 March, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

    Comments: Version 2 updates only the metadata, to correct the formatting of Martín Abadi's name

  9. arXiv:1602.02410  [pdf, other

    cs.CL

    Exploring the Limits of Language Modeling

    Authors: Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, Yonghui Wu

    Abstract: In this work we explore recent advances in Recurrent Neural Networks for large scale Language Modeling, a task central to language understanding. We extend current models to deal with two key challenges present in this task: corpora and vocabulary sizes, and complex, long term structure of language. We perform an exhaustive study on techniques such as character Convolutional Neural Networks or Lon… ▽ More

    Submitted 11 February, 2016; v1 submitted 7 February, 2016; originally announced February 2016.

  10. arXiv:1511.06440  [pdf, other

    cs.LG

    Towards Principled Unsupervised Learning

    Authors: Ilya Sutskever, Rafal Jozefowicz, Karol Gregor, Danilo Rezende, Tim Lillicrap, Oriol Vinyals

    Abstract: General unsupervised learning is a long-standing conceptual problem in machine learning. Supervised learning is successful because it can be solved by the minimization of the training error cost function. Unsupervised learning is not as successful, because the unsupervised objective may be unrelated to the supervised task of interest. For an example, density modelling and reconstruction have often… ▽ More

    Submitted 3 December, 2015; v1 submitted 19 November, 2015; originally announced November 2015.

  11. arXiv:1511.06349  [pdf, other

    cs.LG cs.CL

    Generating Sentences from a Continuous Space

    Authors: Samuel R. Bowman, Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, Samy Bengio

    Abstract: The standard recurrent neural network language model (RNNLM) generates sentences one word at a time and does not work from an explicit global sentence representation. In this work, we introduce and study an RNN-based variational autoencoder generative model that incorporates distributed latent representations of entire sentences. This factorization allows it to explicitly model holistic properties… ▽ More

    Submitted 12 May, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: First two authors contributed equally. Work was done when all authors were at Google, Inc

    Journal ref: SIGNLL Conference on Computational Natural Language Learning (CONLL), 2016

  12. Fast optimization of Multithreshold Entropy Linear Classifier

    Authors: Rafal Jozefowicz, Wojciech Marian Czarnecki

    Abstract: Multithreshold Entropy Linear Classifier (MELC) is a density based model which searches for a linear projection maximizing the Cauchy-Schwarz Divergence of dataset kernel density estimation. Despite its good empirical results, one of its drawbacks is the optimization speed. In this paper we analyze how one can speed it up through solving an approximate problem. We analyze two methods, both similar… ▽ More

    Submitted 18 April, 2015; originally announced April 2015.

    Comments: Presented at Theoretical Foundations of Machine Learning 2015 (http://tfml.gmum.net), final version published in Schedae Informaticae Journal

  13. arXiv:1504.02622  [pdf, other

    cs.LG

    Maximum Entropy Linear Manifold for Learning Discriminative Low-dimensional Representation

    Authors: Wojciech Marian Czarnecki, Rafał Józefowicz, Jacek Tabor

    Abstract: Representation learning is currently a very hot topic in modern machine learning, mostly due to the great success of the deep learning methods. In particular low-dimensional representation which discriminates classes can not only enhance the classification procedure, but also make it faster, while contrary to the high-dimensional embeddings can be efficiently used for visual based exploratory data… ▽ More

    Submitted 10 April, 2015; originally announced April 2015.

    Comments: submitted to ECMLPKDD 2015