Skip to main content

Showing 1–10 of 10 results for author: Nishihara, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:1904.03646  [pdf, other

    cs.LG stat.ML

    Policy Gradient Search: Online Planning and Expert Iteration without Search Trees

    Authors: Thomas Anthony, Robert Nishihara, Philipp Moritz, Tim Salimans, John Schulman

    Abstract: Monte Carlo Tree Search (MCTS) algorithms perform simulation-based search to improve policies online. During search, the simulation policy is adapted to explore the most promising lines of play. MCTS has been used by state-of-the-art programs for many problems, however a disadvantage to MCTS is that it estimates the values of states with Monte Carlo averages, stored in a search tree; this does not… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

  2. arXiv:1807.05118  [pdf, other

    cs.LG cs.DC stat.ML

    Tune: A Research Platform for Distributed Model Selection and Training

    Authors: Richard Liaw, Eric Liang, Robert Nishihara, Philipp Moritz, Joseph E. Gonzalez, Ion Stoica

    Abstract: Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have been proposed for improving the efficiency of model selection, however their adaptation to the distributed compute environment is often ad-hoc. We propose Tune, a… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 8 Pages, Presented at the 2018 ICML AutoML workshop

  3. arXiv:1712.05889  [pdf, other

    cs.DC cs.AI cs.LG stat.ML

    Ray: A Distributed Framework for Emerging AI Applications

    Authors: Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, Ion Stoica

    Abstract: The next generation of AI applications will continuously interact with the environment and learn from these interactions. These applications impose new and demanding systems requirements, both in terms of performance and flexibility. In this paper, we consider these requirements and present Ray---a distributed system to address them. Ray implements a unified interface that can express both task-pa… ▽ More

    Submitted 29 September, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: 17 pages, 14 figures, 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

  4. arXiv:1607.08882  [pdf, other

    stat.ME stat.AP

    The competing risks Cox model with and without auxiliary case covariates under weaker or no missing-at-random cause of failure

    Authors: Daniel Nevo, Reiko Nishihara, Shuji Ogino, Molin Wang

    Abstract: In the analysis of time-to-event data with multiple causes using a competing risks Cox model, often the cause of failure is unknown for some of the cases. The probability of a missing cause is typically assumed to be independent of the cause given the time of the event and covariates measured before the event occurred. In practice, however, the underlying missing-at-random assumption does not nece… ▽ More

    Submitted 29 July, 2016; originally announced July 2016.

    Comments: 17 pages

  5. arXiv:1605.08179  [pdf, other

    stat.ML cs.CV

    Discovering Causal Signals in Images

    Authors: David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou

    Abstract: This paper establishes the existence of observable footprints that reveal the "causal dispositions" of the object categories appearing in collections of images. We achieve this goal in two steps. First, we take a learning approach to observational causal discovery, and build a classifier that achieves state-of-the-art performance on finding the causal direction between pairs of random variables, g… ▽ More

    Submitted 31 October, 2017; v1 submitted 26 May, 2016; originally announced May 2016.

  6. arXiv:1511.06051  [pdf, other

    stat.ML cs.DC cs.LG cs.NE math.OC

    SparkNet: Training Deep Networks in Spark

    Authors: Philipp Moritz, Robert Nishihara, Ion Stoica, Michael I. Jordan

    Abstract: Training deep networks is a time-consuming process, with networks for object recognition often requiring multiple days to train. For this reason, leveraging the resources of a cluster to speed up training is an important area of work. However, widely-popular batch-processing computational frameworks like MapReduce and Spark were not designed to support the asynchronous and communication-intensive… ▽ More

    Submitted 28 February, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: 12 pages, 7 figures

  7. arXiv:1508.02933  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    No Regret Bound for Extreme Bandits

    Authors: Robert Nishihara, David Lopez-Paz, Léon Bottou

    Abstract: Algorithms for hyperparameter optimization abound, all of which work well under different and often unverifiable assumptions. Motivated by the general challenge of sequentially choosing which algorithm to use, we study the more specific task of choosing among distributions to use for random hyperparameter optimization. This work is naturally framed in the extreme bandit setting, which deals with s… ▽ More

    Submitted 11 April, 2016; v1 submitted 12 August, 2015; originally announced August 2015.

    Comments: 11 pages, International Conference on Artificial Intelligence and Statistics, 2016

  8. arXiv:1508.02087  [pdf, other

    math.OC cs.LG math.NA stat.CO stat.ML

    A Linearly-Convergent Stochastic L-BFGS Algorithm

    Authors: Philipp Moritz, Robert Nishihara, Michael I. Jordan

    Abstract: We propose a new stochastic L-BFGS algorithm and prove a linear convergence rate for strongly convex and smooth functions. Our algorithm draws heavily from a recent stochastic variant of L-BFGS proposed in Byrd et al. (2014) as well as a recent approach to variance reduction for stochastic gradient descent from Johnson and Zhang (2013). We demonstrate experimentally that our algorithm performs wel… ▽ More

    Submitted 13 April, 2016; v1 submitted 9 August, 2015; originally announced August 2015.

    Comments: 10 pages, 3 figures in International Conference on Artificial Intelligence and Statistics, 2016

  9. arXiv:1312.5386  [pdf, ps, other

    stat.ML

    Detecting Parameter Symmetries in Probabilistic Models

    Authors: Robert Nishihara, Thomas Minka, Daniel Tarlow

    Abstract: Probabilistic models often have parameters that can be translated, scaled, permuted, or otherwise transformed without changing the model. These symmetries can lead to strong correlation and multimodality in the posterior distribution over the model's parameters, which can pose challenges both for performing inference and interpreting the results. In this work, we address the automatic detection of… ▽ More

    Submitted 18 December, 2013; originally announced December 2013.

    Comments: 24 pages, 8 figures

  10. arXiv:1210.7477  [pdf, other

    stat.CO stat.ML

    Parallel MCMC with Generalized Elliptical Slice Sampling

    Authors: Robert Nishihara, Iain Murray, Ryan P. Adams

    Abstract: Probabilistic models are conceptually powerful tools for finding structure in data, but their practical effectiveness is often limited by our ability to perform inference in them. Exact inference is frequently intractable, so approximate inference is often performed using Markov chain Monte Carlo (MCMC). To achieve the best possible results from MCMC, we want to efficiently simulate many steps of… ▽ More

    Submitted 23 July, 2014; v1 submitted 28 October, 2012; originally announced October 2012.

    Comments: 19 pages, 8 figures, 3 algorithms

    Journal ref: Journal of Machine Learning Research 15:2087-2112, 2014