Skip to main content

Showing 1–4 of 4 results for author: Salakhutdinov, R R

.
  1. arXiv:1309.6865  [pdf

    cs.LG cs.IR stat.ML

    Modeling Documents with Deep Boltzmann Machines

    Authors: Nitish Srivastava, Ruslan R Salakhutdinov, Geoffrey E. Hinton

    Abstract: We introduce a Deep Boltzmann Machine model suitable for modeling and extracting latent semantic representations from a large unstructured collection of documents. We overcome the apparent difficulty of training a DBM with judicious parameter tying. This parameter tying enables an efficient pretraining algorithm and a state initialization scheme that aids inference. The model can be trained just a… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-616-624

  2. arXiv:1212.2490  [pdf

    cs.LG stat.ML

    On the Convergence of Bound Optimization Algorithms

    Authors: Ruslan R Salakhutdinov, Sam T Roweis, Zoubin Ghahramani

    Abstract: Many practitioners who use the EM algorithm complain that it is sometimes slow. When does this happen, and what can be done about it? In this paper, we study the general class of bound optimization algorithms - including Expectation-Maximization, Iterative Scaling and CCCP - and their relationship to direct optimization algorithms such as gradient-based methods for parameter learning. We d… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-509-516

  3. arXiv:1210.4856  [pdf

    cs.LG stat.ML

    Exploiting compositionality to explore a large space of model structures

    Authors: Roger Grosse, Ruslan R Salakhutdinov, William T. Freeman, Joshua B. Tenenbaum

    Abstract: The recent proliferation of richly structured probabilistic models raises the question of how to automatically determine an appropriate model for a dataset. We investigate this question for a space of matrix decomposition models which can express a variety of widely used models from unsupervised learning. To enable model selection, we organize these models into a context-free grammar which generat… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-306-315

  4. arXiv:1207.0580  [pdf, other

    cs.NE cs.CV cs.LG

    Improving neural networks by preventing co-adaptation of feature detectors

    Authors: Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov

    Abstract: When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data. This "overfitting" is greatly reduced by randomly omitting half of the feature detectors on each training case. This prevents complex co-adaptations in which a feature detector is only helpful in the context of several other specific feature detectors. Instead, each neuro… ▽ More

    Submitted 3 July, 2012; originally announced July 2012.