Skip to main content

Showing 1–5 of 5 results for author: Zoghi, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:1806.05819  [pdf, other

    cs.LG stat.ML

    BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback

    Authors: Chang Li, Branislav Kveton, Tor Lattimore, Ilya Markov, Maarten de Rijke, Csaba Szepesvari, Masrour Zoghi

    Abstract: In this paper, we study the problem of safe online learning to re-rank, where user feedback is used to improve the quality of displayed lists. Learning to rank has traditionally been studied in two settings. In the offline setting, rankers are typically learned from relevance labels created by judges. This approach has generally become standard in industrial applications of ranking, such as search… ▽ More

    Submitted 29 June, 2019; v1 submitted 15 June, 2018; originally announced June 2018.

  2. arXiv:1703.02527  [pdf, other

    cs.LG stat.ML

    Online Learning to Rank in Stochastic Click Models

    Authors: Masrour Zoghi, Tomas Tunys, Mohammad Ghavamzadeh, Branislav Kveton, Csaba Szepesvari, Zheng Wen

    Abstract: Online learning to rank is a core problem in information retrieval and machine learning. Many provably efficient algorithms have been recently proposed for this problem in specific click models. The click model is a model of how the user interacts with a list of documents. Though these results are significant, their impact on practice is limited, because all proposed algorithms are designed for sp… ▽ More

    Submitted 20 June, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: Proceedings of the 34th International Conference on Machine Learning

  3. arXiv:1301.1942  [pdf, other

    stat.ML cs.LG

    Bayesian Optimization in a Billion Dimensions via Random Embeddings

    Authors: Ziyu Wang, Frank Hutter, Masrour Zoghi, David Matheson, Nando de Freitas

    Abstract: Bayesian optimization techniques have been successfully applied to robotics, planning, sensor placement, recommendation, advertising, intelligent user interfaces and automatic algorithm configuration. Despite these successes, the approach is restricted to problems of moderate dimension, and several workshops on Bayesian optimization have identified its scaling to high-dimensions as one of the holy… ▽ More

    Submitted 10 January, 2016; v1 submitted 9 January, 2013; originally announced January 2013.

    Comments: 33 pages

  4. arXiv:1206.6457  [pdf

    cs.LG stat.ML

    Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations

    Authors: Nando de Freitas, Alex Smola, Masrour Zoghi

    Abstract: This paper analyzes the problem of Gaussian process (GP) bandits with deterministic observations. The analysis uses a branch and bound algorithm that is related to the UCB algorithm of (Srinivas et al, 2010). For GPs with Gaussian observation noise, with variance strictly greater than zero, Srinivas et al proved that the regret vanishes at the approximate rate of $O(1/\sqrt{t})$, where t is the nu… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012). arXiv admin note: substantial text overlap with arXiv:1203.2177

  5. arXiv:1203.2177  [pdf, other

    cs.LG stat.ML

    Regret Bounds for Deterministic Gaussian Process Bandits

    Authors: Nando de Freitas, Alex Smola, Masrour Zoghi

    Abstract: This paper analyses the problem of Gaussian process (GP) bandits with deterministic observations. The analysis uses a branch and bound algorithm that is related to the UCB algorithm of (Srinivas et al., 2010). For GPs with Gaussian observation noise, with variance strictly greater than zero, (Srinivas et al., 2010) proved that the regret vanishes at the approximate rate of $O(\frac{1}{\sqrt{t}})$,… ▽ More

    Submitted 9 March, 2012; originally announced March 2012.

    Comments: 17 pages, 5 figures