Skip to main content

Showing 1–14 of 14 results for author: Ustimenko, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10024  [pdf, other

    cs.LG cs.IR

    $Δ\text{-}{\rm OPE}$: Off-Policy Estimation with Pairs of Policies

    Authors: Olivier Jeunen, Aleksei Ustimenko

    Abstract: The off-policy paradigm casts recommendation as a counterfactual decision-making task, allowing practitioners to unbiasedly estimate online metrics using offline data. This leads to effective evaluation metrics, as well as learning procedures that directly optimise online success. Nevertheless, the high variance that comes with unbiasedness is typically the crux that complicates practical applicat… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2405.02141  [pdf, other

    cs.IR cs.LG

    Multi-Objective Recommendation via Multivariate Policy Learning

    Authors: Olivier Jeunen, Jatin Mandav, Ivan Potapov, Nakul Agarwal, Sourabh Vaid, Wenzhe Shi, Aleksei Ustimenko

    Abstract: Real-world recommender systems often need to balance multiple objectives when deciding which recommendations to present to users. These include behavioural signals (e.g. clicks, shares, dwell time), as well as broader objectives (e.g. diversity, fairness). Scalarisation methods are commonly used to handle this balancing task, where a weighted average of per-objective reward signals determines the… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2402.03915  [pdf, other

    cs.LG cs.IR stat.AP stat.ML

    Learning Metrics that Maximise Power for Accelerated A/B-Tests

    Authors: Olivier Jeunen, Aleksei Ustimenko

    Abstract: Online controlled experiments are a crucial tool to allow for confident decision-making in technology companies. A North Star metric is defined (such as long-term revenue or user retention), and system variants that statistically significantly improve on this metric in an A/B-test can be considered superior. North Star metrics are typically delayed and insensitive. As a result, the cost of experim… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in the Applied Data Science track at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24)

  4. arXiv:2401.04062  [pdf, ps, other

    cs.LG cs.IR stat.AP

    Variance Reduction in Ratio Metrics for Efficient Online Experiments

    Authors: Shubham Baweja, Neeti Pokharna, Aleksei Ustimenko, Olivier Jeunen

    Abstract: Online controlled experiments, such as A/B-tests, are commonly used by modern tech companies to enable continuous system improvements. Despite their paramount importance, A/B-tests are expensive: by their very definition, a percentage of traffic is assigned an inferior system variant. To ensure statistical significance on top-level metrics, online experiments typically run for several weeks. Even… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted at the European Conference on Information Retrieval (ECIR '24) Industry Day

  5. arXiv:2401.04053  [pdf, other

    cs.IR

    Learning-to-Rank with Nested Feedback

    Authors: Hitesh Sagtani, Olivier Jeunen, Aleksei Ustimenko

    Abstract: Many platforms on the web present ranked lists of content to users, typically optimized for engagement-, satisfaction- or retention- driven metrics. Advances in the Learning-to-Rank (LTR) research literature have enabled rapid growth in this application area. Several popular interfaces now include nested lists, where users can enter a 2nd-level feed via any given 1st-level item. Naturally, this ha… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted at the European Conference on Information Retrieval (ECIR '24)

  6. On Gradient Boosted Decision Trees and Neural Rankers: A Case-Study on Short-Video Recommendations at ShareChat

    Authors: Olivier Jeunen, Hitesh Sagtani, Himanshu Doi, Rasul Karimov, Neeti Pokharna, Danish Kalim, Aleksei Ustimenko, Christopher Green, Wenzhe Shi, Rishabh Mehrotra

    Abstract: Practitioners who wish to build real-world applications that rely on ranking models, need to decide which modelling paradigm to follow. This is not an easy choice to make, as the research literature on this topic has been shifting in recent years. In particular, whilst Gradient Boosted Decision Trees (GBDTs) have reigned supreme for more than a decade, the flexibility of neural networks has allowe… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Appearing in the Industry Track Proceedings of the Forum for Information Retrieval Evaluation (FIRE '23)

  7. arXiv:2310.06081  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    Ito Diffusion Approximation of Universal Ito Chains for Sampling, Optimization and Boosting

    Authors: Aleksei Ustimenko, Aleksandr Beznosikov

    Abstract: In this work, we consider rather general and broad class of Markov chains, Ito chains, that look like Euler-Maryama discretization of some Stochastic Differential Equation. The chain we study is a unified framework for theoretical analysis. It comes with almost arbitrary isotropic and state-dependent noise instead of normal and state-independent one as in most related papers. Moreover, in our chai… ▽ More

    Submitted 30 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Appears in: The Twelfth International Conference on Learning Representations (ICLR 2024). 27 pages, 3 tables. Reference: https://openreview.net/forum?id=fjpfCOV4ru

  8. arXiv:2307.15053  [pdf, other

    cs.IR cs.LG

    On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-$n$ Recommendation

    Authors: Olivier Jeunen, Ivan Potapov, Aleksei Ustimenko

    Abstract: Approaches to recommendation are typically evaluated in one of two ways: (1) via a (simulated) online experiment, often seen as the gold standard, or (2) via some offline evaluation procedure, where the goal is to approximate the outcome of an online experiment. Several offline evaluation metrics have been adopted in the literature, inspired by ranking metrics prevalent in the field of Information… ▽ More

    Submitted 12 June, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: To appear in the research track at the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24)

  9. arXiv:2305.19685  [pdf, other

    cs.LG quant-ph stat.ML

    Deep Stochastic Mechanics

    Authors: Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca Willett

    Abstract: This paper introduces a novel deep-learning-based approach for numerical simulation of a time-evolving Schrödinger equation inspired by stochastic mechanics and generative diffusion models. Unlike existing approaches, which exhibit computational complexity that scales exponentially in the problem dimension, our method allows us to adapt to the latent low-dimensional structure of the wave function… ▽ More

    Submitted 4 June, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

  10. arXiv:2206.05608  [pdf, other

    cs.LG stat.ML

    Gradient Boosting Performs Gaussian Process Inference

    Authors: Aleksei Ustimenko, Artem Beliakov, Liudmila Prokhorenkova

    Abstract: This paper shows that gradient boosting based on symmetric decision trees can be equivalently reformulated as a kernel method that converges to the solution of a certain Kernel Ridge Regression problem. Thus, we obtain the convergence to a Gaussian Process' posterior mean, which, in turn, allows us to easily transform gradient boosting into a sampler from the posterior to provide better knowledge… ▽ More

    Submitted 13 March, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

  11. arXiv:2204.01500  [pdf, other

    cs.LG

    Which Tricks Are Important for Learning to Rank?

    Authors: Ivan Lyzhin, Aleksei Ustimenko, Andrey Gulin, Liudmila Prokhorenkova

    Abstract: Nowadays, state-of-the-art learning-to-rank methods are based on gradient-boosted decision trees (GBDT). The most well-known algorithm is LambdaMART which was proposed more than a decade ago. Recently, several other GBDT-based ranking algorithms were proposed. In this paper, we thoroughly analyze these methods in a unified setup. In particular, we address the following questions. Is direct optimiz… ▽ More

    Submitted 6 October, 2023; v1 submitted 4 April, 2022; originally announced April 2022.

  12. arXiv:2006.10562  [pdf, other

    cs.LG stat.ML

    Uncertainty in Gradient Boosting via Ensembles

    Authors: Andrey Malinin, Liudmila Prokhorenkova, Aleksei Ustimenko

    Abstract: For many practical, high-risk applications, it is essential to quantify uncertainty in a model's predictions to avoid costly mistakes. While predictive uncertainty is widely studied for neural networks, the topic seems to be under-explored for models based on gradient boosting. However, gradient boosting often achieves state-of-the-art results on tabular data. This work examines a probabilistic en… ▽ More

    Submitted 2 April, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

  13. arXiv:2003.02122  [pdf, ps, other

    cs.LG stat.ML

    StochasticRank: Global Optimization of Scale-Free Discrete Functions

    Authors: Aleksei Ustimenko, Liudmila Prokhorenkova

    Abstract: In this paper, we introduce a powerful and efficient framework for direct optimization of ranking metrics. The problem is ill-posed due to the discrete structure of the loss, and to deal with that, we introduce two important techniques: stochastic smoothing and novel gradient estimate based on partial integration. We show that classic smoothing approaches may introduce bias and present a universal… ▽ More

    Submitted 20 August, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

  14. arXiv:2001.07248  [pdf, other

    cs.LG stat.ML

    SGLB: Stochastic Gradient Langevin Boosting

    Authors: Aleksei Ustimenko, Liudmila Prokhorenkova

    Abstract: This paper introduces Stochastic Gradient Langevin Boosting (SGLB) - a powerful and efficient machine learning framework that may deal with a wide range of loss functions and has provable generalization guarantees. The method is based on a special form of the Langevin diffusion equation specifically designed for gradient boosting. This allows us to theoretically guarantee the global convergence ev… ▽ More

    Submitted 16 January, 2022; v1 submitted 20 January, 2020; originally announced January 2020.