Skip to main content

Showing 1–4 of 4 results for author: Singal, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15120  [pdf, ps, other

    cs.LG

    A Counterfactual Analysis of the Dishonest Casino

    Authors: Martin Haugh, Raghav Singal

    Abstract: The dishonest casino is a well-known hidden Markov model (HMM) used in educational settings to introduce HMMs and graphical models. Here, a sequence of die rolls is observed, with the casino switching between a fair and a loaded die. Typically, the goal is to use the observed rolls to infer the pattern of fair and loaded dice, leading to filtering, smoothing, and Viterbi algorithms. This paper, ho… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2205.13832

  2. arXiv:2401.06710  [pdf, other

    cs.LG cs.IR

    Model-Free Approximate Bayesian Learning for Large-Scale Conversion Funnel Optimization

    Authors: Garud Iyengar, Raghav Singal

    Abstract: The flexibility of choosing the ad action as a function of the consumer state is critical for modern-day marketing campaigns. We study the problem of identifying the optimal sequential personalized interventions that maximize the adoption probability for a new product. We model consumer behavior by a conversion funnel that captures the state of each consumer (e.g., interaction history with the fir… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  3. arXiv:2205.13832  [pdf, ps, other

    cs.LG

    Counterfactual Analysis in Dynamic Latent State Models

    Authors: Martin Haugh, Raghav Singal

    Abstract: We provide an optimization-based framework to perform counterfactual analysis in a dynamic model with hidden states. Our framework is grounded in the ``abduction, action, and prediction'' approach to answer counterfactual queries and handles two key challenges where (1) the states are hidden and (2) the model is dynamic. Recognizing the lack of knowledge on the underlying causal mechanism and the… ▽ More

    Submitted 5 May, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 32 pages, 10 figures

  4. arXiv:1806.02450  [pdf, ps, other

    cs.LG stat.ML

    A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation

    Authors: Jalaj Bhandari, Daniel Russo, Raghav Singal

    Abstract: Temporal difference learning (TD) is a simple iterative algorithm used to estimate the value function corresponding to a given policy in a Markov decision process. Although TD is one of the most widely used algorithms in reinforcement learning, its theoretical analysis has proved challenging and few guarantees on its statistical efficiency are available. In this work, we provide a simple and expli… ▽ More

    Submitted 6 November, 2018; v1 submitted 6 June, 2018; originally announced June 2018.