Skip to main content

Showing 1–7 of 7 results for author: Muralidharan, O

.
  1. arXiv:1911.05970  [pdf, other

    stat.ME

    Empirical Bayes mean estimation with nonparametric errors via order statistic regression on replicated data

    Authors: Nikolaos Ignatiadis, Sujayam Saha, Dennis L. Sun, Omkar Muralidharan

    Abstract: We study empirical Bayes estimation of the effect sizes of $N$ units from $K$ noisy observations on each unit. We show that it is possible to achieve near-Bayes optimal mean squared error, without any assumptions or knowledge about the effect size distribution or the noise. The noise distribution can be heteroskedastic and vary arbitrarily from unit to unit. Our proposal, which we call Aurora, lev… ▽ More

    Submitted 10 August, 2021; v1 submitted 14 November, 2019; originally announced November 2019.

  2. arXiv:1510.08437  [pdf, other

    stat.AP stat.ME

    Second Order Calibration: A Simple Way to Get Approximate Posteriors

    Authors: Omkar Muralidharan, Amir Najmi

    Abstract: Many large-scale machine learning problems involve estimating an unknown parameter $θ_{i}$ for each of many items. For example, a key problem in sponsored search is to estimate the click through rate (CTR) of each of billions of query-ad pairs. Most common methods, though, only give a point estimate of each $θ_{i}$. A posterior distribution for each $θ_{i}$ is usually more useful but harder to get… ▽ More

    Submitted 28 October, 2015; originally announced October 2015.

  3. arXiv:1508.01278  [pdf, other

    stat.AP

    Teaching Statistics at Google Scale

    Authors: Nicholas Chamandy, Omkar Muralidharan, Stefan Wager

    Abstract: Modern data and applications pose very different challenges from those of the 1950s or even the 1980s. Students contemplating a career in statistics or data science need to have the tools to tackle problems involving massive, heavy-tailed data, often interacting with live, complex systems. However, despite the deepening connections between engineering and modern data science, we argue that trainin… ▽ More

    Submitted 16 August, 2015; v1 submitted 6 August, 2015; originally announced August 2015.

    Comments: To appear in The American Statistician

  4. arXiv:1310.2931  [pdf, other

    stat.ME cs.LG stat.ML

    Feedback Detection for Live Predictors

    Authors: Stefan Wager, Nick Chamandy, Omkar Muralidharan, Amir Najmi

    Abstract: A predictor that is deployed in a live production system may perturb the features it uses to make predictions. Such a feedback loop can occur, for example, when a model that predicts a certain type of behavior ends up causing the behavior it predicts, thus creating a self-fulfilling prophecy. In this paper we analyze predictor feedback detection as a causal inference problem, and introduce a local… ▽ More

    Submitted 31 October, 2014; v1 submitted 10 October, 2013; originally announced October 2013.

    Comments: Advances in Neural Information Processing Systems (NIPS), 2014

  5. arXiv:1211.3955  [pdf, ps, other

    cs.GT cs.LG

    On Calibrated Predictions for Auction Selection Mechanisms

    Authors: H. Brendan McMahan, Omkar Muralidharan

    Abstract: Calibration is a basic property for prediction systems, and algorithms for achieving it are well-studied in both statistics and machine learning. In many applications, however, the predictions are used to make decisions that select which observations are made. This makes calibration difficult, as adjusting predictions to achieve calibration changes future data. We focus on click-through-rate (CTR)… ▽ More

    Submitted 16 November, 2012; originally announced November 2012.

  6. Detecting mutations in mixed sample sequencing data using empirical Bayes

    Authors: Omkar Muralidharan, Georges Natsoulis, John Bell, Hanlee Ji, Nancy R. Zhang

    Abstract: We develop statistically based methods to detect single nucleotide DNA mutations in next generation sequencing data. Sequencing generates counts of the number of times each base was observed at hundreds of thousands to billions of genome positions in each sample. Using these counts to detect mutations is challenging because mutations may have very low prevalence and sequencing error rates vary dra… ▽ More

    Submitted 28 September, 2012; originally announced September 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS538 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS538

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 3, 1047-1067

  7. An empirical Bayes mixture method for effect size and false discovery rate estimation

    Authors: Omkar Muralidharan

    Abstract: Many statistical problems involve data from thousands of parallel cases. Each case has some associated effect size, and most cases will have no effect. It is often important to estimate the effect size and the local or tail-area false discovery rate for each case. Most current methods do this separately, and most are designed for normal data. This paper uses an empirical Bayes mixture model approa… ▽ More

    Submitted 7 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS276 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS276

    Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 1, 422-438