Skip to main content

Showing 1–6 of 6 results for author: Shen, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.00139  [pdf, other

    stat.ME stat.AP

    A Calibrated Sensitivity Analysis for Weighted Causal Decompositions

    Authors: Andy Shen, Elina Visoki, Ran Barzilay, Samuel D. Pimentel

    Abstract: Disparities in health or well-being experienced by minority groups can be difficult to study using the traditional exposure-outcome paradigm in causal inference, since potential outcomes in variables such as race or sexual minority status are challenging to interpret. Causal decomposition analysis addresses this gap by positing causal effects on disparities under interventions to other, intervenab… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2402.05330  [pdf, other

    stat.ML cs.LG

    Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference

    Authors: Luca Masserano, Alex Shen, Michele Doro, Tommaso Dorigo, Rafael Izbicki, Ann B. Lee

    Abstract: An open scientific challenge is how to classify events with reliable measures of uncertainty, when we have a mechanistic model of the data-generating process but the distribution over both labels and latent nuisance parameters is different between train and target data. We refer to this type of distributional shift as generalized label shift (GLS). Direct classification using observed data… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 26 pages, 19 figures, code available at https://github.com/lee-group-cmu/lf2i

  3. arXiv:2306.01911  [pdf, other

    stat.ME

    Generalized Bayesian MARS: Tools for Emulating Stochastic Computer Models

    Authors: Kellin Rumsey, Devin Francom, Andy Shen

    Abstract: The multivariate adaptive regression spline (MARS) approach of Friedman (1991) and its Bayesian counterpart (Francom et al. 2018) are effective approaches for the emulation of computer models. The traditional assumption of Gaussian errors limits the usefulness of MARS, and many popular alternatives, when dealing with stochastic computer models. We propose a generalized Bayesian MARS (GBMARS) frame… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  4. arXiv:1905.07790  [pdf, other

    cs.CL cs.LG stat.ML

    Correlation Coefficients and Semantic Textual Similarity

    Authors: Vitalii Zhelezniak, Aleksandar Savkov, April Shen, Nils Y. Hammerla

    Abstract: A large body of research into semantic textual similarity has focused on constructing state-of-the-art embeddings using sophisticated modelling, careful choice of learning signals and many clever tricks. By contrast, little attention has been devoted to similarity measures between these embeddings, with cosine similarity being used unquestionably in the majority of cases. In this work, we illustra… ▽ More

    Submitted 19 May, 2019; originally announced May 2019.

    Comments: Accepted as a long paper at NAACL-HLT 2019

  5. arXiv:1204.0307  [pdf, other

    stat.AP

    Elections and statistics: the case of "United Russia", 2009-2020

    Authors: Alexander Shen

    Abstract: This survey contains statistics on elections in Russia published in different places and available online. This data is discussed from the viewpoint of statistical model selection. The current version is updated including the materials up to July, 2020 voting on constitutional changes, Belarus 2020 elections and papers that appeared in 2020; most of the data are not consistent with the assumption… ▽ More

    Submitted 7 September, 2020; v1 submitted 1 April, 2012; originally announced April 2012.

    Comments: in Russian

    MSC Class: 91F10

  6. arXiv:0912.4269  [pdf, ps, other

    math.ST stat.ME

    Test Martingales, Bayes Factors and $p$-Values

    Authors: Glenn Shafer, Alexander Shen, Nikolai Vereshchagin, Vladimir Vovk

    Abstract: A nonnegative martingale with initial value equal to one measures evidence against a probabilistic hypothesis. The inverse of its value at some stop** time can be interpreted as a Bayes factor. If we exaggerate the evidence by considering the largest value attained so far by such a martingale, the exaggeration will be limited, and there are systematic ways to eliminate it. The inverse of the exa… ▽ More

    Submitted 16 June, 2011; v1 submitted 21 December, 2009; originally announced December 2009.

    Comments: Published in at http://dx.doi.org/10.1214/10-STS347 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS347

    Journal ref: Statistical Science 2011, Vol. 26, No. 1, 84-101