Skip to main content

Showing 1–12 of 12 results for author: Erosheva, E A

.
  1. arXiv:2406.19563  [pdf, other

    stat.ME stat.AP

    Bayesian Rank-Clustering

    Authors: Michael Pearce, Elena A. Erosheva

    Abstract: In a traditional analysis of ordinal comparison data, the goal is to infer an overall ranking of objects from best to worst with each object having a unique rank. However, the ranks of some objects may not be statistically distinguishable. This could happen due to insufficient data or to the true underlying abilities or qualities being equal for some objects. In such cases, practitioners may prefe… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 36 pages, 20 figures, 2 tables

  2. arXiv:2301.09755  [pdf, other

    stat.ME

    Modeling Preferences: A Bayesian Mixture of Finite Mixtures for Rankings and Ratings

    Authors: Michael Pearce, Elena A. Erosheva

    Abstract: Rankings and ratings are commonly used to express preferences but provide distinct and complementary information. Rankings give ordinal and scale-free comparisons but lack granularity; ratings provide cardinal and granular assessments but may be highly subjective or inconsistent. Collecting and analyzing rankings and ratings jointly has not been performed until recently due to a lack of principled… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: 41 pages, 16 figures

  3. arXiv:2208.03252  [pdf, other

    stat.ME stat.AP

    Partial-Mastery Cognitive Diagnosis Models

    Authors: Zhuoran Shang, Elena A. Erosheva, Gongjun Xu

    Abstract: Cognitive diagnosis models (CDMs) are a family of discrete latent attribute models that serve as statistical basis in educational and psychological cognitive diagnosis assessments. CDMs aim to achieve fine-grained inference on individuals' latent attributes, based on their observed responses to a set of designed diagnostic items. In the literature, CDMs usually assume that items require mastery of… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Journal ref: This work has been published in Ann. Appl. Stat. 15(3): 1529-1555 (September 2021)

  4. arXiv:2206.12365  [pdf, ps, other

    math.ST

    On the validity of bootstrap uncertainty estimates in the Mallows-Binomial model

    Authors: Michael Pearce, Elena A. Erosheva

    Abstract: The Mallows-Binomial distribution is the first joint statistical model for rankings and ratings (Pearce and Erosheva, 2022). Because frequentist estimation of the model parameters and their uncertainty is challenging, it is natural to consider the nonparametric bootstrap. However, it is not clear that the nonparametric bootstrap is asymptotically valid in this setting. This is because the Mallows-… ▽ More

    Submitted 20 July, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: 9 pages

  5. arXiv:2201.02539  [pdf, other

    stat.ME stat.ML

    A Unified Statistical Learning Model for Rankings and Scores with Application to Grant Panel Review

    Authors: Michael Pearce, Elena A. Erosheva

    Abstract: Rankings and scores are two common data types used by judges to express preferences and/or perceptions of quality in a collection of objects. Numerous models exist to study data of each type separately, but no unified statistical model captures both data types simultaneously without first performing data conversion. We propose the Mallows-Binomial model to close this gap, which combines a Mallows'… ▽ More

    Submitted 24 June, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: 36 pages, 8 figures

    Journal ref: JMLR 23(210): 1-33, 2022

  6. arXiv:2109.11705  [pdf, other

    stat.ME

    Dimension-Grouped Mixed Membership Models for Multivariate Categorical Data

    Authors: Yuqi Gu, Elena A. Erosheva, Gongjun Xu, David B. Dunson

    Abstract: Mixed Membership Models (MMMs) are a popular family of latent structure models for complex multivariate data. Instead of forcing each subject to belong to a single cluster, MMMs incorporate a vector of subject-specific weights characterizing partial membership across clusters. With this flexibility come challenges in uniquely identifying, estimating, and interpreting the parameters. In this articl… ▽ More

    Submitted 14 February, 2023; v1 submitted 23 September, 2021; originally announced September 2021.

  7. arXiv:1909.01284  [pdf, other

    stat.AP

    Gender-based homophily in collaborations across a heterogeneous scholarly landscape

    Authors: Y. Samuel Wang, Carole J. Lee, Jevin D. West, Carl T. Bergstrom, Elena A. Erosheva

    Abstract: In this article, we investigate the role of gender in collaboration patterns by analyzing gender-based homophily -- the tendency for researchers to co-author with individuals of the same gender. We develop and apply novel methodology to the corpus of JSTOR articles, a broad scholarly landscape, which we analyze at various levels of granularity. Most notably, for a precise analysis of gender homoph… ▽ More

    Submitted 16 June, 2022; v1 submitted 3 September, 2019; originally announced September 2019.

  8. arXiv:1711.11057  [pdf, other

    stat.ME stat.AP stat.ML

    On the use of bootstrap with variational inference: Theory, interpretation, and a two-sample test example

    Authors: Yen-Chi Chen, Y. Samuel Wang, Elena A. Erosheva

    Abstract: Variational inference is a general approach for approximating complex density functions, such as those arising in latent variable models, popular in machine learning. It has been applied to approximate the maximum likelihood estimator and to carry out Bayesian inference, however, quantification of uncertainty with variational inference remains challenging from both theoretical and practical perspe… ▽ More

    Submitted 17 April, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

    Comments: Accepted to the Annals of Applied Statistics; 34 pages, 8 pages

    MSC Class: 62G09 (Primary); 62G15; 62H99 (Secondary)

  9. arXiv:1610.09026  [pdf, ps, other

    cs.DL physics.soc-ph stat.ME

    On the relationship between set-based and network-based measures of gender homophily in scholarly publications

    Authors: Y. Samuel Wang, Elena A. Erosheva

    Abstract: There is an increased interest in the scientific community in the problem of measuring gender homophily in co-authorship on scholarly publications (Eisen, 2016). For a given set of publications and co-authorships, we assume that author identities have not been disambiguated in that we do not know when one person is an author on more than one paper. In this case, one way to think about measuring ge… ▽ More

    Submitted 11 November, 2016; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: University of Washington; Center for Statistics and Social Sciences; WP 157

  10. arXiv:1512.08731  [pdf, other

    stat.ME stat.AP

    A Variational EM Method for Mixed Membership Models with Multivariate Rank Data: an Analysis of Public Policy Preferences

    Authors: Y. Samuel Wang, Ross Matsueda, Elena A. Erosheva

    Abstract: In this article, we consider modeling ranked responses from a heterogeneous population. Specifically, we analyze data from the Eurobarometer 34.1 survey regarding public policy preferences towards drugs, alcohol and AIDS. Such policy preferences are likely to exhibit substantial differences within as well as across European nations reflecting a wide variety of cultures, political affiliations, ide… ▽ More

    Submitted 24 February, 2017; v1 submitted 29 December, 2015; originally announced December 2015.

    Comments: 24 pages; 7 figures

  11. A semiparametric approach to mixed outcome latent variable models: Estimating the association between cognition and regional brain volumes

    Authors: Jonathan Gruhl, Elena A. Erosheva, Paul K. Crane

    Abstract: Multivariate data that combine binary, categorical, count and continuous outcomes are common in the social and health sciences. We propose a semiparametric Bayesian latent variable model for multivariate data of arbitrary type that does not require specification of conditional distributions. Drawing on the extended rank likelihood method by Hoff [Ann. Appl. Stat. 1 (2007) 265-283], we develop a se… ▽ More

    Submitted 13 January, 2014; originally announced January 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS675 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS675

    Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 4, 2361-2383

  12. Describing disability through individual-level mixture models for multivariate binary data

    Authors: Elena A. Erosheva, Stephen E. Fienberg, Cyrille Joutard

    Abstract: Data on functional disability are of widespread policy interest in the United States, especially with respect to planning for Medicare and Social Security for a growing population of elderly adults. We consider an extract of functional disability data from the National Long Term Care Survey (NLTCS) and attempt to develop disability profiles using variations of the Grade of Membership (GoM) model… ▽ More

    Submitted 13 December, 2007; originally announced December 2007.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOAS126 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS126

    Journal ref: Annals of Applied Statistics 2007, Vol. 1, No. 2, 502-537