Skip to main content

Showing 1–15 of 15 results for author: Schein, A

.
  1. arXiv:2405.16762  [pdf, other

    cs.CY cs.LG

    Addressing Discretization-Induced Bias in Demographic Prediction

    Authors: Evan Dong, Aaron Schein, Yixin Wang, Nikhil Garg

    Abstract: Racial and other demographic imputation is necessary for many applications, especially in auditing disparities and outreach targeting in political campaigns. The canonical approach is to construct continuous predictions -- e.g., based on name and geography -- and then to $\textit{discretize}$ the predictions by selecting the most likely class (argmax). We study how this practice produces… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: A version of this paper was accepted to the 2024 ACM Conference on Fairness, Accountability, and Transparency

    ACM Class: K.4.0

  2. arXiv:2404.04633  [pdf, other

    cs.CL

    Context versus Prior Knowledge in Language Models

    Authors: Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell

    Abstract: To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context. We hypothesize that models perform this integration in a predictable way across different questions and contexts: models will rely more on prior knowledge for questions about entities (e.g., persons, places, etc.) that they are more familiar with due to… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Long paper accepted at ACL 2024

  3. arXiv:2403.06153  [pdf, other

    stat.ML cs.LG

    The AL$\ell_0$CORE Tensor Decomposition for Sparse Count Data

    Authors: John Hood, Aaron Schein

    Abstract: This paper introduces AL$\ell_0$CORE, a new form of probabilistic non-negative tensor decomposition. AL$\ell_0$CORE is a Tucker decomposition where the number of non-zero elements (i.e., the $\ell_0$-norm) of the core tensor is constrained to a preset value $Q$ much smaller than the size of the core. While the user dictates the total budget $Q$, the locations and values of the non-zero elements ar… ▽ More

    Submitted 12 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  4. arXiv:2312.09203  [pdf, other

    cs.CL

    Measurement in the Age of LLMs: An Application to Ideological Scaling

    Authors: Sean O'Hagan, Aaron Schein

    Abstract: Much of social science is centered around terms like ``ideology'' or ``power'', which generally elude precise definition, and whose contextual meanings are trapped in surrounding language. This paper explores the use of large language models (LLMs) to flexibly navigate the conceptual clutter inherent to social scientific measurement tasks. We rely on LLMs' remarkable linguistic fluency to elicit i… ▽ More

    Submitted 7 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Under review a Harvard Data Science Review. Previously presented at the 4th International Conference of Social Computing in Bei**g, China, September 2023, the New Directions in Analyzing Text as Data (TADA) meeting in Amherst, MA, USA, November 2023, and the NeurIPS workshop titled "I Can't Believe It's Not Better!'' Failure Modes in the Age of Foundation Models in New Orleans, LA, December 2023

  5. arXiv:2212.04130  [pdf, other

    stat.ML cs.LG cs.SI stat.AP

    The Ordered Matrix Dirichlet for State-Space Models

    Authors: Niklas Stoehr, Benjamin J. Radford, Ryan Cotterell, Aaron Schein

    Abstract: Many dynamical systems in the real world are naturally described by latent states with intrinsic orderings, such as "ally", "neutral", and "enemy" relationships in international relations. These latent states manifest through countries' cooperative versus conflictual interactions over time. State-space models (SSMs) explicitly relate the dynamics of observed measurements to transitions in latent s… ▽ More

    Submitted 25 February, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: Presented at the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023

  6. arXiv:2210.03971  [pdf, other

    cs.LG stat.AP

    An Ordinal Latent Variable Model of Conflict Intensity

    Authors: Niklas Stoehr, Lucas Torroba Hennigen, Josef Valvoda, Robert West, Ryan Cotterell, Aaron Schein

    Abstract: Measuring the intensity of events is crucial for monitoring and tracking armed conflict. Advances in automated event extraction have yielded massive data sets of "who did what to whom" micro-records that enable data-driven approaches to monitoring conflict. The Goldstein scale is a widely-used expert-based measure that scores events on a conflictual-cooperative scale. It is based only on the actio… ▽ More

    Submitted 4 June, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Long Paper at ACL 2023

  7. arXiv:2106.06691  [pdf, other

    stat.ML cs.LG q-bio.GN stat.AP

    Doubly Non-Central Beta Matrix Factorization for DNA Methylation Data

    Authors: Aaron Schein, Anjali Nagulpally, Hanna Wallach, Patrick Flaherty

    Abstract: We present a new non-negative matrix factorization model for $(0,1)$ bounded-support data based on the doubly non-central beta (DNCB) distribution, a generalization of the beta distribution. The expressiveness of the DNCB distribution is particularly useful for modeling DNA methylation datasets, which are typically highly dispersed and multi-modal; however, the model structure is sufficiently gene… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: To appear in the Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI) 2021

  8. Preserving general physical properties in model reduction of dynamical systems via constrained-optimization projection

    Authors: A. Schein, K. T. Carlberg, M. J. Zahr

    Abstract: Model-reduction techniques aim to reduce the computational complexity of simulating dynamical systems by applying a (Petrov-)Galerkin projection process that enforces the dynamics to evolve in a low-dimensional subspace of the original state space. Frequently, the resulting reduced-order model (ROM) violates intrinsic physical properties of the original full-order model (FOM) (e.g., global conserv… ▽ More

    Submitted 1 April, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

  9. arXiv:1910.12991  [pdf, other

    stat.ML cs.LG

    Poisson-Randomized Gamma Dynamical Systems

    Authors: Aaron Schein, Scott W. Linderman, Mingyuan Zhou, David M. Blei, Hanna Wallach

    Abstract: This paper presents the Poisson-randomized gamma dynamical system (PRGDS), a model for sequentially observed count tensors that encodes a strong inductive bias toward sparsity and burstiness. The PRGDS is based on a new motif in Bayesian latent variable modeling, an alternating chain of discrete Poisson and continuous gamma latent states that is analytically convenient and computationally tractabl… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: To appear in the Proceedings of the 32nd Advances in Neural Information Processing Systems (NeurIPS 2019)

  10. arXiv:1807.08225  [pdf, other

    stat.ME

    The Hyperedge Event Model

    Authors: Bomin Kim, Aaron Schein, Bruce A. Desmarais, Hanna Wallach

    Abstract: We introduce the hyperedge event model (HEM)---a generative model for events that can be represented as directed edges with one sender and one or more receivers or one receiver and one or more senders. We integrate a dynamic version of the exponential random graph model (ERGM) of edge structure with a survival model for event timing to jointly understand who interacts with whom, and when. The HEM… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

  11. arXiv:1803.08471  [pdf, other

    stat.ML cs.CL cs.CR cs.LG cs.SI

    Locally Private Bayesian Inference for Count Models

    Authors: Aaron Schein, Zhiwei Steven Wu, Alexandra Schofield, Mingyuan Zhou, Hanna Wallach

    Abstract: We present a general method for privacy-preserving Bayesian inference in Poisson factorization, a broad class of models that includes some of the most widely used models in the social sciences. Our method satisfies limited precision local privacy, a generalization of local differential privacy, which we introduce to formulate privacy guarantees appropriate for sparse count data. We develop an MCMC… ▽ More

    Submitted 21 February, 2019; v1 submitted 22 March, 2018; originally announced March 2018.

  12. arXiv:1701.05573  [pdf, other

    stat.ML cs.LG

    Poisson--Gamma Dynamical Systems

    Authors: Aaron Schein, Mingyuan Zhou, Hanna Wallach

    Abstract: We introduce a new dynamical system for sequentially observed multivariate count data. This model is based on the gamma--Poisson construction---a natural choice for count data---and relies on a novel Bayesian nonparametric prior that ties and shrinks the model parameters, thus avoiding overfitting. We present an efficient MCMC inference algorithm that advances recent work on augmentation schemes f… ▽ More

    Submitted 19 January, 2017; originally announced January 2017.

    Comments: Appeared in the Proceedings of the 29th Advances in Neural Information Processing Systems (NIPS 2016)

  13. arXiv:1606.01855  [pdf, other

    stat.ML cs.AI cs.LG cs.SI stat.AP

    Bayesian Poisson Tucker Decomposition for Learning the Structure of International Relations

    Authors: Aaron Schein, Mingyuan Zhou, David M. Blei, Hanna Wallach

    Abstract: We introduce Bayesian Poisson Tucker decomposition (BPTD) for modeling country--country interaction event data. These data consist of interaction events of the form "country $i$ took action $a$ toward country $j$ at time $t$." BPTD discovers overlap** country--community memberships, including the number of latent communities. In addition, it discovers directed community--community interaction ne… ▽ More

    Submitted 6 June, 2016; originally announced June 2016.

    Comments: To appear in Proceedings of the 33rd International Conference on Machine Learning (ICML 2016)

  14. arXiv:1506.03493  [pdf, other

    stat.ML cs.AI cs.LG cs.SI stat.AP

    Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts

    Authors: Aaron Schein, John Paisley, David M. Blei, Hanna Wallach

    Abstract: We present a Bayesian tensor factorization model for inferring latent group structures from dynamic pairwise interaction patterns. For decades, political scientists have collected and analyzed records of the form "country $i$ took action $a$ toward country $j$ at time $t$"---known as dyadic events---in order to form and test theories of international relations. We represent these event data as a t… ▽ More

    Submitted 10 June, 2015; originally announced June 2015.

    Comments: To appear in Proceedings of the 21st ACM SIGKDD Conference of Knowledge Discovery and Data Mining (KDD 2015)

  15. arXiv:1311.3982  [pdf, other

    cs.AI cs.SI

    Inferring Multilateral Relations from Dynamic Pairwise Interactions

    Authors: Aaron Schein, Juston Moore, Hanna Wallach

    Abstract: Correlations between anomalous activity patterns can yield pertinent information about complex social processes: a significant deviation from normal behavior, exhibited simultaneously by multiple pairs of actors, provides evidence for some underlying relationship involving those pairs---i.e., a multilateral relation. We introduce a new nonparametric Bayesian latent variable model that explicitly c… ▽ More

    Submitted 15 November, 2013; originally announced November 2013.

    Comments: NIPS 2013 Workshop on Frontiers of Network Analysis