Skip to main content

Showing 1–27 of 27 results for author: Deshpande, S K

.
  1. arXiv:2405.03538  [pdf, other

    stat.AP

    Adolescent sports participation and health in early adulthood: An observational study

    Authors: A**kya H. Kokandakar, Yuzhou Lin, Steven **, Jordan Weiss, Amanda R. Rabinowitz, Reuben A. Buford May, Dylan Small, Sameer K. Deshpande

    Abstract: We study the impact of teenage sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status. To pro… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: The pre-analysis protocol for this study is available at arXiv:2211.02104

  2. arXiv:2402.13961  [pdf, other

    math.ST

    New directions in algebraic statistics: Three challenges from 2023

    Authors: Yulia Alexandr, Miles Bakenhus, Mark Curiel, Sameer K. Deshpande, Elizabeth Gross, Yuqi Gu, Max Hill, Joseph Johnson, Bryson Kagy, Vishesh Karwa, Jiayi Li, Hanbaek Lyu, Sonja Petrović, Jose Israel Rodriguez

    Abstract: In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences. Naturally… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: This research was performed while the authors were visiting the Institute for Mathematical and Statistical Innovation (IMSI), which is supported by the National Science Foundation (Grant No. DMS-1929348). We participated in the long program "Algebraic Statistics and Our Changing World"

    MSC Class: 62R01

  3. Evaluating plate discipline in Major League Baseball with Bayesian Additive Regression Trees

    Authors: Ryan Yee, Sameer K. Deshpande

    Abstract: We introduce a three-step framework to determine at which pitches Major League batters should swing. Unlike traditional plate discipline metrics, which implicitly assume that all batters should always swing at (resp. take) pitches inside (resp. outside) the strike zone, our approach explicitly accounts not only for the players and umpires involved in the pitch but also in-game contextual informati… ▽ More

    Submitted 20 September, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  4. arXiv:2212.00219  [pdf, other

    stat.ML cs.LG stat.OT

    Are you using test log-likelihood correctly?

    Authors: Sameer K. Deshpande, Soumya Ghosh, Tin D. Nguyen, Tamara Broderick

    Abstract: Test log-likelihood is commonly used to compare different models of the same data or different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test log-likelihood can contradict comparisons according to other objectives. Specifically, our examples show that (i) approximate Bayesian inference algorithms tha… ▽ More

    Submitted 18 January, 2024; v1 submitted 30 November, 2022; originally announced December 2022.

    Comments: Presented at the ICBINB Workshop at NeurIPS 2022. This version accepted at TMLR, available at https://openreview.net/forum?id=n2YifD4Dxo

  5. arXiv:2211.04459  [pdf, other

    stat.ME stat.ML

    flexBART: Flexible Bayesian regression trees with categorical predictors

    Authors: Sameer K. Deshpande

    Abstract: Most implementations of Bayesian additive regression trees (BART) one-hot encode categorical predictors, replacing each one with several binary indicators, one for every level or category. Regression trees built with these indicators partition the discrete set of categorical levels by repeatedly removing one level at a time. Unfortunately, the vast majority of partitions cannot be built with this… ▽ More

    Submitted 21 June, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Software available at https://github.com/skdeshpande91/flexBART

  6. arXiv:2211.02104  [pdf, other

    stat.AP

    Pre-analysis protocol for an observational study on the effects of adolescent sports participation on health in early adulthood

    Authors: A**kya H Kokandakar, Yuzhou Lin, Steven **, Jordan Weiss, Amanda R Rabinowitz, Reuben A Buford May, Dylan Small, Sameer K Deshpande

    Abstract: We will study the impact of adolescent sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status… ▽ More

    Submitted 30 November, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  7. arXiv:2211.02020  [pdf, other

    stat.AP

    Bayesian Causal Forests & the 2022 ACIC Data Challenge: Scalability and Sensitivity

    Authors: A**kya H. Kokandakar, Hyunseung Kang, Sameer K. Deshpande

    Abstract: We demonstrate how Hahn et al.'s Bayesian Causal Forests model (BCF) can be used to estimate conditional average treatment effects for the longitudinal dataset in the 2022 American Causal Inference Conference Data Challenge. Unfortunately, existing implementations of BCF do not scale to the size of the challenge data. Therefore, we developed flexBCF -- a more scalable and flexible implementation o… ▽ More

    Submitted 11 May, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Journal ref: Observational Studies 9(3), 29-41 (2023). https://www.muse.jhu.edu/article/895651

  8. A Bayesian analysis of the time through the order penalty in baseball

    Authors: Ryan S. Brill, Sameer K. Deshpande, Abraham J. Wyner

    Abstract: As a baseball game progresses, batters appear to perform better the more times they face a particular pitcher. The apparent drop-off in pitcher performance from one time through the order to the next, known as the Time Through the Order Penalty (TTOP), is often attributed to within-game batter learning. Although the TTOP has largely been accepted within baseball and influences many managers' in-ga… ▽ More

    Submitted 31 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to JQAS

  9. arXiv:2209.04389  [pdf, other

    math.ST

    Posterior contraction and uncertainty quantification for the multivariate spike-and-slab LASSO

    Authors: Yunyi Shen, Sameer K. Deshpande

    Abstract: We study the asymptotic properties of Deshpande et al.\ (2019)'s multivariate spike-and-slab LASSO (mSSL) procedure for simultaneous variable and covariance selection in the sparse multivariate linear regression problem. In that problem, $q$ correlated responses are regressed onto $p$ covariates and the mSSL works by placing separate spike-and-slab priors on the entries in the matrix of marginal c… ▽ More

    Submitted 22 May, 2024; v1 submitted 9 September, 2022; originally announced September 2022.

  10. arXiv:2207.07020  [pdf, other

    stat.ME

    Estimating sparse direct effects in multivariate regression with the spike-and-slab LASSO

    Authors: Yunyi Shen, Claudia Solís-Lemus, Sameer K. Deshpande

    Abstract: The multivariate regression interpretation of the Gaussian chain graph model simultaneously parametrizes (i) the direct effects of $p$ predictors on $q$ outcomes and (ii) the residual partial covariances between pairs of outcomes. We introduce a new method for fitting sparse Gaussian chain graph models with spike-and-slab LASSO (SSL) priors. We develop an Expectation Conditional Maximization algor… ▽ More

    Submitted 26 March, 2024; v1 submitted 14 July, 2022; originally announced July 2022.

  11. arXiv:2201.04957  [pdf

    cond-mat.mtrl-sci

    Dielectric Properties of Polysulfone Carbon Nanotube Composite Membranes

    Authors: Bhakti Hirani, P. S. Goyal, Deepali Shrivastava, S. K. Deshpande

    Abstract: Polymeric membranes, including Polysulfone (PSf) membranes, are routinely used for water treatment. To enhance water permeation of above membranes, it is common to synthesize polymeric membranes with carbon nanotubes (CNTs) embedded in them. It is seen that water permeability of membranes having vertically aligned CNTs is higher, as compared to those where CNTs are not aligned. It is of interest t… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    Comments: Conference on Technologies for Future Cities 2021

  12. arXiv:2106.06510  [pdf, other

    stat.ML cs.LG stat.CO

    Measuring the robustness of Gaussian processes to kernel choice

    Authors: William T. Stephenson, Soumya Ghosh, Tin D. Nguyen, Mikhail Yurochkin, Sameer K. Deshpande, Tamara Broderick

    Abstract: Gaussian processes (GPs) are used to make medical and scientific decisions, including in cardiac care and monitoring of atmospheric carbon dioxide levels. Notably, the choice of GP kernel is often somewhat arbitrary. In particular, uncountably many kernels typically align with qualitative prior knowledge (e.g.\ function smoothness or stationarity). But in practice, data analysts choose among a han… ▽ More

    Submitted 12 March, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: AISTATS 2022

  13. arXiv:2102.09705  [pdf, other

    stat.ME

    Confidently Comparing Estimators with the c-value

    Authors: Brian L. Trippe, Sameer K. Deshpande, Tamara Broderick

    Abstract: Modern statistics provides an ever-expanding toolkit for estimating unknown parameters. Consequently, applied statisticians frequently face a difficult decision: retain a parameter estimate from a familiar method or replace it with an estimate from a newer or more complex one. While it is traditional to compare estimates using risk, such comparisons are rarely conclusive in realistic settings. I… ▽ More

    Submitted 19 December, 2022; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in the Journal of the American Statistical Association

  14. arXiv:2006.12669  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Approximate Cross-Validation for Structured Models

    Authors: Soumya Ghosh, William T. Stephenson, Tin D. Nguyen, Sameer K. Deshpande, Tamara Broderick

    Abstract: Many modern data analyses benefit from explicitly modeling dependence structure in data -- such as measurements across time or space, ordered words in a sentence, or genes in a genome. A gold standard evaluation technique is structured cross-validation (CV), which leaves out some data subset (such as data within a time interval or data in a geographic region) in each fold. But CV here can be prohi… ▽ More

    Submitted 1 December, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: 25 pages, 8 figures. NeurIPS 2020 camera ready. v2 fixes typos and provides additional empirical results. Code: https://github.com/SoumyaTGhosh/structured-infinitesimal-jackknife

  15. arXiv:2003.06416  [pdf, other

    stat.ME

    VCBART: Bayesian trees for varying coefficients

    Authors: Sameer K. Deshpande, Ray Bai, Cecilia Balocchi, Jennifer E. Starling, Jordan Weiss

    Abstract: The linear varying coefficient models posits a linear relationship between an outcome and covariates in which the covariate effects are modeled as functions of additional effect modifiers. Despite a long history of study and use in statistics and econometrics, state-of-the-art varying coefficient modeling methods cannot accommodate multivariate effect modifiers without imposing restrictive functio… ▽ More

    Submitted 13 May, 2024; v1 submitted 13 March, 2020; originally announced March 2020.

  16. arXiv:1912.00111  [pdf, other

    stat.AP stat.ME

    Crime in Philadelphia: Bayesian Clustering with Particle Optimization

    Authors: Cecilia Balocchi, Sameer K. Deshpande, Edward I. George, Shane T. Jensen

    Abstract: Accurate estimation of the change in crime over time is a critical first step towards better understanding of public safety in large urban environments. Bayesian hierarchical modeling is a natural way to study spatial variation in urban crime dynamics at the neighborhood level, since it facilitates principled ``sharing of information'' between spatially adjacent neighborhoods. Typically, however,… ▽ More

    Submitted 21 June, 2022; v1 submitted 29 November, 2019; originally announced December 2019.

  17. arXiv:1910.12337  [pdf, other

    stat.AP

    Expected Hypothetical Completion Probability

    Authors: Sameer K. Deshpande, Katherine Evans

    Abstract: Using high-resolution player tracking data made available by the National Football League (NFL) for their 2019 Big Data Bowl competition, we introduce the Expected Hypothetical Completion Probability (EHCP), a objective framework for evaluating plays. At the heart of EHCP is the question "on a given passing play, did the quarterback throw the pass to the receiver who was most likely to catch it?"… ▽ More

    Submitted 27 October, 2019; originally announced October 2019.

    Comments: This paper elaborates on work done for the NFL 2019 Big Data Bowl contest. Manuscript accepted at the Journal of Quantitative Analysis in Sports

  18. arXiv:1902.10106  [pdf, ps, other

    stat.AP

    Protocol for an Observational Study of the Association of High School Football Participation on Health in Late Adulthood

    Authors: Timothy G. Gaulton, Sameer K. Deshpande, Dylan S. Small, Mark D. Neuman

    Abstract: American football is the most popular high school sport and is among the leading cause of injury among adolescents. While there has been considerable recent attention on the link between football and cognitive decline, there is also evidence of higher than expected rates of pain, obesity, and lower quality of life among former professional players, either as a result of repetitive head injury or t… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

  19. arXiv:1808.03934  [pdf, other

    stat.AP

    Protocol for an observational study on the effects of playing football in adolescence on mental health in early adulthood

    Authors: Sameer K. Deshpande, Raiden B. Hasegawa, Jordan Weiss, Dylan S. Small

    Abstract: More than 1 million students play high school American football annually, but many health professionals have recently questioned its safety or called for its ban. These concerns have been partially driven by reports of chronic traumatic encephalopathy (CTE), increased risks of neurodegenerative disease, and associations between concussion history and later-life cognitive impairment and depression… ▽ More

    Submitted 9 November, 2018; v1 submitted 12 August, 2018; originally announced August 2018.

    Comments: Updated tables summarizing the matches constructed

  20. arXiv:1807.10558  [pdf

    stat.AP

    Protocol for an Observational Study on the Effects of Early-Life Participation in Contact Sports on Later-Life Cognition in a Sample of Monozygotic and Dizygotic Swedish Twins Reared Together and Twins Reared Apart

    Authors: Jordan Weiss, Amanda R. Rabinowitz, Sameer K. Deshpande, Raiden B. Hasegawa, Dylan S. Small

    Abstract: A large body of work links traumatic brain injury (TBI) in adulthood to the onset of Alzheimer's disease (AD). AD is the chief cause of dementia, leading to reduced cognitive capacity and autonomy and increased mortality risk. More recently, researchers have sought to investigate whether TBI experienced in early-life may influence trajectories of cognitive dysfunction in adulthood. It has been spe… ▽ More

    Submitted 16 April, 2020; v1 submitted 27 July, 2018; originally announced July 2018.

    Comments: Updated methodology and tables

  21. Simultaneous Variable and Covariance Selection with the Multivariate Spike-and-Slab Lasso

    Authors: Sameer K. Deshpande, Veronika Rockova, Edward I. George

    Abstract: We propose a Bayesian procedure for simultaneous variable and covariance selection using continuous spike-and-slab priors in multivariate linear regression models where q possibly correlated responses are regressed onto p predictors. Rather than relying on a stochastic search through the high-dimensional model space, we develop an ECM algorithm similar to the EMVS procedure of Rockova & George (20… ▽ More

    Submitted 24 July, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

  22. arXiv:1705.03918  [pdf, other

    stat.ME

    Causal Inference with Two Versions of Treatment

    Authors: Raiden B. Hasegawa, Sameer K. Deshpande, Dylan S. Small, Paul R. Rosenbaum

    Abstract: Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that the treatment or control condition is not well-defined, existing instead in more than one version. A simple, widely applicable analysis is proposed to address the possibility that the treatment or control condition exists in two version… ▽ More

    Submitted 24 April, 2019; v1 submitted 10 May, 2017; originally announced May 2017.

  23. A Hierarchical Bayesian Model of Pitch Framing

    Authors: Sameer K. Deshpande, Abraham J. Wyner

    Abstract: Since the advent of high-resolution pitch tracking data (PITCHf/x), many in the sabermetrics community have attempted to quantify a Major League Baseball catcher's ability to "frame" a pitch (i.e. increase the chance that a pitch is called as a strike). Especially in the last three years, there has been an explosion of interest in the "art of pitch framing" in the popular press as well as signs th… ▽ More

    Submitted 9 September, 2017; v1 submitted 3 April, 2017; originally announced April 2017.

    Journal ref: Journal of Quantitative Analysis in Sports. 13(3): 95--112. (2017)

  24. arXiv:1607.01756  [pdf, other

    stat.AP

    Protocol for an Observational Study on the Effects of Playing High School Football on Later Life Cognitive Functioning and Mental Health

    Authors: Sameer K. Deshpande, Raiden B. Hasegawa, Amanda R. Rabinowitz, John Whyte, Carol L. Roan, Andrew Tabatabaei, Michael Baiocchi, Jason H. Karlawish, Christina L. Master, Dylan S. Small

    Abstract: A potential causal relationship between head injuries sustained by NFL players and later-life neurological decline may have broad implications for participants in youth and high school football programs. However, brain trauma risk at the professional level may be different than that at the youth and high school levels and the long-term effects of participation at these levels is as-yet unclear. To… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

    Comments: Prior to performing the proposed analysis, we will register this pre-analysis plan on clincialtrials.gov

  25. Estimating an NBA player's impact on his team's chances of winning

    Authors: Sameer K. Deshpande, Shane T. Jensen

    Abstract: Traditional NBA player evaluation metrics are based on scoring differential or some pace-adjusted linear combination of box score statistics like points, rebounds, assists, etc. These measures treat performances with the outcome of the game still in question (e.g. tie score with five minutes left) in exactly the same way as they treat performances with the outcome virtually decided (e.g. when one… ▽ More

    Submitted 11 April, 2016; originally announced April 2016.

    Comments: To appear in the Journal of Quantitative Analysis of Sport

    Journal ref: Journal of Quantitative Analysis in Sport. 12(2): 51-72 (2016)

  26. arXiv:1009.5460  [pdf

    cond-mat.mtrl-sci

    The Dielectric Response of La0.5Ca0.5-xSrxMnO3 (0.1 <= x <= 0.4) Manganites with Different Magnetic Ground States

    Authors: Indu Dhiman, S. K. Deshpande, A. Das

    Abstract: The dielectric behavior of half doped manganites La0.5Ca0.5-xSrxMnO3 (0.1 \leq \times \leq 0.4) with varying magnetic ground states has been studied. The real part of relative permittivity as a function of temperature ε^'(T), exhibits a maximum around the ferromagnetic (TC) and charge ordering transition (TCO) temperatures accompanied with high dielectric losses. The activation energies obtained f… ▽ More

    Submitted 28 September, 2010; originally announced September 2010.

    Comments: 14 pages, 5 figures, To appear in Journal of Applied Physics

  27. Study of ion beam induced mixing in nano-layered Si/C multilayer structures

    Authors: Ram Prakash, S. Amirthapandian, D. M. Phase, S. K. Deshpande, R. Kesavamoorthy, K. G. M. Nair

    Abstract: The effects of ion beam induced atomic mixing and subsequent thermal treatment in Si/C multilayer structures are investigated by use of the technique of grazing incidence X-ray diffraction (GIXRD) and Raman spectroscopy. The [Si (3.0 nm) / C (2.5 nm)]x10 /Si multilayer films were prepared by electron beam evaporation under ultra high vacuum (UHV) environment. The layer thicknesses were measured… ▽ More

    Submitted 11 January, 2006; originally announced January 2006.

    Comments: 16 pages, 3 figures. to be appear soon in NIM B (article in press)

    Journal ref: Nuclear Instruments and Methods B 244 (2006) 283-288