-
Adolescent sports participation and health in early adulthood: An observational study
Authors:
A**kya H. Kokandakar,
Yuzhou Lin,
Steven **,
Jordan Weiss,
Amanda R. Rabinowitz,
Reuben A. Buford May,
Dylan Small,
Sameer K. Deshpande
Abstract:
We study the impact of teenage sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status. To pro…
▽ More
We study the impact of teenage sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status. To probe the possibility that certain types of sports participation may have larger effects on health than others, we conduct a matched observational study at each level within a hierarchy of exposures. Our hierarchy ranges from broadly defined exposures (e.g., participation in any organized after-school activity) to narrow (e.g., participation in collision sports). We deployed an ordered testing approach that exploits the hierarchical relationships between our exposure definitions to perform our analyses while maintaining a fixed family-wise error rate. Compared to teenagers who did not participate in any after-school activities, those who participated in sports had statistically significantly better self-rated and mental health outcomes in early adulthood.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
New directions in algebraic statistics: Three challenges from 2023
Authors:
Yulia Alexandr,
Miles Bakenhus,
Mark Curiel,
Sameer K. Deshpande,
Elizabeth Gross,
Yuqi Gu,
Max Hill,
Joseph Johnson,
Bryson Kagy,
Vishesh Karwa,
Jiayi Li,
Hanbaek Lyu,
Sonja Petrović,
Jose Israel Rodriguez
Abstract:
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally…
▽ More
In the last quarter of a century, algebraic statistics has established itself as an expanding field which uses multilinear algebra, commutative algebra, computational algebra, geometry, and combinatorics to tackle problems in mathematical statistics. These developments have found applications in a growing number of areas, including biology, neuroscience, economics, and social sciences.
Naturally, new connections continue to be made with other areas of mathematics and statistics. This paper outlines three such connections: to statistical models used in educational testing, to a classification problem for a family of nonparametric regression models, and to phase transition phenomena under uniform sampling of contingency tables. We illustrate the motivating problems, each of which is for algebraic statistics a new direction, and demonstrate an enhancement of related methodologies.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Aikyam: A Video Conferencing Utility for Deaf and Dumb
Authors:
Kshitij Deshpande,
Varad Mashalkar,
Kaustubh Mhaisekar,
Amaan Naikwadi,
Archana Ghotkar
Abstract:
With the advent of the pandemic, the use of video conferencing platforms as a means of communication has greatly increased and with it, so have the remote opportunities. The deaf and dumb have traditionally faced several issues in communication, but now the effect is felt more severely. This paper proposes an all-encompassing video conferencing utility that can be used with existing video conferen…
▽ More
With the advent of the pandemic, the use of video conferencing platforms as a means of communication has greatly increased and with it, so have the remote opportunities. The deaf and dumb have traditionally faced several issues in communication, but now the effect is felt more severely. This paper proposes an all-encompassing video conferencing utility that can be used with existing video conferencing platforms to address these issues. Appropriate semantically correct sentences are generated from the signer's gestures which would be interpreted by the system. Along with an audio to emit this sentence, the user's feed is also used to annotate the sentence. This can be viewed by all participants, thus aiding smooth communication with all parties involved. This utility utilizes a simple LSTM model for classification of gestures. The sentences are constructed by a t5 based model. In order to achieve the required data flow, a virtual camera is used.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Empathy and Distress Detection using Ensembles of Transformer Models
Authors:
Tanmay Chavan,
Kshitij Deshpande,
Sheetal Sonawane
Abstract:
This paper presents our approach for the WASSA 2023 Empathy, Emotion and Personality Shared Task. Empathy and distress are human feelings that are implicitly expressed in natural discourses. Empathy and distress detection are crucial challenges in Natural Language Processing that can aid our understanding of conversations. The provided dataset consists of several long-text examples in the English…
▽ More
This paper presents our approach for the WASSA 2023 Empathy, Emotion and Personality Shared Task. Empathy and distress are human feelings that are implicitly expressed in natural discourses. Empathy and distress detection are crucial challenges in Natural Language Processing that can aid our understanding of conversations. The provided dataset consists of several long-text examples in the English language, with each example associated with a numeric score for empathy and distress. We experiment with several BERT-based models as a part of our approach. We also try various ensemble methods. Our final submission has a Pearson's r score of 0.346, placing us third in the empathy and distress detection subtask.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Study and Survey on Gesture Recognition Systems
Authors:
Kshitij Deshpande,
Varad Mashalkar,
Kaustubh Mhaisekar,
Amaan Naikwadi,
Archana Ghotkar
Abstract:
In recent years, there has been a considerable amount of research in the Gesture Recognition domain, mainly owing to the technological advancements in Computer Vision. Various new applications have been conceptualised and developed in this field. This paper discusses the implementation of gesture recognition systems in multiple sectors such as gaming, healthcare, home appliances, industrial robots…
▽ More
In recent years, there has been a considerable amount of research in the Gesture Recognition domain, mainly owing to the technological advancements in Computer Vision. Various new applications have been conceptualised and developed in this field. This paper discusses the implementation of gesture recognition systems in multiple sectors such as gaming, healthcare, home appliances, industrial robots, and virtual reality. Different methodologies for capturing gestures are compared and contrasted throughout this survey. Various data sources and data acquisition techniques have been discussed. The role of gestures in sign language has been studied and existing approaches have been reviewed. Common challenges faced while building gesture recognition systems have also been explored.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language Models for Violence Inciting Text Detection
Authors:
Saurabh Page,
Sudeep Mangalvedhekar,
Kshitij Deshpande,
Tanmay Chavan,
Sheetal Sonawane
Abstract:
This paper presents our work for the Violence Inciting Text Detection shared task in the First Workshop on Bangla Language Processing. Social media has accelerated the propagation of hate and violence-inciting speech in society. It is essential to develop efficient mechanisms to detect and curb the propagation of such texts. The problem of detecting violence-inciting texts is further exacerbated i…
▽ More
This paper presents our work for the Violence Inciting Text Detection shared task in the First Workshop on Bangla Language Processing. Social media has accelerated the propagation of hate and violence-inciting speech in society. It is essential to develop efficient mechanisms to detect and curb the propagation of such texts. The problem of detecting violence-inciting texts is further exacerbated in low-resource settings due to sparse research and less data. The data provided in the shared task consists of texts in the Bangla language, where each example is classified into one of the three categories defined based on the types of violence-inciting texts. We try and evaluate several BERT-based models, and then use an ensemble of the models as our final submission. Our submission is ranked 10th in the final leaderboard of the shared task with a macro F1 score of 0.737.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Mavericks at NADI 2023 Shared Task: Unravelling Regional Nuances through Dialect Identification using Transformer-based Approach
Authors:
Vedant Deshpande,
Yash Patwardhan,
Kshitij Deshpande,
Sudeep Mangalvedhekar,
Ravindra Murumkar
Abstract:
In this paper, we present our approach for the "Nuanced Arabic Dialect Identification (NADI) Shared Task 2023". We highlight our methodology for subtask 1 which deals with country-level dialect identification. Recognizing dialects plays an instrumental role in enhancing the performance of various downstream NLP tasks such as speech recognition and translation. The task uses the Twitter dataset (TW…
▽ More
In this paper, we present our approach for the "Nuanced Arabic Dialect Identification (NADI) Shared Task 2023". We highlight our methodology for subtask 1 which deals with country-level dialect identification. Recognizing dialects plays an instrumental role in enhancing the performance of various downstream NLP tasks such as speech recognition and translation. The task uses the Twitter dataset (TWT-2023) that encompasses 18 dialects for the multi-class classification problem. Numerous transformer-based models, pre-trained on Arabic language, are employed for identifying country-level dialects. We fine-tune these state-of-the-art models on the provided dataset. The ensembling method is leveraged to yield improved performance of the system. We achieved an F1-score of 76.65 (11th rank on the leaderboard) on the test dataset.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Mavericks at ArAIEval Shared Task: Towards a Safer Digital Space -- Transformer Ensemble Models Tackling Deception and Persuasion
Authors:
Sudeep Mangalvedhekar,
Kshitij Deshpande,
Yash Patwardhan,
Vedant Deshpande,
Ravindra Murumkar
Abstract:
In this paper, we highlight our approach for the "Arabic AI Tasks Evaluation (ArAiEval) Shared Task 2023". We present our approaches for task 1-A and task 2-A of the shared task which focus on persuasion technique detection and disinformation detection respectively. Detection of persuasion techniques and disinformation has become imperative to avoid distortion of authentic information. The tasks u…
▽ More
In this paper, we highlight our approach for the "Arabic AI Tasks Evaluation (ArAiEval) Shared Task 2023". We present our approaches for task 1-A and task 2-A of the shared task which focus on persuasion technique detection and disinformation detection respectively. Detection of persuasion techniques and disinformation has become imperative to avoid distortion of authentic information. The tasks use multigenre snippets of tweets and news articles for the given binary classification problem. We experiment with several transformer-based models that were pre-trained on the Arabic language. We fine-tune these state-of-the-art models on the provided dataset. Ensembling is employed to enhance the performance of the systems. We achieved a micro F1-score of 0.742 on task 1-A (8th rank on the leaderboard) and 0.901 on task 2-A (7th rank on the leaderboard) respectively.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Evaluating plate discipline in Major League Baseball with Bayesian Additive Regression Trees
Authors:
Ryan Yee,
Sameer K. Deshpande
Abstract:
We introduce a three-step framework to determine at which pitches Major League batters should swing. Unlike traditional plate discipline metrics, which implicitly assume that all batters should always swing at (resp. take) pitches inside (resp. outside) the strike zone, our approach explicitly accounts not only for the players and umpires involved in the pitch but also in-game contextual informati…
▽ More
We introduce a three-step framework to determine at which pitches Major League batters should swing. Unlike traditional plate discipline metrics, which implicitly assume that all batters should always swing at (resp. take) pitches inside (resp. outside) the strike zone, our approach explicitly accounts not only for the players and umpires involved in the pitch but also in-game contextual information like the number of outs, the count, baserunners, and score. We first fit flexible Bayesian nonparametric models to estimate (i) the probability that the pitch is called a strike if the batter takes the pitch; (ii) the probability that the batter makes contact if he swings; and (iii) the number of runs the batting team is expected to score following each pitch outcome (e.g. swing and miss, take a called strike, etc.). We then combine these intermediate estimates to determine whether swinging increases the batting team's run expectancy. Our approach enables natural uncertainty propagation so that we can not only determine the optimal swing/take decision but also quantify our confidence in that decision. We illustrate our framework using a case study of pitches faced by Mike Trout in 2019.
△ Less
Submitted 20 September, 2023; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Are you using test log-likelihood correctly?
Authors:
Sameer K. Deshpande,
Soumya Ghosh,
Tin D. Nguyen,
Tamara Broderick
Abstract:
Test log-likelihood is commonly used to compare different models of the same data or different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test log-likelihood can contradict comparisons according to other objectives. Specifically, our examples show that (i) approximate Bayesian inference algorithms tha…
▽ More
Test log-likelihood is commonly used to compare different models of the same data or different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test log-likelihood can contradict comparisons according to other objectives. Specifically, our examples show that (i) approximate Bayesian inference algorithms that attain higher test log-likelihoods need not also yield more accurate posterior approximations and (ii) conclusions about forecast accuracy based on test log-likelihood comparisons may not agree with conclusions based on root mean squared error.
△ Less
Submitted 18 January, 2024; v1 submitted 30 November, 2022;
originally announced December 2022.
-
flexBART: Flexible Bayesian regression trees with categorical predictors
Authors:
Sameer K. Deshpande
Abstract:
Most implementations of Bayesian additive regression trees (BART) one-hot encode categorical predictors, replacing each one with several binary indicators, one for every level or category. Regression trees built with these indicators partition the discrete set of categorical levels by repeatedly removing one level at a time. Unfortunately, the vast majority of partitions cannot be built with this…
▽ More
Most implementations of Bayesian additive regression trees (BART) one-hot encode categorical predictors, replacing each one with several binary indicators, one for every level or category. Regression trees built with these indicators partition the discrete set of categorical levels by repeatedly removing one level at a time. Unfortunately, the vast majority of partitions cannot be built with this strategy, severely limiting BART's ability to partially pool data across groups of levels. Motivated by analyses of baseball data and neighborhood-level crime dynamics, we overcame this limitation by re-implementing BART with regression trees that can assign multiple levels to both branches of a decision tree node. To model spatial data aggregated into small regions, we further proposed a new decision rule prior that creates spatially contiguous regions by deleting a random edge from a random spanning tree of a suitably defined network. Our re-implementation, which is available in the flexBART package, often yields improved out-of-sample predictive performance and scales better to larger datasets than existing implementations of BART.
△ Less
Submitted 21 June, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Pre-analysis protocol for an observational study on the effects of adolescent sports participation on health in early adulthood
Authors:
A**kya H Kokandakar,
Yuzhou Lin,
Steven **,
Jordan Weiss,
Amanda R Rabinowitz,
Reuben A Buford May,
Dylan Small,
Sameer K Deshpande
Abstract:
We will study the impact of adolescent sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status…
▽ More
We will study the impact of adolescent sports participation on early-adulthood health using longitudinal data from the National Study of Youth and Religion. We focus on two primary outcomes measured at ages 23--28 -- self-rated health and total score on the PHQ9 Patient Depression Questionnaire -- and control for several potential confounders related to demographics and family socioeconomic status. Comparing outcomes between sports participants and matched non-sports participants with similar confounders is straightforward. Unfortunately, an analysis based on such a broad exposure cannot probe the possibility that participation in certain types of sports (e.g., collision sports like football or soccer) may have larger effects on health than others.
In this study, we introduce a hierarchy of exposure definitions, ranging from broad (participation in any after-school organized activity) to narrow (e.g., participation in limited-contact sports). We will perform separate matched observational studies, one for each definition, to estimate the health effects of several levels of sports participation. In order to conduct these studies while maintaining a fixed family-wise error rate, we deployed an ordered testing approach that exploits the logical relationships between exposure definitions. Our study will also consider several secondary outcomes including body mass index, life satisfaction, and problematic drinking behavior.
△ Less
Submitted 30 November, 2023; v1 submitted 3 November, 2022;
originally announced November 2022.
-
Bayesian Causal Forests & the 2022 ACIC Data Challenge: Scalability and Sensitivity
Authors:
A**kya H. Kokandakar,
Hyunseung Kang,
Sameer K. Deshpande
Abstract:
We demonstrate how Hahn et al.'s Bayesian Causal Forests model (BCF) can be used to estimate conditional average treatment effects for the longitudinal dataset in the 2022 American Causal Inference Conference Data Challenge. Unfortunately, existing implementations of BCF do not scale to the size of the challenge data. Therefore, we developed flexBCF -- a more scalable and flexible implementation o…
▽ More
We demonstrate how Hahn et al.'s Bayesian Causal Forests model (BCF) can be used to estimate conditional average treatment effects for the longitudinal dataset in the 2022 American Causal Inference Conference Data Challenge. Unfortunately, existing implementations of BCF do not scale to the size of the challenge data. Therefore, we developed flexBCF -- a more scalable and flexible implementation of BCF -- and used it in our challenge submission. We investigate the sensitivity of our results to the choice of propensity score estimation method and the use of sparsity-inducing regression tree priors. While we found that our overall point predictions were not especially sensitive to these modeling choices, we did observe that running BCF with flexibly estimated propensity scores often yielded better-calibrated uncertainty intervals.
△ Less
Submitted 11 May, 2023; v1 submitted 3 November, 2022;
originally announced November 2022.
-
A Bayesian analysis of the time through the order penalty in baseball
Authors:
Ryan S. Brill,
Sameer K. Deshpande,
Abraham J. Wyner
Abstract:
As a baseball game progresses, batters appear to perform better the more times they face a particular pitcher. The apparent drop-off in pitcher performance from one time through the order to the next, known as the Time Through the Order Penalty (TTOP), is often attributed to within-game batter learning. Although the TTOP has largely been accepted within baseball and influences many managers' in-ga…
▽ More
As a baseball game progresses, batters appear to perform better the more times they face a particular pitcher. The apparent drop-off in pitcher performance from one time through the order to the next, known as the Time Through the Order Penalty (TTOP), is often attributed to within-game batter learning. Although the TTOP has largely been accepted within baseball and influences many managers' in-game decision making, we argue that existing approaches of estimating the size of the TTOP cannot disentangle continuous evolution in pitcher performance over the course of the game from discontinuities between successive times through the order. Using a Bayesian multinomial regression model, we find that, after adjusting for confounders like batter and pitcher quality, handedness, and home field advantage, there is little evidence of strong discontinuity in pitcher performance between times through the order. Our analysis suggests that the start of the third time through the order should not be viewed as a special cutoff point in deciding whether to pull a starting pitcher.
△ Less
Submitted 31 May, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Posterior contraction and uncertainty quantification for the multivariate spike-and-slab LASSO
Authors:
Yunyi Shen,
Sameer K. Deshpande
Abstract:
We study the asymptotic properties of Deshpande et al.\ (2019)'s multivariate spike-and-slab LASSO (mSSL) procedure for simultaneous variable and covariance selection in the sparse multivariate linear regression problem. In that problem, $q$ correlated responses are regressed onto $p$ covariates and the mSSL works by placing separate spike-and-slab priors on the entries in the matrix of marginal c…
▽ More
We study the asymptotic properties of Deshpande et al.\ (2019)'s multivariate spike-and-slab LASSO (mSSL) procedure for simultaneous variable and covariance selection in the sparse multivariate linear regression problem. In that problem, $q$ correlated responses are regressed onto $p$ covariates and the mSSL works by placing separate spike-and-slab priors on the entries in the matrix of marginal covariate effects and off-diagonal elements in the upper triangle of the residual precision matrix. Under mild assumptions about these matrices, we establish the posterior contraction rate for the mSSL posterior in the asymptotic regime where both $p$ and $q$ diverge with $n.$ By ``de-biasing'' the corresponding MAP estimates, we obtain confidence intervals for each covariate effect and residual partial correlation. In extensive simulation studies, these intervals displayed close-to-nominal frequentist coverage in finite sample settings but tended to be substantially longer than those obtained using a version of the Bayesian bootstrap that randomly re-weights the prior. We further show that the de-biased intervals for individual covariate effects are asymptotically valid.
△ Less
Submitted 22 May, 2024; v1 submitted 9 September, 2022;
originally announced September 2022.
-
Development and Validation of ML-DQA -- a Machine Learning Data Quality Assurance Framework for Healthcare
Authors:
Mark Sendak,
Gaurav Sirdeshmukh,
Timothy Ochoa,
Hayley Premo,
Linda Tang,
Kira Niederhoffer,
Sarah Reed,
Kaivalya Deshpande,
Emily Sterrett,
Melissa Bauer,
Laurie Snyder,
Afreen Shariff,
David Whellan,
Jeffrey Riggio,
David Gaieski,
Kristin Corey,
Megan Richards,
Michael Gao,
Marshall Nichols,
Bradley Heintze,
William Knechtle,
William Ratliff,
Suresh Balu
Abstract:
The approaches by which the machine learning and clinical research communities utilize real world data (RWD), including data captured in the electronic health record (EHR), vary dramatically. While clinical researchers cautiously use RWD for clinical investigations, ML for healthcare teams consume public datasets with minimal scrutiny to develop new algorithms. This study bridges this gap by devel…
▽ More
The approaches by which the machine learning and clinical research communities utilize real world data (RWD), including data captured in the electronic health record (EHR), vary dramatically. While clinical researchers cautiously use RWD for clinical investigations, ML for healthcare teams consume public datasets with minimal scrutiny to develop new algorithms. This study bridges this gap by develo** and validating ML-DQA, a data quality assurance framework grounded in RWD best practices. The ML-DQA framework is applied to five ML projects across two geographies, different medical conditions, and different cohorts. A total of 2,999 quality checks and 24 quality reports were generated on RWD gathered on 247,536 patients across the five projects. Five generalizable practices emerge: all projects used a similar method to group redundant data element representations; all projects used automated utilities to build diagnosis and medication data elements; all projects used a common library of rules-based transformations; all projects used a unified approach to assign data quality checks to data elements; and all projects used a similar approach to clinical adjudication. An average of 5.8 individuals, including clinicians, data scientists, and trainees, were involved in implementing ML-DQA for each project and an average of 23.4 data elements per project were either transformed or removed in response to ML-DQA. This study demonstrates the importance role of ML-DQA in healthcare projects and provides teams a framework to conduct these essential activities.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
Estimating sparse direct effects in multivariate regression with the spike-and-slab LASSO
Authors:
Yunyi Shen,
Claudia Solís-Lemus,
Sameer K. Deshpande
Abstract:
The multivariate regression interpretation of the Gaussian chain graph model simultaneously parametrizes (i) the direct effects of $p$ predictors on $q$ outcomes and (ii) the residual partial covariances between pairs of outcomes. We introduce a new method for fitting sparse Gaussian chain graph models with spike-and-slab LASSO (SSL) priors. We develop an Expectation Conditional Maximization algor…
▽ More
The multivariate regression interpretation of the Gaussian chain graph model simultaneously parametrizes (i) the direct effects of $p$ predictors on $q$ outcomes and (ii) the residual partial covariances between pairs of outcomes. We introduce a new method for fitting sparse Gaussian chain graph models with spike-and-slab LASSO (SSL) priors. We develop an Expectation Conditional Maximization algorithm to obtain sparse estimates of the $p \times q$ matrix of direct effects and the $q \times q$ residual precision matrix. Our algorithm iteratively solves a sequence of penalized maximum likelihood problems with self-adaptive penalties that gradually filter out negligible regression coefficients and partial covariances. Because it adaptively penalizes individual model parameters, our method is seen to outperform fixed-penalty competitors on simulated data. We establish the posterior contraction rate for our model, buttressing our method's excellent empirical performance with strong theoretical guarantees. Using our method, we estimated the direct effects of diet and residence type on the composition of the gut microbiome of elderly adults.
△ Less
Submitted 26 March, 2024; v1 submitted 14 July, 2022;
originally announced July 2022.
-
Anomaly detection in surveillance videos using transformer based attention model
Authors:
Kapil Deshpande,
Narinder Singh Punn,
Sanjay Kumar Sonbhadra,
Sonali Agarwal
Abstract:
Surveillance footage can catch a wide range of realistic anomalies. This research suggests using a weakly supervised strategy to avoid annotating anomalous segments in training videos, which is time consuming. In this approach only video level labels are used to obtain frame level anomaly scores. Weakly supervised video anomaly detection (WSVAD) suffers from the wrong identification of abnormal an…
▽ More
Surveillance footage can catch a wide range of realistic anomalies. This research suggests using a weakly supervised strategy to avoid annotating anomalous segments in training videos, which is time consuming. In this approach only video level labels are used to obtain frame level anomaly scores. Weakly supervised video anomaly detection (WSVAD) suffers from the wrong identification of abnormal and normal instances during the training process. Therefore it is important to extract better quality features from the available videos. WIth this motivation, the present paper uses better quality transformer-based features named Videoswin Features followed by the attention layer based on dilated convolution and self attention to capture long and short range dependencies in temporal domain. This gives us a better understanding of available videos. The proposed framework is validated on real-world dataset i.e. ShanghaiTech Campus dataset which results in competitive performance than current state-of-the-art methods. The model and the code are available at https://github.com/kapildeshpande/Anomaly-Detection-in-Surveillance-Videos
△ Less
Submitted 6 June, 2022; v1 submitted 3 June, 2022;
originally announced June 2022.
-
Dielectric Properties of Polysulfone Carbon Nanotube Composite Membranes
Authors:
Bhakti Hirani,
P. S. Goyal,
Deepali Shrivastava,
S. K. Deshpande
Abstract:
Polymeric membranes, including Polysulfone (PSf) membranes, are routinely used for water treatment. To enhance water permeation of above membranes, it is common to synthesize polymeric membranes with carbon nanotubes (CNTs) embedded in them. It is seen that water permeability of membranes having vertically aligned CNTs is higher, as compared to those where CNTs are not aligned. It is of interest t…
▽ More
Polymeric membranes, including Polysulfone (PSf) membranes, are routinely used for water treatment. To enhance water permeation of above membranes, it is common to synthesize polymeric membranes with carbon nanotubes (CNTs) embedded in them. It is seen that water permeability of membranes having vertically aligned CNTs is higher, as compared to those where CNTs are not aligned. It is of interest to examine if the dielectric constant of a CNT based nanocomposite membrane is sensitive to alignment of CNTs or not. This paper reports dielectric properties of PSf-MWCNT membranes, both, for aligned and unaligned MWCNTs. Multi Walled Carbon Nanotubes (MWCNTs) based polysulfone membranes were synthesized using standard methods. MWCNTs in above membranes were aligned by casting the membrane in presence of magnetic field. The present paper, for the first time, shows that the above result is valid for membranes also.
△ Less
Submitted 6 January, 2022;
originally announced January 2022.
-
Measuring the robustness of Gaussian processes to kernel choice
Authors:
William T. Stephenson,
Soumya Ghosh,
Tin D. Nguyen,
Mikhail Yurochkin,
Sameer K. Deshpande,
Tamara Broderick
Abstract:
Gaussian processes (GPs) are used to make medical and scientific decisions, including in cardiac care and monitoring of atmospheric carbon dioxide levels. Notably, the choice of GP kernel is often somewhat arbitrary. In particular, uncountably many kernels typically align with qualitative prior knowledge (e.g.\ function smoothness or stationarity). But in practice, data analysts choose among a han…
▽ More
Gaussian processes (GPs) are used to make medical and scientific decisions, including in cardiac care and monitoring of atmospheric carbon dioxide levels. Notably, the choice of GP kernel is often somewhat arbitrary. In particular, uncountably many kernels typically align with qualitative prior knowledge (e.g.\ function smoothness or stationarity). But in practice, data analysts choose among a handful of convenient standard kernels (e.g.\ squared exponential). In the present work, we ask: Would decisions made with a GP differ under other, qualitatively interchangeable kernels? We show how to answer this question by solving a constrained optimization problem over a finite-dimensional space. We can then use standard optimizers to identify substantive changes in relevant decisions made with a GP. We demonstrate in both synthetic and real-world examples that decisions made with a GP can exhibit non-robustness to kernel choice, even when prior draws are qualitatively interchangeable to a user.
△ Less
Submitted 12 March, 2022; v1 submitted 11 June, 2021;
originally announced June 2021.
-
Imaging and Spectral Observations of a Type-II Radio Burst Revealing the Section of the CME-Driven Shock that Accelerates Electrons
Authors:
Satabdwa Majumdar,
Srikar Paavan Tadepalli,
Samriddhi Sankar Maity,
Ketaki Deshpande,
Anshu Kumari,
Ritesh Patel,
Nat Gopalswamy
Abstract:
We report on a multi-wavelength analysis of the 26 January 2014 solar eruption involving a coronal mass ejection (CME) and a Type-II radio burst, performed by combining data from various space-and ground-based instruments. An increasing standoff distance with height shows the presence of a strong shock, which further manifests itself in the continuation of the metric Type-II burst into the decamet…
▽ More
We report on a multi-wavelength analysis of the 26 January 2014 solar eruption involving a coronal mass ejection (CME) and a Type-II radio burst, performed by combining data from various space-and ground-based instruments. An increasing standoff distance with height shows the presence of a strong shock, which further manifests itself in the continuation of the metric Type-II burst into the decameter-hectometric (DH) domain. A plot of speed versus position angle (PA) shows different points on the CME leading edge travelled with different speeds. From the starting frequency of the Type-II burst and white-light data, we find that the shock signature producing the Type-II burst might be coming from the flanks of the CME. Measuring the speeds of the CME flanks, we find the southern flank to be at a higher speed than the northern flank; further the radio contours from Type-II imaging data showed that the burst source was coming from the southern flank of the CME. From the standoff distance at the CME nose, we find that the local Alfven speed is close to the white-light shock speed, thus causing the Mach number to be small there. Also, the presence of a streamer near the southern flank appears to have provided additional favorable conditions for the generation of shock-associated radio emission. These results provide conclusive evidence that the Type-II emission could originate from the flanks of the CME, which in our study is from the the southern flank of the CME.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
Confidently Comparing Estimators with the c-value
Authors:
Brian L. Trippe,
Sameer K. Deshpande,
Tamara Broderick
Abstract:
Modern statistics provides an ever-expanding toolkit for estimating unknown parameters. Consequently, applied statisticians frequently face a difficult decision: retain a parameter estimate from a familiar method or replace it with an estimate from a newer or more complex one. While it is traditional to compare estimates using risk, such comparisons are rarely conclusive in realistic settings.
I…
▽ More
Modern statistics provides an ever-expanding toolkit for estimating unknown parameters. Consequently, applied statisticians frequently face a difficult decision: retain a parameter estimate from a familiar method or replace it with an estimate from a newer or more complex one. While it is traditional to compare estimates using risk, such comparisons are rarely conclusive in realistic settings.
In response, we propose the "c-value" as a measure of confidence that a new estimate achieves smaller loss than an old estimate on a given dataset. We show that it is unlikely that a large c-value coincides with a larger loss for the new estimate. Therefore, just as a small p-value supports rejecting a null hypothesis, a large c-value supports using a new estimate in place of the old. For a wide class of problems and estimates, we show how to compute a c-value by first constructing a data-dependent high-probability lower bound on the difference in loss. The c-value is frequentist in nature, but we show that it can provide validation of shrinkage estimates derived from Bayesian models in real data applications involving hierarchical models and Gaussian processes.
△ Less
Submitted 19 December, 2022; v1 submitted 18 February, 2021;
originally announced February 2021.
-
TwInflation
Authors:
Kaustubh Deshpande,
Soubhik Kumar,
Raman Sundrum
Abstract:
The general structure of Hybrid Inflation remains a very well-motivated mechanism for lower-scale cosmic inflation in the face of improving constraints on the tensor-to-scalar ratio. However, as originally modeled, the "waterfall" field in this mechanism gives rise to a hierarchy problem ($η-$problem) for the inflaton after demanding standard effective field theory (EFT) control. We modify the hyb…
▽ More
The general structure of Hybrid Inflation remains a very well-motivated mechanism for lower-scale cosmic inflation in the face of improving constraints on the tensor-to-scalar ratio. However, as originally modeled, the "waterfall" field in this mechanism gives rise to a hierarchy problem ($η-$problem) for the inflaton after demanding standard effective field theory (EFT) control. We modify the hybrid mechanism and incorporate a discrete "twin" symmetry, thereby yielding a viable, natural and EFT-controlled model of non-supersymmetric low-scale inflation, "Twinflation". Analogously to Twin Higgs models, the discrete exchange-symmetry with a "twin" sector reduces quadratic sensitivity in the inflationary potential to ultra-violet physics, at the root of the hierarchy problem. The observed phase of inflation takes place on a hilltop-like potential but without fine-tuning of the initial inflaton position in field-space. We also show that all parameters of the model can take natural values, below any associated EFT-cutoff mass scales and field values, thus ensuring straightforward theoretical control. We discuss the basic phenomenological considerations and constraints, as well as possible future directions.
△ Less
Submitted 21 July, 2021; v1 submitted 15 January, 2021;
originally announced January 2021.
-
The Large Hadron-Electron Collider at the HL-LHC
Authors:
P. Agostini,
H. Aksakal,
S. Alekhin,
P. P. Allport,
N. Andari,
K. D. J. Andre,
D. Angal-Kalinin,
S. Antusch,
L. Aperio Bella,
L. Apolinario,
R. Apsimon,
A. Apyan,
G. Arduini,
V. Ari,
A. Armbruster,
N. Armesto,
B. Auchmann,
K. Aulenbacher,
G. Azuelos,
S. Backovic,
I. Bailey,
S. Bailey,
F. Balli,
S. Behera,
O. Behnke
, et al. (312 additional authors not shown)
Abstract:
The Large Hadron electron Collider (LHeC) is designed to move the field of deep inelastic scattering (DIS) to the energy and intensity frontier of particle physics. Exploiting energy recovery technology, it collides a novel, intense electron beam with a proton or ion beam from the High Luminosity--Large Hadron Collider (HL-LHC). The accelerator and interaction region are designed for concurrent el…
▽ More
The Large Hadron electron Collider (LHeC) is designed to move the field of deep inelastic scattering (DIS) to the energy and intensity frontier of particle physics. Exploiting energy recovery technology, it collides a novel, intense electron beam with a proton or ion beam from the High Luminosity--Large Hadron Collider (HL-LHC). The accelerator and interaction region are designed for concurrent electron-proton and proton-proton operation. This report represents an update of the Conceptual Design Report (CDR) of the LHeC, published in 2012. It comprises new results on parton structure of the proton and heavier nuclei, QCD dynamics, electroweak and top-quark physics. It is shown how the LHeC will open a new chapter of nuclear particle physics in extending the accessible kinematic range in lepton-nucleus scattering by several orders of magnitude. Due to enhanced luminosity, large energy and the cleanliness of the hadronic final states, the LHeC has a strong Higgs physics programme and its own discovery potential for new physics. Building on the 2012 CDR, the report represents a detailed updated design of the energy recovery electron linac (ERL) including new lattice, magnet, superconducting radio frequency technology and further components. Challenges of energy recovery are described and the lower energy, high current, 3-turn ERL facility, PERLE at Orsay, is presented which uses the LHeC characteristics serving as a development facility for the design and operation of the LHeC. An updated detector design is presented corresponding to the acceptance, resolution and calibration goals which arise from the Higgs and parton density function physics programmes. The paper also presents novel results on the Future Circular Collider in electron-hadron mode, FCC-eh, which utilises the same ERL technology to further extend the reach of DIS to even higher centre-of-mass energies.
△ Less
Submitted 12 April, 2021; v1 submitted 28 July, 2020;
originally announced July 2020.
-
Approximate Cross-Validation for Structured Models
Authors:
Soumya Ghosh,
William T. Stephenson,
Tin D. Nguyen,
Sameer K. Deshpande,
Tamara Broderick
Abstract:
Many modern data analyses benefit from explicitly modeling dependence structure in data -- such as measurements across time or space, ordered words in a sentence, or genes in a genome. A gold standard evaluation technique is structured cross-validation (CV), which leaves out some data subset (such as data within a time interval or data in a geographic region) in each fold. But CV here can be prohi…
▽ More
Many modern data analyses benefit from explicitly modeling dependence structure in data -- such as measurements across time or space, ordered words in a sentence, or genes in a genome. A gold standard evaluation technique is structured cross-validation (CV), which leaves out some data subset (such as data within a time interval or data in a geographic region) in each fold. But CV here can be prohibitively slow due to the need to re-run already-expensive learning algorithms many times. Previous work has shown approximate cross-validation (ACV) methods provide a fast and provably accurate alternative in the setting of empirical risk minimization. But this existing ACV work is restricted to simpler models by the assumptions that (i) data across CV folds are independent and (ii) an exact initial model fit is available. In structured data analyses, both these assumptions are often untrue. In the present work, we address (i) by extending ACV to CV schemes with dependence structure between the folds. To address (ii), we verify -- both theoretically and empirically -- that ACV quality deteriorates smoothly with noise in the initial fit. We demonstrate the accuracy and computational benefits of our proposed methods on a diverse set of real-world applications.
△ Less
Submitted 1 December, 2020; v1 submitted 22 June, 2020;
originally announced June 2020.
-
VCBART: Bayesian trees for varying coefficients
Authors:
Sameer K. Deshpande,
Ray Bai,
Cecilia Balocchi,
Jennifer E. Starling,
Jordan Weiss
Abstract:
The linear varying coefficient models posits a linear relationship between an outcome and covariates in which the covariate effects are modeled as functions of additional effect modifiers. Despite a long history of study and use in statistics and econometrics, state-of-the-art varying coefficient modeling methods cannot accommodate multivariate effect modifiers without imposing restrictive functio…
▽ More
The linear varying coefficient models posits a linear relationship between an outcome and covariates in which the covariate effects are modeled as functions of additional effect modifiers. Despite a long history of study and use in statistics and econometrics, state-of-the-art varying coefficient modeling methods cannot accommodate multivariate effect modifiers without imposing restrictive functional form assumptions or involving computationally intensive hyperparameter tuning. In response, we introduce VCBART, which flexibly estimates the covariate effect in a varying coefficient model using Bayesian Additive Regression Trees. With simple default settings, VCBART outperforms existing varying coefficient methods in terms of covariate effect estimation, uncertainty quantification, and outcome prediction. We illustrate the utility of VCBART with two case studies: one examining how the association between later-life cognition and measures of socioeconomic position vary with respect to age and socio-demographics and another estimating how temporal trends in urban crime vary at the neighborhood level. An R package implementing VCBART is available at https://github.com/skdeshpande91/VCBART
△ Less
Submitted 13 May, 2024; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Crime in Philadelphia: Bayesian Clustering with Particle Optimization
Authors:
Cecilia Balocchi,
Sameer K. Deshpande,
Edward I. George,
Shane T. Jensen
Abstract:
Accurate estimation of the change in crime over time is a critical first step towards better understanding of public safety in large urban environments. Bayesian hierarchical modeling is a natural way to study spatial variation in urban crime dynamics at the neighborhood level, since it facilitates principled ``sharing of information'' between spatially adjacent neighborhoods. Typically, however,…
▽ More
Accurate estimation of the change in crime over time is a critical first step towards better understanding of public safety in large urban environments. Bayesian hierarchical modeling is a natural way to study spatial variation in urban crime dynamics at the neighborhood level, since it facilitates principled ``sharing of information'' between spatially adjacent neighborhoods. Typically, however, cities contain many physical and social boundaries that may manifest as spatial discontinuities in crime patterns. In this situation, standard prior choices often yield overly-smooth parameter estimates, which can ultimately produce mis-calibrated forecasts. To prevent potential over-smoothing, we introduce a prior that partitions the set of neighborhoods into several clusters and encourages spatial smoothness within each cluster. In terms of model implementation, conventional stochastic search techniques are computationally prohibitive, as they must traverse a combinatorially vast space of partitions. We introduce an ensemble optimization procedure that simultaneously identifies several high probability partitions by solving one optimization problem using a new local search strategy. We then use the identified partitions to estimate crime trends in Philadelphia between 2006 and 2017. On simulated and real data, our proposed method demonstrates good estimation and partition selection performance.
△ Less
Submitted 21 June, 2022; v1 submitted 29 November, 2019;
originally announced December 2019.
-
Expected Hypothetical Completion Probability
Authors:
Sameer K. Deshpande,
Katherine Evans
Abstract:
Using high-resolution player tracking data made available by the National Football League (NFL) for their 2019 Big Data Bowl competition, we introduce the Expected Hypothetical Completion Probability (EHCP), a objective framework for evaluating plays. At the heart of EHCP is the question "on a given passing play, did the quarterback throw the pass to the receiver who was most likely to catch it?"…
▽ More
Using high-resolution player tracking data made available by the National Football League (NFL) for their 2019 Big Data Bowl competition, we introduce the Expected Hypothetical Completion Probability (EHCP), a objective framework for evaluating plays. At the heart of EHCP is the question "on a given passing play, did the quarterback throw the pass to the receiver who was most likely to catch it?" To answer this question, we first built a Bayesian non-parametric catch probability model that automatically accounts for complex interactions between inputs like the receiver's speed and distances to the ball and nearest defender. While building such a model is, in principle, straightforward, using it to reason about a hypothetical pass is challenging because many of the model inputs corresponding to a hypothetical are necessarily unobserved. To wit, it is impossible to observe how close an un-targeted receiver would be to his nearest defender had the pass been thrown to him instead of the receiver who was actually targeted. To overcome this fundamental difficulty, we propose imputing the unobservable inputs and averaging our model predictions across these imputations to derive EHCP. In this way, EHCP can track how the completion probability evolves for each receiver over the course of a play in a way that accounts for the uncertainty about missing inputs.
△ Less
Submitted 27 October, 2019;
originally announced October 2019.
-
Protocol for an Observational Study of the Association of High School Football Participation on Health in Late Adulthood
Authors:
Timothy G. Gaulton,
Sameer K. Deshpande,
Dylan S. Small,
Mark D. Neuman
Abstract:
American football is the most popular high school sport and is among the leading cause of injury among adolescents. While there has been considerable recent attention on the link between football and cognitive decline, there is also evidence of higher than expected rates of pain, obesity, and lower quality of life among former professional players, either as a result of repetitive head injury or t…
▽ More
American football is the most popular high school sport and is among the leading cause of injury among adolescents. While there has been considerable recent attention on the link between football and cognitive decline, there is also evidence of higher than expected rates of pain, obesity, and lower quality of life among former professional players, either as a result of repetitive head injury or through different mechanisms. Previously hidden downstream effects of playing football may have far-reaching public health implications for participants in youth and high school football programs.
Our proposed study is a retrospective observational study that compares 1,153 high school males who played varsity football with 2,751 male students who did not. 1,951 of the control subjects did not play any sport and the remaining 800 controls played a non-contact sport. Our primary outcome is self-rated health measured at age 65. To control for potential confounders, we adjust for pre-exposure covariates with matching and model-based covariance adjustment. We will conduct an ordered testing procedure designed to use the full pool of 2,751 controls while also controlling for possible unmeasured differences between students who played sports and those who did not. We will quantitatively assess the sensitivity of the results to potential unmeasured confounding. The study will also assess secondary outcomes of pain, difficulty with activities of daily living, and obesity, as these are both important to individual well-being and have public health relevance.
△ Less
Submitted 26 February, 2019;
originally announced February 2019.
-
Supersymmetric Inflation from the Fifth Dimension
Authors:
Kaustubh Deshpande,
Raman Sundrum
Abstract:
We develop a supersymmetric bi-axion model of high-scale inflation coupled to supergravity, in which the axionic structure originates from, and is protected by, gauge symmetry in an extra dimension. While local supersymmetry (SUSY) is necessarily Higgsed at high scales during inflation we show that it can naturally survive down to the $\sim$ TeV scale in the current era in order to resolve the ele…
▽ More
We develop a supersymmetric bi-axion model of high-scale inflation coupled to supergravity, in which the axionic structure originates from, and is protected by, gauge symmetry in an extra dimension. While local supersymmetry (SUSY) is necessarily Higgsed at high scales during inflation we show that it can naturally survive down to the $\sim$ TeV scale in the current era in order to resolve the electroweak hierarchy problem. We show how a suitable inflationary effective potential for the axions can be generated at tree-level by charged fields under the higher-dimensional gauge symmetry. The inflationary trajectory lies along the lightest direction in the bi-axion field space, with periodic effective potential and an effective super-Planckian field range emerging from fundamentally sub-Planckian dynamics. The heavier direction in the field space is shown to also play an important role, as the dominant source of super-Higgsing during inflation. This model presents an interesting interplay of tuning considerations relating the electroweak hierarchy, cosmological constant and inflationary superpotential, where maximal naturalness favors SUSY breaking near the electroweak scale after inflation. The scalar superpartner of the axionic inflaton, the "sinflaton", can naturally have $\sim$ Hubble mass during inflation and sufficiently strong coupling to the inflaton to mediate primordial non-Gaussianities of observable strength in future 21-cm surveys. Non-minimal charged fields under the higher-dimensional gauge symmetry can contribute to periodic modulations in the CMB, within the sensitivity of ongoing measurements.
△ Less
Submitted 30 June, 2021; v1 submitted 14 February, 2019;
originally announced February 2019.
-
Beyond the Standard Model Physics at the HL-LHC and HE-LHC
Authors:
X. Cid Vidal,
M. D'Onofrio,
P. J. Fox,
R. Torre,
K. A. Ulmer,
A. Aboubrahim,
A. Albert,
J. Alimena,
B. C. Allanach,
C. Alpigiani,
M. Altakach,
S. Amoroso,
J. K. Anders,
J. Y. Araz,
A. Arbey,
P. Azzi,
I. Babounikau,
H. Baer,
M. J. Baker,
D. Barducci,
V. Barger,
O. Baron,
L. Barranco Navarro,
M. Battaglia,
A. Bay
, et al. (272 additional authors not shown)
Abstract:
This is the third out of five chapters of the final report [1] of the Workshop on Physics at HL-LHC, and perspectives on HE-LHC [2]. It is devoted to the study of the potential, in the search for Beyond the Standard Model (BSM) physics, of the High Luminosity (HL) phase of the LHC, defined as $3~\mathrm{ab}^{-1}$ of data taken at a centre-of-mass energy of $14~\mathrm{TeV}$, and of a possible futu…
▽ More
This is the third out of five chapters of the final report [1] of the Workshop on Physics at HL-LHC, and perspectives on HE-LHC [2]. It is devoted to the study of the potential, in the search for Beyond the Standard Model (BSM) physics, of the High Luminosity (HL) phase of the LHC, defined as $3~\mathrm{ab}^{-1}$ of data taken at a centre-of-mass energy of $14~\mathrm{TeV}$, and of a possible future upgrade, the High Energy (HE) LHC, defined as $15~\mathrm{ab}^{-1}$ of data at a centre-of-mass energy of $27~\mathrm{TeV}$. We consider a large variety of new physics models, both in a simplified model fashion and in a more model-dependent one. A long list of contributions from the theory and experimental (ATLAS, CMS, LHCb) communities have been collected and merged together to give a complete, wide, and consistent view of future prospects for BSM physics at the considered colliders. On top of the usual standard candles, such as supersymmetric simplified models and resonances, considered for the evaluation of future collider potentials, this report contains results on dark matter and dark sectors, long lived particles, leptoquarks, sterile neutrinos, axion-like particles, heavy scalars, vector-like quarks, and more. Particular attention is placed, especially in the study of the HL-LHC prospects, to the detector upgrades, the assessment of the future systematic uncertainties, and new experimental techniques. The general conclusion is that the HL-LHC, on top of allowing to extend the present LHC mass and coupling reach by $20-50\%$ on most new physics scenarios, will also be able to constrain, and potentially discover, new physics that is presently unconstrained. Moreover, compared to the HL-LHC, the reach in most observables will generally more than double at the HE-LHC, which may represent a good candidate future facility for a final test of TeV-scale new physics.
△ Less
Submitted 13 August, 2019; v1 submitted 19 December, 2018;
originally announced December 2018.
-
Performance Evaluation of Cryptographic Ciphers on IoT Devices
Authors:
Praneet Singh,
Kedar Deshpande
Abstract:
With the advent of Internet of Things (IoT) and the increasing use of application-based processors, security infrastructure needs to be examined on some widely-used IoT hardware architectures. Applications in today's world are moving towards IoT concepts as this makes them fast, efficient, modular and future-proof. However, this leads to a greater security risk as IoT devices thrive in an ecosyste…
▽ More
With the advent of Internet of Things (IoT) and the increasing use of application-based processors, security infrastructure needs to be examined on some widely-used IoT hardware architectures. Applications in today's world are moving towards IoT concepts as this makes them fast, efficient, modular and future-proof. However, this leads to a greater security risk as IoT devices thrive in an ecosystem of co-existence and interconnection. As a result of these security risks, it is of utmost importance to test the existing cryptographic ciphers on such devices and determine if they are viable in terms of swiftness of execution time and memory consumption efficiency. It is also important to determine if there is a requirement to develop new lightweight cryptographic ciphers for these devices. This paper hopes to accomplish the above-mentioned objective by testing various encryption-decryption techniques on different IoT based devices and creating a comparison of execution speeds between these devices for a variety of different data sizes. Keywords-Internet of things(IoT), application-based processors, security, encryption-decryption, speed, efficiency
△ Less
Submitted 5 December, 2018;
originally announced December 2018.
-
Closing the light gluino gap with electron-proton colliders
Authors:
David Curtin,
Kaustubh Deshpande,
Oliver Fischer,
Jose Zurita
Abstract:
The future electron-proton collider proposals, LHeC and FCC-he, can deliver $\mathcal{O}$(TeV) center-of-mass energy collisions, higher than most of the proposed lepton accelerators, with $\mathcal{O}$(ab$^{-1}$) luminosity, while maintaining a much cleaner experimental environment as compared to the hadron machines. This unique capability of $e^- p$ colliders can be harnessed in probing BSM scena…
▽ More
The future electron-proton collider proposals, LHeC and FCC-he, can deliver $\mathcal{O}$(TeV) center-of-mass energy collisions, higher than most of the proposed lepton accelerators, with $\mathcal{O}$(ab$^{-1}$) luminosity, while maintaining a much cleaner experimental environment as compared to the hadron machines. This unique capability of $e^- p$ colliders can be harnessed in probing BSM scenarios giving final states that look like hadronic noise at $pp$ machines. In the present study, we explore the prospects of detecting such a prompt signal having multiple soft jets at the LHeC. Such a signal can come from the decay of gluino in RPV or Stealth SUSY, where there exists a gap in the current experimental search with $m_{\tilde{g}} \approx 50 - 70$ GeV. We perform a simple analysis to demonstrate that, with simple signal selection cuts, we can close this gap at the LHeC at 95 % confidence level, even in the presence of a reasonable systematic error. More sophisticated signal selection strategies and detailed knowledge of the detector can be used to improve the prospects of signal detection.
△ Less
Submitted 4 December, 2018;
originally announced December 2018.
-
Protocol for an observational study on the effects of playing football in adolescence on mental health in early adulthood
Authors:
Sameer K. Deshpande,
Raiden B. Hasegawa,
Jordan Weiss,
Dylan S. Small
Abstract:
More than 1 million students play high school American football annually, but many health professionals have recently questioned its safety or called for its ban. These concerns have been partially driven by reports of chronic traumatic encephalopathy (CTE), increased risks of neurodegenerative disease, and associations between concussion history and later-life cognitive impairment and depression…
▽ More
More than 1 million students play high school American football annually, but many health professionals have recently questioned its safety or called for its ban. These concerns have been partially driven by reports of chronic traumatic encephalopathy (CTE), increased risks of neurodegenerative disease, and associations between concussion history and later-life cognitive impairment and depression among retired professional football players.
A recent observational study of a cohort of men who graduated from a Wisconsin high school in 1957 found no statistically significant harmful effects of playing high school football on a range of cognitive, psychological, and socio-economic outcomes measured at ages 35, 54, 65, and 72. Unfortunately, these findings may not generalize to younger populations, thanks to changes and improvements in football helmet technology and training techniques. In particular, these changes may have led to increased perceptions of safety but ultimately more dangerous styles of play, characterized by the frequent sub-concussive impacts thought to be associated with later-life neurological decline.
In this work, we replicate the methodology of that earlier matched observational study using data from the National Longitudinal Study of Adolescent to Adult Health (Add Health). These include adolescent and family co-morbidities, academic experience, self-reported levels of general health and physical activity, and the score on the Add Health Picture Vocabulary Test. Our primary outcome is the CES-D score measured in 2008 when subjects were aged 24 -- 34 and settling into early adulthood. We also examine several secondary outcomes related to physical and psychological health, including suicidality. Our results can provide insight into the natural history of potential football-related decline and dysfunction.
△ Less
Submitted 9 November, 2018; v1 submitted 12 August, 2018;
originally announced August 2018.
-
Protocol for an Observational Study on the Effects of Early-Life Participation in Contact Sports on Later-Life Cognition in a Sample of Monozygotic and Dizygotic Swedish Twins Reared Together and Twins Reared Apart
Authors:
Jordan Weiss,
Amanda R. Rabinowitz,
Sameer K. Deshpande,
Raiden B. Hasegawa,
Dylan S. Small
Abstract:
A large body of work links traumatic brain injury (TBI) in adulthood to the onset of Alzheimer's disease (AD). AD is the chief cause of dementia, leading to reduced cognitive capacity and autonomy and increased mortality risk. More recently, researchers have sought to investigate whether TBI experienced in early-life may influence trajectories of cognitive dysfunction in adulthood. It has been spe…
▽ More
A large body of work links traumatic brain injury (TBI) in adulthood to the onset of Alzheimer's disease (AD). AD is the chief cause of dementia, leading to reduced cognitive capacity and autonomy and increased mortality risk. More recently, researchers have sought to investigate whether TBI experienced in early-life may influence trajectories of cognitive dysfunction in adulthood. It has been speculated that early-life participation in collision sports may lead to poor cognitive and mental health outcomes. However, to date, the few studies to investigate this relationship have produced mixed results. We propose to extend this literature by conducting a prospective study on the effects of early-life participation in collision sports on later-life cognitive health using the Swedish Adoption/Twin Study on Aging (SATSA). The SATSA is unique in its sampling of monozygotic and dizygotic twins reared together (respectively MZT, DZT) and twins reared apart (respectively MZA, DZA). The proposed analysis is a prospective study of 660 individuals comprised of 270 twin pairs and 120 singletons. Seventy-eight (11.8% individuals reported participation in collision sports. Our primary outcome will be an indicator of cognitive impairment determined by scores on the Mini-Mental State Examination (MMSE). We will also consider several secondary cognitive outcomes including verbal and spatial ability, memory, and processing speed. Our sample will be restricted to individuals with at least one MMSE score out of seven repeated assessments spaced approximately three years apart. We will adjust for age, sex, and education in each of our models.
△ Less
Submitted 16 April, 2020; v1 submitted 27 July, 2018;
originally announced July 2018.
-
Probing BSM physics with electron-proton colliders
Authors:
David Curtin,
Kaustubh Deshpande,
Oliver Fischer,
Jose Zurita
Abstract:
In this talk I will illustrate with two examples (Higgsino dark matter and Exotic Higgs decays) how electron-proton colliders present unique opportunities to probe BSM scenarios where proton-proton colliders fall short due to the experimental difficulties in reconstructing the signal due to the large hadronic backgrounds. The leit-motiv of these examples are long-lived particles (LLPs), which have…
▽ More
In this talk I will illustrate with two examples (Higgsino dark matter and Exotic Higgs decays) how electron-proton colliders present unique opportunities to probe BSM scenarios where proton-proton colliders fall short due to the experimental difficulties in reconstructing the signal due to the large hadronic backgrounds. The leit-motiv of these examples are long-lived particles (LLPs), which have received recently a lot of attention from both the experimental and theoretical communities. We find that the proposed $e^-p$ colliders can be competitive against their more energetic $pp$ incarnations for lifetimes between a millimeter and a micron, depending on the physics scenario under consideration.
△ Less
Submitted 4 July, 2018; v1 submitted 31 May, 2018;
originally announced May 2018.
-
New Physics Opportunities for Long-Lived Particles at Electron-Proton Colliders
Authors:
David Curtin,
Kaustubh Deshpande,
Oliver Fischer,
Jose Zurita
Abstract:
Future electron-proton collider proposals like the LHeC or the FCC-eh can supply 1/ab of collisions with a center-of-mass energy in the TeV range, while maintaining a clean experimental environment more commonly associated with lepton colliders. We point out that this makes electron-proton colliders ideally suited to probe BSM signatures with final states that look like "hadronic noise" in the hig…
▽ More
Future electron-proton collider proposals like the LHeC or the FCC-eh can supply 1/ab of collisions with a center-of-mass energy in the TeV range, while maintaining a clean experimental environment more commonly associated with lepton colliders. We point out that this makes electron-proton colliders ideally suited to probe BSM signatures with final states that look like "hadronic noise" in the high-energy, pile-up-rich environment of hadron colliders. We focus on the generic vector boson fusion production mechanism, which is available for all BSM particles with electroweak charges at mass scales far above the reach of most lepton colliders. This is in contrast to previous BSM studies at these machines, which focused on BSM processes with large production rates from the asymmetric initial state. We propose to exploit the unique experimental environment in the search for long-lived particle signals arising from Higgsinos or exotic Higgs decays. At electron-proton colliders, the soft decay products of long-lived Higgsinos can be explicitly reconstructed ("displaced single pion"), and very short lifetimes can be probed. We find that electron-proton colliders can explore significant regions of BSM parameter space inaccessible to other collider searches, with important implications for the design of such machines.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
Simultaneous Variable and Covariance Selection with the Multivariate Spike-and-Slab Lasso
Authors:
Sameer K. Deshpande,
Veronika Rockova,
Edward I. George
Abstract:
We propose a Bayesian procedure for simultaneous variable and covariance selection using continuous spike-and-slab priors in multivariate linear regression models where q possibly correlated responses are regressed onto p predictors. Rather than relying on a stochastic search through the high-dimensional model space, we develop an ECM algorithm similar to the EMVS procedure of Rockova & George (20…
▽ More
We propose a Bayesian procedure for simultaneous variable and covariance selection using continuous spike-and-slab priors in multivariate linear regression models where q possibly correlated responses are regressed onto p predictors. Rather than relying on a stochastic search through the high-dimensional model space, we develop an ECM algorithm similar to the EMVS procedure of Rockova & George (2014) targeting modal estimates of the matrix of regression coefficients and residual precision matrix. Varying the scale of the continuous spike densities facilitates dynamic posterior exploration and allows us to filter out negligible regression coefficients and partial covariances gradually. Our method is seen to substantially outperform regularization competitors on simulated data. We demonstrate our method with a re-examination of data from a recent observational study of the effect of playing high school football on several later-life cognition, psychological, and socio-economic outcomes.
△ Less
Submitted 24 July, 2018; v1 submitted 29 August, 2017;
originally announced August 2017.
-
Causal Inference with Two Versions of Treatment
Authors:
Raiden B. Hasegawa,
Sameer K. Deshpande,
Dylan S. Small,
Paul R. Rosenbaum
Abstract:
Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that the treatment or control condition is not well-defined, existing instead in more than one version. A simple, widely applicable analysis is proposed to address the possibility that the treatment or control condition exists in two version…
▽ More
Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that the treatment or control condition is not well-defined, existing instead in more than one version. A simple, widely applicable analysis is proposed to address the possibility that the treatment or control condition exists in two versions with two different treatment effects. This analysis loses no power in the main comparison of treatment and control, provides additional information about version effects, and controls the family-wise error rate in several comparisons. The method is motivated and illustrated using an on-going study of the possibility that repeated head trauma in high school football causes an increase in risk of early on-set dementia.
△ Less
Submitted 24 April, 2019; v1 submitted 10 May, 2017;
originally announced May 2017.
-
A Hierarchical Bayesian Model of Pitch Framing
Authors:
Sameer K. Deshpande,
Abraham J. Wyner
Abstract:
Since the advent of high-resolution pitch tracking data (PITCHf/x), many in the sabermetrics community have attempted to quantify a Major League Baseball catcher's ability to "frame" a pitch (i.e. increase the chance that a pitch is called as a strike). Especially in the last three years, there has been an explosion of interest in the "art of pitch framing" in the popular press as well as signs th…
▽ More
Since the advent of high-resolution pitch tracking data (PITCHf/x), many in the sabermetrics community have attempted to quantify a Major League Baseball catcher's ability to "frame" a pitch (i.e. increase the chance that a pitch is called as a strike). Especially in the last three years, there has been an explosion of interest in the "art of pitch framing" in the popular press as well as signs that teams are considering framing when making roster decisions.
We introduce a Bayesian hierarchical model to estimate each umpire's probability of calling a strike, adjusting for pitch participants, pitch location, and contextual information like the count. Using our model, we can estimate each catcher's effect on an umpire's chance of calling a strike.We are then able to translate these estimated effects into average runs saved across a season. We also introduce a new metric, analogous to Jensen, Shirley, and Wyner's Spatially Aggregate Fielding Evaluation metric, which provides a more honest assessment of the impact of framing.
△ Less
Submitted 9 September, 2017; v1 submitted 3 April, 2017;
originally announced April 2017.
-
Protocol for an Observational Study on the Effects of Playing High School Football on Later Life Cognitive Functioning and Mental Health
Authors:
Sameer K. Deshpande,
Raiden B. Hasegawa,
Amanda R. Rabinowitz,
John Whyte,
Carol L. Roan,
Andrew Tabatabaei,
Michael Baiocchi,
Jason H. Karlawish,
Christina L. Master,
Dylan S. Small
Abstract:
A potential causal relationship between head injuries sustained by NFL players and later-life neurological decline may have broad implications for participants in youth and high school football programs. However, brain trauma risk at the professional level may be different than that at the youth and high school levels and the long-term effects of participation at these levels is as-yet unclear. To…
▽ More
A potential causal relationship between head injuries sustained by NFL players and later-life neurological decline may have broad implications for participants in youth and high school football programs. However, brain trauma risk at the professional level may be different than that at the youth and high school levels and the long-term effects of participation at these levels is as-yet unclear. To investigate the effect of playing high school football on later life depression and cognitive functioning, we propose a retrospective observational study using data from the Wisconsin Longitudinal Study (WLS) of graduates from Wisconsin high schools in 1957.
We compare 1,153 high school males who played varsity football to 2,751 male students who did not. 1,951 of the control subjects did not play any sport and the remaining 800 controls played a non-contact sport. We focus on two primary outcomes measured at age 65: a composite cognitive outcome measuring verbal fluency and memory and the modified CES-D depression score. To control for potential confounders we adjust for pre-exposure covariates such as IQ with matching and model-based covariate adjustment. We will conduct an ordered testing procedure that uses all 2,751 controls while controlling for possible unmeasured differences between students who played sports and those who did not. We will quantitatively assess the sensitivity of the results to potential unmeasured confounding. The study will also consider several secondary outcomes of clinical interest such as aggression and heavy drinking. The rich set of pre-exposure variables, relatively unbiased sampling, and longitudinal nature of the WLS dataset make the proposed analysis unique among related studies that rely primarily on convenience samples of football players with reported neurological symptoms.
△ Less
Submitted 6 July, 2016;
originally announced July 2016.
-
Estimating an NBA player's impact on his team's chances of winning
Authors:
Sameer K. Deshpande,
Shane T. Jensen
Abstract:
Traditional NBA player evaluation metrics are based on scoring differential or some pace-adjusted linear combination of box score statistics like points, rebounds, assists, etc. These measures treat performances with the outcome of the game still in question (e.g. tie score with five minutes left) in exactly the same way as they treat performances with the outcome virtually decided (e.g. when one…
▽ More
Traditional NBA player evaluation metrics are based on scoring differential or some pace-adjusted linear combination of box score statistics like points, rebounds, assists, etc. These measures treat performances with the outcome of the game still in question (e.g. tie score with five minutes left) in exactly the same way as they treat performances with the outcome virtually decided (e.g. when one team leads by 30 points with one minute left). Because they ignore the context in which players perform, these measures can result in misleading estimates of how players help their teams win. We instead use a win probability framework for evaluating the impact NBA players have on their teams' chances of winning. We propose a Bayesian linear regression model to estimate an individual player's impact, after controlling for the other players on the court. We introduce several posterior summaries to derive rank-orderings of players within their team and across the league. This allows us to identify highly paid players with low impact relative to their teammates, as well as players whose high impact is not captured by existing metrics.
△ Less
Submitted 11 April, 2016;
originally announced April 2016.
-
On geodesic deviation in Schwarzschild spacetime
Authors:
Dennis Philipp,
Volker Perlick,
Claus Laemmerzahl,
Kaustubh Deshpande
Abstract:
For metrology, geodesy and gravimetry in space, satellite based instruments and measurement techniques are used and the orbits of the satellites as well as possible deviations between nearby ones are of central interest. The measurement of this deviation itself gives insight into the underlying structure of the spacetime geometry, which is curved and therefore described by the theory of general re…
▽ More
For metrology, geodesy and gravimetry in space, satellite based instruments and measurement techniques are used and the orbits of the satellites as well as possible deviations between nearby ones are of central interest. The measurement of this deviation itself gives insight into the underlying structure of the spacetime geometry, which is curved and therefore described by the theory of general relativity (GR). In the context of GR, the deviation of nearby geodesics can be described by the Jacobi equation that is a result of linearizing the geodesic equation around a known reference geodesic with respect to the deviation vector and the relative velocity. We review the derivation of this Jacobi equation and restrict ourselves to the simple case of the spacetime outside a spherically symmetric mass distribution and circular reference geodesics to find solutions by projecting the Jacobi equation on a parallel propagated tetrad as done by Fuchs. Using his results, we construct solutions of the Jacobi equation for different physical initial scenarios inspired by satellite gravimetry missions and give a set of parameter together with their precise impact on satellite orbit deviation. We further consider the Newtonian analog and construct the full solution, that exhibits a similar structure, within this theory.
△ Less
Submitted 26 August, 2015;
originally announced August 2015.
-
Action-at-a-distance electrodynamics in Quasi-steady-state cosmology
Authors:
Kaustubh Sudhir Deshpande
Abstract:
Action-at-a-distance electrodynamics - alternative approach to field theory - can be extended to cosmological models using conformal symmetry. An advantage of this is that the origin of arrow of time in electromagnetism can be attributed to the cosmological structure. Different cosmological models can be investigated, based on Wheeler-Feynman absorber theory, and only those models can be considere…
▽ More
Action-at-a-distance electrodynamics - alternative approach to field theory - can be extended to cosmological models using conformal symmetry. An advantage of this is that the origin of arrow of time in electromagnetism can be attributed to the cosmological structure. Different cosmological models can be investigated, based on Wheeler-Feynman absorber theory, and only those models can be considered viable for our universe which have net full retarded electromagnetic interactions i.e. forward direction of time. This work evaluates quasi-steady-state model and demonstrates that it admits full retarded and not advanced solution. Thus QSSC satisfies this necessary condition for a correct cosmological model, based on action-at-a-distance formulation.
△ Less
Submitted 4 February, 2014; v1 submitted 16 November, 2013;
originally announced November 2013.
-
The Dielectric Response of La0.5Ca0.5-xSrxMnO3 (0.1 <= x <= 0.4) Manganites with Different Magnetic Ground States
Authors:
Indu Dhiman,
S. K. Deshpande,
A. Das
Abstract:
The dielectric behavior of half doped manganites La0.5Ca0.5-xSrxMnO3 (0.1 \leq \times \leq 0.4) with varying magnetic ground states has been studied. The real part of relative permittivity as a function of temperature ε^'(T), exhibits a maximum around the ferromagnetic (TC) and charge ordering transition (TCO) temperatures accompanied with high dielectric losses. The activation energies obtained f…
▽ More
The dielectric behavior of half doped manganites La0.5Ca0.5-xSrxMnO3 (0.1 \leq \times \leq 0.4) with varying magnetic ground states has been studied. The real part of relative permittivity as a function of temperature ε^'(T), exhibits a maximum around the ferromagnetic (TC) and charge ordering transition (TCO) temperatures accompanied with high dielectric losses. The activation energies obtained for x = 0.1 and 0.3 samples below TCO are the same ~ 0.12eV, whereas the relaxation time constant varies in the range 2.8 \times 10-9 s - 6.03 \times 10-11 s. In contrast to samples having x \leq 0.3, for x = 0.4 do** the dielectric permittivity exhibits a strong temperature dependence in the vicinity of magnetic phase transitions. This behavior may be correlated with the presence of competing magnetic interactions (magnetic polarons) close to the magnetic transitions.
△ Less
Submitted 28 September, 2010;
originally announced September 2010.
-
Searching for Transient Pulses with the ETA Radio Telescope
Authors:
Cameron D. Patterson,
Steven W. Ellingson,
Brian S. Martin,
Kshitija Deshpande,
John H. Simonetti,
Michael Kavic,
Sean E. Cutchin
Abstract:
Array-based, direct-sampling radio telescopes have computational and communication requirements unsuited to conventional computer and cluster architectures. Synchronization must be strictly maintained across a large number of parallel data streams, from A/D conversion, through operations such as beamforming, to dataset recording. FPGAs supporting multi-gigabit serial I/O are ideally suited to th…
▽ More
Array-based, direct-sampling radio telescopes have computational and communication requirements unsuited to conventional computer and cluster architectures. Synchronization must be strictly maintained across a large number of parallel data streams, from A/D conversion, through operations such as beamforming, to dataset recording. FPGAs supporting multi-gigabit serial I/O are ideally suited to this application. We describe a recently-constructed radio telescope called ETA having all-sky observing capability for detecting low frequency pulses from transient events such as gamma ray bursts and primordial black hole explosions. Signals from 24 dipole antennas are processed by a tiered arrangement of 28 commercial FPGA boards and 4 PCs with FPGA-based data acquisition cards, connected with custom I/O adapter boards supporting InfiniBand and LVDS physical links. ETA is designed for unattended operation, allowing configuration and recording to be controlled remotely.
△ Less
Submitted 5 December, 2008;
originally announced December 2008.
-
Stereography using a single lens
Authors:
Kshitija Deshpande,
Arvind Paranjapye
Abstract:
In this paper we have put forth an innovative method of obtaining a stereographic image on a single frame using a single lens. This method has been verified experimentally. A preliminary prototype of the same is built with an optimized use of the material available in the laboratory. The prospective applications of this technique are also explored in brief. This method once commercialized, will…
▽ More
In this paper we have put forth an innovative method of obtaining a stereographic image on a single frame using a single lens. This method has been verified experimentally. A preliminary prototype of the same is built with an optimized use of the material available in the laboratory. The prospective applications of this technique are also explored in brief. This method once commercialized, will reduce the expenses incurred in the stereo videography. We also propose a simplified method of obtaining anaglyph.
△ Less
Submitted 8 October, 2006; v1 submitted 23 September, 2006;
originally announced September 2006.
-
Study of ion beam induced mixing in nano-layered Si/C multilayer structures
Authors:
Ram Prakash,
S. Amirthapandian,
D. M. Phase,
S. K. Deshpande,
R. Kesavamoorthy,
K. G. M. Nair
Abstract:
The effects of ion beam induced atomic mixing and subsequent thermal treatment in Si/C multilayer structures are investigated by use of the technique of grazing incidence X-ray diffraction (GIXRD) and Raman spectroscopy. The [Si (3.0 nm) / C (2.5 nm)]x10 /Si multilayer films were prepared by electron beam evaporation under ultra high vacuum (UHV) environment. The layer thicknesses were measured…
▽ More
The effects of ion beam induced atomic mixing and subsequent thermal treatment in Si/C multilayer structures are investigated by use of the technique of grazing incidence X-ray diffraction (GIXRD) and Raman spectroscopy. The [Si (3.0 nm) / C (2.5 nm)]x10 /Si multilayer films were prepared by electron beam evaporation under ultra high vacuum (UHV) environment. The layer thicknesses were measured using in-situ quartz crystal oscillator. These multilayer films were subjected to 40 keV Ar+ ion irradiation with fluences 5E-16 (low fluence) and 1E-17 ions / cm2 (high fluence).The as-prepared and irradiated multilayer samples were annealed at 773 K for one hour. The GIXRD and Raman spectroscopy results reveal the formation of different phases of SiC in these multilayer structures. Deposition induced reactions at the nano-structured interface and subsequent room temperature Ar ion irradiation at low fluence result in formation of the hexagonal SiC phase. High fluence Ar+ ion irradiation and subsequent annealing at 773 K for one hour leads to precipitation of the cubic-SiC phase.
△ Less
Submitted 11 January, 2006;
originally announced January 2006.