Search | arXiv e-print repository

Entropy-Based Strategies for Multi-Bracket Pools

Authors: Ryan S. Brill, Abraham J. Wyner, Ian J. Barnett

Abstract: Much work in the parimutuel betting literature has discussed estimating event outcome probabilities or develo** optimal wagering strategies, particularly for horse race betting. Some betting pools, however, involve betting not just on a single event, but on a tuple of events. For example, pick six betting in horse racing, March Madness bracket challenges, and predicting a randomly drawn bitstrin… ▽ More Much work in the parimutuel betting literature has discussed estimating event outcome probabilities or develo** optimal wagering strategies, particularly for horse race betting. Some betting pools, however, involve betting not just on a single event, but on a tuple of events. For example, pick six betting in horse racing, March Madness bracket challenges, and predicting a randomly drawn bitstring each involve making a series of individual forecasts. Although traditional optimal wagering strategies work well when the size of the tuple is very small (e.g., betting on the winner of a horse race), they are intractable for more general betting pools in higher dimensions (e.g., March Madness bracket challenges). Hence we pose the multi-brackets problem: supposing we wish to predict a tuple of events and that we know the true probabilities of each potential outcome of each event, what is the best way to tractably generate a set of $n$ predicted tuples? The most general version of this problem is extremely difficult, so we begin with a simpler setting. In particular, we generate $n$ independent predicted tuples according to a distribution having optimal entropy. This entropy-based approach is tractable, scalable, and performs well. △ Less

Submitted 20 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

arXiv:2303.04535 [pdf, other]

doi 10.1093/jamia/ocad140

The impact of the COVID-19 pandemic on daily rhythms

Authors: Nguyen Luong, Ian Barnett, Talayeh Aledavood

Abstract: The COVID-19 pandemic has significantly impacted daily activity rhythms and life routines. Understanding the dynamics of these impacts on different groups of people is essential for creating environments where people's lives and well-being are least disturbed during such circumstances. Starting in June 2021, we conducted a year-long study to collect high-resolution data from fitness trackers as we… ▽ More The COVID-19 pandemic has significantly impacted daily activity rhythms and life routines. Understanding the dynamics of these impacts on different groups of people is essential for creating environments where people's lives and well-being are least disturbed during such circumstances. Starting in June 2021, we conducted a year-long study to collect high-resolution data from fitness trackers as well as answers to monthly questionnaires from 128 working adults. Using questionnaires, we investigate how routines of exercising and working have changed throughout the pandemic for different people. In addition to that, for each person in the study, we build temporal distributions of daily step counts to quantify their daily movement rhythms and use the inverse of the Earth mover's distance between different movement rhythms to quantify the movement consistency over time. Throughout the pandemic, our cohort shows a shift in exercise routines, manifested in a decrease in time spent on non-walking physical exercises as opposed to the unchanged amount of time spent on walking. In terms of daily rhythms of movement, we show that migrants and those who live alone demonstrate a lower level of consistency of daily rhythms of movement compared to their counterparts. We also observe a relationship between movement and on-site work attendance, as participants who go to work (as opposed to working remotely) also tend to maintain more consistent daily rhythms of movement. Men and migrants show a faster pace in going back to work after the decrease in restriction measures that were set in place due to the pandemic. Our results quantitatively demonstrate the unequal effect of the pandemic among different sub-populations and inform organizations and policymakers to provide more adequate support and adapt to the different needs of different groups in the post-pandemic era. △ Less

Submitted 8 March, 2023; originally announced March 2023.

arXiv:2202.12482 [pdf, other]

Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity

Authors: Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett

Abstract: Interpretable machine learning has demonstrated impressive performance while preserving explainability. In particular, neural additive models (NAM) offer the interpretability to the black-box deep learning and achieve state-of-the-art accuracy among the large family of generalized additive models. In order to empower NAM with feature selection and improve the generalization, we propose the sparse… ▽ More Interpretable machine learning has demonstrated impressive performance while preserving explainability. In particular, neural additive models (NAM) offer the interpretability to the black-box deep learning and achieve state-of-the-art accuracy among the large family of generalized additive models. In order to empower NAM with feature selection and improve the generalization, we propose the sparse neural additive models (SNAM) that employ the group sparsity regularization (e.g. Group LASSO), where each feature is learned by a sub-network whose trainable parameters are clustered as a group. We study the theoretical properties for SNAM with novel techniques to tackle the non-parametric truth, thus extending from classical sparse linear models such as the LASSO, which only works on the parametric truth. Specifically, we show that SNAM with subgradient and proximal gradient descents provably converges to zero training loss as $t\to\infty$, and that the estimation error of SNAM vanishes asymptotically as $n\to\infty$. We also prove that SNAM, similar to LASSO, can have exact support recovery, i.e. perfect feature selection, with appropriate regularization. Moreover, we show that the SNAM can generalize well and preserve the `identifiability', recovering each feature's effect. We validate our theories via extensive experiments and further testify to the good accuracy and efficiency of SNAM. △ Less

Submitted 24 February, 2022; originally announced February 2022.

arXiv:1610.05868 [pdf, other]

Feature-Based Classification of Networks

Authors: Ian Barnett, Nishant Malik, Marieke L. Kuijjer, Peter J. Mucha, Jukka-Pekka Onnela

Abstract: Network representations of systems from various scientific and societal domains are neither completely random nor fully regular, but instead appear to contain recurring structural building blocks. These features tend to be shared by networks belonging to the same broad class, such as the class of social networks or the class of biological networks. At a finer scale of classification within each su… ▽ More Network representations of systems from various scientific and societal domains are neither completely random nor fully regular, but instead appear to contain recurring structural building blocks. These features tend to be shared by networks belonging to the same broad class, such as the class of social networks or the class of biological networks. At a finer scale of classification within each such class, networks describing more similar systems tend to have more similar features. This occurs presumably because networks representing similar purposes or constructions would be expected to be generated by a shared set of domain specific mechanisms, and it should therefore be possible to classify these networks into categories based on their features at various structural levels. Here we describe and demonstrate a new, hybrid approach that combines manual selection of features of potential interest with existing automated classification methods. In particular, selecting well-known and well-studied features that have been used throughout social network analysis and network science and then classifying with methods such as random forests that are of special utility in the presence of feature collinearity, we find that we achieve higher accuracy, in shorter computation time, with greater interpretability of the network classification results. △ Less

Submitted 19 October, 2016; originally announced October 2016.

Comments: 14 pages including 4 figures and a table. Methods and supplementary material included to the end

arXiv:1605.06898 [pdf, other]

doi 10.1371/journal.pone.0156794

Social and Spatial Clustering of People at Humanity's Largest Gathering

Authors: Ian Barnett, Tarun Khanna, Jukka-Pekka Onnela

Abstract: Macroscopic behavior of scientific and societal systems results from the aggregation of microscopic behaviors of their constituent elements, but connecting the macroscopic with the microscopic in human behavior has traditionally been difficult. Manifestations of homophily, the notion that individuals tend to interact with others who resemble them, have been observed in many small and intermediate… ▽ More Macroscopic behavior of scientific and societal systems results from the aggregation of microscopic behaviors of their constituent elements, but connecting the macroscopic with the microscopic in human behavior has traditionally been difficult. Manifestations of homophily, the notion that individuals tend to interact with others who resemble them, have been observed in many small and intermediate size settings. However, whether this behavior translates to truly macroscopic levels, and what its consequences may be, remains unknown. Here, we use call detail records (CDRs) to examine the population dynamics and manifestations of social and spatial homophily at a macroscopic level among the residents of 23 states of India at the Kumbh Mela, a 3-month-long Hindu festival. We estimate that the festival was attended by 61 million people, making it the largest gathering in the history of humanity. While we find strong overall evidence for both types of homophily for residents of different states, participants from low-representation states show considerably stronger propensity for both social and spatial homophily than those from high-representation states. These manifestations of homophily are amplified on crowded days, such as the peak day of the festival, which we estimate was attended by 25 million people. Our findings confirm that homophily, which here likely arises from social influence, permeates all scales of human behavior. △ Less

Submitted 23 May, 2016; originally announced May 2016.

Comments: 16 pages, 4 figures

Showing 1–5 of 5 results for author: Barnett, I