-
Learning sources of variability from high-dimensional observational studies
Authors:
Eric W. Bridgeford,
Jaewon Chung,
Brian Gilbert,
Sambit Panda,
Adam Li,
Cencheng Shen,
Alexandra Badea,
Brian Caffo,
Joshua T. Vogelstein
Abstract:
Causal inference studies whether the presence of a variable influences an observed outcome. As measured by quantities such as the "average treatment effect," this paradigm is employed across numerous biological fields, from vaccine and drug development to policy interventions. Unfortunately, the majority of these methods are often limited to univariate outcomes. Our work generalizes causal estiman…
▽ More
Causal inference studies whether the presence of a variable influences an observed outcome. As measured by quantities such as the "average treatment effect," this paradigm is employed across numerous biological fields, from vaccine and drug development to policy interventions. Unfortunately, the majority of these methods are often limited to univariate outcomes. Our work generalizes causal estimands to outcomes with any number of dimensions or any measurable space, and formulates traditional causal estimands for nominal variables as causal discrepancy tests. We propose a simple technique for adjusting universally consistent conditional independence tests and prove that these tests are universally consistent causal discrepancy tests. Numerical experiments illustrate that our method, Causal CDcorr, leads to improvements in both finite sample validity and power when compared to existing strategies. Our methods are all open source and available at github.com/ebridge2/cdcorr.
△ Less
Submitted 28 November, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Polarity is all you need to learn and transfer faster
Authors:
Qingyang Wang,
Michael A. Powell,
Ali Geisa,
Eric W. Bridgeford,
Joshua T. Vogelstein
Abstract:
Natural intelligences (NIs) thrive in a dynamic world - they learn quickly, sometimes with only a few samples. In contrast, artificial intelligences (AIs) typically learn with a prohibitive number of training samples and computational power. What design principle difference between NI and AI could contribute to such a discrepancy? Here, we investigate the role of weight polarity: development proce…
▽ More
Natural intelligences (NIs) thrive in a dynamic world - they learn quickly, sometimes with only a few samples. In contrast, artificial intelligences (AIs) typically learn with a prohibitive number of training samples and computational power. What design principle difference between NI and AI could contribute to such a discrepancy? Here, we investigate the role of weight polarity: development processes initialize NIs with advantageous polarity configurations; as NIs grow and learn, synapse magnitudes update, yet polarities are largely kept unchanged. We demonstrate with simulation and image classification tasks that if weight polarities are adequately set a priori, then networks learn with less time and data. We also explicitly illustrate situations in which a priori setting the weight polarities is disadvantageous for networks. Our work illustrates the value of weight polarities from the perspective of statistical and computational efficiency during learning.
△ Less
Submitted 30 May, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Why do networks have inhibitory/negative connections?
Authors:
Qingyang Wang,
Michael A. Powell,
Ali Geisa,
Eric Bridgeford,
Carey E. Priebe,
Joshua T. Vogelstein
Abstract:
Why do brains have inhibitory connections? Why do deep networks have negative weights? We propose an answer from the perspective of representation capacity. We believe representing functions is the primary role of both (i) the brain in natural intelligence, and (ii) deep networks in artificial intelligence. Our answer to why there are inhibitory/negative weights is: to learn more functions. We pro…
▽ More
Why do brains have inhibitory connections? Why do deep networks have negative weights? We propose an answer from the perspective of representation capacity. We believe representing functions is the primary role of both (i) the brain in natural intelligence, and (ii) deep networks in artificial intelligence. Our answer to why there are inhibitory/negative weights is: to learn more functions. We prove that, in the absence of negative weights, neural networks with non-decreasing activation functions are not universal approximators. While this may be an intuitive result to some, to the best of our knowledge, there is no formal theory, in either machine learning or neuroscience, that demonstrates why negative weights are crucial in the context of representation capacity. Further, we provide insights on the geometric properties of the representation space that non-negative deep networks cannot represent. We expect these insights will yield a deeper understanding of more sophisticated inductive priors imposed on the distribution of weights that lead to more efficient biological and machine learning.
△ Less
Submitted 17 August, 2023; v1 submitted 5 August, 2022;
originally announced August 2022.
-
hyppo: A Multivariate Hypothesis Testing Python Package
Authors:
Sambit Panda,
Satish Palaniappan,
Junhao Xiong,
Eric W. Bridgeford,
Ronak Mehta,
Cencheng Shen,
Joshua T. Vogelstein
Abstract:
We introduce hyppo, a unified library for performing multivariate hypothesis testing, including independence, two-sample, and k-sample testing. While many multivariate independence tests have R packages available, the interfaces are inconsistent and most are not available in Python. hyppo includes many state of the art multivariate testing procedures. The package is easy-to-use and is flexible eno…
▽ More
We introduce hyppo, a unified library for performing multivariate hypothesis testing, including independence, two-sample, and k-sample testing. While many multivariate independence tests have R packages available, the interfaces are inconsistent and most are not available in Python. hyppo includes many state of the art multivariate testing procedures. The package is easy-to-use and is flexible enough to enable future extensions. The documentation and all releases are available at https://hyppo.neurodata.io.
△ Less
Submitted 1 April, 2021; v1 submitted 3 July, 2019;
originally announced July 2019.
-
GraSPy: Graph Statistics in Python
Authors:
Jaewon Chung,
Benjamin D. Pedigo,
Eric W. Bridgeford,
Bijan K. Varjavand,
Hayden S. Helm,
Joshua T. Vogelstein
Abstract:
We introduce GraSPy, a Python library devoted to statistical inference, machine learning, and visualization of random graphs and graph populations. This package provides flexible and easy-to-use algorithms for analyzing and understanding graphs with a scikit-learn compliant API. GraSPy can be downloaded from Python Package Index (PyPi), and is released under the Apache 2.0 open-source license. The…
▽ More
We introduce GraSPy, a Python library devoted to statistical inference, machine learning, and visualization of random graphs and graph populations. This package provides flexible and easy-to-use algorithms for analyzing and understanding graphs with a scikit-learn compliant API. GraSPy can be downloaded from Python Package Index (PyPi), and is released under the Apache 2.0 open-source license. The documentation and all releases are available at https://neurodata.io/graspy.
△ Less
Submitted 14 August, 2019; v1 submitted 29 March, 2019;
originally announced April 2019.
-
On a 'Two Truths' Phenomenon in Spectral Graph Clustering
Authors:
Carey E. Priebe,
Youngser Park,
Joshua T. Vogelstein,
John M. Conroy,
Vince Lyzinski,
Minh Tang,
Avanti Athreya,
Joshua Cape,
Eric Bridgeford
Abstract:
Clustering is concerned with coherently grou** observations without any explicit concept of true grou**s. Spectral graph clustering - clustering the vertices of a graph based on their spectral embedding - is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian or Adjacency spectral embedding (LSE or ASE). Recent theoretical resu…
▽ More
Clustering is concerned with coherently grou** observations without any explicit concept of true grou**s. Spectral graph clustering - clustering the vertices of a graph based on their spectral embedding - is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian or Adjacency spectral embedding (LSE or ASE). Recent theoretical results provide new understanding of the problem and solutions, and lead us to a 'Two Truths' LSE vs. ASE spectral graph clustering phenomenon convincingly illustrated here via a diffusion MRI connectome data set: the different embedding methods yield different clustering results, with LSE capturing left hemisphere/right hemisphere affinity structure and ASE capturing gray matter/white matter core-periphery structure.
△ Less
Submitted 11 February, 2019; v1 submitted 23 August, 2018;
originally announced August 2018.