-
Inferring couplings in networks across order-disorder phase transitions
Authors:
Vudtiwat Ngampruetikorn,
Vedant Sachdeva,
Johanna Torrence,
Jan Humplik,
David J. Schwab,
Stephanie E. Palmer
Abstract:
Statistical inference is central to many scientific endeavors, yet how it works remains unresolved. Answering this requires a quantitative understanding of the intrinsic interplay between statistical models, inference methods and data structure. To this end, we characterize the efficacy of direct coupling analysis (DCA)--a highly successful method for analyzing amino acid sequence data--in inferri…
▽ More
Statistical inference is central to many scientific endeavors, yet how it works remains unresolved. Answering this requires a quantitative understanding of the intrinsic interplay between statistical models, inference methods and data structure. To this end, we characterize the efficacy of direct coupling analysis (DCA)--a highly successful method for analyzing amino acid sequence data--in inferring pairwise interactions from samples of ferromagnetic Ising models on random graphs. Our approach allows for physically motivated exploration of qualitatively distinct data regimes separated by phase transitions. We show that inference quality depends strongly on the nature of generative models: optimal accuracy occurs at an intermediate temperature where the detrimental effects from macroscopic order and thermal noise are minimal. Importantly our results indicate that DCA does not always outperform its local-statistics-based predecessors; while DCA excels at low temperatures, it becomes inferior to simple correlation thresholding at virtually all temperatures when data are limited. Our findings offer new insights into the regime in which DCA operates so successfully and more broadly how inference interacts with data structure.
△ Less
Submitted 25 August, 2021; v1 submitted 4 June, 2021;
originally announced June 2021.
-
Extending Hypothesis Testing with Persistence Homology to Three or More Groups
Authors:
Christopher Cericola,
Inga Johnson,
Joshua Kiers,
Mitchell Krock,
Jordan Purdy,
Johanna Torrence
Abstract:
We extend the work of Robinson and Turner to use hypothesis testing with persistence homology to test for measurable differences in shape between point clouds from three or more groups. Using samples of point clouds from three distinct groups, we conduct a large-scale simulation study to validate our proposed extension. We consider various combinations of groups, samples sizes and measurement erro…
▽ More
We extend the work of Robinson and Turner to use hypothesis testing with persistence homology to test for measurable differences in shape between point clouds from three or more groups. Using samples of point clouds from three distinct groups, we conduct a large-scale simulation study to validate our proposed extension. We consider various combinations of groups, samples sizes and measurement errors in the simulation study, providing for each combination the percentage of $p$-values below an alpha-level of 0.05. Additionally, we apply our method to a Cardiotocography data set and find statistically significant evidence of measurable differences in shape between normal, suspect and pathologic health status groups.
△ Less
Submitted 14 January, 2016;
originally announced February 2016.
-
Identification of RR Lyrae Variables in SDSS from Single-Epoch Photometric and Spectroscopic Observations
Authors:
Ronald Wilhelm,
W. Lee Powell Jr.,
Timothy C. Beers,
Branimir Sesar,
Carlos Alende Prieto,
Kenneth W. Carrell,
Young Sun Lee,
Brian Yanny,
Constance M. Rockosi,
Nathan De Lee,
Gwen Hansford Armstrong,
Stephen J. Torrence
Abstract:
We describe a new RR Lyrae identification technique based on out-of-phase single-epoch photometric and spectroscopic observations contained in SDSS Data Release 6 (DR-6). This technique detects variability by exploiting the large disparity between the g-r color and the strength of the hydrogen Balmer lines when the two observations are made at random phases. Comparison with a large sample of kno…
▽ More
We describe a new RR Lyrae identification technique based on out-of-phase single-epoch photometric and spectroscopic observations contained in SDSS Data Release 6 (DR-6). This technique detects variability by exploiting the large disparity between the g-r color and the strength of the hydrogen Balmer lines when the two observations are made at random phases. Comparison with a large sample of known variables in the SDSS equatorial stripe (Stripe 82) shows that the discovery efficiency for our technique is ~85%. Analysis of stars with multiple spectroscopic observations suggests a similar efficiency throughout the entire DR-6 sample. We also develop a technique to estimate the average g apparent magnitude (over the pulsation cycle) for individual RR Lyrae stars, using the <g-r> for the entire sample and measured colors for each star. The resulting distances are found to have precisions of ~14%. Finally, we explore the properties of our DR-6 sample of N = 1087 variables, and recover portions of the Sagittarius Northern and Southern Stream. Analysis of the distance and velocity for the Southern Stream are consistent with previously published data for blue horizontal-branch stars. In a sample near the North Galactic Polar Cap, we find evidence for the descending leading Northern arm, and a possible detection of the trailing arm.
△ Less
Submitted 5 December, 2007;
originally announced December 2007.