-
Fast Direct Methods for Gaussian Processes
Authors:
Sivaram Ambikasaran,
Daniel Foreman-Mackey,
Leslie Greengard,
David W. Hogg,
Michael O'Neil
Abstract:
A number of problems in probability and statistics can be addressed using the multivariate normal (Gaussian) distribution. In the one-dimensional case, computing the probability for a given mean and variance simply requires the evaluation of the corresponding Gaussian density. In the $n$-dimensional setting, however, it requires the inversion of an $n \times n$ covariance matrix, $C$, as well as t…
▽ More
A number of problems in probability and statistics can be addressed using the multivariate normal (Gaussian) distribution. In the one-dimensional case, computing the probability for a given mean and variance simply requires the evaluation of the corresponding Gaussian density. In the $n$-dimensional setting, however, it requires the inversion of an $n \times n$ covariance matrix, $C$, as well as the evaluation of its determinant, $\det(C)$. In many cases, such as regression using Gaussian processes, the covariance matrix is of the form $C = σ^2 I + K$, where $K$ is computed using a specified covariance kernel which depends on the data and additional parameters (hyperparameters). The matrix $C$ is typically dense, causing standard direct methods for inversion and determinant evaluation to require $\mathcal O(n^3)$ work. This cost is prohibitive for large-scale modeling. Here, we show that for the most commonly used covariance functions, the matrix $C$ can be hierarchically factored into a product of block low-rank updates of the identity matrix, yielding an $\mathcal O (n\log^2 n) $ algorithm for inversion. More importantly, we show that this factorization enables the evaluation of the determinant $\det(C)$, permitting the direct calculation of probabilities in high dimensions under fairly broad assumptions on the kernel defining $K$. Our fast algorithm brings many problems in marginalization and the adaptation of hyperparameters within practical reach using a single CPU core. The combination of nearly optimal scaling in terms of problem size with high-performance computing resources will permit the modeling of previously intractable problems. We illustrate the performance of the scheme on standard covariance kernels.
△ Less
Submitted 4 April, 2015; v1 submitted 24 March, 2014;
originally announced March 2014.
-
A New Approach to Detailed Structural Decomposition from the SPLASH and PHAT Surveys: Kicked-up Disk Stars in the Andromeda Galaxy?
Authors:
Claire E. Dorman,
Lawrence M. Widrow,
Puragra Guhathakurta,
Anil C. Seth,
Daniel Foreman-Mackey,
Eric F. Bell,
Julianne J. Dalcanton,
Karoline M. Gilbert,
Evan D. Skillman,
Benjamin F. Williams
Abstract:
We characterize the bulge, disk, and halo subcomponents in the Andromeda galaxy (M31) over the radial range 4 < R_proj < 225 kpc. The cospatial nature of these subcomponents renders them difficult to disentangle using surface brightness (SB) information alone, especially interior to ~20 kpc. Our new decomposition technique combines information from the luminosity function (LF) of over 1.5 million…
▽ More
We characterize the bulge, disk, and halo subcomponents in the Andromeda galaxy (M31) over the radial range 4 < R_proj < 225 kpc. The cospatial nature of these subcomponents renders them difficult to disentangle using surface brightness (SB) information alone, especially interior to ~20 kpc. Our new decomposition technique combines information from the luminosity function (LF) of over 1.5 million bright (20 < m_814W < 22) stars from the Panchromatic Hubble Andromeda Treasury (PHAT) survey, radial velocities of over 5000 red giant branch stars in the same magnitude range from the Spectroscopic and Photometric Landscape of Andromeda's Stellar Halo (SPLASH) survey, and integrated I-band SB profiles from various sources. We use an affine-invariant Markov chain Monte Carlo algorithm to fit an appropriate toy model to these three data sets. The bulge, disk, and halo SB profiles are modeled as a Sersic, exponential, and cored power-law, respectively, and the LFs are modeled as broken power-laws. We present probability distributions for each of 32 parameters describing the SB profiles and LFs of the three subcomponents. We find that the number of stars with a disk-like LF is ~5% larger than the the number with disk-like (dynamically cold) kinematics, suggesting that some stars born in the disk have been dynamically heated to the point that they are kinematically indistinguishable from halo members. This is the first kinematical evidence for a "kicked-up disk" halo population in M31. The fraction of kicked-up disk stars is consistent with that found in simulations. We also find evidence for a radially varying disk LF, consistent with a negative metallicity gradient in the stellar disk.
△ Less
Submitted 15 October, 2013;
originally announced October 2013.
-
Maximizing Kepler science return per telemetered pixel: Searching the habitable zones of the brightest stars
Authors:
Benjamin T. Montet,
Ruth Angus,
Tom Barclay,
Rebekah Dawson,
Rob Fergus,
Dan Foreman-Mackey,
Stefan Harmeling,
Michael Hirsch,
David W. Hogg,
Dustin Lang,
David Schiminovich,
Bernhard Scholkopf
Abstract:
In today's mailing, Hogg et al. propose image modeling techniques to maintain 10-ppm-level precision photometry in Kepler data with only two working reaction wheels. While these results are relevant to many scientific goals for the repurposed mission, all modeling efforts so far have used a toy model of the Kepler telescope. Because the two-wheel performance of Kepler remains to be determined, we…
▽ More
In today's mailing, Hogg et al. propose image modeling techniques to maintain 10-ppm-level precision photometry in Kepler data with only two working reaction wheels. While these results are relevant to many scientific goals for the repurposed mission, all modeling efforts so far have used a toy model of the Kepler telescope. Because the two-wheel performance of Kepler remains to be determined, we advocate for the consideration of an alternate strategy for a >1 year program that maximizes the science return from the "low-torque" fields across the ecliptic plane. Assuming we can reach the precision of the original Kepler mission, we expect to detect 800 new planet candidates in the first year of such a mission. Our proposed strategy has benefits for transit timing variation and transit duration variation studies, especially when considered in concert with the future TESS mission. We also expect to help address the first key science goal of Kepler: the frequency of planets in the habitable zone as a function of spectral type.
△ Less
Submitted 3 September, 2013;
originally announced September 2013.
-
Maximizing Kepler science return per telemetered pixel: Detailed models of the focal plane in the two-wheel era
Authors:
David W. Hogg,
Ruth Angus,
Tom Barclay,
Rebekah Dawson,
Rob Fergus,
Dan Foreman-Mackey,
Stefan Harmeling,
Michael Hirsch,
Dustin Lang,
Benjamin T. Montet,
David Schiminovich,
Bernhard Schölkopf
Abstract:
Kepler's immense photometric precision to date was maintained through satellite stability and precise pointing. In this white paper, we argue that image modeling--fitting the Kepler-downlinked raw pixel data--can vastly improve the precision of Kepler in pointing-degraded two-wheel mode. We argue that a non-trivial modeling effort may permit continuance of photometry at 10-ppm-level precision. We…
▽ More
Kepler's immense photometric precision to date was maintained through satellite stability and precise pointing. In this white paper, we argue that image modeling--fitting the Kepler-downlinked raw pixel data--can vastly improve the precision of Kepler in pointing-degraded two-wheel mode. We argue that a non-trivial modeling effort may permit continuance of photometry at 10-ppm-level precision. We demonstrate some baby steps towards precise models in both data-driven (flexible) and physics-driven (interpretably parameterized) modes. We demonstrate that the expected drift or jitter in positions in the two-weel era will help with constraining calibration parameters. In particular, we show that we can infer the device flat-field at higher than pixel resolution; that is, we can infer pixel-to-pixel variations in intra-pixel sensitivity. These results are relevant to almost any scientific goal for the repurposed mission; image modeling ought to be a part of any two-wheel repurpose for the satellite. We make other recommendations for Kepler operations, but fundamentally advocate that the project stick with its core mission of finding and characterizing Earth analogs. [abridged]
△ Less
Submitted 3 September, 2013;
originally announced September 2013.
-
The Panchromatic Hubble Andromeda Treasury IV. A Probabilistic Approach to Inferring the High Mass Stellar Initial Mass Function and Other Power-law Functions
Authors:
Daniel R. Weisz,
Morgan Fouesneau,
David W. Hogg,
Hans-Walter Rix,
Andrew E. Dolphin,
Julianne J. Dalcanton,
Daniel T. Foreman-Mackey,
Dustin Lang,
L. Clifton Johnson,
Lori C. Beerman,
Eric F. Bell,
Karl D. Gordon,
Dimitrios Gouliermis,
Jason S. Kalirai,
Evan D. Skillman,
Benjamin F. Williams
Abstract:
We present a probabilistic approach for inferring the parameters of the present day power-law stellar mass function (MF) of a resolved young star cluster. This technique (a) fully exploits the information content of a given dataset; (b) accounts for observational uncertainties in a straightforward way; (c) assigns meaningful uncertainties to the inferred parameters; (d) avoids the pitfalls associa…
▽ More
We present a probabilistic approach for inferring the parameters of the present day power-law stellar mass function (MF) of a resolved young star cluster. This technique (a) fully exploits the information content of a given dataset; (b) accounts for observational uncertainties in a straightforward way; (c) assigns meaningful uncertainties to the inferred parameters; (d) avoids the pitfalls associated with binning data; and (e) is applicable to virtually any resolved young cluster, laying the groundwork for a systematic study of the high mass stellar MF (M > 1 Msun). Using simulated clusters and Markov chain Monte Carlo sampling of the probability distribution functions, we show that estimates of the MF slope, α, are unbiased and that the uncertainty, Δα, depends primarily on the number of observed stars and stellar mass range they span, assuming that the uncertainties on individual masses and the completeness are well-characterized. Using idealized mock data, we compute the lower limit precision on α and provide an analytic approximation for Δα as a function of the observed number of stars and mass range. We find that ~ 3/4 of quoted literature uncertainties are smaller than the theoretical lower limit. By correcting these uncertainties to the theoretical lower limits, we find the literature studies yield <α>=2.46 with a 1-σ dispersion of 0.35 dex. We verify that it is impossible for a power-law MF to obtain meaningful constraints on the upper mass limit of the IMF. We show that avoiding substantial biases in the MF slope requires: (1) including the MF as a prior when deriving individual stellar mass estimates; (2) modeling the uncertainties in the individual stellar masses; and (3) fully characterizing and then explicitly modeling the completeness for stars of a given mass. (abridged)
△ Less
Submitted 26 November, 2012;
originally announced November 2012.
-
Probabilistic Catalogs for Crowded Stellar Fields
Authors:
Brendon J. Brewer,
Daniel Foreman-Mackey,
David W. Hogg
Abstract:
We present and implement a probabilistic (Bayesian) method for producing catalogs from images of stellar fields. The method is capable of inferring the number of sources N in the image and can also handle the challenges introduced by noise, overlap** sources, and an unknown point spread function (PSF). The luminosity function of the stars can also be inferred even when the precise luminosity of…
▽ More
We present and implement a probabilistic (Bayesian) method for producing catalogs from images of stellar fields. The method is capable of inferring the number of sources N in the image and can also handle the challenges introduced by noise, overlap** sources, and an unknown point spread function (PSF). The luminosity function of the stars can also be inferred even when the precise luminosity of each star is uncertain, via the use of a hierarchical Bayesian model. The computational feasibility of the method is demonstrated on two simulated images with different numbers of stars. We find that our method successfully recovers the input parameter values along with principled uncertainties even when the field is crowded. We also compare our results with those obtained from the SExtractor software. While the two approaches largely agree about the fluxes of the bright stars, the Bayesian approach provides more accurate inferences about the faint stars and the number of stars, particularly in the crowded case.
△ Less
Submitted 20 April, 2013; v1 submitted 25 November, 2012;
originally announced November 2012.
-
emcee: The MCMC Hammer
Authors:
Daniel Foreman-Mackey,
David W. Hogg,
Dustin Lang,
Jonathan Goodman
Abstract:
We introduce a stable, well tested Python implementation of the affine-invariant ensemble sampler for Markov chain Monte Carlo (MCMC) proposed by Goodman & Weare (2010). The code is open source and has already been used in several published projects in the astrophysics literature. The algorithm behind emcee has several advantages over traditional MCMC sampling methods and it has excellent performa…
▽ More
We introduce a stable, well tested Python implementation of the affine-invariant ensemble sampler for Markov chain Monte Carlo (MCMC) proposed by Goodman & Weare (2010). The code is open source and has already been used in several published projects in the astrophysics literature. The algorithm behind emcee has several advantages over traditional MCMC sampling methods and it has excellent performance as measured by the autocorrelation time (or function calls per independent sample). One major advantage of the algorithm is that it requires hand-tuning of only 1 or 2 parameters compared to $\sim N^2$ for a traditional algorithm in an N-dimensional parameter space. In this document, we describe the algorithm and the details of our implementation and API. Exploiting the parallelism of the ensemble method, emcee permits any user to take advantage of multiple CPU cores without extra effort. The code is available online at http://dan.iel.fm/emcee under the MIT License.
△ Less
Submitted 25 November, 2013; v1 submitted 16 February, 2012;
originally announced February 2012.