Search | arXiv e-print repository

doi 10.3847/1538-4357/833/2/276

Hydrogen Emission from the Ionized Gaseous Halos of Low Redshift Galaxies

Authors: Huanian Zhang, Dennis Zaritsky, Guangtun Zhu, Brice Ménard, David W. Hogg

Abstract: Using a sample of nearly half million galaxies, intersected by over 7 million lines of sight from the Sloan Digital Sky Survey Data Release 12, we trace H$α$ + [N{\small II}] emission from a galactocentric projected radius, $r_p$, of 5 kpc to more than 100 kpc. The emission flux surface brightness is $\propto r_p^{-1.9 \pm 0.4}$. We obtain consistent results using only the H$α$ or [N{\small II}] f… ▽ More Using a sample of nearly half million galaxies, intersected by over 7 million lines of sight from the Sloan Digital Sky Survey Data Release 12, we trace H$α$ + [N{\small II}] emission from a galactocentric projected radius, $r_p$, of 5 kpc to more than 100 kpc. The emission flux surface brightness is $\propto r_p^{-1.9 \pm 0.4}$. We obtain consistent results using only the H$α$ or [N{\small II}] flux. We measure a stronger signal for the bluer half of the target sample than for the redder half on small scales, $r_p <$ 20 kpc. We obtain a $3σ$ detection of H$α$ + [N{\small II}] emission in the 50 to 100 kpc $r_p$ bin. The mean emission flux within this bin is $(1.10 \pm 0.35) \times 10^{-20}$ erg cm$^{-2}$ s$^{-1}$ Å$^{-1}$, which corresponds to $1.87 \times 10^{-20}$ erg cm$^{-2}$ s$^{-1}$ arcsec$^{-2}$ or 0.0033 Rayleigh. This detection is 34 times fainter than a previous strict limit obtained using deep narrow-band imaging. The faintness of the signal demonstrates why it has been so difficult to trace recombination radiation out to large radii around galaxies. This signal, combined with published estimates of n$_{\rm H}$, lead us to estimate the temperature of the gas to be 12,000 K, consistent with independent empirical estimates based on metal ion absorption lines and expectations from numerical simulations. △ Less

Submitted 31 October, 2016; originally announced November 2016.

Comments: 12 pages, 13 figures

arXiv:1610.07602 [pdf, other]

doi 10.3847/1538-4357/aa5e50

The Joker: A custom Monte Carlo sampler for binary-star and exoplanet radial velocity data

Authors: Adrian M. Price-Whelan, David W. Hogg, Daniel Foreman-Mackey, Hans-Walter Rix

Abstract: Given sparse or low-quality radial-velocity measurements of a star, there are often many qualitatively different stellar or exoplanet companion orbit models that are consistent with the data. The consequent multimodality of the likelihood function leads to extremely challenging search, optimization, and MCMC posterior sampling over the orbital parameters. Here we create a custom Monte Carlo sample… ▽ More Given sparse or low-quality radial-velocity measurements of a star, there are often many qualitatively different stellar or exoplanet companion orbit models that are consistent with the data. The consequent multimodality of the likelihood function leads to extremely challenging search, optimization, and MCMC posterior sampling over the orbital parameters. Here we create a custom Monte Carlo sampler for sparse or noisy radial-velocity measurements of two-body systems that can produce posterior samples for orbital parameters even when the likelihood function is poorly behaved. The six standard orbital parameters for a binary system can be split into four non-linear parameters (period, eccentricity, argument of pericenter, phase) and two linear parameters (velocity amplitude, barycenter velocity). We capitalize on this by building a sampling method in which we densely sample the prior pdf in the non-linear parameters and perform rejection sampling using a likelihood function marginalized over the linear parameters. With sparse or uninformative data, the sampling obtained by this rejection sampling is generally multimodal and dense. With informative data, the sampling becomes effectively unimodal but too sparse: in these cases we follow the rejection sampling with standard MCMC. The method produces correct samplings in orbital parameters for data that include as few as three epochs. The Joker can therefore be used to produce proper samplings of multimodal pdfs, which are still informative and can be used in hierarchical (population) modeling. We give some examples that show how the posterior pdf depends sensitively on the number and time coverage of the observations and their uncertainties. △ Less

Submitted 29 March, 2017; v1 submitted 24 October, 2016; originally announced October 2016.

Comments: published in ApJ

arXiv:1610.05873 [pdf, other]

Do fast stellar centroiding methods saturate the Cramér-Rao lower bound?

Authors: Mohammadjavad Vakili, David W. Hogg

Abstract: One of the most demanding tasks in astronomical image processing---in terms of precision---is the centroiding of stars. Upcoming large surveys are going to take images of billions of point sources, including many faint stars, with short exposure times. Real-time estimation of the centroids of stars is crucial for real-time PSF estimation, and maximal precision is required for measurements of prope… ▽ More One of the most demanding tasks in astronomical image processing---in terms of precision---is the centroiding of stars. Upcoming large surveys are going to take images of billions of point sources, including many faint stars, with short exposure times. Real-time estimation of the centroids of stars is crucial for real-time PSF estimation, and maximal precision is required for measurements of proper motion. The fundamental Cramér-Rao lower bound sets a limit on the root-mean-squared-error achievable by optimal estimators. In this work, we aim to compare the performance of various centroiding methods, in terms of saturating the bound, when they are applied to relatively low signal-to-noise ratio unsaturated stars assuming zero-mean constant Gaussian noise. In order to make this comparison, we present the ratio of the root-mean-squared-errors of these estimators to their corresponding Cramér-Rao bound as a function of the signal-to-noise ratio and the full-width at half-maximum of faint stars. We discuss two general circumstances in centroiding of faint stars: (i) when we have a good estimate of the PSF, (ii) when we do not know the PSF. In the case that we know the PSF, we show that a fast polynomial centroiding after smoothing the image by the PSF can be as efficient as the maximum-likelihood estimator at saturating the bound. In the case that we do not know the PSF, we demonstrate that although polynomial centroiding is not as optimal as PSF profile fitting, it comes very close to saturating the Cramér-Rao lower bound in a wide range of conditions. We also show that the moment-based method of center-of-light never comes close to saturating the bound, and thus it does not deliver reliable estimates of centroids. △ Less

Submitted 19 October, 2016; originally announced October 2016.

Comments: 25 pages, 8 figures

arXiv:1609.03195 [pdf, other]

doi 10.3847/1538-4357/aa6db3

Masses and Ages for 230,000 LAMOST Giants, via Their Carbon and Nitrogen Abundances

Authors: Anna Y. Q. Ho, Hans-Walter Rix, Melissa K. Ness, David W. Hogg, Chao Liu, Yuan-Sen Ting

Abstract: We measure carbon and nitrogen abundances to $\lesssim$ 0.1 dex for 450,000 giant stars from their low-resolution (R$\sim$1800) LAMOST DR2 survey spectra. We use these [C/M] and [N/M] measurements, together with empirical relations based on the APOKASC sample, to infer stellar masses and implied ages for 230,000 of these objects to 0.08 dex and 0.2 dex respectively. We use The Cannon, a data-drive… ▽ More We measure carbon and nitrogen abundances to $\lesssim$ 0.1 dex for 450,000 giant stars from their low-resolution (R$\sim$1800) LAMOST DR2 survey spectra. We use these [C/M] and [N/M] measurements, together with empirical relations based on the APOKASC sample, to infer stellar masses and implied ages for 230,000 of these objects to 0.08 dex and 0.2 dex respectively. We use The Cannon, a data-driven approach to spectral modeling, to construct a predictive model for LAMOST spectra. Our reference set comprises 8125 stars observed in common between the APOGEE and LAMOST surveys, taking seven APOGEE DR12 labels (parameters) as ground truth: Teff, logg, [M/H], [$α$/M], [C/M], [N/M], and Ak. We add seven colors to the Cannon model, based on the g, r, i, J, H, K, W1, and W2 magnitudes from APASS, 2MASS & WISE, which improves our constraints on Teff and logg by up to 20% and on Ak by up to 70%. Cross-validation of the model demonstrates that, for high-SNR objects, our inferred labels agree with the APOGEE values to within 50 K in temperature, 0.04 magnitudes in Ak, and < 0.1 dex in logg, [M/H], [C/M], [N/M], and [$α$/M]. We apply the model to 450,000 giants in LAMOST DR2 that have not been observed by APOGEE. This demonstrates that precise individual abundances can be measured from low-resolution spectra, and represents the largest catalog of [C/M], [N/M], masses and ages to date. As as result, we greatly increase the number and sky coverage of stars with mass and age estimates. △ Less

Submitted 14 June, 2017; v1 submitted 11 September, 2016; originally announced September 2016.

Comments: Accepted for publication in ApJ. Associated code available at https://github.com/annayqho/TheCannon

Journal ref: The Astrophysical Journal, Volume 841, Issue 1, article id. 40, 12 pp. (2017)

arXiv:1609.02914 [pdf, other]

doi 10.3847/1538-4357/aa69c2

The RAVE-on catalog of stellar atmospheric parameters and chemical abundances for chemo-dynamic studies in the Gaia era

Authors: Andrew R. Casey, Keith Hawkins, David W. Hogg, Melissa Ness, Hans Walter-Rix, Georges Kordopatis, Andrea Kunder, Matthias Steinmetz, Sergey Koposov, Harry Enke, Jason Sanders, Gerry Gilmore, Tomaž Zwitter, Kenneth C. Freeman, Luca Casagrande, Gal Matijevič, George Seabroke, Olivier Bienaymé, Joss Bland-Hawthorn, Brad K. Gibson, Eva K. Grebel, Amina Helmi, Ulisse Munari, Julio F. Navarro, Warren Reid , et al. (2 additional authors not shown)

Abstract: The orbits, atmospheric parameters, chemical abundances, and ages of individual stars in the Milky Way provide the most comprehensive illustration of galaxy formation available. The Tycho-Gaia Astrometric Solution (TGAS) will deliver astrometric parameters for the largest ever sample of Milky Way stars, though its full potential cannot be realized without the addition of complementary spectroscopy… ▽ More The orbits, atmospheric parameters, chemical abundances, and ages of individual stars in the Milky Way provide the most comprehensive illustration of galaxy formation available. The Tycho-Gaia Astrometric Solution (TGAS) will deliver astrometric parameters for the largest ever sample of Milky Way stars, though its full potential cannot be realized without the addition of complementary spectroscopy. Among existing spectroscopic surveys, the RAdial Velocity Experiment (RAVE) has the largest overlap with TGAS ($\gtrsim$200,000 stars). We present a data-driven re-analysis of 520,781 RAVE spectra using The Cannon. For red giants, we build our model using high-fidelity APOGEE stellar parameters and abundances for stars that overlap with RAVE. For main-sequence and sub-giant stars, our model uses stellar parameters from the K2/EPIC. We derive and validate effective temperature $T_{\rm eff}$, surface gravity $\log{g}$, and chemical abundances of up to seven elements (O, Mg, Al, Si, Ca, Fe, Ni). We report a total of 1,685,851 elemental abundances with a typical precision of 0.07 dex, a substantial improvement over previous RAVE data releases. The synthesis of RAVE-on and TGAS is the most powerful data set for chemo-dynamic analyses of the Milky Way ever produced. △ Less

Submitted 9 September, 2016; originally announced September 2016.

Comments: Derived labels, associated errors, and relevant metadata are available from the RAVE database (http://www.rave-survey.org) from 19 September 2016

arXiv:1608.02013 [pdf, other]

doi 10.3847/1538-4365/aa8992

The Thirteenth Data Release of the Sloan Digital Sky Survey: First Spectroscopic Data from the SDSS-IV Survey MAp** Nearby Galaxies at Apache Point Observatory

Authors: SDSS Collaboration, Franco D. Albareti, Carlos Allende Prieto, Andres Almeida, Friedrich Anders, Scott Anderson, Brett H. Andrews, Alfonso Aragon-Salamanca, Maria Argudo-Fernandez, Eric Armengaud, Eric Aubourg, Vladimir Avila-Reese, Carles Badenes, Stephen Bailey, Beatriz Barbuy, Kat Barger, Jorge Barrera-Ballesteros, Curtis Bartosz, Sarbani Basu, Dominic Bates, Giuseppina Battaglia, Falk Baumgarten, Julien Baur, Julian Bautista, Timothy C. Beers , et al. (314 additional authors not shown)

Abstract: The fourth generation of the Sloan Digital Sky Survey (SDSS-IV) began observations in July 2014. It pursues three core programs: APOGEE-2, MaNGA, and eBOSS. In addition, eBOSS contains two major subprograms: TDSS and SPIDERS. This paper describes the first data release from SDSS-IV, Data Release 13 (DR13), which contains new data, reanalysis of existing data sets and, like all SDSS data releases,… ▽ More The fourth generation of the Sloan Digital Sky Survey (SDSS-IV) began observations in July 2014. It pursues three core programs: APOGEE-2, MaNGA, and eBOSS. In addition, eBOSS contains two major subprograms: TDSS and SPIDERS. This paper describes the first data release from SDSS-IV, Data Release 13 (DR13), which contains new data, reanalysis of existing data sets and, like all SDSS data releases, is inclusive of previously released data. DR13 makes publicly available 1390 spatially resolved integral field unit observations of nearby galaxies from MaNGA, the first data released from this survey. It includes new observations from eBOSS, completing SEQUELS. In addition to targeting galaxies and quasars, SEQUELS also targeted variability-selected objects from TDSS and X-ray selected objects from SPIDERS. DR13 includes new reductions of the SDSS-III BOSS data, improving the spectrophotometric calibration and redshift classification. DR13 releases new reductions of the APOGEE-1 data from SDSS-III, with abundances of elements not previously included and improved stellar parameters for dwarf stars and cooler stars. For the SDSS imaging data, DR13 provides new, more robust and precise photometric calibrations. Several value-added catalogs are being released in tandem with DR13, in particular target catalogs relevant for eBOSS, TDSS, and SPIDERS, and an updated red-clump catalog for APOGEE. This paper describes the location and format of the data now publicly available, as well as providing references to the important technical papers that describe the targeting, observing, and data reduction. The SDSS website, http://www.sdss.org, provides links to the data, tutorials and examples of data access, and extensive documentation of the reduction and analysis procedures. DR13 is the first of a scheduled set that will contain new data and analyses from the planned ~6-year operations of SDSS-IV. △ Less

Submitted 25 September, 2017; v1 submitted 5 August, 2016; originally announced August 2016.

Comments: Full information on DR13 available at http://www.sdss.org. Comments welcome to [email protected]. To be published in ApJS

arXiv:1607.08237 [pdf, other]

doi 10.3847/0004-6256/152/6/206

The population of long-period transiting exoplanets

Authors: Daniel Foreman-Mackey, Timothy D. Morton, David W. Hogg, Eric Agol, Bernhard Schölkopf

Abstract: The Kepler Mission has discovered thousands of exoplanets and revolutionized our understanding of their population. This large, homogeneous catalog of discoveries has enabled rigorous studies of the occurrence rate of exoplanets and planetary systems as a function of their physical properties. However, transit surveys like Kepler are most sensitive to planets with orbital periods much shorter than… ▽ More The Kepler Mission has discovered thousands of exoplanets and revolutionized our understanding of their population. This large, homogeneous catalog of discoveries has enabled rigorous studies of the occurrence rate of exoplanets and planetary systems as a function of their physical properties. However, transit surveys like Kepler are most sensitive to planets with orbital periods much shorter than the orbital periods of Jupiter and Saturn, the most massive planets in our Solar System. To address this deficiency, we perform a fully automated search for long-period exoplanets with only one or two transits in the archival Kepler light curves. When applied to the $\sim 40,000$ brightest Sun-like target stars, this search produces 16 long-period exoplanet candidates. Of these candidates, 6 are novel discoveries and 5 are in systems with inner short-period transiting planets. Since our method involves no human intervention, we empirically characterize the detection efficiency of our search. Based on these results, we measure the average occurrence rate of exoplanets smaller than Jupiter with orbital periods in the range 2-25 years to be $2.0\pm0.7$ planets per Sun-like star. △ Less

Submitted 6 October, 2016; v1 submitted 27 July, 2016; originally announced July 2016.

Comments: Accepted for publication in AJ

arXiv:1607.01782 [pdf, other]

doi 10.1093/mnras/stx894

Approximate Bayesian Computation in Large Scale Structure: constraining the galaxy-halo connection

Authors: ChangHoon Hahn, Mohammadjavad Vakili, Kilian Walsh, Andrew P. Hearin, David W. Hogg, Duncan Campbell

Abstract: Standard approaches to Bayesian parameter inference in large scale structure assume a Gaussian functional form (chi-squared form) for the likelihood. This assumption, in detail, cannot be correct. Likelihood free inferences such as Approximate Bayesian Computation (ABC) relax these restrictions and make inference possible without making any assumptions on the likelihood. Instead ABC relies on a fo… ▽ More Standard approaches to Bayesian parameter inference in large scale structure assume a Gaussian functional form (chi-squared form) for the likelihood. This assumption, in detail, cannot be correct. Likelihood free inferences such as Approximate Bayesian Computation (ABC) relax these restrictions and make inference possible without making any assumptions on the likelihood. Instead ABC relies on a forward generative model of the data and a metric for measuring the distance between the model and data. In this work, we demonstrate that ABC is feasible for LSS parameter inference by using it to constrain parameters of the halo occupation distribution (HOD) model for populating dark matter halos with galaxies. Using specific implementation of ABC supplemented with Population Monte Carlo importance sampling, a generative forward model using HOD, and a distance metric based on galaxy number density, two-point correlation function, and galaxy group multiplicity function, we constrain the HOD parameters of mock observation generated from selected "true" HOD parameters. The parameter constraints we obtain from ABC are consistent with the "true" HOD parameters, demonstrating that ABC can be reliably used for parameter inference in LSS. Furthermore, we compare our ABC constraints to constraints we obtain using a pseudo-likelihood function of Gaussian form with MCMC and find consistent HOD parameter constraints. Ultimately our results suggest that ABC can and should be applied in parameter inference for LSS analyses. △ Less

Submitted 10 April, 2017; v1 submitted 6 July, 2016; originally announced July 2016.

Comments: 16 pages, 10 figures

arXiv:1606.06182 [pdf, other]

doi 10.3847/0004-637X/826/2/104

The Panchromatic Hubble Andromeda Treasury XV. The BEAST: Bayesian Extinction and Stellar Tool

Authors: Karl D. Gordon, Morgan Fouesneau, Heddy Arab, Kirill Tchernyshyov, Daniel R. Weisz, Julianne J. Dalcanton, Benjamin F. Williams, Eric F. Bell, Luciana Bianchi, Martha Boyer, Yumi Choi, Andrew Dolphin, Leo Girardi, David W. Hogg, Jason S. Kalirai, Maria Kapala, Alexia R. Lewis, Hans-Walter Rix, Karin Sandstrom, Evan D. Skillman

Abstract: We present the Bayesian Extinction And Stellar Tool (BEAST), a probabilistic approach to modeling the dust extinguished photometric spectral energy distribution of an individual star while accounting for observational uncertainties common to large resolved star surveys. Given a set of photometric measurements and an observational uncertainty model, the BEAST infers the physical properties of the s… ▽ More We present the Bayesian Extinction And Stellar Tool (BEAST), a probabilistic approach to modeling the dust extinguished photometric spectral energy distribution of an individual star while accounting for observational uncertainties common to large resolved star surveys. Given a set of photometric measurements and an observational uncertainty model, the BEAST infers the physical properties of the stellar source using stellar evolution and atmosphere models and constrains the line of sight extinction using a newly developed mixture model that encompasses the full range of dust extinction curves seen in the Local Group. The BEAST is specifically formulated for use with large multi-band surveys of resolved stellar populations. Our approach accounts for measurement uncertainties and any covariance between them due to stellar crowding (both systematic biases and uncertainties in the bias) and absolute flux calibration, thereby incorporating the full information content of the measurement. We illustrate the accuracy and precision possible with the BEAST using data from the Panchromatic Hubble Andromeda Treasury. While the BEAST has been developed for this survey, it can be easily applied to similar existing and planned resolved star surveys. △ Less

Submitted 20 June, 2016; originally announced June 2016.

Comments: 20 pages, 19 figures, ApJ, in press

arXiv:1606.05648 [pdf, other]

doi 10.3847/1538-4357/833/1/98

AGNfitter: A Bayesian MCMC approach to fitting spectral energy distributions of AGN

Authors: Gabriela Calistro Rivera, Elisabeta Lusso, Joseph F. Hennawi, David W. Hogg

Abstract: We present AGNfitter, a publicly available open-source algorithm implementing a fully Bayesian Markov Chain Monte Carlo method to fit the spectral energy distributions (SEDs) of active galactic nuclei (AGN) from the sub-mm to the UV, allowing one to robustly disentangle the physical processes responsible for their emission. AGNfitter makes use of a large library of theoretical, empirical, and semi… ▽ More We present AGNfitter, a publicly available open-source algorithm implementing a fully Bayesian Markov Chain Monte Carlo method to fit the spectral energy distributions (SEDs) of active galactic nuclei (AGN) from the sub-mm to the UV, allowing one to robustly disentangle the physical processes responsible for their emission. AGNfitter makes use of a large library of theoretical, empirical, and semi-empirical models to characterize both the nuclear and host galaxy emission simultaneously. The model consists of four physical emission components: an accretion disk, a torus of AGN heated dust, stellar populations, and cold dust in star forming regions. AGNfitter determines the posterior distributions of numerous parameters that govern the physics of AGN with a fully Bayesian treatment of errors and parameter degeneracies, allowing one to infer integrated luminosities, dust attenuation parameters, stellar masses, and star formation rates. We tested AGNfitter's performace on real data by fitting the SEDs of a sample of 714 X-ray selected AGN from the XMM-COSMOS survey, spectroscopically classified as Type1 (unobscured) and Type2 (obscured) AGN by their optical-UV emission lines. We find that two independent model parameters, namely the reddening of the accretion disk and the column density of the dusty torus, are good proxies for AGN obscuration, allowing us to develop a strategy for classifying AGN as Type1 or Type2, based solely on an SED-fitting analysis. Our classification scheme is in excellent agreement with the spectroscopic classification, giving a completeness fraction of $\sim 86\%$ and $\sim 70\%$, and an efficiency of $\sim 80\%$ and $\sim 77\%$, for Type1 and Type2 AGNs, respectively. △ Less

Submitted 17 June, 2016; originally announced June 2016.

Comments: 21 pages, 10 figures, submitted to the ApJ. The AGNfitter python code is publicly available at https://github.com/GabrielaCR/AGNfitter

arXiv:1603.06574 [pdf, other]

doi 10.3847/2041-8205/826/2/L25

Constructing Polynomial Spectral Models for Stars

Authors: Hans-Walter Rix, Yuan-Sen Ting, Charlie Conroy, David W. Hogg

Abstract: Stellar spectra depend on the stellar parameters and on dozens of photospheric elemental abundances. Simultaneous fitting of these $\mathcal{N}\sim 10-40$ model labels to observed spectra has been deemed unfeasible, because the number of ab initio spectral model grid calculations scales exponentially with $\mathcal{N}$. We suggest instead the construction of a polynomial spectral model (PSM) of or… ▽ More Stellar spectra depend on the stellar parameters and on dozens of photospheric elemental abundances. Simultaneous fitting of these $\mathcal{N}\sim 10-40$ model labels to observed spectra has been deemed unfeasible, because the number of ab initio spectral model grid calculations scales exponentially with $\mathcal{N}$. We suggest instead the construction of a polynomial spectral model (PSM) of order $\mathcal{O}$ for the model flux at each wavelength. Building this approximation requires a minimum of only ${\mathcal{N}+\mathcal{O}\choose\mathcal{O}}$ calculations: e.g. a quadratic spectral model ($\mathcal{O}=2$) to fit $\mathcal{N}=20$ labels simultaneously, can be constructed from as few as $231$ ab initio spectral model calculations; in practice, a somewhat larger number ($\sim 300-1000$) of randomly chosen models lead to a better performing PSM. Such a PSM can be a good approximation only over a portion of label space, which will vary case by case. Yet, taking the APOGEE survey as an example, a single quadratic PSM provides a remarkably good approximation to the exact ab initio spectral models across much of this survey: for random labels within that survey the PSM approximates the flux to within $10^{-3}$, and recovers the abundances to within $\sim 0.02$ dex rms of the exact models. This enormous speed-up enables the simultaneous many-label fitting of spectra with computationally expensive ab initio models for stellar spectra, such as non-LTE models. A PSM also enables the simultaneous fitting of observational parameters, such as the spectrum's continuum or line-spread function. △ Less

Submitted 16 May, 2016; v1 submitted 21 March, 2016; originally announced March 2016.

Comments: 4 pages, 2 figures, ApJL (Accepted for publication- 2016 May 9)

Journal ref: ApJ, 826, L25 (2016)

arXiv:1603.03040 [pdf, other]

The Cannon 2: A data-driven model of stellar spectra for detailed chemical abundance analyses

Authors: Andrew R. Casey, David W. Hogg, Melissa Ness, Hans-Walter Rix, Anna Q Y Ho, Gerry Gilmore

Abstract: We have shown that data-driven models are effective for inferring physical attributes of stars (labels; Teff, logg, [M/H]) from spectra, even when the signal-to-noise ratio is low. Here we explore whether this is possible when the dimensionality of the label space is large (Teff, logg, and 15 abundances: C, N, O, Na, Mg, Al, Si, S, K, Ca, Ti, V, Mn, Fe, Ni) and the model is non-linear in its respo… ▽ More We have shown that data-driven models are effective for inferring physical attributes of stars (labels; Teff, logg, [M/H]) from spectra, even when the signal-to-noise ratio is low. Here we explore whether this is possible when the dimensionality of the label space is large (Teff, logg, and 15 abundances: C, N, O, Na, Mg, Al, Si, S, K, Ca, Ti, V, Mn, Fe, Ni) and the model is non-linear in its response to abundance and parameter changes. We adopt ideas from compressed sensing to limit overall model complexity while retaining model freedom. The model is trained with a set of 12,681 red-giant stars with high signal-to-noise spectroscopic observations and stellar parameters and abundances taken from the APOGEE Survey. We find that we can successfully train and use a model with 17 stellar labels. Validation shows that the model does a good job of inferring all 17 labels (typical abundance precision is 0.04 dex), even when we degrade the signal-to-noise by discarding ~50% of the observing time. The model dependencies make sense: the spectral derivatives with respect to abundances correlate with known atomic lines, and we identify elements belonging to atomic lines that were previously unknown. We recover (anti-)correlations in abundance labels for globular cluster stars, consistent with the literature. However we find the intrinsic spread in globular cluster abundances is 3--4 times smaller than previously reported. We deliver 17 labels with associated errors for 87,563 red giant stars, as well as open-source code to extend this work to other spectroscopic surveys. △ Less

Submitted 9 March, 2016; originally announced March 2016.

Comments: Submitted to AAS (ApJ)

arXiv:1602.09010 [pdf, other]

doi 10.1088/1475-7516/2016/11/060

A 14 $h^{-3}$ Gpc$^3$ study of cosmic homogeneity using BOSS DR12 quasar sample

Authors: Pierre Laurent, Jean-Marc Le Goff, Etienne Burtin, Jean-Christophe Hamilton, David W. Hogg, Adam Myers, Pierros Ntelis, Isabelle Pâris, James Rich, Eric Aubourg, Julian Bautista, Timothée Delubac, Hélion du Mas des Bourboux, Sarah Eftekharzadeh, Nathalie Palanque Delabrouille, Patrick Petitjean, Graziano Rossi, Donald P. Schneider, Christophe Yeche

Abstract: The BOSS quasar sample is used to study cosmic homogeneity with a 3D survey in the redshift range $2.2<z<2.8$. We measure the count-in-sphere, $N(<\! r)$, i.e. the average number of objects around a given object, and its logarithmic derivative, the fractal correlation dimension, $D_2(r)$. For a homogeneous distribution $N(<\! r) \propto r^3$ and $D_2(r)=3$. Due to the uncertainty on tracer density… ▽ More The BOSS quasar sample is used to study cosmic homogeneity with a 3D survey in the redshift range $2.2<z<2.8$. We measure the count-in-sphere, $N(<\! r)$, i.e. the average number of objects around a given object, and its logarithmic derivative, the fractal correlation dimension, $D_2(r)$. For a homogeneous distribution $N(<\! r) \propto r^3$ and $D_2(r)=3$. Due to the uncertainty on tracer density evolution, 3D surveys can only probe homogeneity up to a redshift dependence, i.e. they probe so-called "spatial isotropy". Our data demonstrate spatial isotropy of the quasar distribution in the redshift range $2.2<z<2.8$ in a model-independent way, independent of any FLRW fiducial cosmology, resulting in $3-\langle D_2 \rangle < 1.7 \times 10^{-3}$ (2 $σ$) over the range $250<r<1200 \, h^{-1}$Mpc for the quasar distribution. If we assume that quasars do not have a bias much less than unity, this implies spatial isotropy of the matter distribution on large scales. Then, combining with the Copernican principle, we finally get homogeneity of the matter distribution on large scales. Alternatively, using a flat $Λ$CDM fiducial cosmology with CMB-derived parameters, and measuring the quasar bias relative to this $Λ$CDM model, our data provide a consistency check of the model, in terms of how homogeneous the Universe is on different scales. $D_2(r)$ is found to be compatible with our $Λ$CDM model on the whole $10<r<1200 \, h^{-1}$Mpc range. For the matter distribution we obtain $3-\langle D_2 \rangle < 5 \times 10^{-5}$ (2 $σ$) over the range $250<r<1200 \, h^{-1}$Mpc, consistent with homogeneity on large scales. △ Less

Submitted 21 November, 2016; v1 submitted 29 February, 2016; originally announced February 2016.

Comments: version accepted for publication by JCAP

arXiv:1602.07939 [pdf, other]

doi 10.1088/1538-3873/128/964/066001

State of the Field: Extreme Precision Radial Velocities

Authors: Debra Fischer, Guillem Anglada-Escude, Pamela Arriagada, Roman V. Baluev, Jacob L. Bean, Francois Bouchy, Lars A. Buchhave, Thorsten Carroll, Abhijit Chakraborty, Justin R. Crepp, Rebekah I. Dawson, Scott A. Diddams, Xavier Dumusque, Jason D. Eastman, Michael Endl, Pedro Figueira, Eric B. Ford, Daniel Foreman-Mackey, Paul Fournier, Gabor Furesz, B. Scott Gaudi, Philip C. Gregory, Frank Grundahl, Artie P. Hatzes, Guillaume Hebrard , et al. (31 additional authors not shown)

Abstract: The Second Workshop on Extreme Precision Radial Velocities defined circa 2015 the state of the art Doppler precision and identified the critical path challenges for reaching 10 cm/s measurement precision. The presentations and discussion of key issues for instrumentation and data analysis and the workshop recommendations for achieving this precision are summarized here. Beginning with the HARPS… ▽ More The Second Workshop on Extreme Precision Radial Velocities defined circa 2015 the state of the art Doppler precision and identified the critical path challenges for reaching 10 cm/s measurement precision. The presentations and discussion of key issues for instrumentation and data analysis and the workshop recommendations for achieving this precision are summarized here. Beginning with the HARPS spectrograph, technological advances for precision radial velocity measurements have focused on building extremely stable instruments. To reach still higher precision, future spectrometers will need to produce even higher fidelity spectra. This should be possible with improved environmental control, greater stability in the illumination of the spectrometer optics, better detectors, more precise wavelength calibration, and broader bandwidth spectra. Key data analysis challenges for the precision radial velocity community include distinguishing center of mass Keplerian motion from photospheric velocities, and the proper treatment of telluric contamination. Success here is coupled to the instrument design, but also requires the implementation of robust statistical and modeling techniques. Center of mass velocities produce Doppler shifts that affect every line identically, while photospheric velocities produce line profile asymmetries with wavelength and temporal dependencies that are different from Keplerian signals. Exoplanets are an important subfield of astronomy and there has been an impressive rate of discovery over the past two decades. Higher precision radial velocity measurements are required to serve as a discovery technique for potentially habitable worlds and to characterize detections from transit missions. The future of exoplanet science has very different trajectories depending on the precision that can ultimately be achieved with Doppler measurements. △ Less

Submitted 27 February, 2016; v1 submitted 25 February, 2016; originally announced February 2016.

Comments: 45 pages, 23 Figures, workshop summary proceedings

arXiv:1602.00303 [pdf, other]

doi 10.3847/1538-4357/836/1/5

Label Transfer from APOGEE to LAMOST: Precise Stellar Parameters for 450,000 LAMOST Giants

Authors: Anna Y. Q. Ho, Melissa K. Ness, David W. Hogg, Hans-Walter Rix, Chao Liu, Fan Yang, Yong Zhang, Yonghui Hou, Yuefei Wang

Abstract: In this era of large-scale stellar spectroscopic surveys, measurements of stellar attributes ("labels," i.e. parameters and abundances) must be made precise and consistent across surveys. Here, we demonstrate that this can be achieved by a data-driven approach to spectral modeling. With The Cannon, we transfer information from the APOGEE survey to determine precise Teff, log g, [Fe/H], and [$α$/M]… ▽ More In this era of large-scale stellar spectroscopic surveys, measurements of stellar attributes ("labels," i.e. parameters and abundances) must be made precise and consistent across surveys. Here, we demonstrate that this can be achieved by a data-driven approach to spectral modeling. With The Cannon, we transfer information from the APOGEE survey to determine precise Teff, log g, [Fe/H], and [$α$/M] from the spectra of 450,000 LAMOST giants. The Cannon fits a predictive model for LAMOST spectra using 9952 stars observed in common between the two surveys, taking five labels from APOGEE DR12 as ground truth: Teff, log g, [Fe/H], [α/M], and K-band extinction $A_k$. The model is then used to infer Teff, log g, [Fe/H], and [$α$/M] for 454,180 giants, 20% of the LAMOST DR2 stellar sample. These are the first [$α$/M] values for the full set of LAMOST giants, and the largest catalog of [$α$/M] for giant stars to date. Furthermore, these labels are by construction on the APOGEE label scale; for spectra with S/N > 50, cross-validation of the model yields typical uncertainties of 70K in Teff, 0.1 in log g, 0.1 in [Fe/H], and 0.04 in [$α$/M], values comparable to the broadly stated, conservative APOGEE DR12 uncertainties. Thus, by using "label transfer" to tie low-resolution (LAMOST R $\sim$ 1800) spectra to the label scale of a much higher-resolution (APOGEE R $\sim$ 22,500) survey, we substantially reduce the inconsistencies between labels measured by the individual survey pipelines. This demonstrates that label transfer with The Cannon can successfully bring different surveys onto the same physical scale. △ Less

Submitted 13 January, 2017; v1 submitted 31 January, 2016; originally announced February 2016.

Comments: 27 pages, 14 figures. Accepted by ApJ on 16 Dec 2016, implementing suggestions from the referee reports. Associated code available at https://github.com/annayqho/TheCannon

arXiv:1601.05413 [pdf, other]

doi 10.3847/1538-4357/833/2/262

Chemical tagging can work: Identification of stellar phase-space structures purely by chemical-abundance similarity

Authors: David W. Hogg, Andrew R. Casey, Melissa Ness, Hans-Walter Rix, Daniel Foreman-Mackey, Sten Hasselquist, Anna Y. Q. Ho, Jon A. Holtzman, Steven R. Majewski, Sarah L. Martell, Szabolcs Meszaros, David L. NIdever, Matthew Shetrone

Abstract: Chemical tagging promises to use detailed abundance measurements to identify spatially separated stars that were in fact born together (in the same molecular cloud), long ago. This idea has not yielded much practical success, presumably because of the noise and incompleteness in chemical-abundance measurements. We have succeeded in substantially improving spectroscopic measurements with The Cannon… ▽ More Chemical tagging promises to use detailed abundance measurements to identify spatially separated stars that were in fact born together (in the same molecular cloud), long ago. This idea has not yielded much practical success, presumably because of the noise and incompleteness in chemical-abundance measurements. We have succeeded in substantially improving spectroscopic measurements with The Cannon, which has now delivered 15 individual abundances for ~100,000 stars observed as part of the APOGEE spectroscopic survey, with precisions around 0.04 dex. We test the chemical-tagging hypothesis by looking at clusters in abundance space and confirming that they are clustered in phase space. We identify (by the k-means algorithm) overdensities of stars in the 15-dimensional chemical-abundance space delivered by The Cannon, and plot the associated stars in phase space. We use only abundance-space information (no positional information) to identify stellar groups. We find that clusters in abundance space are indeed clusters in phase space. We recover some known phase-space clusters and find other interesting structures. This is the first-ever project to identify phase-space structures at survey-scale by blind search purely in abundance space; it verifies the precision of the abundance measurements delivered by The Cannon; the prospects for future data sets appear very good. △ Less

Submitted 25 August, 2016; v1 submitted 20 January, 2016; originally announced January 2016.

Comments: accepted for publication in the ApJ

arXiv:1601.00266 [pdf, ps, other]

Detecting Diffuse Sources in Astronomical Images

Authors: T. Butler-Yeoman, M. Frean, C. P. Hollitt, D. W. Hogg, M. Johnston-Hollitt

Abstract: We present an algorithm capable of detecting diffuse, dim sources of any size in an astronomical image. These sources often defeat traditional methods for source finding, which expand regions around points of high intensity. Extended sources often have no bright points and are only detectable when viewed as a whole, so a more sophisticated approach is required. Our algorithm operates at all scales… ▽ More We present an algorithm capable of detecting diffuse, dim sources of any size in an astronomical image. These sources often defeat traditional methods for source finding, which expand regions around points of high intensity. Extended sources often have no bright points and are only detectable when viewed as a whole, so a more sophisticated approach is required. Our algorithm operates at all scales simultaneously by considering a tree of nested candidate bounding boxes, and inverts a hierarchical Bayesian generative model to obtain the probability of sources existing at given locations and sizes. This model naturally accommodates the detection of nested sources, and no prior knowledge of the distribution of a source, or even the background, is required. The algorithm scales nearly linear with the number of pixels making it feasible to run on large images, and requires minimal parameter tweaking to be effective. We demonstrate the algorithm on several types of astronomical and artificial images. △ Less

Submitted 3 January, 2016; originally announced January 2016.

Comments: 4 pages, 10 figures, in press in ADASS XXV, edited by N. P. F. Lorente, & K. Shortridge (San Francisco: ASP), ASP Conf. Series

arXiv:1512.09142 [pdf, other]

doi 10.1088/1538-3873/128/970/124401

Campaign 9 of the $K2$ Mission: Observational Parameters, Scientific Drivers, and Community Involvement for a Simultaneous Space- and Ground-based Microlensing Survey

Authors: Calen B. Henderson, Radosław Poleski, Matthew Penny, Rachel A. Street, David P. Bennett, David W. Hogg, B. Scott Gaudi, W. Zhu, T. Barclay, G. Barentsen, S. B. Howell, F. Mullally, A. Udalski, M. K. Szymański, J. Skowron, P. Mróz, S. Kozłowski, Ł. Wyrzykowski, P. Pietrukowicz, I. Soszyński, K. Ulaczyk, M. Pawlak, T. Sumi, F. Abe, Y. Asakura , et al. (96 additional authors not shown)

Abstract: $K2$'s Campaign 9 ($K2$C9) will conduct a $\sim$3.7 deg$^{2}$ survey toward the Galactic bulge from 7/April through 1/July of 2016 that will leverage the spatial separation between $K2$ and the Earth to facilitate measurement of the microlens parallax $π_{\rm E}$ for $\gtrsim… ▽ More $K2$'s Campaign 9 ($K2$C9) will conduct a $\sim$3.7 deg$^{2}$ survey toward the Galactic bulge from 7/April through 1/July of 2016 that will leverage the spatial separation between $K2$ and the Earth to facilitate measurement of the microlens parallax $π_{\rm E}$ for $\gtrsim$127 microlensing events. These will include several that are planetary in nature as well as many short-timescale microlensing events, which are potentially indicative of free-floating planets (FFPs). These satellite parallax measurements will in turn allow for the direct measurement of the masses of and distances to the lensing systems. In this white paper we provide an overview of the $K2$C9 space- and ground-based microlensing survey. Specifically, we detail the demographic questions that can be addressed by this program, including the frequency of FFPs and the Galactic distribution of exoplanets, the observational parameters of $K2$C9, and the array of resources dedicated to concurrent observations. Finally, we outline the avenues through which the larger community can become involved, and generally encourage participation in $K2$C9, which constitutes an important pathfinding mission and community exercise in anticipation of $WFIRST$. △ Less

Submitted 7 March, 2016; v1 submitted 30 December, 2015; originally announced December 2015.

Comments: 19 pages, 11 figures, 3 tables; submitted to PASP

arXiv:1511.08204 [pdf, other]

doi 10.3847/0004-637X/823/2/114

Spectroscopic determination of masses (and implied ages) for red giants

Authors: M. Ness, David W. Hogg, H-W. Rix, M. Martig, Marc H. Pinsonneault, A. Y. Q Ho

Abstract: The mass of a star is arguably its most fundamental parameter. For red giant stars, tracers luminous enough to be observed across the Galaxy, mass implies a stellar evolution age. It has proven to be extremely difficult to infer ages and masses directly from red giant spectra using existing methods. From the KEPLER and APOGEE surveys, samples of several thousand stars exist with high-quality spect… ▽ More The mass of a star is arguably its most fundamental parameter. For red giant stars, tracers luminous enough to be observed across the Galaxy, mass implies a stellar evolution age. It has proven to be extremely difficult to infer ages and masses directly from red giant spectra using existing methods. From the KEPLER and APOGEE surveys, samples of several thousand stars exist with high-quality spectra and asteroseismic masses. Here we show that from these data we can build a data-driven spectral model using The Cannon, which can determine stellar masses to $\sim$ 0.07 dex from APOGEE DR12 spectra of red giants; these imply age estimates accurate to $\sim$ 0.2 dex (40 percent). We show that The Cannon constrains these ages foremost from spectral regions with CN absorption lines, elements whose surface abundances reflect mass-dependent dredge-up. We deliver an unprecedented catalog of 80,000 giants (including 20,000 red-clump stars) with mass and age estimates, spanning the entire disk (from the Galactic center to R $\sim$ 20 kpc). We show that the age information in the spectra is not simply a corollary of the birth-material abundances [Fe/H] and [$α$/Fe], and that even within a mono-abundance population of stars, there are age variations that vary sensibly with Galactic position. Such stellar age constraints across the Milky Way open up new avenues in Galactic archeology. △ Less

Submitted 25 November, 2015; originally announced November 2015.

Comments: Submitted to ApJ 13 October 2015

arXiv:1511.05527 [pdf, other]

doi 10.3847/0004-637X/817/1/73

Finding, characterizing and classifying variable sources in multi-epoch sky surveys: QSOs and RR Lyrae in PS1 3$π$ data

Authors: Nina Hernitschek, Edward F. Schlafly, Branimir Sesar, Hans-Walter Rix, David W. Hogg, Zeljko Ivezic, Eva K. Grebel, Eric F. Bell, Nicolas F. Martin, W. S. Burgett, H. Flewelling, K. W. Hodapp, N. Kaiser, E. A. Magnier, N. Metcalfe, R. J. Wainscoat, C. Waters

Abstract: In area and depth, the Pan-STARRS1 (PS1) 3$π$ survey is unique among many-epoch, multi-band surveys and has enormous potential for all-sky identification of variable sources. PS1 has observed the sky typically seven times in each of its five bands ($grizy$) over 3.5 years, but unlike SDSS not simultaneously across the bands. Here we develop a new approach for quantifying statistical properties of… ▽ More In area and depth, the Pan-STARRS1 (PS1) 3$π$ survey is unique among many-epoch, multi-band surveys and has enormous potential for all-sky identification of variable sources. PS1 has observed the sky typically seven times in each of its five bands ($grizy$) over 3.5 years, but unlike SDSS not simultaneously across the bands. Here we develop a new approach for quantifying statistical properties of non-simultaneous, sparse, multi-color lightcurves through light-curve structure functions, effectively turning PS1 into a $\sim 35$-epoch survey. We use this approach to estimate variability amplitudes and timescales $(ω_r, τ)$ for all point-sources brighter than $r_{\mathrm{P1}}=21.5$ mag in the survey. With PS1 data on SDSS Stripe 82 as ``ground truth", we use a Random Forest Classifier to identify QSOs and RR Lyrae based on their variability and their mean PS1 and WISE colors. We find that, aside from the Galactic plane, QSO and RR Lyrae samples of purity $\sim$75\% and completeness $\sim$92\% can be selected. On this basis we have identified a sample of $\sim 1,000,000$ QSO candidates, as well as an unprecedentedly large and deep sample of $\sim$150,000 RR Lyrae candidates with distances from $\sim$10 kpc to $\sim$120 kpc. Within the Draco dwarf spheroidal, we demonstrate a distance precision of 6\% for RR Lyrae candidates. We provide a catalog of all likely variable point sources and likely QSOs in PS1, a total of $25.8\times 10^6$ sources. △ Less

Submitted 17 November, 2015; originally announced November 2015.

arXiv:1511.01496 [pdf, ps, other]

doi 10.3847/0004-6256/151/1/8

SDSS-IV/MaNGA: Spectrophotometric Calibration Technique

Authors: Renbin Yan, Christy Tremonti, Matthew A. Bershady, David R. Law, David J. Schlegel, Kevin Bundy, Niv Drory, Nicholas MacDonald, Dmitry Bizyaev, Guillermo A. Blanc, Michael R. Blanton, Brian Cherinka, Arthur Eigenbrot, James E. Gunn, Paul Harding, David W. Hogg, José R. Sánchez-Gallego, Sebastian F. Sánchez, David A. Wake, Anne-Marie Weijmans, Ting Xiao, Kai Zhang

Abstract: Map** Nearby Galaxies at Apache Point Observatory (MaNGA), one of three core programs in the Sloan Digital Sky Survey-IV (SDSS-IV), is an integral-field spectroscopic (IFS) survey of roughly 10,000 nearby galaxies. It employs dithered observations using 17 hexagonal bundles of 2 arcsec fibers to obtain resolved spectroscopy over a wide wavelength range of 3,600-10,300A. To map the internal varia… ▽ More Map** Nearby Galaxies at Apache Point Observatory (MaNGA), one of three core programs in the Sloan Digital Sky Survey-IV (SDSS-IV), is an integral-field spectroscopic (IFS) survey of roughly 10,000 nearby galaxies. It employs dithered observations using 17 hexagonal bundles of 2 arcsec fibers to obtain resolved spectroscopy over a wide wavelength range of 3,600-10,300A. To map the internal variations within each galaxy, we need to perform accurate {\it spectral surface photometry}, which is to calibrate the specific intensity at every spatial location sampled by each individual aperture element of the integral field unit. The calibration must correct only for the flux loss due to atmospheric throughput and the instrument response, but not for losses due to the finite geometry of the fiber aperture. This requires the use of standard star measurements to strictly separate these two flux loss factors (throughput versus geometry), a difficult challenge with standard single-fiber spectroscopy techniques due to various practical limitations. Therefore, we developed a technique for spectral surface photometry using multiple small fiber-bundles targeting standard stars simultaneously with galaxy observations. We discuss the principles of our approach and how they compare to previous efforts, and we demonstrate the precision and accuracy achieved. MaNGA's relative calibration between the wavelengths of H$α$ and H$β$ has a root-mean-square (RMS) of 1.7%, while that between [NII] $λ$6583A and [OII] $λ$3727A has an RMS of 4.7%. Using extinction-corrected star formation rates and gas-phase metallicities as an illustration, this level of precision guarantees that flux calibration errors will be sub-dominant when estimating these quantities. The absolute calibration is better than 5% for more than 89% of MaNGA's wavelength range. △ Less

Submitted 4 November, 2015; originally announced November 2015.

Comments: 19 pages, 9 figures. Accepted for publication in AJ

arXiv:1509.09304 [pdf, other]

doi 10.1017/S1743921315006742

Globular Cluster Streams as Galactic High-Precision Scales

Authors: A. H. W. Küpper, E. Balbinot, A. Bonaca, K. V. Johnston, D. W. Hogg, P. Kroupa, B. X. Santiago

Abstract: Tidal streams of globular clusters are ideal tracers of the Galactic gravitational potential. Compared to the few known, complex and diffuse dwarf-galaxy streams, they are kinematically cold, have thin morphologies and are abundant in the halo of the Milky Way. Their coldness and thinness in combination with potential epicyclic substructure in the vicinity of the stream progenitor turns them into… ▽ More Tidal streams of globular clusters are ideal tracers of the Galactic gravitational potential. Compared to the few known, complex and diffuse dwarf-galaxy streams, they are kinematically cold, have thin morphologies and are abundant in the halo of the Milky Way. Their coldness and thinness in combination with potential epicyclic substructure in the vicinity of the stream progenitor turns them into high-precision scales. With the example of Palomar 5, we demonstrate how modeling of a globular cluster stream allows us to simultaneously measure the properties of the disrupting globular cluster, its orbital motion, and the gravitational potential of the Milky Way. △ Less

Submitted 30 September, 2015; v1 submitted 30 September, 2015; originally announced September 2015.

Comments: Proceedings of IAUS317 - "The General Assembly of Galaxy Halos: Structure, Origin and Evolution", 5 pages, 2 figures, replaced eps figures causing problems with pngs

arXiv:1509.06988 [pdf, other]

doi 10.1088/0004-637X/814/1/3

The Panchromatic Hubble Andromeda Treasury VIII: A Wide-Area, High-Resolution Map of Dust Extinction in M31

Authors: Julianne J. Dalcanton, Morgan Fouesneau, David W. Hogg, Dustin Lang, Adam K. Leroy, Karl D. Gordon, Karin Sandstrom, Daniel R. Weisz, Benjamin F. Williams, Eric F. Bell, Hui Dong, Karoline M. Gilbert, Dimitrious A. Gouliermis, Puragra Guhathakurta, Tod R. Lauer, Andreas Schruba, Anil C. Seth, Evan D. Skillman

Abstract: We map the distribution of dust in M31 at 25pc resolution, using stellar photometry from the Panchromatic Hubble Andromeda Treasury. We develop a new map** technique that models the NIR color-magnitude diagram (CMD) of red giant branch (RGB) stars. The model CMDs combine an unreddened foreground of RGB stars with a reddened background population viewed through a log-normal column density distrib… ▽ More We map the distribution of dust in M31 at 25pc resolution, using stellar photometry from the Panchromatic Hubble Andromeda Treasury. We develop a new map** technique that models the NIR color-magnitude diagram (CMD) of red giant branch (RGB) stars. The model CMDs combine an unreddened foreground of RGB stars with a reddened background population viewed through a log-normal column density distribution of dust. Fits to the model constrain the median extinction, the width of the extinction distribution, and the fraction of reddened stars. The resulting extinction map has >4 times better resolution than maps of dust emission, while providing a more direct measurement of the dust column. There is superb morphological agreement between the new map and maps of the extinction inferred from dust emission by Draine et al. 2014. However, the widely-used Draine & Li (2007) dust models overpredict the observed extinction by a factor of ~2.5, suggesting that M31's true dust mass is lower and that dust grains are significantly more emissive than assumed in Draine et al. (2014). The discrepancy we identify is consistent with similar findings in the Milky Way by the Planck Collaboration (2015), but has a more complex dependence on parameters from the Draine & Li (2007) dust models. We also show that the discrepancy with the Draine et al. (2014) map is lowest where the interstellar radiation field has a harder spectrum than average. We discuss possible improvements to the CMD dust map** technique, and explore further applications. △ Less

Submitted 22 September, 2015; originally announced September 2015.

Comments: 52 pages in ApJ format including 39 figures. Accepted to the Astrophysical Journal

arXiv:1508.01853 [pdf, other]

doi 10.1088/1538-3873/128/967/094503

A Causal, Data-Driven Approach to Modeling the Kepler Data

Authors: Dun Wang, David W. Hogg, Dan Foreman-Mackey, Bernhard Schölkopf

Abstract: Astronomical observations are affected by several kinds of noise, each with its own causal source; there is photon noise, stochastic source variability, and residuals coming from imperfect calibration of the detector or telescope. The precision of NASA Kepler photometry for exoplanet science---the most precise photometric measurements of stars ever made---appears to be limited by unknown or untrac… ▽ More Astronomical observations are affected by several kinds of noise, each with its own causal source; there is photon noise, stochastic source variability, and residuals coming from imperfect calibration of the detector or telescope. The precision of NASA Kepler photometry for exoplanet science---the most precise photometric measurements of stars ever made---appears to be limited by unknown or untracked variations in spacecraft pointing and temperature, and unmodeled stellar variability. Here we present the Causal Pixel Model (CPM) for Kepler data, a data-driven model intended to capture variability but preserve transit signals. The CPM works at the pixel level so that it can capture very fine-grained information about the variation of the spacecraft. The CPM predicts each target pixel value from a large number of pixels of other stars sharing the instrument variabilities while not containing any information on possible transits in the target star. In addition, we use the target star's future and past (auto-regression). By appropriately separating, for each data point, the data into training and test sets, we ensure that information about any transit will be perfectly isolated from the model. The method has four hyper-parameters (the number of predictor stars, the auto-regressive window size, and two L2-regularization amplitudes for model components), which we set by cross-validation. We determine a generic set of hyper-parameters that works well for most of the stars and apply the method to a corresponding set of target stars. We find that we can consistently outperform (for the purposes of exoplanet detection) the Kepler Pre-search Data Conditioning (PDC) method for exoplanet discovery. △ Less

Submitted 25 April, 2016; v1 submitted 8 August, 2015; originally announced August 2015.

Comments: Accepted for publication in the PASP

arXiv:1507.08662 [pdf, other]

doi 10.1093/mnras/stv2383

Chaotic Dispersal of Tidal Debris

Authors: Adrian M. Price-Whelan, Kathryn V. Johnston, Monica Valluri, Sarah Pearson, Andreas H. W. Kupper, David W. Hogg

Abstract: Several long, dynamically cold stellar streams have been observed around the Milky Way Galaxy, presumably formed from the tidal disruption of globular clusters. In integrable potentials---where all orbits are regular---tidal debris phase-mixes close to the orbit of the progenitor system. However, the Milky Way's dark matter halo is expected not to be fully integrable; an appreciable fraction of or… ▽ More Several long, dynamically cold stellar streams have been observed around the Milky Way Galaxy, presumably formed from the tidal disruption of globular clusters. In integrable potentials---where all orbits are regular---tidal debris phase-mixes close to the orbit of the progenitor system. However, the Milky Way's dark matter halo is expected not to be fully integrable; an appreciable fraction of orbits will be chaotic. This paper examines the influence of chaos on the phase-space morphology of cold tidal streams. Streams even in weakly chaotic regions look very different from those in regular regions. We find that streams can be sensitive to chaos on a much shorter time-scale than any standard prediction (from the Lyapunov or frequency-diffusion times). For example, on a weakly chaotic orbit with a chaotic timescale predicted to be $>$1000 orbital periods ($>$1000 Gyr), the resulting stellar stream is, after just a few 10's of orbits, substantially more diffuse than any formed on a nearby but regular orbit. We find that the enhanced diffusion of the stream stars can be understood by looking at the variance in orbital frequencies of orbit ensembles centered around the parent (progenitor) orbit. Our results suggest that long, cold streams around our Galaxy must exist only on regular (or very nearly regular) orbits; they potentially provide a map of the regular regions of the Milky Way potential. This suggests a promising new direction for the use of tidal streams to constrain the distribution of dark matter around our Galaxy. △ Less

Submitted 6 April, 2016; v1 submitted 30 July, 2015; originally announced July 2015.

Comments: 46 pages, 14 figures, publshed in MNRAS

arXiv:1505.03036 [pdf, other]

Removing systematic errors for exoplanet search via latent causes

Authors: Bernhard Schölkopf, David W. Hogg, Dun Wang, Daniel Foreman-Mackey, Dominik Janzing, Carl-Johann Simon-Gabriel, Jonas Peters

Abstract: We describe a method for removing the effect of confounders in order to reconstruct a latent quantity of interest. The method, referred to as half-sibling regression, is inspired by recent work in causal inference using additive noise models. We provide a theoretical justification and illustrate the potential of the method in a challenging astronomy application. We describe a method for removing the effect of confounders in order to reconstruct a latent quantity of interest. The method, referred to as half-sibling regression, is inspired by recent work in causal inference using additive noise models. We provide a theoretical justification and illustrate the potential of the method in a challenging astronomy application. △ Less

Submitted 12 May, 2015; originally announced May 2015.

Comments: Extended version of a paper appearing in the Proceedings of the 32nd International Conference on Machine Learning, Lille, France, 2015

ACM Class: G.3; I.2.6; J.2

arXiv:1503.07866 [pdf, other]

doi 10.1088/0004-637X/809/1/25

Stellar and Planetary Properties of K2 Campaign 1 Candidates and Validation of 17 Planets, Including a Planet Receiving Earth-like Insolation

Authors: Benjamin T. Montet, Timothy D. Morton, Daniel Foreman-Mackey, John Asher Johnson, David W. Hogg, Brendan P. Bowler, David W. Latham, Allyson Bieryla, Andrew W. Mann

Abstract: The extended Kepler mission, K2, is now providing photometry of new fields every three months in a search for transiting planets. In a recent study, Foreman-Mackey and collaborators presented a list of 36 planet candidates orbiting 31 stars in K2 Campaign 1. In this contribution, we present stellar and planetary properties for all systems. We combine ground-based seeing-limited survey data and ada… ▽ More The extended Kepler mission, K2, is now providing photometry of new fields every three months in a search for transiting planets. In a recent study, Foreman-Mackey and collaborators presented a list of 36 planet candidates orbiting 31 stars in K2 Campaign 1. In this contribution, we present stellar and planetary properties for all systems. We combine ground-based seeing-limited survey data and adaptive optics imaging with an automated transit analysis scheme to validate 21 candidates as planets, 17 for the first time, and identify 6 candidates as likely false positives. Of particular interest is K2-18 (EPIC 201912552), a bright (K=8.9) M2.8 dwarf hosting a 2.23 \pm 0.25 R_Earth planet with T_eq = 272 \pm 15 K and an orbital period of 33 days. We also present two new open-source software packages which enable this analysis. The first, isochrones, is a flexible tool for fitting theoretical stellar models to observational data to determine stellar properties using a nested sampling scheme to capture the multimodal nature of the posterior distributions of the physical parameters of stars that may plausibly be evolved. The second is vespa, a new general-purpose procedure to calculate false positive probabilities and statistically validate transiting exoplanets. △ Less

Submitted 14 September, 2015; v1 submitted 26 March, 2015; originally announced March 2015.

Comments: 17 pages, 5 figures, 5 tables, accepted for publication in the Astrophysical Journal. Updated to closely reflect published version in ApJ (2015, 809, 25)

Journal ref: Montet, B. T. et al. 2015, ApJ, 809, 25

arXiv:1502.06621 [pdf, other]

The High-Mass Stellar Initial Mass Function in M31 Clusters

Authors: Daniel R. Weisz, L. Clifton Johnson, Daniel Foreman-Mackey, Andrew E. Dolphin, Lori C. Beerman, Benjamin F. Williams, Julianne J. Dalcanton, Hans-Walter Rix, David W. Hogg, Morgan Fouesneau, Benjamin D. Johnson, Eric F. Bell, Martha L. Boyer, Dimitrios Gouliermis, Puragra Guhathakurta, Jason S. Kalirai, Alexia R. Lewis, Anil C. Seth, Evan D. Skillman

Abstract: We have undertaken the largest systematic study of the high-mass stellar initial mass function (IMF) to date using the optical color-magnitude diagrams (CMDs) of 85 resolved, young (4 Myr < t < 25 Myr), intermediate mass star clusters (10^3-10^4 Msun), observed as part of the Panchromatic Hubble Andromeda Treasury (PHAT) program. We fit each cluster's CMD to measure its mass function (MF) slope fo… ▽ More We have undertaken the largest systematic study of the high-mass stellar initial mass function (IMF) to date using the optical color-magnitude diagrams (CMDs) of 85 resolved, young (4 Myr < t < 25 Myr), intermediate mass star clusters (10^3-10^4 Msun), observed as part of the Panchromatic Hubble Andromeda Treasury (PHAT) program. We fit each cluster's CMD to measure its mass function (MF) slope for stars >2 Msun. For the ensemble of clusters, the distribution of stellar MF slopes is best described by $Γ=+1.45^{+0.03}_{-0.06}$ with a very small intrinsic scatter. The data also imply no significant dependencies of the MF slope on cluster age, mass, and size, providing direct observational evidence that the measured MF represents the IMF. This analysis implies that the high-mass IMF slope in M31 clusters is universal with a slope ($Γ=+1.45^{+0.03}_{-0.06}$) that is steeper than the canonical Kroupa (+1.30) and Salpeter (+1.35) values. Using our inference model on select Milky Way (MW) and LMC high-mass IMF studies from the literature, we find $Γ_{\rm MW} \sim+1.15\pm0.1$ and $Γ_{\rm LMC} \sim+1.3\pm0.1$, both with intrinsic scatter of ~0.3-0.4 dex. Thus, while the high-mass IMF in the Local Group may be universal, systematics in literature IMF studies preclude any definitive conclusions; homogenous investigations of the high-mass IMF in the local universe are needed to overcome this limitation. Consequently, the present study represents the most robust measurement of the high-mass IMF slope to date. We have grafted the M31 high-mass IMF slope onto widely used sub-solar mass Kroupa and Chabrier IMFs and show that commonly used UV- and Halpha-based star formation rates should be increased by a factor of ~1.3-1.5 and the number of stars with masses >8 Msun are ~25% fewer than expected for a Salpeter/Kroupa IMF. [abridged] △ Less

Submitted 23 February, 2015; originally announced February 2015.

Comments: 11 pages, 7 Figures, submitted to ApJ. Comments welcome

arXiv:1502.04715 [pdf, other]

A systematic search for transiting planets in the K2 data

Authors: Daniel Foreman-Mackey, Benjamin T. Montet, David W. Hogg, Timothy D. Morton, Dun Wang, Bernhard Schölkopf

Abstract: Photometry of stars from the K2 extension of NASA's Kepler mission is afflicted by systematic effects caused by small (few-pixel) drifts in the telescope pointing and other spacecraft issues. We present a method for searching K2 light curves for evidence of exoplanets by simultaneously fitting for these systematics and the transit signals of interest. This method is more computationally expensive… ▽ More Photometry of stars from the K2 extension of NASA's Kepler mission is afflicted by systematic effects caused by small (few-pixel) drifts in the telescope pointing and other spacecraft issues. We present a method for searching K2 light curves for evidence of exoplanets by simultaneously fitting for these systematics and the transit signals of interest. This method is more computationally expensive than standard search algorithms but we demonstrate that it can be efficiently implemented and used to discover transit signals. We apply this method to the full Campaign 1 dataset and report a list of 36 planet candidates transiting 31 stars, along with an analysis of the pipeline performance and detection efficiency based on artificial signal injections and recoveries. For all planet candidates, we present posterior distributions on the properties of each system based strictly on the transit observables. △ Less

Submitted 2 June, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

Comments: Updated to ApJ accepted version. Code at https://github.com/dfm/ketu & LaTeX source at https://github.com/dfm/k2-paper

arXiv:1502.02658 [pdf, other]

doi 10.1088/0004-637X/803/2/80

Globular Cluster Streams as Galactic High-Precision Scales - The Poster Child Palomar 5

Authors: Andreas H. W. Küpper, Eduardo Balbinot, Ana Bonaca, Kathryn V. Johnston, David W. Hogg, Pavel Kroupa, Basilio X. Santiago

Abstract: Using the example of the tidal stream of the Milky Way globular cluster Palomar 5 (Pal 5), we demonstrate how observational data on streams can be efficiently reduced in dimensionality and modeled in a Bayesian framework. Our approach combines detection of stream overdensities by a Difference-of-Gaussians process with fast streakline models, a continuous likelihood function built from these models… ▽ More Using the example of the tidal stream of the Milky Way globular cluster Palomar 5 (Pal 5), we demonstrate how observational data on streams can be efficiently reduced in dimensionality and modeled in a Bayesian framework. Our approach combines detection of stream overdensities by a Difference-of-Gaussians process with fast streakline models, a continuous likelihood function built from these models, and inference with MCMC. By generating $\approx10^7$ model streams, we show that the geometry of the Pal 5 debris yields powerful constraints on the solar position and motion, the Milky Way and Pal 5 itself. All 10 model parameters were allowed to vary over large ranges without additional prior information. Using only SDSS data and a few radial velocities from the literature, we find that the distance of the Sun from the Galactic Center is $8.30\pm0.25$ kpc, and the transverse velocity is $253\pm16$ km/s. Both estimates are in excellent agreement with independent measurements of these quantities. Assuming a standard disk and bulge model, we determine the Galactic mass within Pal 5's apogalactic radius of 19 kpc to be $(2.1\pm0.4)\times10^{11}$ M$_\odot$. Moreover, we find the potential of the dark halo with a flattening of $q_z = 0.95^{+0.16}_{-0.12}$ to be essentially spherical within the radial range that is effectively probed by Pal 5. We also determine Pal 5's mass, distance and proper motion independently from other methods, which enables us to perform vital cross-checks. We conclude that with more observational data and by using additional prior information, the precision of this method can be significantly increased. △ Less

Submitted 9 February, 2015; originally announced February 2015.

Comments: 28 pages, 14 figures, submitted to ApJ (revised version), comments welcome

arXiv:1501.07604 [pdf, other]

doi 10.1088/0004-637X/808/1/16

The Cannon: A data-driven approach to stellar label determination

Authors: Melissa Ness, David W. Hogg, Hans-Walter Rix, Anna Y. Q. Ho, Gail Zasowski

Abstract: New spectroscopic surveys offer the promise of consistent stellar parameters and abundances ('stellar labels') for hundreds of thousands of stars in the Milky Way: this poses a formidable spectral modeling challenge. In many cases, there is a sub-set of reference objects for which the stellar labels are known with high(er) fidelity. We take advantage of this with The Cannon, a new data-driven appr… ▽ More New spectroscopic surveys offer the promise of consistent stellar parameters and abundances ('stellar labels') for hundreds of thousands of stars in the Milky Way: this poses a formidable spectral modeling challenge. In many cases, there is a sub-set of reference objects for which the stellar labels are known with high(er) fidelity. We take advantage of this with The Cannon, a new data-driven approach for determining stellar labels from spectroscopic data. The Cannon learns from the 'known' labels of reference stars how the continuum-normalized spectra depend on these labels by fitting a flexible model at each wavelength; then, The Cannon uses this model to derive labels for the remaining survey stars. We illustrate The Cannon by training the model on only 542 stars in 19 clusters as reference objects, with Teff, log g and [Fe/H] as the labels, and then applying it to the spectra of 56,000 stars from APOGEE DR10. The Cannon is very accurate. Its stellar labels compare well to the stars for which APOGEE pipeline (ASPCAP) labels are provided in DR10, with rms differences that are basically identical to the stated ASPCAP uncertainties. Beyond the reference labels, The Cannon makes no use of stellar models nor any line-list, but needs a set of reference objects that span label-space. The Cannon performs well at lower signal-to-noise, as it delivers comparably good labels even at one ninth the APOGEE observing time. We discuss the limitations of The Cannon and its future potential, particularly, to bring different spectroscopic surveys onto a consistent scale of stellar labels. △ Less

Submitted 30 August, 2015; v1 submitted 29 January, 2015; originally announced January 2015.

Comments: Published in ApJ

Journal ref: ApJ 808 16 (2015)

arXiv:1501.05251 [pdf, other]

doi 10.1088/0004-637X/810/1/66

Dissecting magnetar variability with Bayesian hierarchical models

Authors: D. Huppenkothen, B. J. Brewer, D. W. Hogg, I. Murray, M. Frean, C. Elenbaas, A. L. Watts, Y. Levin, A. J. van der Horst, C. Kouveliotou

Abstract: Neutron stars are a prime laboratory for testing physical processes under conditions of strong gravity, high density, and extreme magnetic fields. Among the zoo of neutron star phenomena, magnetars stand out for their bursting behaviour, ranging from extremely bright, rare giant flares to numerous, less energetic recurrent bursts. The exact trigger and emission mechanisms for these bursts are not… ▽ More Neutron stars are a prime laboratory for testing physical processes under conditions of strong gravity, high density, and extreme magnetic fields. Among the zoo of neutron star phenomena, magnetars stand out for their bursting behaviour, ranging from extremely bright, rare giant flares to numerous, less energetic recurrent bursts. The exact trigger and emission mechanisms for these bursts are not known; favoured models involve either a crust fracture and subsequent energy release into the magnetosphere, or explosive reconnection of magnetic field lines. In the absence of a predictive model, understanding the physical processes responsible for magnetar burst variability is difficult. Here, we develop an empirical model that decomposes magnetar bursts into a superposition of small spike-like features with a simple functional form, where the number of model components is itself part of the inference problem. The cascades of spikes that we model might be formed by avalanches of reconnection, or crust rupture aftershocks. Using Markov Chain Monte Carlo (MCMC) sampling augmented with reversible jumps between models with different numbers of parameters, we characterise the posterior distributions of the model parameters and the number of components per burst. We relate these model parameters to physical quantities in the system, and show for the first time that the variability within a burst does not conform to predictions from ideas of self-organised criticality. We also examine how well the properties of the spikes fit the predictions of simplified cascade models for the different trigger mechanisms. △ Less

Submitted 29 June, 2015; v1 submitted 21 January, 2015; originally announced January 2015.

Comments: accepted for publication in The Astrophysical Journal; code available at https://bitbucket.org/dhuppenkothen/magnetron, data products at http://figshare.com/articles/SGR_J1550_5418_magnetron_data/1292424

arXiv:1501.00963 [pdf, other]

doi 10.1088/0067-0049/219/1/12

The Eleventh and Twelfth Data Releases of the Sloan Digital Sky Survey: Final Data from SDSS-III

Authors: Shadab Alam, Franco D. Albareti, Carlos Allende Prieto, F. Anders, Scott F. Anderson, Brett H. Andrews, Eric Armengaud, Éric Aubourg, Stephen Bailey, Julian E. Bautista, Rachael L. Beaton, Timothy C. Beers, Chad F. Bender, Andreas A. Berlind, Florian Beutler, Vaishali Bhardwaj, Jonathan C. Bird, Dmitry Bizyaev, Cullen H. Blake, Michael R. Blanton, Michael Blomqvist, John J. Bochanski, Adam S. Bolton, Jo Bovy, A. Shelden Bradley , et al. (249 additional authors not shown)

Abstract: The third generation of the Sloan Digital Sky Survey (SDSS-III) took data from 2008 to 2014 using the original SDSS wide-field imager, the original and an upgraded multi-object fiber-fed optical spectrograph, a new near-infrared high-resolution spectrograph, and a novel optical interferometer. All the data from SDSS-III are now made public. In particular, this paper describes Data Release 11 (DR11… ▽ More The third generation of the Sloan Digital Sky Survey (SDSS-III) took data from 2008 to 2014 using the original SDSS wide-field imager, the original and an upgraded multi-object fiber-fed optical spectrograph, a new near-infrared high-resolution spectrograph, and a novel optical interferometer. All the data from SDSS-III are now made public. In particular, this paper describes Data Release 11 (DR11) including all data acquired through 2013 July, and Data Release 12 (DR12) adding data acquired through 2014 July (including all data included in previous data releases), marking the end of SDSS-III observing. Relative to our previous public release (DR10), DR12 adds one million new spectra of galaxies and quasars from the Baryon Oscillation Spectroscopic Survey (BOSS) over an additional 3000 sq. deg of sky, more than triples the number of H-band spectra of stars as part of the Apache Point Observatory (APO) Galactic Evolution Experiment (APOGEE), and includes repeated accurate radial velocity measurements of 5500 stars from the Multi-Object APO Radial Velocity Exoplanet Large-area Survey (MARVELS). The APOGEE outputs now include measured abundances of 15 different elements for each star. In total, SDSS-III added 2350 sq. deg of ugriz imaging; 155,520 spectra of 138,099 stars as part of the Sloan Exploration of Galactic Understanding and Evolution 2 (SEGUE-2) survey; 2,497,484 BOSS spectra of 1,372,737 galaxies, 294,512 quasars, and 247,216 stars over 9376 sq. deg; 618,080 APOGEE spectra of 156,593 stars; and 197,040 MARVELS spectra of 5,513 stars. Since its first light in 1998, SDSS has imaged over 1/3 of the Celestial sphere in five bands and obtained over five million astronomical spectra. △ Less

Submitted 21 May, 2015; v1 submitted 5 January, 2015; originally announced January 2015.

Comments: DR12 data are available at http://www.sdss3.org/dr12. 30 pages. 11 figures. Accepted to ApJS

arXiv:1412.5177 [pdf, other]

doi 10.1088/0004-637X/812/2/128

Constructing A Flexible Likelihood Function For Spectroscopic Inference

Authors: Ian Czekala, Sean M. Andrews, Kaisey S. Mandel, David W. Hogg, Gregory M. Green

Abstract: We present a modular, extensible likelihood framework for spectroscopic inference based on synthetic model spectra. The subtraction of an imperfect model from a continuously sampled spectrum introduces covariance between adjacent datapoints (pixels) into the residual spectrum. For the high signal-to-noise data with large spectral range that is commonly employed in stellar astrophysics, that covari… ▽ More We present a modular, extensible likelihood framework for spectroscopic inference based on synthetic model spectra. The subtraction of an imperfect model from a continuously sampled spectrum introduces covariance between adjacent datapoints (pixels) into the residual spectrum. For the high signal-to-noise data with large spectral range that is commonly employed in stellar astrophysics, that covariant structure can lead to dramatically underestimated parameter uncertainties (and, in some cases, biases). We construct a likelihood function that accounts for the structure of the covariance matrix, utilizing the machinery of Gaussian process kernels. This framework specifically address the common problem of mismatches in model spectral line strengths (with respect to data) due to intrinsic model imperfections (e.g., in the atomic/molecular databases or opacity prescriptions) by develo** a novel local covariance kernel formalism that identifies and self-consistently downweights pathological spectral line "outliers." By fitting many spectra in a hierarchical manner, these local kernels provide a mechanism to learn about and build data-driven corrections to synthetic spectral libraries. An open-source software implementation of this approach is available at http://iancze.github.io/Starfish, including a sophisticated probabilistic scheme for spectral interpolation when using model libraries that are sparsely sampled in the stellar parameters. We demonstrate some salient features of the framework by fitting the high resolution $V$-band spectrum of WASP-14, an F5 dwarf with a transiting exoplanet, and the moderate resolution $K$-band spectrum of Gliese 51, an M5 field dwarf. △ Less

Submitted 15 September, 2015; v1 submitted 16 December, 2014; originally announced December 2014.

Comments: Accepted to ApJ. Incorporated referees' comments. New figures 1, 8, 10, 12, and 14. Supplemental website: http://iancze.github.io/Starfish/

arXiv:1412.1825 [pdf, ps, other]

doi 10.1093/mnras/stv781

GREAT3 results I: systematic errors in shear estimation and the impact of real galaxy morphology

Authors: Rachel Mandelbaum, Barnaby Rowe, Robert Armstrong, Deborah Bard, Emmanuel Bertin, James Bosch, Dominique Boutigny, Frederic Courbin, William A. Dawson, Annamaria Donnarumma, Ian Fenech Conti, Raphael Gavazzi, Marc Gentile, Mandeep S. S. Gill, David W. Hogg, Eric M. Huff, M. James Jee, Tomasz Kacprzak, Martin Kilbinger, Thibault Kuntzer, Dustin Lang, Wentao Luo, Marisa C. March, Philip J. Marshall, Joshua E. Meyers , et al. (18 additional authors not shown)

Abstract: We present first results from the third GRavitational lEnsing Accuracy Testing (GREAT3) challenge, the third in a sequence of challenges for testing methods of inferring weak gravitational lensing shear distortions from simulated galaxy images. GREAT3 was divided into experiments to test three specific questions, and included simulated space- and ground-based data with constant or cosmologically-v… ▽ More We present first results from the third GRavitational lEnsing Accuracy Testing (GREAT3) challenge, the third in a sequence of challenges for testing methods of inferring weak gravitational lensing shear distortions from simulated galaxy images. GREAT3 was divided into experiments to test three specific questions, and included simulated space- and ground-based data with constant or cosmologically-varying shear fields. The simplest (control) experiment included parametric galaxies with a realistic distribution of signal-to-noise, size, and ellipticity, and a complex point spread function (PSF). The other experiments tested the additional impact of realistic galaxy morphology, multiple exposure imaging, and the uncertainty about a spatially-varying PSF; the last two questions will be explored in Paper II. The 24 participating teams competed to estimate lensing shears to within systematic error tolerances for upcoming Stage-IV dark energy surveys, making 1525 submissions overall. GREAT3 saw considerable variety and innovation in the types of methods applied. Several teams now meet or exceed the targets in many of the tests conducted (to within the statistical errors). We conclude that the presence of realistic galaxy morphology in simulations changes shear calibration biases by $\sim 1$ per cent for a wide range of methods. Other effects such as truncation biases due to finite galaxy postage stamps, and the impact of galaxy type as measured by the Sérsic index, are quantified for the first time. Our results generalize previous studies regarding sensitivities to galaxy size and signal-to-noise, and to PSF properties such as seeing and defocus. Almost all methods' results support the simple model in which additive shear biases depend linearly on PSF ellipticity. △ Less

Submitted 30 December, 2022; v1 submitted 4 December, 2014; originally announced December 2014.

Comments: 32 pages + 15 pages of technical appendices; 28 figures; published in MNRAS; all data and other information may be found at https://github.com/barnabytprowe/great3-public as the GREAT3 website and leaderboard are no longer live. v3 has updates to this comment only

Journal ref: Monthly Notices of the Royal Astronomical Society, 2015, 450 (3): 2963-3007

arXiv:1411.2608 [pdf, other]

doi 10.1088/0004-637X/807/1/87

Hierarchical probabilistic inference of cosmic shear

Authors: Michael D. Schneider, David W. Hogg, Philip J. Marshall, William A. Dawson, Joshua Meyers, Deborah J. Bard, Dustin Lang

Abstract: Point estimators for the shearing of galaxy images induced by gravitational lensing involve a complex inverse problem in the presence of noise, pixelization, and model uncertainties. We present a probabilistic forward modeling approach to gravitational lensing inference that has the potential to mitigate the biased inferences in most common point estimators and is practical for upcoming lensing su… ▽ More Point estimators for the shearing of galaxy images induced by gravitational lensing involve a complex inverse problem in the presence of noise, pixelization, and model uncertainties. We present a probabilistic forward modeling approach to gravitational lensing inference that has the potential to mitigate the biased inferences in most common point estimators and is practical for upcoming lensing surveys. The first part of our statistical framework requires specification of a likelihood function for the pixel data in an imaging survey given parameterized models for the galaxies in the images. We derive the lensing shear posterior by marginalizing over all intrinsic galaxy properties that contribute to the pixel data (i.e., not limited to galaxy ellipticities) and learn the distributions for the intrinsic galaxy properties via hierarchical inference with a suitably flexible conditional probabilitiy distribution specification. We use importance sampling to separate the modeling of small imaging areas from the global shear inference, thereby rendering our algorithm computationally tractable for large surveys. With simple numerical examples we demonstrate the improvements in accuracy from our importance sampling approach, as well as the significance of the conditional distribution specification for the intrinsic galaxy properties when the data are generated from an unknown number of distinct galaxy populations with different morphological characteristics. △ Less

Submitted 10 November, 2014; originally announced November 2014.

Comments: 23 pages, 9 figures, submitted, related to the 'MBI' team submissions to the GREAT3 gravitational lensing community challenge

Report number: LLNL-JRNL-661076

arXiv:1410.7397 [pdf, other]

doi 10.3847/0004-6256/151/2/36

WISE photometry for 400 million SDSS sources

Authors: Dustin Lang, David W. Hogg, David J. Schlegel

Abstract: We present photometry of images from the Wide-Field Infrared Survey Explorer (WISE; Wright et al. 2010) of over 400 million sources detected by the Sloan Digital Sky Survey (SDSS; York et al. 2000). We use a "forced photometry" technique, using measured SDSS source positions, star-galaxy separation and galaxy profiles to define the sources whose fluxes are to be measured in the WISE images. We per… ▽ More We present photometry of images from the Wide-Field Infrared Survey Explorer (WISE; Wright et al. 2010) of over 400 million sources detected by the Sloan Digital Sky Survey (SDSS; York et al. 2000). We use a "forced photometry" technique, using measured SDSS source positions, star-galaxy separation and galaxy profiles to define the sources whose fluxes are to be measured in the WISE images. We perform photometry with The Tractor image modeling code, working on our "unWISE" coaddds and taking account of the WISE point-spread function and a noise model. The result is a measurement of the flux of each SDSS source in each WISE band. Many sources have little flux in the WISE bands, so often the measurements we report are consistent with zero. However, for many sources we get three- or four-sigma measurements; these sources would not be reported by the WISE pipeline and will not appear in the WISE catalog, yet they can be highly informative for some scientific questions. In addition, these small-signal measurements can be used in stacking analyses at catalog level. The forced photometry approach has the advantage that we measure a consistent set of sources between SDSS and WISE, taking advantage of the resolution and depth of the SDSS images to interpret the WISE images; objects that are resolved in SDSS but blended together in WISE still have accurate measurements in our photometry. Our results, and the code used to produce them, are publicly available at http://unwise.me. △ Less

Submitted 27 October, 2014; originally announced October 2014.

arXiv:1408.4248 [pdf, ps, other]

doi 10.1088/0004-637X/794/2/161

S4: A Spatial-Spectral model for Speckle Suppression

Authors: Rob Fergus, David W. Hogg, Rebecca Oppenheimer, Douglas Brenner, Laurent Pueyo

Abstract: High dynamic-range imagers aim to block out or null light from a very bright primary star to make it possible to detect and measure far fainter companions; in real systems a small fraction of the primary light is scattered, diffracted, and unocculted. We introduce S4, a flexible data-driven model for the unocculted (and highly speckled) light in the P1640 spectroscopic coronograph. The model uses… ▽ More High dynamic-range imagers aim to block out or null light from a very bright primary star to make it possible to detect and measure far fainter companions; in real systems a small fraction of the primary light is scattered, diffracted, and unocculted. We introduce S4, a flexible data-driven model for the unocculted (and highly speckled) light in the P1640 spectroscopic coronograph. The model uses Principal Components Analysis (PCA) to capture the spatial structure and wavelength dependence of the speckles but not the signal produced by any companion. Consequently, the residual typically includes the companion signal. The companion can thus be found by filtering this error signal with a fixed companion model. The approach is sensitive to companions that are of order a percent of the brightness of the speckles, or up to $10^{-7}$ times the brightness of the primary star. This outperforms existing methods by a factor of 2-3 and is close to the shot-noise physical limit. △ Less

Submitted 19 August, 2014; originally announced August 2014.

Comments: accepted for publication in ApJ

arXiv:1406.6063 [pdf, ps, other]

doi 10.1088/0004-637X/795/1/94

Milky Way Mass and Potential Recovery Using Tidal Streams in a Realistic Halo

Authors: Ana Bonaca, Marla Geha, Andreas H. W. Kuepper, Juerg Diemand, Kathryn V. Johnston, David W. Hogg

Abstract: We present a new method for determining the Galactic gravitational potential based on forward modeling of tidal stellar streams. We use this method to test the performance of smooth and static analytic potentials in representing realistic dark matter halos, which have substructure and are continually evolving by accretion. Our FAST-FORWARD method uses a Markov Chain Monte Carlo algorithm to compar… ▽ More We present a new method for determining the Galactic gravitational potential based on forward modeling of tidal stellar streams. We use this method to test the performance of smooth and static analytic potentials in representing realistic dark matter halos, which have substructure and are continually evolving by accretion. Our FAST-FORWARD method uses a Markov Chain Monte Carlo algorithm to compare, in 6D phase space, an "observed" stream to models created in trial analytic potentials. We analyze a large sample of streams evolved in the Via Lactea II (VL2) simulation, which represents a realistic Galactic halo potential. The recovered potential parameters are in agreement with the best fit to the global, present-day VL2 potential. However, merely assuming an analytic potential limits the dark matter halo mass measurement to an accuracy of 5 to 20%, depending on the choice of analytic parametrization. Collectively, mass estimates using streams from our sample reach this fundamental limit, but individually they can be highly biased. Individual streams can both under- and overestimate the mass, and the bias is progressively worse for those with smaller perigalacticons, motivating the search for tidal streams at galactocentric distances larger than 70 kpc. We estimate that the assumption of a static and smooth dark matter potential in modeling of the GD-1 and Pal5-like streams introduces an error of up to 50% in the Milky Way mass estimates. △ Less

Submitted 23 June, 2014; originally announced June 2014.

Comments: 12 pages, 6 figures, submitted to ApJ; more information on our stream sample and a movie of the potential recovery method used can be found at http://www.astro.yale.edu/abonaca/research/potential_recovery.html

arXiv:1406.3020 [pdf, other]

doi 10.1088/0004-637X/795/1/64

Exoplanet population inference and the abundance of Earth analogs from noisy, incomplete catalogs

Authors: Daniel Foreman-Mackey, David W. Hogg, Timothy D. Morton

Abstract: No true extrasolar Earth analog is known. Hundreds of planets have been found around Sun-like stars that are either Earth-sized but on shorter periods, or else on year-long orbits but somewhat larger. Under strong assumptions, exoplanet catalogs have been used to make an extrapolated estimate of the rate at which Sun-like stars host Earth analogs. These studies are complicated by the fact that eve… ▽ More No true extrasolar Earth analog is known. Hundreds of planets have been found around Sun-like stars that are either Earth-sized but on shorter periods, or else on year-long orbits but somewhat larger. Under strong assumptions, exoplanet catalogs have been used to make an extrapolated estimate of the rate at which Sun-like stars host Earth analogs. These studies are complicated by the fact that every catalog is censored by non-trivial selection effects and detection efficiencies, and every property (period, radius, etc.) is measured noisily. Here we present a general hierarchical probabilistic framework for making justified inferences about the population of exoplanets, taking into account survey completeness and, for the first time, observational uncertainties. We are able to make fewer assumptions about the distribution than previous studies; we only require that the occurrence rate density be a smooth function of period and radius (employing a Gaussian process). By applying our method to synthetic catalogs, we demonstrate that it produces more accurate estimates of the whole population than standard procedures based on weighting by inverse detection efficiency. We apply the method to an existing catalog of small planet candidates around G dwarf stars (Petigura et al. 2013). We confirm a previous result that the radius distribution changes slope near Earth's radius. We find that the rate density of Earth analogs is about 0.02 (per star per natural logarithmic bin in period and radius) with large uncertainty. This number is much smaller than previous estimates made with the same data but stronger assumptions. △ Less

Submitted 3 September, 2014; v1 submitted 11 June, 2014; originally announced June 2014.

Comments: ApJ accepted version. The data and results are available at http://dx.doi.org/10.5281/zenodo.11507 and the code can be found at https://github.com/dfm/exopop

arXiv:1406.1528 [pdf, other]

Towards building a Crowd-Sourced Sky Map

Authors: Dustin Lang, David W. Hogg, Bernhard Scholkopf

Abstract: We describe a system that builds a high dynamic-range and wide-angle image of the night sky by combining a large set of input images. The method makes use of pixel-rank information in the individual input images to improve a "consensus" pixel rank in the combined image. Because it only makes use of ranks and the complexity of the algorithm is linear in the number of images, the method is useful fo… ▽ More We describe a system that builds a high dynamic-range and wide-angle image of the night sky by combining a large set of input images. The method makes use of pixel-rank information in the individual input images to improve a "consensus" pixel rank in the combined image. Because it only makes use of ranks and the complexity of the algorithm is linear in the number of images, the method is useful for large sets of uncalibrated images that might have undergone unknown non-linear tone map** transformations for visualization or aesthetic reasons. We apply the method to images of the night sky (of unknown provenance) discovered on the Web. The method permits discovery of astronomical objects or features that are not visible in any of the input images taken individually. More importantly, however, it permits scientific exploitation of a huge source of astronomical images that would not be available to astronomical research without our automatic system. △ Less

Submitted 5 June, 2014; originally announced June 2014.

Comments: Appeared at AI-STATS 2014

Journal ref: JMLR Workshop and Conference Proceedings, 33 (AI & Statistics 2014), 549

arXiv:1405.6721 [pdf, other]

doi 10.1088/0004-637X/794/1/4

Inferring the gravitational potential of the Milky Way with a few precisely measured stars

Authors: Adrian M. Price-Whelan, David W. Hogg, Kathryn V. Johnston, David Hendel

Abstract: The dark matter halo of the Milky Way is expected to be triaxial and filled with substructure. It is hoped that streams or shells of stars produced by tidal disruption of stellar systems will provide precise measures of the gravitational potential to test these predictions. We develop a method for inferring the Galactic potential with tidal streams based on the idea that the stream stars were once… ▽ More The dark matter halo of the Milky Way is expected to be triaxial and filled with substructure. It is hoped that streams or shells of stars produced by tidal disruption of stellar systems will provide precise measures of the gravitational potential to test these predictions. We develop a method for inferring the Galactic potential with tidal streams based on the idea that the stream stars were once close in phase space. Our method can flexibly adapt to any form for the Galactic potential: it works in phase-space rather than action-space and hence relies neither on our ability to derive actions nor on the integrability of the potential. Our model is probabilistic, with a likelihood function and priors on the parameters. The method can properly account for finite observational uncertainties and missing data dimensions. We test our method on synthetic datasets generated from N-body simulations of satellite disruption in a static, multi-component Milky Way including a triaxial dark matter halo with observational uncertainties chosen to mimic current and near-future surveys of various stars. We find that with just four well-measured stream stars, we can infer properties of a triaxial potential with precisions of order 5-7 percent. Without proper motions we obtain 15 percent constraints on potential parameters and precisions around 25 percent for recovering missing phase-space coordinates. These results are encouraging for the eventual goal of using flexible, time-dependent potential models combined with larger data sets to unravel the detailed shape of the dark matter distribution around the Milky Way. △ Less

Submitted 26 May, 2014; originally announced May 2014.

Comments: 30 pages, 12 figures, submitted to ApJ

arXiv:1405.1072 [pdf, ps, other]

doi 10.1088/0004-637X/799/2/196

IGM Constraints from the SDSS-III/BOSS DR9 Ly-alpha Forest Flux Probability Distribution Function

Authors: Khee-Gan Lee, Joseph P. Hennawi, David N. Spergel, David H. Weinberg, David W. Hogg, Matteo Viel, James S. Bolton, Stephen Bailey, Matthew M. Pieri, William Carithers, David J. Schlegel, Britt Lundgren, Nathalie Palanque-Delabrouille, Nao Suzuki, Donald P. Schneider, Christophe Yeche

Abstract: The Ly$α$ forest transmission probability distribution function (PDF) is an established probe of the intergalactic medium (IGM) astrophysics, especially the temperature-density relationship of the IGM. We measure the transmission PDF from 3393 Baryon Oscillations Spectroscopic Survey (BOSS) quasars from SDSS Data Release 9, and compare with mock spectra that include careful modeling of the noise,… ▽ More The Ly$α$ forest transmission probability distribution function (PDF) is an established probe of the intergalactic medium (IGM) astrophysics, especially the temperature-density relationship of the IGM. We measure the transmission PDF from 3393 Baryon Oscillations Spectroscopic Survey (BOSS) quasars from SDSS Data Release 9, and compare with mock spectra that include careful modeling of the noise, continuum, and astrophysical uncertainties. The BOSS transmission PDFs, measured at $\langle z \rangle = [2.3,2.6,3.0]$, are compared with PDFs created from mock spectra drawn from a suite of hydrodynamical simulations that sample the IGM temperature-density relationship, $γ$, and temperature at mean-density, $T_0$, where $T(Δ) = T_0 Δ^{γ-1}$. We find that a significant population of partial Lyman-limit systems with a column-density distribution slope of $β_\mathrm{pLLS} \sim -2$ are required to explain the data at the low-transmission end of transmission PDF, while uncertainties in the mean Ly$α$ forest transmission affect the high-transmission end. After modelling the LLSs and marginalizing over mean-transmission uncertainties, we find that $γ=1.6$ best describes the data over our entire redshift range, although constraints on $T_0$ are affected by systematic uncertainties. Within our model framework, isothermal or inverted temperature-density relationships ($γ\leq 1$) are disfavored at a significance of over 4$σ$, although this could be somewhat weakened by cosmological and astrophysical uncertainties that we did not model. △ Less

Submitted 4 December, 2014; v1 submitted 5 May, 2014; originally announced May 2014.

Comments: Accepted for publication in ApJ. 35 pages, 25 figures

arXiv:1404.6534 [pdf, other]

doi 10.1088/0004-637X/801/2/98

Action-space clustering of tidal streams to infer the Galactic potential

Authors: Robyn E. Sanderson, Amina Helmi, David W. Hogg

Abstract: We present a new method for constraining the Milky Way halo gravitational potential by simultaneously fitting multiple tidal streams. This method requires full three-dimensional positions and velocities for all stars to be fit, but does not require identification of any specific stream or determination of stream membership for any star. We exploit the principle that the action distribution of stre… ▽ More We present a new method for constraining the Milky Way halo gravitational potential by simultaneously fitting multiple tidal streams. This method requires full three-dimensional positions and velocities for all stars to be fit, but does not require identification of any specific stream or determination of stream membership for any star. We exploit the principle that the action distribution of stream stars is most clustered when the potential used to calculate the actions is closest to the true potential. Clustering is quantified with the Kullback-Leibler Divergence (KLD), which also provides conditional uncertainties for our parameter estimates. We show, for toy Gaia-like data in a spherical isochrone potential, that maximizing the KLD of the action distribution relative to a smoother distribution recovers the true values of the potential parameters. The precision depends on the observational errors and the number of streams in the sample; using KIII giants as tracers, we measure the enclosed mass at the average radius of the sample stars accurate to 3% and precise to 20-40%. Recovery of the scale radius is precise to 25%, and is biased 50% high by the small galactocentric distance range of stars in our mock sample (1-25 kpc, or about three scale radii, with mean 6.5 kpc). About 15 streams, with at least 100 stars per stream, are needed to obtain upper and lower bounds on the enclosed mass and scale radius when observational errors are taken into account; 20-25 streams are required to stabilize the size of the confidence interval. If radial velocities are provided for stars out to 100 kpc (10 scale radii), all parameters can be determined with 10% accuracy and 20% precision (1.3% accuracy in the case of the enclosed mass), underlining the need for ground-based spectroscopic follow-up to complete the radial velocity catalog for faint halo stars observed by Gaia. △ Less

Submitted 5 January, 2015; v1 submitted 25 April, 2014; originally announced April 2014.

Comments: Accepted version

arXiv:1403.6015 [pdf, other]

Fast Direct Methods for Gaussian Processes

Authors: Sivaram Ambikasaran, Daniel Foreman-Mackey, Leslie Greengard, David W. Hogg, Michael O'Neil

Abstract: A number of problems in probability and statistics can be addressed using the multivariate normal (Gaussian) distribution. In the one-dimensional case, computing the probability for a given mean and variance simply requires the evaluation of the corresponding Gaussian density. In the $n$-dimensional setting, however, it requires the inversion of an $n \times n$ covariance matrix, $C$, as well as t… ▽ More A number of problems in probability and statistics can be addressed using the multivariate normal (Gaussian) distribution. In the one-dimensional case, computing the probability for a given mean and variance simply requires the evaluation of the corresponding Gaussian density. In the $n$-dimensional setting, however, it requires the inversion of an $n \times n$ covariance matrix, $C$, as well as the evaluation of its determinant, $\det(C)$. In many cases, such as regression using Gaussian processes, the covariance matrix is of the form $C = σ^2 I + K$, where $K$ is computed using a specified covariance kernel which depends on the data and additional parameters (hyperparameters). The matrix $C$ is typically dense, causing standard direct methods for inversion and determinant evaluation to require $\mathcal O(n^3)$ work. This cost is prohibitive for large-scale modeling. Here, we show that for the most commonly used covariance functions, the matrix $C$ can be hierarchically factored into a product of block low-rank updates of the identity matrix, yielding an $\mathcal O (n\log^2 n) $ algorithm for inversion. More importantly, we show that this factorization enables the evaluation of the determinant $\det(C)$, permitting the direct calculation of probabilities in high dimensions under fairly broad assumptions on the kernel defining $K$. Our fast algorithm brings many problems in marginalization and the adaptation of hyperparameters within practical reach using a single CPU core. The combination of nearly optimal scaling in terms of problem size with high-performance computing resources will permit the modeling of previously intractable problems. We illustrate the performance of the scheme on standard covariance kernels. △ Less

Submitted 4 April, 2015; v1 submitted 24 March, 2014; originally announced March 2014.

arXiv:1403.4931 [pdf, ps, other]

doi 10.1093/mnras/stu572

The nature of massive black hole binary candidates: II. Spectral energy distribution atlas

Authors: Elisabeta Lusso, Roberto Decarli, Massimo Dotti, Carmen Montuori, David W. Hogg, Paraskevi Tsalmantza, Michele Fumagalli, Jason X. Prochaska

Abstract: Recoiling supermassive black holes (SMBHs) are considered one plausible physical mechanism to explain high velocity shifts between narrow and broad emission lines sometimes observed in quasar spectra. If the sphere of influence of the recoiling SMBH is such that only the accretion disc is bound, the dusty torus would be left behind, hence the SED should then present distinctive features (i.e. a mi… ▽ More Recoiling supermassive black holes (SMBHs) are considered one plausible physical mechanism to explain high velocity shifts between narrow and broad emission lines sometimes observed in quasar spectra. If the sphere of influence of the recoiling SMBH is such that only the accretion disc is bound, the dusty torus would be left behind, hence the SED should then present distinctive features (i.e. a mid-infrared deficit). Here we present results from fitting the Spectral Energy Distributions (SEDs) of 32 Type-1 AGN with high velocity shifts between broad and narrow lines. The aim is to find peculiar properties in the multi-wavelength SEDs of such objects by comparing their physical parameters (torus and disc luminosity, intrinsic reddening, and size of the 12$μ$m emitter) with those estimated from a control sample of $\sim1000$ \emph{typical} quasars selected from the Sloan Digital Sky Survey in the same redshift range. We find that all sources, with the possible exception of J1154+0134, analysed here present a significant amount of 12~$μ$m emission. This is in contrast with a scenario of a SMBH displaced from the center of the galaxy, as expected for an undergoing recoil event. △ Less

Submitted 19 March, 2014; originally announced March 2014.

Comments: 19 pages, 7 figures, accepted for publication in the Monthly Notices of the Royal Astronomical Society

arXiv:1401.6128 [pdf, ps, other]

The Probabilities of Orbital-Companion Models for Stellar Radial Velocity Data

Authors: Fengji Hou, Jonathan Goodman, David W. Hogg

Abstract: The fully marginalized likelihood, or Bayesian evidence, is of great importance in probabilistic data analysis, because it is involved in calculating the posterior probability of a model or re-weighting a mixture of models conditioned on data. It is, however, extremely challenging to compute. This paper presents a geometric-path Monte Carlo method, inspired by multi-canonical Monte Carlo to evalua… ▽ More The fully marginalized likelihood, or Bayesian evidence, is of great importance in probabilistic data analysis, because it is involved in calculating the posterior probability of a model or re-weighting a mixture of models conditioned on data. It is, however, extremely challenging to compute. This paper presents a geometric-path Monte Carlo method, inspired by multi-canonical Monte Carlo to evaluate the fully marginalized likelihood. We show that the algorithm is very fast and easy to implement and produces a justified uncertainty estimate on the fully marginalized likelihood. The algorithm performs efficiently on a trial problem and multi-companion model fitting for radial velocity data. For the trial problem, the algorithm returns the correct fully marginalized likelihood, and the estimated uncertainty is also consistent with the standard deviation of results from multiple runs. We apply the algorithm to the problem of fitting radial velocity data from HIP 88048 ($ν$ Oph) and Gliese 581. We evaluate the fully marginalized likelihood of 1, 2, 3, and 4-companion models given data from HIP 88048 and various choices of prior distributions. We consider prior distributions with three different minimum radial velocity amplitude $K_{\mathrm{min}}$. Under all three priors, the 2-companion model has the largest marginalized likelihood, but the detailed values depend strongly on $K_{\mathrm{min}}$. We also evaluate the fully marginalized likelihood of 3, 4, 5, and 6-planet model given data from Gliese 581 and find that the fully marginalized likelihood of the 5-planet model is too close to that of the 6-planet model for us to confidently decide between them. △ Less

Submitted 29 January, 2014; v1 submitted 23 January, 2014; originally announced January 2014.

Comments: 24 pages, 10 figures, 2 tables, submitted to AJ

arXiv:1401.5758 [pdf, other]

Fitting Spectral Energy Distributions of AGN - A Markov Chain Monte Carlo Approach

Authors: Gabriela Calistro Rivera, Elisabeta Lusso, Joseph F. Hennawi, David W. Hogg

Abstract: We present AGNfitter: a Markov Chain Monte Carlo algorithm developed to fit the spectral energy distributions (SEDs) of active galactic nuclei (AGN) with different physical models of AGN components. This code is well suited to determine in a robust way multiple parameters and their uncertainties, which quantify the physical processes responsible for the panchromatic nature of active galaxies and q… ▽ More We present AGNfitter: a Markov Chain Monte Carlo algorithm developed to fit the spectral energy distributions (SEDs) of active galactic nuclei (AGN) with different physical models of AGN components. This code is well suited to determine in a robust way multiple parameters and their uncertainties, which quantify the physical processes responsible for the panchromatic nature of active galaxies and quasars. We describe the technicalities of the code and test its capabilities in the context of X-ray selected obscured AGN using multiwavelength data from the XMM-COSMOS survey. △ Less

Submitted 22 January, 2014; originally announced January 2014.

Comments: Proceedings to be published for the IAU Symposium 304: Multiwavelength AGN Surveys and Studies. 2 pages. 2 figures

arXiv:1401.2134 [pdf, other]

doi 10.1371/journal.pcbi.1003542

10 Simple Rules for the Care and Feeding of Scientific Data

Authors: Alyssa Goodman, Alberto Pepe, Alexander W. Blocker, Christine L. Borgman, Kyle Cranmer, Mercè Crosas, Rosanne Di Stefano, Yolanda Gil, Paul Groth, Margaret Hedstrom, David W. Hogg, Vinay Kashyap, Ashish Mahabal, Aneta Siemiginowska, Aleksandra Slavkovic

Abstract: This article offers a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized. In just the past few years, hundreds of scholarly papers and reports have been written on questions of data sharing, data provenance, research reproducibility, licensing, attribution, privacy, and more, but our goal here is not to review… ▽ More This article offers a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized. In just the past few years, hundreds of scholarly papers and reports have been written on questions of data sharing, data provenance, research reproducibility, licensing, attribution, privacy, and more, but our goal here is not to review that literature. Instead, we present a short guide intended for researchers who want to know why it is important to "care for and feed" data, with some practical advice on how to do that. △ Less

Submitted 9 January, 2014; originally announced January 2014.

Comments: Accepted in PLOS Computational Biology. This paper was written collaboratively, on the web, in the open, using Authorea. The living version of this article, which includes sources and history, is available at http://www.authorea.com/3410/

arXiv:1309.0654 [pdf, other]

Maximizing Kepler science return per telemetered pixel: Searching the habitable zones of the brightest stars

Authors: Benjamin T. Montet, Ruth Angus, Tom Barclay, Rebekah Dawson, Rob Fergus, Dan Foreman-Mackey, Stefan Harmeling, Michael Hirsch, David W. Hogg, Dustin Lang, David Schiminovich, Bernhard Scholkopf

Abstract: In today's mailing, Hogg et al. propose image modeling techniques to maintain 10-ppm-level precision photometry in Kepler data with only two working reaction wheels. While these results are relevant to many scientific goals for the repurposed mission, all modeling efforts so far have used a toy model of the Kepler telescope. Because the two-wheel performance of Kepler remains to be determined, we… ▽ More In today's mailing, Hogg et al. propose image modeling techniques to maintain 10-ppm-level precision photometry in Kepler data with only two working reaction wheels. While these results are relevant to many scientific goals for the repurposed mission, all modeling efforts so far have used a toy model of the Kepler telescope. Because the two-wheel performance of Kepler remains to be determined, we advocate for the consideration of an alternate strategy for a >1 year program that maximizes the science return from the "low-torque" fields across the ecliptic plane. Assuming we can reach the precision of the original Kepler mission, we expect to detect 800 new planet candidates in the first year of such a mission. Our proposed strategy has benefits for transit timing variation and transit duration variation studies, especially when considered in concert with the future TESS mission. We also expect to help address the first key science goal of Kepler: the frequency of planets in the habitable zone as a function of spectral type. △ Less

Submitted 3 September, 2013; originally announced September 2013.

Comments: A white paper submitted in response to the "Kepler Project Office Call for White Papers: Soliciting Community Input for Alternate Science Investigations for the Kepler Spacecraft"; 14 pages in length (that is, a modest 4 pages over the white-paper page limit)

Showing 101–150 of 298 results for author: Hogg, D W