-
Statistical modelling under differential privacy constraints: A case study in fine-scale geographical analysis with Australian Bureau of Statistics TableBuilder data
Authors:
Ewan Cameron
Abstract:
Guided by the principles of differential privacy protection the Australian Bureau of Statistics modifies the data summaries from the Australian Census provided through TableBuilder to researchers at approved institutions. This modification algorithm includes the injection of a small degree of artificial noise to every nonzero cell count followed by the suppression of very small cell counts to zero…
▽ More
Guided by the principles of differential privacy protection the Australian Bureau of Statistics modifies the data summaries from the Australian Census provided through TableBuilder to researchers at approved institutions. This modification algorithm includes the injection of a small degree of artificial noise to every nonzero cell count followed by the suppression of very small cell counts to zero. Researchers working with small area TableBuilder outputs with a high suppression fraction have proposed various algorithmic solutions to reconciling these with less suppressed outputs from larger enclosing areas. Here we propose that a Bayesian, likelihood-based statistical approach in which the perturbation algorithm itself is explicitly represented is well suited to analyses with such randomly perturbed data. Using both real (TableBuilder) and mock datasets representing dwelling classifications in the Perth Greater Capital City Area we demonstrate the feasibility and utility of multi-scale Bayesian reconstruction of modified cell counts in a spatial setting.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
A technique for improving dispersion within polymer-glass composites using polymer precipitation
Authors:
Reece N. Oosterbeek,
Xiang C. Zhang,
Serena M. Best,
Ruth E. Cameron
Abstract:
Particulate reinforcement of polymeric matrices is a powerful technique for tailoring the mechanical and degradation properties of bioresorbable implant materials. Dispersion of inorganic particles is critical to achieving optimal properties, however established techniques such as twin-screw extrusion or solvent casting can have significant drawbacks including excessive thermal degradation or part…
▽ More
Particulate reinforcement of polymeric matrices is a powerful technique for tailoring the mechanical and degradation properties of bioresorbable implant materials. Dispersion of inorganic particles is critical to achieving optimal properties, however established techniques such as twin-screw extrusion or solvent casting can have significant drawbacks including excessive thermal degradation or particle agglomeration. We present a facile method for production of polymer-inorganic composites that reduces the time at elevated temperature and the time available for particle agglomeration. Glass slurry was added to a dissolved PLLA solution, and ethanol was added to precipitate polymer onto the glass particles. Characterisation of parts formed by subsequent micro-injection moulding of composite precipitate revealed a significant reduction in agglomeration, with d0.9 reduced from 170 to 43 μm. This drastically improved the ductility (εB) from 7% to 120%, without loss of strength or stiffness. The method is versatile and applicable to a wide range of polymer and filler materials.
△ Less
Submitted 14 December, 2021;
originally announced December 2021.
-
The evolution of the structure and mechanical properties of fully bioresorbable polymer-glass composites during degradation
Authors:
Reece N. Oosterbeek,
Xiang C. Zhang,
Serena M. Best,
Ruth E. Cameron
Abstract:
Fully bioresorbable polymer matrix composites have long been considered as potential orthopaedic implant materials, however their combination of mechanical strength, stiffness, ductility and bioresorbability is also attractive for cardiac stent applications. This work investigated reinforcement of polylactide-based polymers with phosphate glasses, addressing key drawbacks of current polymer stents…
▽ More
Fully bioresorbable polymer matrix composites have long been considered as potential orthopaedic implant materials, however their combination of mechanical strength, stiffness, ductility and bioresorbability is also attractive for cardiac stent applications. This work investigated reinforcement of polylactide-based polymers with phosphate glasses, addressing key drawbacks of current polymer stents, and examined the often-neglected evolution of structure and mechanical properties during degradation. Incorporation of 15 - 30wt.% phosphate glass led to modulus increases of up to 80% under simulated body conditions, and 15wt.% glass composites retained comparable ductility to pure polymers, crucial for stent applications where ductility and stiffness are required. Two-stage degradation was observed, dominated by interfacial water absorption and glass dissolution. Polymer embrittlement mechanisms (crystallisation, enthalpy relaxation) were suppressed by glass addition, allowing composites to achieve a more controlled loss of mechanical properties during degradation, which could allow gradual transfer of loading to newly healed tissue. These results provide a valuable new system for understanding the structural and mechanical changes occurring during degradation of fully bioresorbable polymer matrix composites, providing important new data to underpin the design of effective cardiac stent materials.
△ Less
Submitted 14 December, 2021;
originally announced December 2021.
-
Women in academia: a warning on selection bias in gender studies from the astronomical perspective
Authors:
M. L. L. Dantas,
E. Cameron,
Rafael S. de Souza,
A. R. da Silva,
A. L. Chies-Santos,
C. Heneka,
P. R. T. Coelho,
A. Ederoclite,
I. S. Beloto,
V. Branco,
Morgan S. Camargo,
V. M. Carvalho de Oliveira,
C. de Sá-Freitas,
G. Gonçalves,
T. A. Pacheco,
Isabel Rebollido
Abstract:
The recent paper by AlShebli et al. (2020) investigates the impact of mentorship in young scientists. Among their conclusions, they state that female protégés benefit more from male than female mentorship. We herein expose a critical flaw in their methodological design that is a common issue in Astronomy, namely "selection biases". An effect that if not treated properly may lead to unwarranted cau…
▽ More
The recent paper by AlShebli et al. (2020) investigates the impact of mentorship in young scientists. Among their conclusions, they state that female protégés benefit more from male than female mentorship. We herein expose a critical flaw in their methodological design that is a common issue in Astronomy, namely "selection biases". An effect that if not treated properly may lead to unwarranted causality claims. In their analysis, selection biases seem to be present in the response rate of their survey (8.35%), the choice of database, success criterion, and the overlook of the numerous drawbacks female researchers face in academia. We discuss these issues and their implications -- one of them being the potential increase in obstacles for women in academia. Finally, we reinforce the dangers of not considering selection bias effects in studies aimed at retrieving causal relations.
△ Less
Submitted 4 December, 2020;
originally announced December 2020.
-
nazgul: A statistical approach to gamma-ray burst localization. Triangulation via non-stationary time-series models
Authors:
J. Michael Burgess,
Ewan Cameron,
Dmitry Svinkin,
Jochen Greiner
Abstract:
Context. Gamma-ray bursts can be located via arrival time signal triangulation using gamma-ray detectors in orbit throughout the solar system. The classical approach based on cross-correlations of binned light curves ignores the Poisson nature of the time-series data, and is unable to model the full complexity of the problem.
Aims. To present a statistically proper and robust GRB timing/triangul…
▽ More
Context. Gamma-ray bursts can be located via arrival time signal triangulation using gamma-ray detectors in orbit throughout the solar system. The classical approach based on cross-correlations of binned light curves ignores the Poisson nature of the time-series data, and is unable to model the full complexity of the problem.
Aims. To present a statistically proper and robust GRB timing/triangulation algorithm as a modern update to the original procedures used for the Interplanetary Network (IPN).
Methods. A hierarchical Bayesian forward model for the unknown temporal signal evolution is learned via random Fourier features (RFF) and fitted to each detector's time-series data with time-differences that correspond to GRB's position on the sky via the appropriate Poisson likelihood.
Results. Our novel method can robustly estimate the position of a GRB as verified via simulations. The uncertainties generated by the method are robust and in many cases more precise compared to the classical method. Thus, we have a method that can become a valuable tool for gravitational wave follow-up. All software and analysis scripts are made publicly available here (https://github.com/grburgess/nazgul) for the purpose of replication.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
Tuning structural relaxations, mechanical properties, and degradation timescale of PLLA during hydrolytic degradation by blending with PLCL-PEG
Authors:
Reece N. Oosterbeek,
Kyung-Ah Kwon,
Patrick Duffy,
Sean McMahon,
Xiang C. Zhang,
Serena M. Best,
Ruth E. Cameron
Abstract:
Poly-L-lactide (PLLA) is a popular choice for medical devices due to its bioresorbability and superior mechanical properties compared with other polymers. However, although PLLA has been investigated for use in bioresorbable cardiovascular stents, it presents application-specific limitations which hamper device therapies. These include low toughness and strength compared with metals used for this…
▽ More
Poly-L-lactide (PLLA) is a popular choice for medical devices due to its bioresorbability and superior mechanical properties compared with other polymers. However, although PLLA has been investigated for use in bioresorbable cardiovascular stents, it presents application-specific limitations which hamper device therapies. These include low toughness and strength compared with metals used for this purpose, and slow degradation. Blending PLLA with novel polyethylene glycol functionalised poly(L-lactide-co-$\varepsilon$-caprolactone) (PLCL-PEG) materials has been investigated here to tailor the mechanical properties and degradation behaviour of PLLA. This exciting approach provides a foundation for a next generation of bioresorbable materials whose properties can be rapidly tuned. The degradation of PLLA was significantly accelerated by addition of PLCL-PEG. After 30 days of degradation, several structural changes were observed in the polymer blends, which were dependent on the level of PLCL-PEG addition. Blends with low PLCL-PEG content displayed enthalpy relaxation, resulting in embrittlement, while blends with high PLCL-PEG content displayed crystallisation, due to enhanced chain mobility brought on by chain scission, also causing embrittlement. Moderate PLCL-PEG additions (10% PLCL(70:30)-PEG and 20 - 30% PLCL(80:20)-PEG) stabilised the structure, reducing the extent of enthalpy relaxation and crystallisation and thus retaining ductility. Compositional optimisation identified a sweet spot for this blend strategy, whereby the ductility was enhanced while maintaining strength. Our results indicate that blending PLLA with PLCL-PEG provides an effective method of tuning the degradation timescale and mechanical properties of PLLA, and provides important new insight into the mechanisms of structural relaxations that occur during degradation, and strategies for regulating these.
△ Less
Submitted 12 September, 2020;
originally announced September 2020.
-
Non-linear dissolution mechanisms of sodium calcium phosphate glasses as a function of pH in various aqueous media
Authors:
Reece N. Oosterbeek,
Kalliope I. Margaronis,
Xiang C. Zhang,
Serena M. Best,
Ruth E. Cameron
Abstract:
Phosphate glasses for bioresorbable implants display dissolution rates that vary significantly with composition, however currently their mechanisms of dissolution are not well understood. Based on this systematic study we present new insights into these mechanisms. Two-stage dissolution was observed, with time dependence initially parabolic and later linear, and a two-stage model was developed to…
▽ More
Phosphate glasses for bioresorbable implants display dissolution rates that vary significantly with composition, however currently their mechanisms of dissolution are not well understood. Based on this systematic study we present new insights into these mechanisms. Two-stage dissolution was observed, with time dependence initially parabolic and later linear, and a two-stage model was developed to describe this behaviour. Dissolution was accelerated by lower Ca concentration in the glass, and lower pH in the dissolution medium. A new dissolution mechanism is proposed, involving an initial stage where diffusion-controlled formation of a conversion layer occurs. Once the conversion layer is stabilised, layer dissolution reactions become rate-limiting. Under this mechanism the transition time is sensitive to the nature of the conversion layer and solution conditions. These results reveal the dependence of P$_{2}$O$_{5}$-CaO-Na$_{2}$O glass dissolution on solution pH, and provide new insight into the dissolution mechanisms, particularly regarding the transition between the two dissolution stages.
△ Less
Submitted 12 September, 2020;
originally announced September 2020.
-
Spatiotemporal map** of malaria prevalence in Madagascar using routine surveillance and health survey data
Authors:
Rohan Arambepola,
Suzanne H. Keddie,
Emma L. Collins,
Katherine A. Twohig,
Punam Amratia,
Amelia Bertozzi-Villa,
Elisabeth G. Chestnutt,
Joseph Harris,
Justin Millar,
Jennifer Rozier,
Susan F. Rumisha,
Tasmin L. Symons,
Camilo Vargas-Ruiz,
Mauricette Andriamananjara,
Saraha Rabeherisoa,
Arsène C. Ratsimbasoa,
Rosalind E. Howes,
Daniel J. Weiss,
Peter W. Gething,
Ewan Cameron
Abstract:
Malaria transmission in Madagascar is highly heterogeneous, exhibiting spatial, seasonal and long-term trends. Previous efforts to map malaria risk in Madagascar used prevalence data from Malaria Indicator Surveys. These cross-sectional surveys, conducted during the high transmission season most recently in 2013 and 2016, provide nationally representative prevalence data but cover relatively short…
▽ More
Malaria transmission in Madagascar is highly heterogeneous, exhibiting spatial, seasonal and long-term trends. Previous efforts to map malaria risk in Madagascar used prevalence data from Malaria Indicator Surveys. These cross-sectional surveys, conducted during the high transmission season most recently in 2013 and 2016, provide nationally representative prevalence data but cover relatively short time frames. Conversely, monthly case data are collected at health facilities but suffer from biases, including incomplete reporting.
We combined survey and case data to make monthly maps of prevalence between 2013 and 2016. Health facility catchments were estimated and incidence surfaces, environmental and socioeconomic covariates, and survey data informed a Bayesian prevalence model. Prevalence estimates were consistently high in the coastal regions and low in the highlands. Prevalence was lowest in 2014 and peaked in 2015, highlighting the importance of estimates between survey years. Seasonality was widely observed. Similar multi-metric approaches may be applicable across sub-Saharan Africa.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
A simulation study of disaggregation regression for spatial disease map**
Authors:
Rohan Arambepola,
Tim C D Lucas,
Anita K Nandi,
Peter W Gething,
Ewan Cameron
Abstract:
Disaggregation regression has become an important tool in spatial disease map** for making fine-scale predictions of disease risk from aggregated response data. By including high resolution covariate information and modelling the data generating process on a fine scale, it is hoped that these models can accurately learn the relationships between covariates and response at a fine spatial scale. H…
▽ More
Disaggregation regression has become an important tool in spatial disease map** for making fine-scale predictions of disease risk from aggregated response data. By including high resolution covariate information and modelling the data generating process on a fine scale, it is hoped that these models can accurately learn the relationships between covariates and response at a fine spatial scale. However, validating these high resolution predictions can be a challenge, as often there is no data observed at this spatial scale. In this study, disaggregation regression was performed on simulated data in various settings and the resulting fine-scale predictions are compared to the simulated ground truth. Performance was investigated with varying numbers of data points, sizes of aggregated areas and levels of model misspecification. The effectiveness of cross validation on the aggregate level as a measure of fine-scale predictive performance was also investigated. Predictive performance improved as the number of observations increased and as the size of the aggregated areas decreased. When the model was well-specified, fine-scale predictions were accurate even with small numbers of observations and large aggregated areas. Under model misspecification predictive performance was significantly worse for large aggregated areas but remained high when response data was aggregated over smaller regions. Cross-validation correlation on the aggregate level was a moderately good predictor of fine-scale predictive performance. While the simulations are unlikely to capture the nuances of real-life response data, this study gives insight into the effectiveness of disaggregation regression in different contexts.
△ Less
Submitted 7 May, 2020;
originally announced May 2020.
-
Nonparametric Causal Feature Selection for Spatiotemporal Risk Map** of Malaria Incidence in Madagascar
Authors:
Rohan Arambepola,
Peter Gething,
Ewan Cameron
Abstract:
Modern disease map** draws upon a wealth of high resolution spatial data products reflecting environmental and/or socioeconomic factors as covariates, or `features', within a geostatistical framework to improve predictions of disease risk. Feature selection is an important step in building these models, hel** to reduce overfitting and computational complexity, and to improve model interpretabi…
▽ More
Modern disease map** draws upon a wealth of high resolution spatial data products reflecting environmental and/or socioeconomic factors as covariates, or `features', within a geostatistical framework to improve predictions of disease risk. Feature selection is an important step in building these models, hel** to reduce overfitting and computational complexity, and to improve model interpretability. Selecting only features that have a causal relationship with the response variable could potentially improve predictions and generalisability, but identifying these causal features from non-interventional, spatiotemporal data is a challenging problem. Here we examine the performance of a causal feature selection procedure with regard to estimating malaria incidence in Madagascar. The studied procedure designed for this task combines the PC algorithm with spatiotemporal prewhitening and kernel-based independence tests extended to accommodate aggregated data. This case study reveals a clear advantage for causal feature selection in terms of the out-of-sample predictive accuracy in a forward temporal estimation task, but not in a spatiotemporal interpolation task, in comparison with thresholded spike-and-slab, for both linear and non-linear regression models. Compared to no feature selection, causal feature selection was most beneficial in settings wherein the volume of available data was low relative to the model complexity.
△ Less
Submitted 15 March, 2021; v1 submitted 21 January, 2020;
originally announced January 2020.
-
Map** malaria seasonality: a case study from Madagascar
Authors:
Michele Nguyen,
Rosalind E. Howes,
Tim C. D. Lucas,
Katherine E. Battle,
Ewan Cameron,
Harry S. Gibson,
Jennifer Rozier,
Suzanne Keddie,
Emma Collins,
Rohan Arambepola,
Su Yun Kang,
Chantal Hendriks,
Anita Nandi,
Susan F. Rumisha,
Samir Bhatt,
Sedera A. Mioramalala,
Mauricette Andriamananjara Nambinisoa,
Fanjasoa Rakotomanana,
Peter W. Gething,
Daniel J. Weiss
Abstract:
Many malaria-endemic areas experience seasonal fluctuations in case incidence as Anopheles mosquito and Plasmodium parasite life cycles respond to changing environmental conditions. While most existing maps of malaria seasonality use fixed thresholds of rainfall, temperature, and/or vegetation indices to identify suitable transmission months, we develop a statistical modelling framework for charac…
▽ More
Many malaria-endemic areas experience seasonal fluctuations in case incidence as Anopheles mosquito and Plasmodium parasite life cycles respond to changing environmental conditions. While most existing maps of malaria seasonality use fixed thresholds of rainfall, temperature, and/or vegetation indices to identify suitable transmission months, we develop a statistical modelling framework for characterising the seasonal patterns derived directly from case data.
The procedure involves a spatiotemporal regression model for estimating the monthly proportions of total annual cases and an algorithm to identify operationally relevant characteristics such as the transmission start and peak months. A seasonality index combines the monthly proportion estimates and existing estimates of annual case incidence to provide a summary of "how seasonal" locations are relative to their surroundings. An advancement upon past seasonality map** endeavours is the presentation of the uncertainty associated with each map, which will enable policymakers to make more statistically sound decisions. The methodology is illustrated using health facility data from Madagascar.
△ Less
Submitted 17 May, 2019; v1 submitted 30 January, 2019;
originally announced January 2019.
-
Black Hole Mass Scaling Relations for Spiral Galaxies. II. $M_{\rm BH}$-$M_{\rm *,tot}$ and $M_{\rm BH}$-$M_{\rm *,disk}$
Authors:
Benjamin L. Davis,
Alister W. Graham,
Ewan Cameron
Abstract:
Black hole mass ($M_{BH}$) scaling relations are typically derived using the properties of a galaxy's bulge and samples dominated by (high-mass) early-type galaxies. Studying late-type galaxies should provide greater insight into the mutual growth of black holes and galaxies in more gas-rich environments. We have used 40 spiral galaxies to establish how $M_{BH}$ scales with both the total stellar…
▽ More
Black hole mass ($M_{BH}$) scaling relations are typically derived using the properties of a galaxy's bulge and samples dominated by (high-mass) early-type galaxies. Studying late-type galaxies should provide greater insight into the mutual growth of black holes and galaxies in more gas-rich environments. We have used 40 spiral galaxies to establish how $M_{BH}$ scales with both the total stellar mass ($M_{*,tot}$) and the disk's stellar mass, having measured the spheroid (bulge) stellar mass ($M_{*,sph}$) and presented the $M_{BH}$-$M_{*,sph}$ relation in Paper I. The relation involving $M_{*,tot}$ may be beneficial for estimating $M_{BH}$ either from pipeline data or at higher redshift, conditions that are not ideal for the accurate isolation of the bulge. A symmetric Bayesian analysis finds $\log\left(M_{BH}/M_{\odot}\right)=\left(3.05_{-0.49}^{+0.57}\right)\log\left\{M_{*,tot}/[\upsilon(6.37\times10^{10}\,M_{\odot})]\right\}+(7.25_{-0.14}^{+0.13})$. The scatter from the regression of $M_{BH}$ on $M_{*,tot}$ is 0.66 dex; compare 0.56 dex for $M_{BH}$ on $M_{*,sph}$ and $0.57$ dex for $M_{BH}$ on $σ_*$. The slope is $>2$ times that obtained using core-Sérsic early-type galaxies, echoing a similar result involving $M_{*,sph}$, and supporting a varied growth mechanism among different morphological types. This steeper relation has consequences for galaxy/black hole formation theories, simulations, and predicting black hole masses. We caution that (i) an $M_{BH}$-$M_{*,tot}$ relation built from a mixture of early- and late-type galaxies will find an arbitrary slope of approximately 1-3, with no physical meaning beyond one's sample selection, and (ii) evolutionary studies of the $M_{BH}$-$M_{*,tot}$ relation need to be mindful of the galaxy types included at each epoch. We additionally update the $M_{*,tot}$-($\textit{face-on}$ spiral arm pitch angle) relation.
△ Less
Submitted 11 March, 2019; v1 submitted 11 October, 2018;
originally announced October 2018.
-
Black Hole Mass Scaling Relations for Spiral Galaxies. I. $M_{\rm BH}$-$M_{\rm *,sph}$
Authors:
Benjamin L. Davis,
Alister W. Graham,
Ewan Cameron
Abstract:
The (supermassive black hole mass, $M_\text{BH}$)-(bulge stellar mass, $M_{\rm*,sph}$) relation is, obviously, derived using two quantities. We endeavor to provide accurate values for the latter via detailed multicomponent galaxy decompositions for the current full sample of 43 spiral galaxies having directly measured $M_\text{BH}$ values; 35 of these galaxies have been alleged to contain pseudobu…
▽ More
The (supermassive black hole mass, $M_\text{BH}$)-(bulge stellar mass, $M_{\rm*,sph}$) relation is, obviously, derived using two quantities. We endeavor to provide accurate values for the latter via detailed multicomponent galaxy decompositions for the current full sample of 43 spiral galaxies having directly measured $M_\text{BH}$ values; 35 of these galaxies have been alleged to contain pseudobulges, 21 have water maser measurements, and three appear bulgeless. This more than doubles the previous sample size of spiral galaxies with a finessed image analysis. We have analyzed near-infrared images, accounting for not only the bulge, disk (exponential, truncated, or inclined), and bar but also for spiral arms and rings and additional central components (active galactic nuclei (AGNs), etc.). A symmetric Bayesian analysis finds $\log\left(M_\text{BH}/M_{\odot}\right)=\left(2.44_{-0.31}^{+0.35}\right)\log\left\{M_{\rm*,sph}/[\upsilon(1.15\times10^{10}\,M_{\odot})]\right\}+(7.24\pm0.12)$, with $\upsilon$ a stellar mass-to-light ratio term. The level of scatter equals that about the $M_{\rm BH}$-$σ_*$ relation. The nonlinear slope rules out the idea that many mergers, coupled with the central limit theorem, produced this scaling relation, and it corroborates previous observational studies and simulations, which have reported a near-quadratic slope at the low-mass end of the $M_\text{BH}$-$M_{\rm*,sph}$ diagram. Furthermore, bulges with AGNs follow this relation; they are not offset by an order of magnitude, and models that have invoked AGN feedback to establish a linear $M_{\rm BH}$-$M_{\rm*,sph}$ relation need revisiting. We additionally present an updated $M_\text{BH}$-(Sérsic index, $n_\text{sph}$) relation for spiral galaxy bulges with a comparable level of scatter and a new $M_{\rm*,sph}$-(spiral-arm pitch angle, $φ$) relation.
△ Less
Submitted 11 March, 2019; v1 submitted 11 October, 2018;
originally announced October 2018.
-
Variational Learning on Aggregate Outputs with Gaussian Processes
Authors:
Ho Chung Leon Law,
Dino Sejdinovic,
Ewan Cameron,
Tim CD Lucas,
Seth Flaxman,
Katherine Battle,
Kenji Fukumizu
Abstract:
While a typical supervised learning framework assumes that the inputs and the outputs are measured at the same levels of granularity, many applications, including global map** of disease, only have access to outputs at a much coarser level than that of the inputs. Aggregation of outputs makes generalization to new inputs much more difficult. We consider an approach to this problem based on varia…
▽ More
While a typical supervised learning framework assumes that the inputs and the outputs are measured at the same levels of granularity, many applications, including global map** of disease, only have access to outputs at a much coarser level than that of the inputs. Aggregation of outputs makes generalization to new inputs much more difficult. We consider an approach to this problem based on variational learning with a model of output aggregation and Gaussian processes, where aggregation leads to intractability of the standard evidence lower bounds. We propose new bounds and tractable approximations, leading to improved prediction accuracy and scalability to large datasets, while explicitly taking uncertainty into account. We develop a framework which extends to several types of likelihoods, including the Poisson model for aggregated count data. We apply our framework to a challenging and important problem, the fine-scale spatial modelling of malaria incidence, with over 1 million observations.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.
-
Spatial field reconstruction with INLA: Application to IFU galaxy data
Authors:
S. González-Gaitán,
R. S. de Souza,
A. Krone-Martins,
E. Cameron,
P. Coelho,
L. Galbany,
E. E. O. Ishida
Abstract:
Astronomical observations of extended sources, such as cubes of integral field spectroscopy (IFS), encode auto-correlated spatial structures that cannot be optimally exploited by standard methodologies. This work introduces a novel technique to model IFS datasets, which treats the observed galaxy properties as realizations of an unobserved Gaussian Markov random field. The method is computationall…
▽ More
Astronomical observations of extended sources, such as cubes of integral field spectroscopy (IFS), encode auto-correlated spatial structures that cannot be optimally exploited by standard methodologies. This work introduces a novel technique to model IFS datasets, which treats the observed galaxy properties as realizations of an unobserved Gaussian Markov random field. The method is computationally efficient, resilient to the presence of low-signal-to-noise regions, and uses an alternative to Markov Chain Monte Carlo for fast Bayesian inference, the Integrated Nested Laplace Approximation (INLA). As a case study, we analyse 721 IFS data cubes of nearby galaxies from the CALIFA and PISCO surveys, for which we retrieve the maps of the following physical properties: age, metallicity, mass and extinction. The proposed Bayesian approach, built on a generative representation of the galaxy properties, enables the creation of synthetic images, recovery of areas with bad pixels, and an increased power to detect structures in datasets subject to substantial noise and/or sparsity of sampling. A snippet code to reproduce the analysis of this paper is available in the COIN toolbox, together with the field reconstructions of the CALIFA and PISCO samples.
△ Less
Submitted 30 December, 2018; v1 submitted 17 February, 2018;
originally announced February 2018.
-
Improved prediction accuracy for disease risk map** using Gaussian Process stacked generalisation
Authors:
Samir Bhatt,
Ewan Cameron,
Seth R Flaxman,
Daniel J Weiss,
David L Smith,
Peter W Gething
Abstract:
Maps of infectious disease---charting spatial variations in the force of infection, degree of endemicity, and the burden on human health---provide an essential evidence base to support planning towards global health targets. Contemporary disease map** efforts have embraced statistical modelling approaches to properly acknowledge uncertainties in both the available measurements and their spatial…
▽ More
Maps of infectious disease---charting spatial variations in the force of infection, degree of endemicity, and the burden on human health---provide an essential evidence base to support planning towards global health targets. Contemporary disease map** efforts have embraced statistical modelling approaches to properly acknowledge uncertainties in both the available measurements and their spatial interpolation. The most common such approach is that of Gaussian process regression, a mathematical framework comprised of two components: a mean function harnessing the predictive power of multiple independent variables, and a covariance function yielding spatio-temporal shrinkage against residual variation from the mean. Though many techniques have been developed to improve the flexibility and fitting of the covariance function, models for the mean function have typically been restricted to simple linear terms. For infectious diseases, known to be driven by complex interactions between environmental and socio-economic factors, improved modelling of the mean function can greatly boost predictive power. Here we present an ensemble approach based on stacked generalisation that allows for multiple, non-linear algorithmic mean functions to be jointly embedded within the Gaussian process framework. We apply this method to map** Plasmodium falciparum prevalence data in Sub-Saharan Africa and show that the generalised ensemble approach markedly out-performs any individual method.
△ Less
Submitted 10 December, 2016;
originally announced December 2016.
-
Is the cluster environment quenching the Seyfert activity in elliptical and spiral galaxies?
Authors:
R. S. de Souza,
M. L. L. Dantas,
A. Krone-Martins,
E. Cameron,
P. Coelho,
M. W. Hattab,
M. de Val-Borro,
J. M. Hilbe,
J. Elliott,
A. Hagen
Abstract:
We developed a hierarchical Bayesian model (HBM) to investigate how the presence of Seyfert activity relates to their environment, herein represented by the galaxy cluster mass, $M_{200}$, and the normalized cluster-centric distance, $r/r_{200}$. We achieved this by constructing an unbiased sample of galaxies from the Sloan Digital Sky Survey, with morphological classifications provided by the Gal…
▽ More
We developed a hierarchical Bayesian model (HBM) to investigate how the presence of Seyfert activity relates to their environment, herein represented by the galaxy cluster mass, $M_{200}$, and the normalized cluster-centric distance, $r/r_{200}$. We achieved this by constructing an unbiased sample of galaxies from the Sloan Digital Sky Survey, with morphological classifications provided by the Galaxy Zoo Project. A propensity score matching approach is introduced to control for the effects of confounding variables: stellar mass, galaxy colour, and star formation rate. The connection between Seyfert-activity and environmental properties in the de-biased sample is modelled within an HBM framework using the so-called logistic regression technique, suitable for the analysis of binary data (e.g., whether or not a galaxy hosts an AGN). Unlike standard ordinary least square fitting methods, our methodology naturally allows modelling the probability of Seyfert-AGN activity in galaxies on their natural scale, i.e. as a binary variable. Furthermore, we demonstrate how an HBM can incorporate information of each particular galaxy morphological type in a unified framework. In elliptical galaxies, our analysis indicates a strong correlation of Seyfert-AGN activity with $r/r_{200}$, and a weaker correlation with the mass of the host. In spiral galaxies these trends do not appear, suggesting that the link between Seyfert activity and the properties of spiral galaxies are independent of the environment.
△ Less
Submitted 6 July, 2016; v1 submitted 20 March, 2016;
originally announced March 2016.
-
The star cluster mass--galactocentric radius relation: Implications for cluster formation
Authors:
Weijia Sun,
Richard de Grijs,
Zhou Fan,
Ewan Cameron
Abstract:
Whether or not the initial star cluster mass function is established through a universal, galactocentric-distance-independent stochastic process, on the scales of individual galaxies, remains an unsolved problem. This debate has recently gained new impetus through the publication of a study that concluded that the maximum cluster mass in a given population is not solely determined by size-of-sampl…
▽ More
Whether or not the initial star cluster mass function is established through a universal, galactocentric-distance-independent stochastic process, on the scales of individual galaxies, remains an unsolved problem. This debate has recently gained new impetus through the publication of a study that concluded that the maximum cluster mass in a given population is not solely determined by size-of-sample effects. Here, we revisit the evidence in favor and against stochastic cluster formation by examining the young ($\lesssim$ a few $\times 10^8$ yr-old) star cluster mass--galactocentric radius relation in M33, M51, M83, and the Large Magellanic Cloud. To eliminate size-of-sample effects, we first adopt radial bin sizes containing constant numbers of clusters, which we use to quantify the radial distribution of the first- to fifth-ranked most massive clusters using ordinary least-squares fitting. We supplement this analysis with an application of quantile regression, a binless approach to rank-based regression taking an absolute-value-distance penalty. Both methods yield, within the $1σ$ to $3σ$ uncertainties, near-zero slopes in the diagnostic plane, largely irrespective of the maximum age or minimum mass imposed on our sample selection, or of the radial bin size adopted. We conclude that, at least in our four well-studied sample galaxies, star cluster formation does not necessarily require an environment-dependent cluster formation scenario, which thus supports the notion of stochastic star cluster formation as the dominant star cluster-formation process within a given galaxy.
△ Less
Submitted 13 November, 2015;
originally announced November 2015.
-
Using gamma regression for photometric redshifts of survey galaxies
Authors:
J. Elliott,
R. S. de Souza,
A. Krone-Martins,
E. Cameron,
E. E. O. Ishida,
J. Hilbe
Abstract:
Machine learning techniques offer a plethora of opportunities in tackling big data within the astronomical community. We present the set of Generalized Linear Models as a fast alternative for determining photometric redshifts of galaxies, a set of tools not commonly applied within astronomy, despite being widely used in other professions. With this technique, we achieve catastrophic outlier rates…
▽ More
Machine learning techniques offer a plethora of opportunities in tackling big data within the astronomical community. We present the set of Generalized Linear Models as a fast alternative for determining photometric redshifts of galaxies, a set of tools not commonly applied within astronomy, despite being widely used in other professions. With this technique, we achieve catastrophic outlier rates of the order of ~1%, that can be achieved in a matter of seconds on large datasets of size ~1,000,000. To make these techniques easily accessible to the astronomical community, we developed a set of libraries and tools that are publicly available.
△ Less
Submitted 5 July, 2015;
originally announced July 2015.
-
The Overlooked Potential of Generalized Linear Models in Astronomy-III: Bayesian Negative Binomial Regression and Globular Cluster Populations
Authors:
R. S. de Souza,
J. M. Hilbe,
B. Buelens,
J. D. Riggs,
E. Cameron,
E. E. O. Ishida,
A. L. Chies-Santos,
M. Killedar
Abstract:
In this paper, the third in a series illustrating the power of generalized linear models (GLMs) for the astronomical community, we elucidate the potential of the class of GLMs which handles count data. The size of a galaxy's globular cluster population $N_{\rm GC}$ is a prolonged puzzle in the astronomical literature. It falls in the category of count data analysis, yet it is usually modelled as i…
▽ More
In this paper, the third in a series illustrating the power of generalized linear models (GLMs) for the astronomical community, we elucidate the potential of the class of GLMs which handles count data. The size of a galaxy's globular cluster population $N_{\rm GC}$ is a prolonged puzzle in the astronomical literature. It falls in the category of count data analysis, yet it is usually modelled as if it were a continuous response variable. We have developed a Bayesian negative binomial regression model to study the connection between $N_{\rm GC}$ and the following galaxy properties: central black hole mass, dynamical bulge mass, bulge velocity dispersion, and absolute visual magnitude. The methodology introduced herein naturally accounts for heteroscedasticity, intrinsic scatter, errors in measurements in both axes (either discrete or continuous), and allows modelling the population of globular clusters on their natural scale as a non-negative integer variable. Prediction intervals of 99% around the trend for expected $N_{\rm GC}$comfortably envelope the data, notably including the Milky Way, which has hitherto been considered a problematic outlier. Finally, we demonstrate how random intercept models can incorporate information of each particular galaxy morphological type. Bayesian variable selection methodology allows for automatically identifying galaxy types with different productions of GCs, suggesting that on average S0 galaxies have a GC population 35% smaller than other types with similar brightness.
△ Less
Submitted 13 August, 2015; v1 submitted 15 June, 2015;
originally announced June 2015.
-
cosmoabc: Likelihood-free inference via Population Monte Carlo Approximate Bayesian Computation
Authors:
E. E. O. Ishida,
S. D. P. Vitenti,
M. Penna-Lima,
J. Cisewski,
R. S. de Souza,
A. M. M. Trindade,
E. Cameron,
V. C. Busti
Abstract:
Approximate Bayesian Computation (ABC) enables parameter inference for complex physical systems in cases where the true likelihood function is unknown, unavailable, or computationally too expensive. It relies on the forward simulation of mock data and comparison between observed and synthetic catalogues. Here we present cosmoabc, a Python ABC sampler featuring a Population Monte Carlo (PMC) variat…
▽ More
Approximate Bayesian Computation (ABC) enables parameter inference for complex physical systems in cases where the true likelihood function is unknown, unavailable, or computationally too expensive. It relies on the forward simulation of mock data and comparison between observed and synthetic catalogues. Here we present cosmoabc, a Python ABC sampler featuring a Population Monte Carlo (PMC) variation of the original ABC algorithm, which uses an adaptive importance sampling scheme. The code is very flexible and can be easily coupled to an external simulator, while allowing to incorporate arbitrary distance and prior functions. As an example of practical application, we coupled cosmoabc with the numcosmo library and demonstrate how it can be used to estimate posterior probability distributions over cosmological parameters based on measurements of galaxy clusters number counts without computing the likelihood function. cosmoabc is published under the GPLv3 license on PyPI and GitHub and documentation is available at http://goo.gl/SmB8EX
△ Less
Submitted 3 October, 2017; v1 submitted 23 April, 2015;
originally announced April 2015.
-
The Overlooked Potential of Generalized Linear Models in Astronomy-II: Gamma regression and photometric redshifts
Authors:
J. Elliott,
R. S. de Souza,
A. Krone-Martins,
E. Cameron,
E. E. O. Ishida,
J. Hilbe
Abstract:
Machine learning techniques offer a precious tool box for use within astronomy to solve problems involving so-called big data. They provide a means to make accurate predictions about a particular system without prior knowledge of the underlying physical processes of the data. In this article, and the companion papers of this series, we present the set of Generalized Linear Models (GLMs) as a fast…
▽ More
Machine learning techniques offer a precious tool box for use within astronomy to solve problems involving so-called big data. They provide a means to make accurate predictions about a particular system without prior knowledge of the underlying physical processes of the data. In this article, and the companion papers of this series, we present the set of Generalized Linear Models (GLMs) as a fast alternative method for tackling general astronomical problems, including the ones related to the machine learning paradigm. To demonstrate the applicability of GLMs to inherently positive and continuous physical observables, we explore their use in estimating the photometric redshifts of galaxies from their multi-wavelength photometry. Using the gamma family with a log link function we predict redshifts from the PHoto-z Accuracy Testing simulated catalogue and a subset of the Sloan Digital Sky Survey from Data Release 10. We obtain fits that result in catastrophic outlier rates as low as ~1% for simulated and ~2% for real data. Moreover, we can easily obtain such levels of precision within a matter of seconds on a normal desktop computer and with training sets that contain merely thousands of galaxies. Our software is made publicly available as an user-friendly package developed in Python, R and via an interactive web application (https://cosmostatisticsinitiative.shinyapps.io/CosmoPhotoz). This software allows users to apply a set of GLMs to their own photometric catalogues and generates publication quality plots with minimum effort from the user. By facilitating their ease of use to the astronomical community, this paper series aims to make GLMs widely known and to encourage their implementation in future large-scale projects, such as the Large Synoptic Survey Telescope.
△ Less
Submitted 30 December, 2018; v1 submitted 26 September, 2014;
originally announced September 2014.
-
The Overlooked Potential of Generalized Linear Models in Astronomy - I: Binomial Regression
Authors:
R. S. de Souza,
E. Cameron,
M. Killedar,
J. Hilbe,
R. Vilalta,
U. Maio,
V. Biffi,
B. Ciardi,
J. D. Riggs
Abstract:
Revealing hidden patterns in astronomical data is often the path to fundamental scientific breakthroughs; meanwhile the complexity of scientific inquiry increases as more subtle relationships are sought. Contemporary data analysis problems often elude the capabilities of classical statistical techniques, suggesting the use of cutting edge statistical methods. In this light, astronomers have overlo…
▽ More
Revealing hidden patterns in astronomical data is often the path to fundamental scientific breakthroughs; meanwhile the complexity of scientific inquiry increases as more subtle relationships are sought. Contemporary data analysis problems often elude the capabilities of classical statistical techniques, suggesting the use of cutting edge statistical methods. In this light, astronomers have overlooked a whole family of statistical techniques for exploratory data analysis and robust regression, the so-called Generalized Linear Models (GLMs). In this paper -- the first in a series aimed at illustrating the power of these methods in astronomical applications -- we elucidate the potential of a particular class of GLMs for handling binary/binomial data, the so-called logit and probit regression techniques, from both a maximum likelihood and a Bayesian perspective. As a case in point, we present the use of these GLMs to explore the conditions of star formation activity and metal enrichment in primordial minihaloes from cosmological hydro-simulations including detailed chemistry, gas physics, and stellar feedback. We predict that for a dark mini-halo with metallicity $\approx 1.3 \times 10^{-4} Z_{\bigodot}$, an increase of $1.2 \times 10^{-2}$ in the gas molecular fraction, increases the probability of star formation occurrence by a factor of 75%. Finally, we highlight the use of receiver operating characteristic curves as a diagnostic for binary classifiers, and ultimately we use these to demonstrate the competitive predictive performance of GLMs against the popular technique of artificial neural networks.
△ Less
Submitted 4 April, 2015; v1 submitted 26 September, 2014;
originally announced September 2014.
-
What we talk about when we talk about fields
Authors:
Ewan Cameron
Abstract:
In astronomical and cosmological studies one often wishes to infer some properties of an infinite-dimensional field indexed within a finite-dimensional metric space given only a finite collection of noisy observational data. Bayesian inference offers an increasingly-popular strategy to overcome the inherent ill-posedness of this signal reconstruction challenge. However, there remains a great deal…
▽ More
In astronomical and cosmological studies one often wishes to infer some properties of an infinite-dimensional field indexed within a finite-dimensional metric space given only a finite collection of noisy observational data. Bayesian inference offers an increasingly-popular strategy to overcome the inherent ill-posedness of this signal reconstruction challenge. However, there remains a great deal of confusion within the astronomical community regarding the appropriate mathematical devices for framing such analyses and the diversity of available computational procedures for recovering posterior functionals. In this brief research note I will attempt to clarify both these issues from an "applied statistics" perpective, with insights garnered from my post-astronomy experiences as a computational Bayesian / epidemiological geostatistician.
△ Less
Submitted 24 June, 2014;
originally announced June 2014.
-
Galaxy And Mass Assembly (GAMA): Testing galaxy formation models through the most massive galaxies in the Universe
Authors:
P. Oliva-Altamirano,
S. Brough,
C. Lidman,
W. J. Couch,
A. M. Hopkins,
M. Colless,
E. Taylor,
A. S. G. Robotham,
M. L. P. Gunawardhana,
T. Ponman,
I. Baldry,
A. E. Bauer,
J. Bland-Hawthorn,
M. Cluver,
E. Cameron,
C. J. Conselice,
S. Driver,
A. C. Edge,
A. W. Graham,
E. van Kampen,
M. A. Lara-López,
J. Liske,
A. R. López-Sánchez,
J. Loveday,
S. Mahajan
, et al. (4 additional authors not shown)
Abstract:
We have analysed the growth of Brightest Group Galaxies and Brightest Cluster Galaxies (BGGs/BCGs) over the last 3 billion years using a large sample of 883 galaxies from the Galaxy And Mass Assembly Survey. By comparing the stellar mass of BGGs and BCGs in groups and clusters of similar dynamical masses, we find no significant growth between redshift $z=0.27$ and $z=0.09$. We also examine the num…
▽ More
We have analysed the growth of Brightest Group Galaxies and Brightest Cluster Galaxies (BGGs/BCGs) over the last 3 billion years using a large sample of 883 galaxies from the Galaxy And Mass Assembly Survey. By comparing the stellar mass of BGGs and BCGs in groups and clusters of similar dynamical masses, we find no significant growth between redshift $z=0.27$ and $z=0.09$. We also examine the number of BGGs/BCGs that have line emission, finding that approximately 65 per cent of BGGs/BCGs show H$α$ in emission. From the galaxies where the necessary spectroscopic lines were accurately recovered (54 per cent of the sample), we find that half of this (i.e. 27 per cent of the sample) harbour on-going star formation with rates up to $10\,$M$_{\odot}$yr$^{-1}$, and the other half (i.e. 27 per cent of the sample) have an active nucleus (AGN) at the centre. BGGs are more likely to have ongoing star formation, while BCGs show a higher fraction of AGN activity. By examining the position of the BGGs/BCGs with respect to their host dark matter halo we find that around 13 per cent of them do not lie at the centre of the dark matter halo. This could be an indicator of recent cluster-cluster mergers. We conclude that BGGs and BCGs acquired their stellar mass rapidly at higher redshifts as predicted by semi-analytic models, mildly slowing down at low redshifts.
△ Less
Submitted 17 February, 2014;
originally announced February 2014.
-
A Generalized Savage-Dickey Ratio
Authors:
Ewan Cameron
Abstract:
In this brief research note I present a generalized version of the Savage-Dickey Density Ratio for representation of the Bayes factor (or marginal likelihood ratio) of nested statistical models; the new version takes the form of a Radon-Nikodym derivative and is thus applicable to a wider family of probability spaces than the original (restricted to those admitting an ordinary Lebesgue density). A…
▽ More
In this brief research note I present a generalized version of the Savage-Dickey Density Ratio for representation of the Bayes factor (or marginal likelihood ratio) of nested statistical models; the new version takes the form of a Radon-Nikodym derivative and is thus applicable to a wider family of probability spaces than the original (restricted to those admitting an ordinary Lebesgue density). A derivation is given following the measure-theoretic construction of Marin & Robert (2010), and the equivalent estimator is demonstrated in application to a distributional modeling problem.
△ Less
Submitted 6 November, 2013;
originally announced November 2013.
-
Transdimensional Approximate Bayesian Computation for Inference on Invasive Species Models with Latent Variables of Unknown Dimension
Authors:
Oksana A. Chkrebtii,
Erin K. Cameron,
David A. Campbell,
Erin M. Bayne
Abstract:
Accurate information on patterns of introduction and spread of non-native species is essential for making predictions and management decisions. In many cases, estimating unknown rates of introduction and spread from observed data requires evaluating intractable variable-dimensional integrals. In general, inference on the large class of models containing latent variables of large or variable dimens…
▽ More
Accurate information on patterns of introduction and spread of non-native species is essential for making predictions and management decisions. In many cases, estimating unknown rates of introduction and spread from observed data requires evaluating intractable variable-dimensional integrals. In general, inference on the large class of models containing latent variables of large or variable dimension precludes exact sampling techniques. Approximate Bayesian computation (ABC) methods provide an alternative to exact sampling but rely on inefficient conditional simulation of the latent variables. To accomplish this task efficiently, a new transdimensional Monte Carlo sampler is developed for approximate Bayesian model inference and used to estimate rates of introduction and spread for the non-native earthworm species Dendrobaena octaedra (Savigny) along roads in the boreal forest of northern Alberta. Using low and high estimates of introduction and spread rates, the extent of earthworm invasions in northeastern Alberta was simulated to project the proportion of suitable habitat invaded in the year following data collection.
△ Less
Submitted 30 December, 2014; v1 submitted 10 October, 2013;
originally announced October 2013.
-
On the Evidence for Cosmic Variation of the Fine Structure Constant (II): A Semi-Parametric Bayesian Model Selection Analysis of the Quasar Dataset
Authors:
Ewan Cameron,
Tony Pettitt
Abstract:
In the second paper of this series we extend our Bayesian reanalysis of the evidence for a cosmic variation of the fine structure constant to the semi-parametric modelling regime. By adopting a mixture of Dirichlet processes prior for the unexplained errors in each instrumental subgroup of the benchmark quasar dataset we go some way towards freeing our model selection procedure from the apparent s…
▽ More
In the second paper of this series we extend our Bayesian reanalysis of the evidence for a cosmic variation of the fine structure constant to the semi-parametric modelling regime. By adopting a mixture of Dirichlet processes prior for the unexplained errors in each instrumental subgroup of the benchmark quasar dataset we go some way towards freeing our model selection procedure from the apparent subjectivity of a fixed distributional form. Despite the infinite-dimensional domain of the error hierarchy so constructed we are able to demonstrate a recursive scheme for marginal likelihood estimation with prior-sensitivity analysis directly analogous to that presented in Paper I, thereby allowing the robustness of our posterior Bayes factors to hyper-parameter choice and model specification to be readily verified. In the course of this work we elucidate various similarities between unexplained error problems in the seemingly disparate fields of astronomy and clinical meta-analysis, and we highlight a number of sophisticated techniques for handling such problems made available by past research in the latter. It is our hope that the novel approach to semi-parametric model selection demonstrated herein may serve as a useful reference for others exploring this potentially difficult class of error model.
△ Less
Submitted 11 September, 2013;
originally announced September 2013.
-
Importance Nested Sampling and the MultiNest Algorithm
Authors:
F. Feroz,
M. P. Hobson,
E. Cameron,
A. N. Pettitt
Abstract:
Bayesian inference involves two main computational challenges. First, in estimating the parameters of some model for the data, the posterior distribution may well be highly multi-modal: a regime in which the convergence to stationarity of traditional Markov Chain Monte Carlo (MCMC) techniques becomes incredibly slow. Second, in selecting between a set of competing models the necessary estimation o…
▽ More
Bayesian inference involves two main computational challenges. First, in estimating the parameters of some model for the data, the posterior distribution may well be highly multi-modal: a regime in which the convergence to stationarity of traditional Markov Chain Monte Carlo (MCMC) techniques becomes incredibly slow. Second, in selecting between a set of competing models the necessary estimation of the Bayesian evidence for each is, by definition, a (possibly high-dimensional) integration over the entire parameter space; again this can be a daunting computational task, although new Monte Carlo (MC) integration algorithms offer solutions of ever increasing efficiency. Nested sampling (NS) is one such contemporary MC strategy targeted at calculation of the Bayesian evidence, but which also enables posterior inference as a by-product, thereby allowing simultaneous parameter estimation and model selection. The widely-used MultiNest algorithm presents a particularly efficient implementation of the NS technique for multi-modal posteriors. In this paper we discuss importance nested sampling (INS), an alternative summation of the MultiNest draws, which can calculate the Bayesian evidence at up to an order of magnitude higher accuracy than `vanilla' NS with no change in the way MultiNest explores the parameter space. This is accomplished by treating as a (pseudo-)importance sample the totality of points collected by MultiNest, including those previously discarded under the constrained likelihood sampling of the NS algorithm. We apply this technique to several challenging test problems and compare the accuracy of Bayesian evidences obtained with INS against those from vanilla NS.
△ Less
Submitted 26 November, 2019; v1 submitted 10 June, 2013;
originally announced June 2013.
-
Newly-quenched galaxies as the cause for the apparent evolution in average size of the population
Authors:
C. M. Carollo,
T. J. Bschorr,
A. Renzini,
S. J. Lilly,
P. Capak,
A. Cibinel,
O. Ilbert,
M. Onodera,
N. Scoville,
E. Cameron,
B. Mobasher,
D. Sanders,
Y. Taniguchi
Abstract:
Abridged. We use COSMOS to study in a self-consistent way the change in the number densities of quenched early-type galaxies (Q-ETGs) of a given size over the interval 0.2 < z < 1.0 to study the claimed size evolution of these galaxies. At 10^10.5<Mgalaxy<10^11 Msun, we see no change in the number density of compact Q-ETGs, while at >10^11 Msun we find a decrease by 30%. In both mass bins, the inc…
▽ More
Abridged. We use COSMOS to study in a self-consistent way the change in the number densities of quenched early-type galaxies (Q-ETGs) of a given size over the interval 0.2 < z < 1.0 to study the claimed size evolution of these galaxies. At 10^10.5<Mgalaxy<10^11 Msun, we see no change in the number density of compact Q-ETGs, while at >10^11 Msun we find a decrease by 30%. In both mass bins, the increase of the median sizes of Q-ETGs with time is primarily caused by the addition to the size function of larger and more diffuse Q-ETGs. At all masses, compact Q-ETGs become systematically redder towards later epochs, with a (U-V) difference consistent with passive evolution of their stellar populations, indicating that they are a population that does not appreciably evolve in size. At all epochs, the larger Q-ETGs (at least in the lower mass bin) have average rest-frame colors systematically bluer than those of the more compact Q-ETGs, suggesting that the former are younger than the latter. The idea that new, large, Q-ETGs are responsible for the observed growth in the median size of the population at a given mass is supported by the sizes and number of the star-forming galaxies that are expected to be progenitors of the new Q-ETGs over the same period. In the low mass bin, the new Q-ETG have 30% smaller sizes than their star-forming progenitors. This is likely due to the fading of their disks after they cease star-formation. Comparison with higher z shows that the median size of newly-quenched galaxies roughly scales, at constant mass, as (1+z)^-1. The dominant cause of the size evolution seen in the Q-ETG population is thus that the average sizes of individual Q-ETGs scale with the average density of the Universe at the time when they were quenched, with subsequent size changes in individual objects through eg merging of secondary importance, especially at masses <10^11 Msun.
△ Less
Submitted 8 July, 2013; v1 submitted 20 February, 2013;
originally announced February 2013.
-
Galaxy And Mass Assembly (GAMA): Spectroscopic analysis
Authors:
A. M. Hopkins,
S. P. Driver,
S. Brough,
M. S. Owers,
A. E. Bauer,
M. L. P. Gunawardhana,
M. E. Cluver,
M. Colless,
C. Foster,
M. A. Lara-Lopez,
I. Roseboom,
R. Sharp,
O. Steele,
D. Thomas,
I. K. Baldry,
M. J. I. Brown,
J. Liske,
P. Norberg,
A. S. G. Robotham,
S. Bamford,
J. Bland-Hawthorn,
M. J. Drinkwater,
J. Loveday,
M. Meyer,
J. A. Peacock
, et al. (57 additional authors not shown)
Abstract:
The Galaxy And Mass Assembly (GAMA) survey is a multiwavelength photometric and spectroscopic survey, using the AAOmega spectrograph on the Anglo-Australian Telescope to obtain spectra for up to ~300000 galaxies over 280 square degrees, to a limiting magnitude of r_pet < 19.8 mag. The target galaxies are distributed over 0<z<0.5 with a median redshift of z~0.2, although the redshift distribution i…
▽ More
The Galaxy And Mass Assembly (GAMA) survey is a multiwavelength photometric and spectroscopic survey, using the AAOmega spectrograph on the Anglo-Australian Telescope to obtain spectra for up to ~300000 galaxies over 280 square degrees, to a limiting magnitude of r_pet < 19.8 mag. The target galaxies are distributed over 0<z<0.5 with a median redshift of z~0.2, although the redshift distribution includes a small number of systems, primarily quasars, at higher redshifts, up to and beyond z=1. The redshift accuracy ranges from sigma_v~50km/s to sigma_v~100km/s depending on the signal-to-noise of the spectrum. Here we describe the GAMA spectroscopic reduction and analysis pipeline. We present the steps involved in taking the raw two-dimensional spectroscopic images through to flux-calibrated one-dimensional spectra. The resulting GAMA spectra cover an observed wavelength range of 3750<lambda<8850 A at a resolution of R~1300. The final flux calibration is typically accurate to 10-20%, although the reliability is worse at the extreme wavelength ends, and poorer in the blue than the red. We present details of the measurement of emission and absorption features in the GAMA spectra. These measurements are characterised through a variety of quality control analyses detailing the robustness and reliability of the measurements. We illustrate the quality of the measurements with a brief exploration of elementary emission line properties of the galaxies in the GAMA sample. We demonstrate the luminosity dependence of the Balmer decrement, consistent with previously published results, and explore further how Balmer decrement varies with galaxy mass and redshift. We also investigate the mass and redshift dependencies of the [NII]/Halpha vs [OIII]/Hbeta spectral diagnostic diagram, commonly used to discriminate between star forming and nuclear activity in galaxies.
△ Less
Submitted 29 January, 2013;
originally announced January 2013.
-
Recursive Pathways to Marginal Likelihood Estimation with Prior-Sensitivity Analysis
Authors:
Ewan Cameron,
Anthony Pettitt
Abstract:
We investigate the utility to computational Bayesian analyses of a particular family of recursive marginal likelihood estimators characterized by the (equivalent) algorithms known as "biased sampling" or "reverse logistic regression" in the statistics literature and "the density of states" in physics. Through a pair of numerical examples (including mixture modeling of the well-known galaxy data se…
▽ More
We investigate the utility to computational Bayesian analyses of a particular family of recursive marginal likelihood estimators characterized by the (equivalent) algorithms known as "biased sampling" or "reverse logistic regression" in the statistics literature and "the density of states" in physics. Through a pair of numerical examples (including mixture modeling of the well-known galaxy data set) we highlight the remarkable diversity of sampling schemes amenable to such recursive normalization, as well as the notable efficiency of the resulting pseudo-mixture distributions for gauging prior sensitivity in the Bayesian model selection context. Our key theoretical contributions are to introduce a novel heuristic ("thermodynamic integration via importance sampling") for qualifying the role of the bridging sequence in this procedure and to reveal various connections between these recursive estimators and the nested sampling technique.
△ Less
Submitted 15 October, 2014; v1 submitted 28 January, 2013;
originally announced January 2013.
-
Herschel-ATLAS/GAMA: spatial clustering of low-redshift sub-mm galaxies
Authors:
E. van Kampen,
D. J. B. Smith,
S. Maddox,
A. M. Hopkins,
I. Valtchanov,
J. A. Peacock,
M. J. Michalowski,
P. Norberg,
S. Eales,
L. Dunne,
J. Liske,
M. Baes,
D. Scott,
E. Rigby,
A. Robotham,
P. van der Werf,
E. Ibar,
M. J. Jarvis,
J. Loveday,
R. Auld,
I. K. Baldry,
S. Bamford,
E. Cameron,
S. Croom,
S. Buttiglione
, et al. (22 additional authors not shown)
Abstract:
We have measured the clustering properties of low-redshift (z < 0.3) sub-mm galaxies detected at 250 micron in the Herschel-ATLAS Science Demonstration Phase (SDP) field. We selected a sample for which we have high-quality spectroscopic redshifts, obtained from reliably matching the 250-micron sources to a complete (for r < 19.4) sample of galaxies from the GAMA database. Both the angular and spat…
▽ More
We have measured the clustering properties of low-redshift (z < 0.3) sub-mm galaxies detected at 250 micron in the Herschel-ATLAS Science Demonstration Phase (SDP) field. We selected a sample for which we have high-quality spectroscopic redshifts, obtained from reliably matching the 250-micron sources to a complete (for r < 19.4) sample of galaxies from the GAMA database. Both the angular and spatial clustering strength are measured for all z < 0.3 sources as well as for five redshift slices with thickness delta z=0.05 in the range 0.05 < z < 0.3. Our measured spatial clustering length r_0 is comparable to that of optically-selected, moderately star-forming (blue) galaxies: we find values around 5 Mpc. One of the redshift bins contains an interesting structure, at z = 0.164.
△ Less
Submitted 14 September, 2012;
originally announced September 2012.
-
Galaxy And Mass Assembly (GAMA): The mass-metallicity relationship
Authors:
C. Foster,
A. M. Hopkins,
M. Gunawardhana,
M. A. Lara-Lopez,
R. G. Sharp,
O. Steele,
E. N. Taylor,
S. P. Driver,
I. K. Baldryi,
S. P. Bamford,
J. Liske,
J. Loveday,
P. Norberg,
J. A. Peacock,
M. Alpaslan,
A. E. Bauer,
J. Bland-Hawthorn,
S. Brough,
E. Cameron,
M. Colless,
C. J. Conselice,
S. M. Croom,
C. S. Frenk,
D. T. Hill,
D. H. Jones
, et al. (15 additional authors not shown)
Abstract:
Context: The mass-metallicity relationship (MMR) of star-forming galaxies is well-established, however there is still some disagreement with respect to its exact shape and its possible dependence on other observables. Aims: We measure the MMR in the Galaxy And Mass Assembly (GAMA) survey. We compare our measured MMR to that measured in the Sloan Digital Sky Survey (SDSS) and study the dependence o…
▽ More
Context: The mass-metallicity relationship (MMR) of star-forming galaxies is well-established, however there is still some disagreement with respect to its exact shape and its possible dependence on other observables. Aims: We measure the MMR in the Galaxy And Mass Assembly (GAMA) survey. We compare our measured MMR to that measured in the Sloan Digital Sky Survey (SDSS) and study the dependence of the MMR on various selection criteria to identify potential causes for disparities seen in the literature. Methods: We use strong emission line ratio diagnostics to derive oxygen abundances. We then apply a range of selection criteria for the minimum signal-to-noise in various emission lines, as well as the apparent and absolute magnitude to study variations in the inferred MMR. Results: The shape and position of the MMR can differ significantly depending on the metallicity calibration and selection used. After selecting a robust metallicity calibration amongst those tested, we find that the mass-metallicity relation for redshifts 0.061< z<0.35 in GAMA is in reasonable agreement with that found in the SDSS despite the difference in the luminosity range probed. Conclusions: In view of the significant variations of the MMR brought about by reasonable changes in the sample selection criteria and method, we recommend that care be taken when comparing the MMR from different surveys and studies directly. We also conclude that there could be a modest level of evolution over 0.06<z<0.35 within the GAMA sample.
△ Less
Submitted 7 September, 2012;
originally announced September 2012.
-
Galaxy And Mass Assembly (GAMA): The 0.013 < z < 0.1 cosmic spectral energy distribution from 0.1 micron to 1mm
Authors:
S. P. Driver,
A. S. G. Robotham,
L. Kelvin,
M. Alpaslan,
I. K. Baldry,
S. P. Bamford,
S. Brough,
M. Brown,
A. M. Hopkins,
J. Liske,
J. Loveday,
P. Norberg,
J. A. Peacock,
E. Andrae,
J. Bland-Hawthorn,
N. Bourne,
E. Cameron,
M. Colless,
C. J. Conselice,
S. M. Croom,
L. Dunne,
C. S. Frenk,
Alister W. Graham,
M. Gunawardhana,
D. T. Hill
, et al. (18 additional authors not shown)
Abstract:
We use the GAMA I dataset combined with GALEX, SDSS and UKIDSS imaging to construct the low-redshift (z<0.1) galaxy luminosity functions in FUV, NUV, ugriz, and YJHK bands from within a single well constrained volume of 3.4 x 10^5 (Mpc/h)^{3}. The derived luminosity distributions are normalised to the SDSS DR7 main survey to reduce the estimated cosmic variance to the 5 per cent level. The data ar…
▽ More
We use the GAMA I dataset combined with GALEX, SDSS and UKIDSS imaging to construct the low-redshift (z<0.1) galaxy luminosity functions in FUV, NUV, ugriz, and YJHK bands from within a single well constrained volume of 3.4 x 10^5 (Mpc/h)^{3}. The derived luminosity distributions are normalised to the SDSS DR7 main survey to reduce the estimated cosmic variance to the 5 per cent level. The data are used to construct the cosmic spectral energy distribution (CSED) from 0.1 to 2.1 \mum free from any wavelength dependent cosmic variance for both the elliptical and non-elliptical populations. The two populations exhibit dramatically different CSEDs as expected for a predominantly old and young population respectively. Using the Driver et al. (2008) prescription for the azimuthally averaged photon escape fraction, the non-ellipticals are corrected for the impact of dust attenuation and the combined CSED constructed. The final results show that the Universe is currently generating (1.8 +/- 0.3) x 10^{35} h W Mpc^{-3} of which (1.2 +/- 0.1) x 10^{35} h W Mpc^{-3} is directly released into the inter-galactic medium and (0.6 +/- 0.1) x 10^{35} h W Mpc^{-3} is reprocessed and reradiated by dust in the far-IR. Using the GAMA data and our dust model we predict the mid and far-IR emission which agrees remarkably well with available data. We therefore provide a robust description of the pre- and post dust attenuated energy output of the nearby Universe from 0.1micron to 0.6mm. The largest uncertainty in this measurement lies in the mid and far-IR bands stemming from the dust attenuation correction and its currently poorly constrained dependence on environment, stellar mass, and morphology.
△ Less
Submitted 3 September, 2012;
originally announced September 2012.
-
On the Evidence for Cosmic Variation of the Fine Structure Constant (I): A Parametric Bayesian Model Selection Analysis of the Quasar Dataset
Authors:
Ewan Cameron,
Tony Pettitt
Abstract:
We review the evidence behind recent claims of spatial variation in the fine structure constant deriving from observations of ionic absorption lines in the light from distant quasars. To this end we expand upon previous non-Bayesian analyses limited by the assumptions of an unbiased and strictly Normal distribution for the "unexplained errors" of the benchmark quasar dataset. Through the technique…
▽ More
We review the evidence behind recent claims of spatial variation in the fine structure constant deriving from observations of ionic absorption lines in the light from distant quasars. To this end we expand upon previous non-Bayesian analyses limited by the assumptions of an unbiased and strictly Normal distribution for the "unexplained errors" of the benchmark quasar dataset. Through the technique of reverse logistic regression we estimate and compare marginal likelihoods for three competing hypotheses---(i) the null hypothesis (no cosmic variation), (ii) the monopole hypothesis (a constant Earth-to-quasar offset), and (iii) the monopole+dipole hypothesis (a cosmic variation manifest to the Earth-bound observer as a North-South divergence)---under a variety of candidate parametric forms for the unexplained error term. Our analysis reveals weak support for a skeptical interpretation in which the apparent dipole effect is driven solely by systematic errors of opposing sign inherent in measurements from the two telescopes employed to obtain these observations. Throughout we seek to exemplify a 'best practice' approach to Bayesian model selection with prior-sensitivity analysis; in a companion paper we extend this methodology to a semi-parametric framework using the infinite-dimensional Dirichlet process.
△ Less
Submitted 11 September, 2013; v1 submitted 26 July, 2012;
originally announced July 2012.
-
The Zurich Environmental Study of Galaxies in Groups along the Cosmic Web. III. Galaxy Photometric Measurements and the Spatially-Resolved Color Properties of Early- and Late-Type Satellites in Diverse Environments
Authors:
A. Cibinel,
C. M. Carollo,
S. J. Lilly,
S. Bonoli,
F. Miniati,
A. Pipino,
J. D. Silverman,
J. H. van Gorkom,
E. Cameron,
A. Finoguenov,
P. Norberg,
Y. Peng,
C. S. Rudick
Abstract:
We present photometric measurements for the galaxies - and when possible their bulges and disks - in the 0.05<z<0.0585 groups of the Zurich Environmental Study (ZENS); these measurements include (B-I) colors, color gradients and maps, color dispersions, as well as stellar masses and star-formation rates. The ZENS galaxies are classified into quenched, moderately star-forming, and strongly star-for…
▽ More
We present photometric measurements for the galaxies - and when possible their bulges and disks - in the 0.05<z<0.0585 groups of the Zurich Environmental Study (ZENS); these measurements include (B-I) colors, color gradients and maps, color dispersions, as well as stellar masses and star-formation rates. The ZENS galaxies are classified into quenched, moderately star-forming, and strongly star-forming using a combination of spectral features and FUV-to-optical colors; this approach optimally distinguishes quenched systems from dust-reddened star-forming galaxies. The latter contribute up to 50% to the (B-I) "red sequence" at ~10^10Msun. At fixed morphological or spectral type, we find that galaxy stellar masses are largely independent of environment, and especially of halo mass. As a first utilization of our photometric database, we study, at fixed stellar mass and Hubble type, how (B-I) colors, color gradients and color dispersion of disk satellites depend on group mass (M_GROUP), group-centric distance (R/R_200) and large-scale structure overdensity. The strongest environmental trend is found for disk-dominated satellites with M_GROUP and R/R_200. At M<10^10 Msun, disk-dominated satellites are redder in the inner regions of the groups than in the outer parts. At M>10^10 Msun, these satellites have shallower color gradients in higher mass groups and in the cores of groups compared with lower mass groups and the outskirts of groups. Stellar population analyses and semi-analytic models suggest that disk-dominated satellites undergo quenching of star formation in their outer disks, on timescales ~2 Gyr, as they progressively move inside the group potential.
△ Less
Submitted 14 October, 2013; v1 submitted 27 June, 2012;
originally announced June 2012.
-
The Zurich Environmental Study (ZENS) of Galaxies in Groups along the Cosmic Web. II. Galaxy Structural Measurements and the Concentration of Morphologically Classified Satellites in Diverse Environments
Authors:
A. Cibinel,
C. M. Carollo,
S. J. Lilly,
F. Miniati,
J. D. Silverman,
J. H. van Gorkom,
E. Cameron,
A. Finoguenov,
P. Norberg,
Y. Peng,
A. Pipino,
C. S. Rudick
Abstract:
We present structural measurements for the galaxies in the 0.05<z<0.0585 groups of the Zurich Environmental Study, aimed at establishing how galaxy properties depend on four environmental parameters: group halo mass M_GROUP, group-centric distance R/R_200, ranking into central or satellite, and large-scale structure density delta_LSS. Global galaxy structure is quantified both parametrically and n…
▽ More
We present structural measurements for the galaxies in the 0.05<z<0.0585 groups of the Zurich Environmental Study, aimed at establishing how galaxy properties depend on four environmental parameters: group halo mass M_GROUP, group-centric distance R/R_200, ranking into central or satellite, and large-scale structure density delta_LSS. Global galaxy structure is quantified both parametrically and non-parametrically. We correct all these measurements for observational biases due to PSF blurring and surface brightness effects as a function of galaxy size, magnitude, steepness of light profile and ellipticity. Structural parameters are derived also for bulges, disks and bars. We use the galaxy bulge-to-total ratios (B/T), together with the calibrated non-parametric structural estimators, to implement a quantitative morphological classification that maximizes purity in the resulting morphological samples. We investigate how the concentration C of satellite galaxies depends on galaxy mass for each Hubble type, and on M_GROUP, R/R_200 and delta_LSS. At galaxy masses M>10^10 M_sun, the concentration of disk satellites increases with increasing stellar mass, separately within each morphological bin of B/T. The known increase in concentration with stellar mass for disk satellites is thus due, at least in part, to an increase in galaxy central stellar density at constant B/T. The correlation between concentration and galaxy stellar mass becomes progressively steeper for later morphological types. The concentration of disk satellites shows a barely significant dependence on delta_LSS or R/R_200. The strongest environmental effect is found with group mass for M>10^10 M_sun disk-dominated satellites, which are ~10% more concentrated in high mass groups than in lower mass groups.
△ Less
Submitted 28 August, 2013; v1 submitted 26 June, 2012;
originally announced June 2012.
-
The Zurich Environmental Study (ZENS) of Galaxies in Groups along the Cosmic Web. I. Which Environment Affects Galaxy Evolution?
Authors:
C. M. Carollo,
A. Cibinel,
S. J. Lilly,
F. Miniati,
P. Norberg,
J. D. Silverman,
J. van Gorkom,
E. Cameron,
A. Finoguenov,
Y. Peng,
A. Pipino,
C. S. Rudick
Abstract:
The Zurich Environmental Study (ZENS) is based on a sample of ~1500 galaxy members of 141 groups in the mass range ~10^12.5-14.5 M_sun within the narrow redshift range 0.05<z<0.0585. ZENS adopts novel approaches, here described, to quantify four different galactic environments, namely: (1) the mass of the host group halo; (2) the projected halo-centric distance; (3) the rank of galaxies as central…
▽ More
The Zurich Environmental Study (ZENS) is based on a sample of ~1500 galaxy members of 141 groups in the mass range ~10^12.5-14.5 M_sun within the narrow redshift range 0.05<z<0.0585. ZENS adopts novel approaches, here described, to quantify four different galactic environments, namely: (1) the mass of the host group halo; (2) the projected halo-centric distance; (3) the rank of galaxies as central or satellites within their group halos; and (4) the filamentary large-scale structure (LSS) density. No self-consistent identification of a central galaxy is found in ~40% of <10^13.5 M_sun groups, from which we estimate that ~15% of groups at these masses are dynamically unrelaxed systems. Central galaxies in relaxed and unrelaxed groups have in general similar properties, suggesting that centrals are regulated by their mass and not by their environment. Centrals in relaxed groups have however ~30% larger sizes than in unrelaxed groups, possibly due accretion of small satellites in virialized group halos. At M>10^10 M_sun, satellite galaxies in relaxed and unrelaxed groups have similar size, color and (specific) star formation rate distributions; at lower galaxy masses, satellites are marginally redder in relaxed relative to unrelaxed groups, suggesting quenching of star formation in low-mass satellites by physical processes active in relaxed halos. Finally, relaxed and unrelated groups show similar stellar mass conversion efficiencies, peaking at halo masses around 10^12.5 M_sun. In the enclosed ZENS catalogue we publish all environmental diagnostics as well as the galaxy structural and photometric measurements described in companion ZENS papers II and III.
△ Less
Submitted 28 August, 2013; v1 submitted 25 June, 2012;
originally announced June 2012.
-
Galaxy and Mass Assembly (GAMA): Colour and luminosity dependent clustering from calibrated photometric redshifts
Authors:
L. Christodoulou,
C. Eminian,
J. Loveday,
P. Norberg,
I. K. Baldry,
P. D. Hurley,
S. P. Driver,
S. P. Bamford,
A. M. Hopkins,
J. Liske,
J. A. Peacock,
J. Bland-Hawthorn,
S. Brough,
E. Cameron,
C. J. Conselice,
S. M. Croom,
C. S. Frenk,
M. Gunawardhana,
D. H. Jones,
L. S. Kelvin,
K. Kuijken,
R. C. Nichol,
H. Parkinson,
K. A. Pimbblet,
C. C. Popescu
, et al. (9 additional authors not shown)
Abstract:
We measure the two-point angular correlation function of a sample of 4,289,223 galaxies with r < 19.4 mag from the Sloan Digital Sky Survey as a function of photometric redshift, absolute magnitude and colour down to M_r - 5log h = -14 mag. Photometric redshifts are estimated from ugriz model magnitudes and two Petrosian radii using the artificial neural network package ANNz, taking advantage of t…
▽ More
We measure the two-point angular correlation function of a sample of 4,289,223 galaxies with r < 19.4 mag from the Sloan Digital Sky Survey as a function of photometric redshift, absolute magnitude and colour down to M_r - 5log h = -14 mag. Photometric redshifts are estimated from ugriz model magnitudes and two Petrosian radii using the artificial neural network package ANNz, taking advantage of the Galaxy and Mass Assembly (GAMA) spectroscopic sample as our training set. The photometric redshifts are then used to determine absolute magnitudes and colours. For all our samples, we estimate the underlying redshift and absolute magnitude distributions using Monte-Carlo resampling. These redshift distributions are used in Limber's equation to obtain spatial correlation function parameters from power law fits to the angular correlation function. We confirm an increase in clustering strength for sub-L* red galaxies compared with ~L* red galaxies at small scales in all redshift bins, whereas for the blue population the correlation length is almost independent of luminosity for ~L* galaxies and fainter. A linear relation between relative bias and log luminosity is found to hold down to luminosities L~0.03L*. We find that the redshift dependence of the bias of the L* population can be described by the passive evolution model of Tegmark & Peebles (1998). A visual inspection of a random sample of our r < 19.4 sample of SDSS galaxies reveals that about 10 per cent are spurious, with a higher contamination rate towards very faint absolute magnitudes due to over-deblended nearby galaxies. We correct for this contamination in our clustering analysis.
△ Less
Submitted 5 June, 2012;
originally announced June 2012.
-
Galaxy And Mass Assembly (GAMA): Galaxy environments and star formation rate variations
Authors:
D. B. Wijesinghe,
A. M. Hopkins,
S. Brough,
E. N. Taylor,
P. Norberg,
A. Bauer,
M. J. I. Brown,
E. Cameron,
C. J. Conselice,
S. Croom,
S. Driver,
M. W. Grootes,
D. H. Jones,
L. Kelvin,
J. Loveday,
K. A. Pimbblet,
C. C. Popescu,
M. Prescott,
R. Sharp,
I. Baldry,
E. M. Sadler,
J. Liske,
A. S. G. Robotham,
S. Bamford,
J. Bland-Hawthorn
, et al. (1 additional authors not shown)
Abstract:
We present a detailed investigation into the effects of galaxy environment on their star formation rates (SFR) using galaxies observed in the Galaxy and Mass Assembly Survey (GAMA). We use three independent volume-limited samples of galaxies within z < 0.2 and Mr < -17.8. We investigate the known SFR-density relationship and explore in detail the dependence of SFR on stellar mass and density. We s…
▽ More
We present a detailed investigation into the effects of galaxy environment on their star formation rates (SFR) using galaxies observed in the Galaxy and Mass Assembly Survey (GAMA). We use three independent volume-limited samples of galaxies within z < 0.2 and Mr < -17.8. We investigate the known SFR-density relationship and explore in detail the dependence of SFR on stellar mass and density. We show that the SFR-density trend is only visible when we include the passive galaxy population along with the star-forming population. This SFR-density relation is absent when we consider only the star-forming population of galaxies, consistent with previous work. While there is a strong dependence of the EWH?a on density we find, as in previous studies, that these trends are largely due to the passive galaxy population and this relationship is absent when considering a "star-forming" sample of galaxies. We find that stellar mass has the strongest influence on SFR and EWH?a with the environment having no significant effect on the star-formation properties of the star forming population. We also show that the SFR-density relationship is absent for both early and late-type star-forming galaxies. We conclude that the stellar mass has the largest impact on the current SFR of a galaxy, and any environmental effect is not detectable. The observation that the trends with density are due to the changing morphology fraction with density implies that the timescales must be very short for any quenching of the SFR in infalling galaxies. Alternatively galaxies may in fact undergo predominantly in-situ evolution where the infall and quenching of galaxies from the field into dense environments is not the dominant evolutionary mode.
△ Less
Submitted 15 May, 2012;
originally announced May 2012.
-
Approximate Bayesian Computation for Astronomical Model Analysis: A Case Study in Galaxy Demographics and Morphological Transformation at High Redshift
Authors:
E. Cameron,
A. N. Pettitt
Abstract:
"Approximate Bayesian Computation" (ABC) represents a powerful methodology for the analysis of complex stochastic systems for which the likelihood of the observed data under an arbitrary set of input parameters may be entirely intractable-the latter condition rendering useless the standard machinery of tractable likelihood-based, Bayesian statistical inference (e.g. conventional Markov Chain Monte…
▽ More
"Approximate Bayesian Computation" (ABC) represents a powerful methodology for the analysis of complex stochastic systems for which the likelihood of the observed data under an arbitrary set of input parameters may be entirely intractable-the latter condition rendering useless the standard machinery of tractable likelihood-based, Bayesian statistical inference (e.g. conventional Markov Chain Monte Carlo simulation; MCMC). In this article we demonstrate the potential of ABC for astronomical model analysis by application to a case study in the morphological transformation of high redshift galaxies. To this end we develop, first, a stochastic model for the competing processes of merging and secular evolution in the early Universe; and second, through an ABC-based comparison against the observed demographics of massive (M_gal > 10^11 M_sun) galaxies (at 1.5 < z < 3) in the CANDELS/EGS dataset we derive posterior probability densities for the key parameters of this model. The "Sequential Monte Carlo" (SMC) implementation of ABC exhibited herein, featuring both a self-generating target sequence and self-refining MCMC kernel, is amongst the most efficient of contemporary approaches to this important statistical algorithm. We highlight as well through our chosen case study the value of careful summary statistic selection, and demonstrate two modern strategies for assessment and optimisation in this regard. Ultimately, our ABC analysis of the high redshift morphological mix returns tight constraints on the evolving merger rate in the early Universe and favours major merging (with disc survival or rapid reformation) over secular evolution as the mechanism most responsible for building up the first generation of bulges in early-type disks.
△ Less
Submitted 17 May, 2012; v1 submitted 7 February, 2012;
originally announced February 2012.
-
Galaxy and Mass Assembly (GAMA): ugriz galaxy luminosity functions
Authors:
J. Loveday,
P. Norberg,
I. K. Baldry,
S. P. Driver,
A. M. Hopkins,
J. A. Peacock,
S. P. Bamford,
J. Liske,
J. Bland-Hawthorn,
S. Brough,
M. J. I. Brown,
E. Cameron,
C. J. Conselice,
S. M. Croom,
C. S. Frenk,
M. Gunawardhana,
D. T. Hill,
D. H. Jones,
L. S. Kelvin,
K. Kuijken,
R. C. Nichol,
H. R. Parkinson,
S. Phillipps,
K. A. Pimbblet,
C. C. Popescu
, et al. (9 additional authors not shown)
Abstract:
Galaxy and Mass Assembly (GAMA) is a project to study galaxy formation and evolution, combining imaging data from ultraviolet to radio with spectroscopic data from the AAOmega spectrograph on the Anglo-Australian Telescope. Using data from phase 1 of GAMA, taken over three observing seasons, and correcting for various minor sources of incompleteness, we calculate galaxy luminosity functions (LFs)…
▽ More
Galaxy and Mass Assembly (GAMA) is a project to study galaxy formation and evolution, combining imaging data from ultraviolet to radio with spectroscopic data from the AAOmega spectrograph on the Anglo-Australian Telescope. Using data from phase 1 of GAMA, taken over three observing seasons, and correcting for various minor sources of incompleteness, we calculate galaxy luminosity functions (LFs) and their evolution in the ugriz passbands.
At low redshift, z < 0.1, we find that blue galaxies, defined according to a magnitude-dependent but non-evolving colour cut, are reasonably well fit over a range of more than ten magnitudes by simple Schechter functions in all bands. Red galaxies, and the combined blue-plus-red sample, require double power-law Schechter functions to fit a dip in their LF faintward of the characteristic magnitude M* before a steepening faint end. This upturn is at least partly due to dust-reddened disk galaxies.
We measure evolution of the galaxy LF over the redshift range 0.002 < z < 0.5 using both a parametric fit and by measuring binned LFs in redshift slices. The characteristic luminosity L* is found to increase with redshift in all bands, with red galaxies showing stronger luminosity evolution than blue galaxies. The comoving number density of blue galaxies increases with redshift, while that of red galaxies decreases, consistent with prevailing movement from blue cloud to red sequence. As well as being more numerous at higher redshift, blue galaxies also dominate the overall luminosity density beyond redshifts z = 0.2. At lower redshifts, the luminosity density is dominated by red galaxies in the riz bands, by blue galaxies in u and g.
△ Less
Submitted 28 November, 2011; v1 submitted 1 November, 2011;
originally announced November 2011.
-
The near-IR $M_{bh}$ - L and $M_{bh}$ - n relations
Authors:
Marina Vika,
Simon P. Driver,
Ewan Cameron,
Lee Kelvin,
Aaron Robotham
Abstract:
We present near-IR surface photometry (2D-profiling) for a sample of 29 nearby galaxies for which super-massive black hole (SMBH) masses are constrained. The data is derived from the UKIDSS-LASS survey representing a significant improvement in image quality and depth over previous studies based on 2MASS data. We derive the spheroid luminosity and spheroid Sérsic index for each galaxy with GALFIT3…
▽ More
We present near-IR surface photometry (2D-profiling) for a sample of 29 nearby galaxies for which super-massive black hole (SMBH) masses are constrained. The data is derived from the UKIDSS-LASS survey representing a significant improvement in image quality and depth over previous studies based on 2MASS data. We derive the spheroid luminosity and spheroid Sérsic index for each galaxy with GALFIT3 and use these data to construct SMBH mass -bulge luminosity ($M_{\rm bh}$--$L$) and SMBH - Sérsic index ($M_{\rm bh}$--$n$) relations. The best fit K-band relation for elliptical and disk galaxies is $\log(M_{\rm bh}/M_{\odot})= -0.36(\pm 0.03) (M_{\rm K} + 18) + 6.17(\pm 0.16)$ with an intrinsic scatter of 0.4$^{+0.09}_{-0.06}$dex whilst for elliptical galaxies we find $\log(M_{\rm bh}/M_{\odot})= -0.42(\pm 0.06) (M_{\rm K} + 22) + 7.5(\pm 0.15)$ with an intrinsic scatter of 0.31$^{+0.087}_{-0.047}$dex. Our revised $M_{\rm bh}$--$L$ relation agrees closely with the previous near-IR constraint by \citet{tex:G07}. The lack of improvement in the intrinsic scatter in moving to higher quality near-IR data suggests that the SMBH relations are not currently limited by the quality of the imaging data but is either intrinsic or a result of uncertainty in the precise number of required components required in the profiling process. Contrary to expectation (see \citealt{tex:GD07a}) a relation between SMBH mass and the Sérsic index was not found at near-IR wavelengths. This latter outcome is believed to be explained by the generic inconsistencies between 1D and 2D galaxy profiling which are currently under further investigation.
△ Less
Submitted 5 October, 2011;
originally announced October 2011.
-
Galaxy And Mass Assembly: Stellar Mass Estimates
Authors:
Edward N Taylor,
Andrew M Hopkins,
Ivan K Baldry,
Michael J I Brown,
Simon P Driver,
Lee S Kelvin,
David T Hill,
Aaron S G Robotham,
Joss Bland-Hawthorn,
D H Jones,
R G Sharp,
Daniel Thomas,
Jochen Liske,
Jon Loveday,
Peder Norberg,
J A Peacock,
Steven P Bamford,
Sarah Brough,
Matthew Colless,
Ewan Cameron,
Chistopher J Conselice,
Scott M Croom,
C S Frenk,
Madusha Gunawardhana,
Konrad Kuijken
, et al. (10 additional authors not shown)
Abstract:
This paper describes the first catalogue of photometrically-derived stellar mass estimates for intermediate-redshift (z < 0.65) galaxies in the Galaxy And Mass Assembly (GAMA) spectroscopic redshift survey. These masses, as well as the full set of ancillary stellar population parameters, will be made public as part of GAMA data release 2. Although the GAMA database does include NIR photometry, we…
▽ More
This paper describes the first catalogue of photometrically-derived stellar mass estimates for intermediate-redshift (z < 0.65) galaxies in the Galaxy And Mass Assembly (GAMA) spectroscopic redshift survey. These masses, as well as the full set of ancillary stellar population parameters, will be made public as part of GAMA data release 2. Although the GAMA database does include NIR photometry, we show that the quality of our stellar population synthesis fits is significantly poorer when these NIR data are included. Further, for a large fraction of galaxies, the stellar population parameters inferred from the optical-plus-NIR photometry are formally inconsistent with those inferred from the optical data alone. This may indicate problems in our stellar population library, or NIR data issues, or both; these issues will be addressed for future versions of the catalogue. For now, we have chosen to base our stellar mass estimates on optical photometry only. In light of our decision to ignore the available NIR data, we examine how well stellar mass can be constrained based on optical data alone. We use generic properties of stellar population synthesis models to demonstrate that restframe colour alone is in principle a very good estimator of stellar mass-to-light ratio, M*/Li. Further, we use the observed relation between restframe (g-i) and M*/Li for real GAMA galaxies to argue that, modulo uncertainties in the stellar evolution models themselves, (g-i) colour can in practice be used to estimate M*/Li to an accuracy of < ~0.1 dex. This 'empirically calibrated' (g-i)-M*/Li relation offers a simple and transparent means for estimating galaxies' stellar masses based on minimal data, and so provides a solid basis for other surveys to compare their results to z < ~0.4 measurements from GAMA.
△ Less
Submitted 3 August, 2011; v1 submitted 2 August, 2011;
originally announced August 2011.
-
Galaxy And Mass Assembly (GAMA): the red fraction and radial distribution of satellite galaxies
Authors:
Matthew Prescott,
I. K. Baldry,
P. A. James,
S. P. Bamford,
J. Bland-Hawthorn,
S. Brough,
M. J. I. Brown,
E. Cameron,
C. J. Conselice,
S. M. Croom,
S. P. Driver,
C. S. Frenk,
M. Gunawardhana,
D. T. Hill,
A. M. Hopkins,
D. H. Jones,
L. S. Kelvin,
K. Kuijken,
J. Liske,
J. Loveday,
R. C. Nichol,
P. Norberg,
H. R. Parkinson,
J. A. Peacock,
S. Phillipps
, et al. (9 additional authors not shown)
Abstract:
We investigate the properties of satellite galaxies that surround isolated hosts within the redshift range 0.01 < z < 0.15, using data taken as part of the Galaxy And Mass Assembly survey. Making use of isolation and satellite criteria that take into account stellar mass estimates, we find 3514 isolated galaxies of which 1426 host a total of 2998 satellites. Separating the red and blue populations…
▽ More
We investigate the properties of satellite galaxies that surround isolated hosts within the redshift range 0.01 < z < 0.15, using data taken as part of the Galaxy And Mass Assembly survey. Making use of isolation and satellite criteria that take into account stellar mass estimates, we find 3514 isolated galaxies of which 1426 host a total of 2998 satellites. Separating the red and blue populations of satellites and hosts, using colour-mass diagrams, we investigate the radial distribution of satellite galaxies and determine how the red fraction of satellites varies as a function of satellite mass, host mass and the projected distance from their host. Comparing the red fraction of satellites to a control sample of small neighbours at greater projected radii, we show that the increase in red fraction is primarily a function of host mass. The satellite red fraction is about 0.2 higher than the control sample for hosts with 11.0 < log M_* < 11.5, while the red fractions show no difference for hosts with 10.0 < log M_* < 10.5. For the satellites of more massive hosts the red fraction also increases as a function of decreasing projected distance. Our results suggest that the likely main mechanism for the quenching of star formation in satellites hosted by isolated galaxies is strangulation.
△ Less
Submitted 1 July, 2011;
originally announced July 2011.
-
Galaxy and Mass Assembly (GAMA): The GAMA Galaxy Group Catalogue (G3Cv1)
Authors:
A. S. G. Robotham,
P. Norberg,
S. P. Driver,
I. K. Baldry,
S. P. Bamford,
A. M. Hopkins,
J. Liske,
J. Loveday,
A. Merson,
J. A. Peacock,
S. Brough,
E. Cameron,
C. J. Conselice,
S. M. Croom,
C. S. Frenk,
M. Gunawardhana,
D. T. Hill,
D. H. Jones,
L. S. Kelvin,
K. Kuijken,
R. C. Nichol,
H. R. Parkinson,
K. A. Pimbblet,
S. Phillipps,
C. C. Popescu
, et al. (8 additional authors not shown)
Abstract:
Using the complete GAMA-I survey covering ~142 sq. deg. to r=19.4, of which ~47 sq. deg. is to r=19.8, we create the GAMA-I galaxy group catalogue (G3Cv1), generated using a friends-of-friends (FoF) based grou** algorithm. Our algorithm has been tested extensively on one family of mock GAMA lightcones, constructed from Lambda-CDM N-body simulations populated with semi-analytic galaxies. Recovere…
▽ More
Using the complete GAMA-I survey covering ~142 sq. deg. to r=19.4, of which ~47 sq. deg. is to r=19.8, we create the GAMA-I galaxy group catalogue (G3Cv1), generated using a friends-of-friends (FoF) based grou** algorithm. Our algorithm has been tested extensively on one family of mock GAMA lightcones, constructed from Lambda-CDM N-body simulations populated with semi-analytic galaxies. Recovered group properties are robust to the effects of interlopers and are median unbiased in the most important respects. G3Cv1 contains 14,388 galaxy groups (with multiplicity >= 2$), including 44,186 galaxies out of a possible 110,192 galaxies, implying ~40% of all galaxies are assigned to a group. The similarities of the mock group catalogues and G3Cv1 are multiple: global characteristics are in general well recovered. However, we do find a noticeable deficit in the number of high multiplicity groups in GAMA compared to the mocks. Additionally, despite exceptionally good local spatial completeness, G3Cv1 contains significantly fewer compact groups with 5 or more members, this effect becoming most evident for high multiplicity systems. These two differences are most likely due to limitations in the physics included of the current GAMA lightcone mock. Further studies using a variety of galaxy formation models are required to confirm their exact origin.
△ Less
Submitted 14 June, 2011; v1 submitted 10 June, 2011;
originally announced June 2011.
-
Galaxy And Mass Assembly (GAMA): The star formation rate dependence of the stellar initial mass function
Authors:
M. L. P. Gunawardhana,
A. M. Hopkins,
R. G. Sharp,
S. Brough,
E. Taylor,
J. Bland-Hawthorn,
C. Maraston,
R. J. Tuffs,
C. C. Popescu,
D. Wijesinghe,
D. H. Jones,
S. Croom,
E. Sadler,
S. Wilkins,
S. P. Driver,
J. Liske,
P. Norberg,
I. K. Baldry,
S. P. Bamford,
J. Loveday,
J. A. Peacock,
A. S. G. Robotham,
D. B. Zucker,
Q. A. Parker,
C. J. Conselice
, et al. (13 additional authors not shown)
Abstract:
The stellar initial mass function (IMF) describes the distribution in stellar masses produced from a burst of star formation. For more than fifty years, the implicit assumption underpinning most areas of research involving the IMF has been that it is universal, regardless of time and environment. We measure the high-mass IMF slope for a sample of low-to-moderate redshift galaxies from the Galaxy A…
▽ More
The stellar initial mass function (IMF) describes the distribution in stellar masses produced from a burst of star formation. For more than fifty years, the implicit assumption underpinning most areas of research involving the IMF has been that it is universal, regardless of time and environment. We measure the high-mass IMF slope for a sample of low-to-moderate redshift galaxies from the Galaxy And Mass Assembly survey. The large range in luminosities and galaxy masses of the sample permits the exploration of underlying IMF dependencies. A strong IMF-star formation rate dependency is discovered, which shows that highly star forming galaxies form proportionally more massive stars (they have IMFs with flatter power-law slopes) than galaxies with low star formation rates. This has a significant impact on a wide variety of galaxy evolution studies, all of which rely on assumptions about the slope of the IMF. Our result is supported by, and provides an explanation for, the results of numerous recent explorations suggesting a variation of or evolution in the IMF.
△ Less
Submitted 13 April, 2011;
originally announced April 2011.
-
GAMA/H-ATLAS: The ultraviolet spectral slope and obscuration in galaxies
Authors:
Dinuka B. Wijesinghe,
Elisabete. da Cunha,
Andrew. M. Hopkins,
Loretta. Dunne,
R. Sharp,
M. Gunawardhana,
S. Brough,
E. M. Sadler,
S. Driver,
I. Baldry,
S. Bamford,
J. Liske,
J. Loveday,
P. Norberg,
J. Peacock,
C. C. Popescu,
R. Tuffs,
E. Andrae,
R. Auld,
M. Baes,
J. Bland-Hawthorn,
S. Buttiglione,
A. Cava,
E. Cameron,
C. J. Conselice
, et al. (37 additional authors not shown)
Abstract:
We use multiwavelength data from the Galaxy And Mass Assembly (GAMA) and Herschel ATLAS (H-ATLAS) surveys to compare the relationship between various dust obscuration measures in galaxies. We explore the connections between the ultraviolet (UV) spectral slope, $β$, the Balmer decrement, and the far infrared (IR) to $150\,$nm far ultraviolet (FUV) luminosity ratio. We explore trends with galaxy mas…
▽ More
We use multiwavelength data from the Galaxy And Mass Assembly (GAMA) and Herschel ATLAS (H-ATLAS) surveys to compare the relationship between various dust obscuration measures in galaxies. We explore the connections between the ultraviolet (UV) spectral slope, $β$, the Balmer decrement, and the far infrared (IR) to $150\,$nm far ultraviolet (FUV) luminosity ratio. We explore trends with galaxy mass, star formation rate (SFR) and redshift in order to identify possible systematics in these various measures. We reiterate the finding of other authors that there is a large scatter between the Balmer decrement and the $β$ parameter, and that $β$ may be poorly constrained when derived from only two broad passbands in the UV. We also emphasise that FUV derived SFRs, corrected for dust obscuration using $β$, will be overestimated unless a modified relation between $β$ and the attenuation factor is used. Even in the optimum case, the resulting SFRs have a significant scatter, well over an order of magnitude. While there is a stronger correlation between the IR to FUV luminosity ratio and $β$ parameter than with the Balmer decrement, neither of these correlations are particularly tight, and dust corrections based on $β$ for high redshift galaxy SFRs must be treated with caution. We conclude with a description of the extent to which the different obscuration measures are consistent with each other as well as the effects of including other galactic properties on these correlations.
△ Less
Submitted 27 March, 2011; v1 submitted 15 March, 2011;
originally announced March 2011.
-
Galaxy And Mass Assembly (GAMA): Galaxies at the faint end of the Halpha luminosity function
Authors:
S. Brough,
A. M. Hopkins,
R. G. Sharp,
M. Gunawardhana,
D. Wijesinghe,
A. S. G. Robotham,
S. P. Driver,
I. K. Baldry,
S. P. Bamford,
J. Liske,
J. Loveday,
P. Norberg,
J. A. Peacock,
J. H. Bland-Hawthorn,
M. J. I. Brown,
E. Cameron,
S. M. Croom,
C. S. Frenk,
C. Foster,
D. T. Hill,
D. H. Jones,
L. S. Kelvin,
K. Kuijken,
R. C. Nichol,
H. R. Parkinson
, et al. (8 additional authors not shown)
Abstract:
We present an analysis of the properties of the lowest Halpha-luminosity galaxies (L_Halpha<4x10^32 W; SFR<0.02 Msun/yr) in the Galaxy And Mass Assembly (GAMA) survey. These galaxies make up the the rise above a Schechter function in the number density of systems seen at the faint end of the Halpha luminosity function. Above our flux limit we find that these galaxies are principally composed of in…
▽ More
We present an analysis of the properties of the lowest Halpha-luminosity galaxies (L_Halpha<4x10^32 W; SFR<0.02 Msun/yr) in the Galaxy And Mass Assembly (GAMA) survey. These galaxies make up the the rise above a Schechter function in the number density of systems seen at the faint end of the Halpha luminosity function. Above our flux limit we find that these galaxies are principally composed of intrinsically low stellar mass systems (median stellar mass =2.5x10^8 Msun) with only 5/90 having stellar masses M>10^10 Msun. The low SFR systems are found to exist predominantly in the lowest density environments (median density ~0.02 galaxy Mpc^-2 with none in environments more dense than ~1.5 galaxy Mpc^-2). Their current specific star formation rates (SSFR; -8.5 < log(SSFR[yr^-1])<-12.) are consistent with their having had a variety of star formation histories. The low density environments of these galaxies demonstrates that such low-mass, star-forming systems can only remain as low-mass and forming stars if they reside sufficiently far from other galaxies to avoid being accreted, dispersed through tidal effects or having their gas reservoirs rendered ineffective through external processes.
△ Less
Submitted 16 December, 2010;
originally announced December 2010.