-
Four principles for improved statistical ecology
Authors:
Gordana Popovic,
Tanya J. Mason,
Tiago A. Marques,
Joanne Potts,
Szymon M. Drobniak,
Rocío Joo,
Res Altwegg,
Carolyn C. I. Burns,
Michael A. McCarthy,
Alison Johnston,
Shinichi Nakagawa,
Louise McMillan,
Kadambari Devarajan,
Patrick l. Taggart,
Alison C. Wunderlich,
Magdalena M. Mair,
Juan Andrés Martínez-Lanfranco,
Malgorzata Lagisz,
Patrice P. Pottier
Abstract:
Increasing attention has been drawn to the misuse of statistical methods over recent years, with particular concern about the prevalence of practices such as poor experimental design, cherry-picking and inadequate reporting. These failures are largely unintentional and no more common in ecology than in other scientific disciplines, with many of them easily remedied given the right guidance.
Orig…
▽ More
Increasing attention has been drawn to the misuse of statistical methods over recent years, with particular concern about the prevalence of practices such as poor experimental design, cherry-picking and inadequate reporting. These failures are largely unintentional and no more common in ecology than in other scientific disciplines, with many of them easily remedied given the right guidance.
Originating from a discussion at the 2020 International Statistical Ecology Conference, we show how ecologists can build their research following four guiding principles for impactful statistical research practices: 1. Define a focused research question, then plan sampling and analysis to answer it; 2. Develop a model that accounts for the distribution and dependence of your data; 3. Emphasise effect sizes to replace statistical significance with ecological relevance; 4. Report your methods and findings in sufficient detail so that your research is valid and reproducible.
Listed in approximate order of importance, these principles provide a framework for experimental design and reporting that guards against unsound practices. Starting with a well-defined research question allows researchers to create an efficient study to answer it, and guards against poor research practices that lead to false positives and poor replicability. Correct and appropriate statistical models give sound conclusions, good reporting practices and a focus on ecological relevance make results impactful and replicable.
Illustrated with an example from a recent study into the impact of disturbance on upland swamps, this paper explains the rationale for the selection and use of effective statistical practices and provides practical guidance for ecologists seeking to improve their use of statistical methods.
△ Less
Submitted 2 February, 2023;
originally announced February 2023.
-
How many sites? Methods to assist design decisions when collecting multivariate data in ecology
Authors:
Ben Maslen,
Gordana Popovic,
Adriana Vergés,
Ezequiel Marzinelli,
David Warton
Abstract:
1. Sample size estimation through power analysis is a fundamental tool in planning an ecological study, yet there are currently no well-established procedures for when multivariate abundances are to be collected. A power analysis procedure would need to address three challenges: designing a parsimonious simulation model that captures key community data properties; measuring effect size in a realis…
▽ More
1. Sample size estimation through power analysis is a fundamental tool in planning an ecological study, yet there are currently no well-established procedures for when multivariate abundances are to be collected. A power analysis procedure would need to address three challenges: designing a parsimonious simulation model that captures key community data properties; measuring effect size in a realistic yet interpretable fashion; and ensuring computational feasibility when simulation is used both for power estimation and significance testing. 2. Here we propose a power analysis procedure that addresses these three challenges by: using for simulation a Gaussian copula model with factor analytical structure, fitted to pilot data; assuming a common effect size across all taxa, but applied in different directions according to expert opinion (to "increaser", "decreaser" or "no effect" taxa); using a critical value approach to estimate power, which reduces computation time by a factor of 500 with little loss of accuracy. 3. The procedure is demonstrated on pilot data from fish assemblages in a restoration study, where it was found that the planned study design would only be capable of detecting relatively large effects (change in abundance by a factor of 1.5 or more). 4. The methods outlined in this paper are available in accompanying R software (the ecopower package), which allows researchers with pilot data to answer a wide range of design questions to assist them in planning their studies.
△ Less
Submitted 19 June, 2022;
originally announced June 2022.
-
Computationally efficient dense moving object detection based on reduced space disparity estimation
Authors:
Goran Popović,
Antea Hadviger,
Ivan Marković,
Ivan Petrović
Abstract:
Computationally efficient moving object detection and depth estimation from a stereo camera is an extremely useful tool for many computer vision applications, including robotics and autonomous driving. In this paper we show how moving objects can be densely detected by estimating disparity using an algorithm that improves complexity and accuracy of stereo matching by relying on information from pr…
▽ More
Computationally efficient moving object detection and depth estimation from a stereo camera is an extremely useful tool for many computer vision applications, including robotics and autonomous driving. In this paper we show how moving objects can be densely detected by estimating disparity using an algorithm that improves complexity and accuracy of stereo matching by relying on information from previous frames. The main idea behind this approach is that by using the ego-motion estimation and the disparity map of the previous frame, we can set a prior base that enables us to reduce the complexity of the current frame disparity estimation, subsequently also detecting moving objects in the scene. For each pixel we run a Kalman filter that recursively fuses the disparity prediction and reduced space semi-global matching (SGM) measurements. The proposed algorithm has been implemented and optimized using streaming single instruction multiple data instruction set and multi-threading. Furthermore, in order to estimate the process and measurement noise as reliably as possible, we conduct extensive experiments on the KITTI suite using the ground truth obtained by the 3D laser range sensor. Concerning disparity estimation, compared to the OpenCV SGM implementation, the proposed method yields improvement on the KITTI dataset sequences in terms of both speed and accuracy.
△ Less
Submitted 21 September, 2018;
originally announced September 2018.
-
Diagnostic tools of approximate Bayesian computation using the coverage property
Authors:
D. Prangle,
M. G. B. Blum,
G. Popovic,
S. A. Sisson
Abstract:
Approximate Bayesian computation (ABC) is an approach for sampling from an approximate posterior distribution in the presence of a computationally intractable likelihood function. A common implementation is based on simulating model, parameter and dataset triples, (m,θ,y), from the prior, and then accepting as samples from the approximate posterior, those pairs (m,θ) for which y, or a summary of y…
▽ More
Approximate Bayesian computation (ABC) is an approach for sampling from an approximate posterior distribution in the presence of a computationally intractable likelihood function. A common implementation is based on simulating model, parameter and dataset triples, (m,θ,y), from the prior, and then accepting as samples from the approximate posterior, those pairs (m,θ) for which y, or a summary of y, is "close" to the observed data. Closeness is typically determined though a distance measure and a kernel scale parameter, ε. Appropriate choice of εis important to producing a good quality approximation. This paper proposes diagnostic tools for the choice of εbased on assessing the coverage property, which asserts that credible intervals have the correct coverage levels. We provide theoretical results on coverage for both model and parameter inference, and adapt these into diagnostics for the ABC context. We re-analyse a study on human demographic history to determine whether the adopted posterior approximation was appropriate. R code implementing the proposed methodology is freely available in the package "abc."
△ Less
Submitted 14 January, 2013;
originally announced January 2013.