-
Comparison of methods for analyzing environmental mixtures effects on survival outcomes and application to a population-based cohort study
Authors:
Melanie N. Mayer,
Arce Domingo-Relloso,
Marianthi-Anna Kioumourtzoglou,
Ana Navas-Acien,
Brent Coull,
Linda Valeri
Abstract:
The estimation of the effect of environmental exposures and overall mixtures on a survival time outcome is common in environmental epidemiological studies. While advanced statistical methods are increasingly being used for mixture analyses, their applicability and performance for survival outcomes has yet to be explored. We identified readily available methods for analyzing an environmental mixtur…
▽ More
The estimation of the effect of environmental exposures and overall mixtures on a survival time outcome is common in environmental epidemiological studies. While advanced statistical methods are increasingly being used for mixture analyses, their applicability and performance for survival outcomes has yet to be explored. We identified readily available methods for analyzing an environmental mixture's effect on a survival outcome and assessed their performance via simulations replicating various real-life scenarios. Using prespecified criteria, we selected Bayesian Additive Regression Trees (BART), Cox Elastic Net, Cox Proportional Hazards (PH) with and without penalized splines, Gaussian Process Regression (GPR) and Multivariate Adaptive Regression Splines (MARS) to compare the bias and efficiency produced when estimating individual exposure, overall mixture, and interaction effects on a survival outcome. We illustrate the selected methods in a real-world data application. We estimated the effects of arsenic, cadmium, molybdenum, selenium, tungsten, and zinc on incidence of cardiovascular disease in American Indians using data from the Strong Heart Study (SHS). In the simulation study, there was a consistent bias-variance trade off. The more flexible models (BART, GPR and MARS) were found to be most advantageous in the presence of nonproportional hazards, where the Cox models often did not capture the true effects due to their higher bias and lower variance. In the SHS, estimates of the effect of selenium and the overall mixture indicated negative effects, but the magnitudes of the estimated effects varied across methods. In practice, we recommend evaluating if findings are consistent across methods.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?
Authors:
Nikolaus Mayer,
Eddy Ilg,
Philipp Fischer,
Caner Hazirbas,
Daniel Cremers,
Alexey Dosovitskiy,
Thomas Brox
Abstract:
The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method…
▽ More
The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method in visual recognition is based on web data and manual annotation. Yet, for many computer vision problems, such as stereo or optical flow estimation, this approach is not feasible because humans cannot manually enter a pixel-accurate flow field. In this paper, we promote the use of synthetically generated data for the purpose of training deep networks on such tasks.We suggest multiple ways to generate such data and evaluate the influence of dataset properties on the performance and generalization properties of the resulting networks. We also demonstrate the benefit of learning schedules that use different types of data at selected stages of the training process.
△ Less
Submitted 22 March, 2018; v1 submitted 19 January, 2018;
originally announced January 2018.
-
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation
Authors:
Nikolaus Mayer,
Eddy Ilg,
Philip Häusser,
Philipp Fischer,
Daniel Cremers,
Alexey Dosovitskiy,
Thomas Brox
Abstract:
Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we pro…
▽ More
Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluating scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.
△ Less
Submitted 7 December, 2015;
originally announced December 2015.