Semi-Supervised Non-Parametric Bayesian Modelling of Spatial Proteomics
Authors:
Oliver M. Crook,
Kathryn S. Lilley,
Laurent Gatto,
Paul D. W. Kirk
Abstract:
Understanding sub-cellular protein localisation is an essential component to analyse context specific protein function. Recent advances in quantitative mass-spectrometry (MS) have led to high resolution map** of thousands of proteins to sub-cellular locations within the cell. Novel modelling considerations to capture the complex nature of these data are thus necessary. We approach analysis of sp…
▽ More
Understanding sub-cellular protein localisation is an essential component to analyse context specific protein function. Recent advances in quantitative mass-spectrometry (MS) have led to high resolution map** of thousands of proteins to sub-cellular locations within the cell. Novel modelling considerations to capture the complex nature of these data are thus necessary. We approach analysis of spatial proteomics data in a non-parametric Bayesian framework, using mixtures of Gaussian process regression models. The Gaussian process regression model accounts for correlation structure within a sub-cellular niche, with each mixture component capturing the distinct correlation structure observed within each niche. Proteins with a priori labelled locations motivate using semi-supervised learning to inform the Gaussian process hyperparameters. We moreover provide an efficient Hamiltonian-within-Gibbs sampler for our model. As in other recent work, we reduce the computational burden associated with inversion of covariance matrices by exploiting the structure in the covariance matrix. A tensor decomposition of our covariance matrices allows extended Trench and Durbin algorithms to be applied it order to reduce the computational complexity of inversion and hence accelerate computation. A stand-alone R-package implementing these methods using high-performance C++ libraries is available at: https://github.com/ococrook/toeplitz
△ Less
Submitted 11 March, 2019; v1 submitted 7 March, 2019;
originally announced March 2019.
Guidelines for reporting the use of gel electrophoresis in proteomics
Authors:
Frank Gibson,
Leigh Anderson,
Gyorgy Babnigg,
Mark Baker,
Matthias Berth,
Pierre-Alain Binz,
Andy Borthwick,
Phil Cash,
Billy W Day,
David B Friedman,
Donita Garland,
Howard B Gutstein,
Christine Hoogland,
Neil A Jones,
Alamgir Khan,
Joachim Klose,
Angus I Lamond,
Peter F Lemkin,
Kathryn S Lilley,
Jonathan Minden,
Nicholas J Morris,
Norman W Paton,
Michael R Pisano,
John E Prime,
Thierry Rabilloud
, et al. (5 additional authors not shown)
Abstract:
the MIAPE Gel Electrophoresis (MIAPE-GE) guidelines specify the minimum information that should be provided when reporting the use of n-dimensional gel electrophoresis in a proteomics experiment. Developed through a joint effort between the gel-based analysis working group of the Human Proteome Organisation's Proteomics Standards Initiative (HUPO-PSI; http://www.psidev.info/) and the wider prote…
▽ More
the MIAPE Gel Electrophoresis (MIAPE-GE) guidelines specify the minimum information that should be provided when reporting the use of n-dimensional gel electrophoresis in a proteomics experiment. Developed through a joint effort between the gel-based analysis working group of the Human Proteome Organisation's Proteomics Standards Initiative (HUPO-PSI; http://www.psidev.info/) and the wider proteomics community, they constitute one part of the overall Minimum Information about a Proteomics Experiment (MIAPE) documentation system published last August in Nature Biotechnology
△ Less
Submitted 4 April, 2009;
originally announced April 2009.