-
CustOmics: A versatile deep-learning based strategy for multi-omics integration
Authors:
Hakim Benkirane,
Yoann Pradat,
Stefan Michiels,
Paul-Henry Cournède
Abstract:
Recent advances in high-throughput sequencing technologies have enabled the extraction of multiple features that depict patient samples at diverse and complementary molecular levels. The generation of such data has led to new challenges in computational biology regarding the integration of high-dimensional and heterogeneous datasets that capture the interrelationships between multiple genes and th…
▽ More
Recent advances in high-throughput sequencing technologies have enabled the extraction of multiple features that depict patient samples at diverse and complementary molecular levels. The generation of such data has led to new challenges in computational biology regarding the integration of high-dimensional and heterogeneous datasets that capture the interrelationships between multiple genes and their functions. Thanks to their versatility and ability to learn synthetic latent representations of complex data, deep learning methods offer promising perspectives for integrating multi-omics data. These methods have led to the conception of many original architectures that are primarily based on autoencoder models. However, due to the difficulty of the task, the integration strategy is fundamental to take full advantage of the sources' particularities without losing the global trends. This paper presents a novel strategy to build a customizable autoencoder model that adapts to the dataset used in the case of high-dimensional multi-source integration. We will assess the impact of integration strategies on the latent representation and combine the best strategies to propose a new method, CustOmics (https://github.com/HakimBenkirane/CustOmics). We focus here on the integration of data from multiple omics sources and demonstrate the performance of the proposed method on test cases for several tasks such as classification and survival analysis.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
A biology-driven deep generative model for cell-type annotation in cytometry
Authors:
Quentin Blampey,
Nadège Bercovici,
Charles-Antoine Dutertre,
Isabelle Pic,
Fabrice André,
Joana Mourato Ribeiro,
Paul-Henry Cournède
Abstract:
Cytometry enables precise single-cell phenoty** within heterogeneous populations. These cell types are traditionally annotated via manual gating, but this method suffers from a lack of reproducibility and sensitivity to batch-effect. Also, the most recent cytometers - spectral flow or mass cytometers - create rich and high-dimensional data whose analysis via manual gating becomes challenging and…
▽ More
Cytometry enables precise single-cell phenoty** within heterogeneous populations. These cell types are traditionally annotated via manual gating, but this method suffers from a lack of reproducibility and sensitivity to batch-effect. Also, the most recent cytometers - spectral flow or mass cytometers - create rich and high-dimensional data whose analysis via manual gating becomes challenging and time-consuming. To tackle these limitations, we introduce Scyan (https://github.com/MICS-Lab/scyan), a Single-cell Cytometry Annotation Network that automatically annotates cell types using only prior expert knowledge about the cytometry panel. We demonstrate that Scyan significantly outperforms the related state-of-the-art models on multiple public datasets while being faster and interpretable. In addition, Scyan overcomes several complementary tasks such as batch-effect removal, debarcoding, and population discovery. Overall, this model accelerates and eases cell population characterisation, quantification, and discovery in cytometry.
△ Less
Submitted 21 April, 2023; v1 submitted 11 August, 2022;
originally announced August 2022.
-
Mathematical modelling, selection and hierarchical inference to determine the minimal dose in IFN$α$ therapy against Myeloproliferative Neoplasms
Authors:
Gurvan Hermange,
William Vainchenker,
Isabelle Plo,
Paul-Henry Cournède
Abstract:
Myeloproliferative Neoplasms (MPN) are blood cancers that appear after acquiring a driver mutation in a hematopoietic stem cell. These hematological malignancies result in the overproduction of mature blood cells and, if not treated, induce a risk of cardiovascular events and thrombosis. Pegylated IFN$α$ is commonly used to treat MPN, but no clear guidelines exist concerning the dose prescribed to…
▽ More
Myeloproliferative Neoplasms (MPN) are blood cancers that appear after acquiring a driver mutation in a hematopoietic stem cell. These hematological malignancies result in the overproduction of mature blood cells and, if not treated, induce a risk of cardiovascular events and thrombosis. Pegylated IFN$α$ is commonly used to treat MPN, but no clear guidelines exist concerning the dose prescribed to patients. We applied a model selection procedure and ran a hierarchical Bayesian inference method to decipher how dose variations impact the response to the therapy. We inferred that IFN$α$ acts on mutated stem cells by inducing their differentiation into progenitor cells, the higher the dose, the higher the effect. We found that when a sufficient (patient-dependent) dose is reached, the treatment can induce a long-term remission. We determined this minimal dose for individuals in a cohort of patients and estimated the most suitable starting dose to give to a new patient to increase the chances of being cured.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Mean field approximation of a heterogeneous population of plants in competition
Authors:
Antonin Della Noce,
Amélie Mathieu,
Paul-Henry Cournède
Abstract:
The processes of interplant competition within a field are still poorly understood. However, they explain a large part of the heterogeneity in a field and may have longer-term consequences, especially in mixed stands. Modeling can help to better understand these phenomena but requires simulating the interactions between different individuals. In the case of large populations, assessing the paramet…
▽ More
The processes of interplant competition within a field are still poorly understood. However, they explain a large part of the heterogeneity in a field and may have longer-term consequences, especially in mixed stands. Modeling can help to better understand these phenomena but requires simulating the interactions between different individuals. In the case of large populations, assessing the parameters of a heterogeneous population model from experimental data is intractable computationally. This paper investigates the mean-field approximation of large dynamical systems with random initial conditions and individual parameters, and with interaction being represented by pairwise potentials between individuals. Under this approximation, each individual is in interaction with an infinitely-crowded population, summarized by a probability measure, the mean-field limit distribution, being itself the weak solution of a non-linear hyperbolic partial differential equation. In particular, the phenomenon of chaos propagation implies that the individuals are independent asymptotically when the size of the population tends towards infinity. This result provides perspectives for a possible simplification of the inference problem. The simulation of the mean-field distribution, consisting in a semi-Lagrangian scheme with an interpolation step using Gaussian process regression, is illustrated for a heterogeneous population model representing plants in competition for light.
△ Less
Submitted 4 June, 2019;
originally announced June 2019.
-
Quantitative Genetics and Functional-Structural Plant Growth Models: Simulation of Quantitative Trait Loci Detection for Model Parameters and Application to Potential Yield Optimization
Authors:
Veronique Letort,
Paul Mahe,
Paul-Henry Cournède,
Philippe De Reffye,
Brigitte Courtois
Abstract:
Background and Aims: Prediction of phenotypic traits from new genotypes under untested environmental conditions is crucial to build simulations of breeding strategies to improve target traits. Although the plant response to environmental stresses is characterized by both architectural and functional plasticity, recent attempts to integrate biological knowledge into genetics models have mainly conc…
▽ More
Background and Aims: Prediction of phenotypic traits from new genotypes under untested environmental conditions is crucial to build simulations of breeding strategies to improve target traits. Although the plant response to environmental stresses is characterized by both architectural and functional plasticity, recent attempts to integrate biological knowledge into genetics models have mainly concerned specific physiological processes or crop models without architecture, and thus may prove limited when studying genotype x environment interactions. Consequently, this paper presents a simulation study introducing genetics into a functional-structural growth model, which gives access to more fundamental traits for quantitative trait loci (QTL) detection and thus to promising tools for yield optimization.
Methods: The GreenLab model was selected as a reasonable choice to link growth model parameters to QTL. Virtual genes and virtual chromosomes were defined to build a simple genetic model that drove the settings of the species-specific parameters of the model. The QTL Cartographer software was used to study QTL detection of simulated plant traits. A genetic algorithm was implemented to define the ideotype for yield maximization based on the model parameters and the associated allelic combination.
Key Results and Conclusions: By kee** the environmental factors constant and using a virtual population with a large number of individuals generated by a Mendelian genetic model, results for an ideal case could be simulated. Virtual QTL detection was compared in the case of phenotypic traits - such as cob weight - and when traits were model parameters, and was found to be more accurate in the latter case. The practical interest of this approach is illustrated by calculating the parameters (and the corresponding genotype) associated with yield optimization of a GreenLab maize model. The paper discusses the potentials of GreenLab to represent environment x genotype interactions, in particular through its main state variable, the ratio of biomass supply over demand.
△ Less
Submitted 25 October, 2010;
originally announced October 2010.
-
Parametric identification of a functional-structural tree growth model and application to beech trees (Fagus sylvatica)
Authors:
Veronique Letort,
Paul-Henry Cournède,
Amélie Mathieu,
Philippe De Reffye,
Thiéry Constant
Abstract:
Functional-structural models provide detailed representations of tree growth and their application to forestry seems full of prospects. However, owing to the complexity of tree architecture, parametric identification of such models remains a critical issue. We present the GreenLab approach for modelling tree growth. It simulates tree growth plasticity in response to changes of their internal level…
▽ More
Functional-structural models provide detailed representations of tree growth and their application to forestry seems full of prospects. However, owing to the complexity of tree architecture, parametric identification of such models remains a critical issue. We present the GreenLab approach for modelling tree growth. It simulates tree growth plasticity in response to changes of their internal level of trophic competition, especially topological development and cambial growth. The model includes a simplified representation of tree architecture, based on a species-specific description of branching patterns. We study whether those simplifications allow enough flexibility to reproduce with the same set of parameters the growth of two observed understorey beech trees (Fagus sylvatica L.) of different ages in different environmental conditions. The parametric identification of the model is global, i.e. all parameters are estimated simultaneously, potentially providing a better description of interactions between sub-processes. As a result, the source-sink dynamics throughout tree development is retrieved. Simulated and measured trees were compared for their trunk profiles (fresh masses and dimensions of every growth units, ring diameters at different heights) and compartment masses of their order 2 branches. Possible improvements of this method by including topological criteria are discussed.
△ Less
Submitted 25 October, 2010;
originally announced October 2010.
-
A morphogenetic crop model for sugar-beet (Beta vulgaris L.)
Authors:
Sébastien Lemaire,
Fabienne Maupas,
Paul-Henry Cournède,
Philippe De Reffye
Abstract:
This paper is the instructions for the proceeding of the International Symposium on Crop. Sugar beet crop models have rarely taken into account the morphogenetic process generating plant architecture despite the fact that plant architectural plasticity plays a key role during growth, especially under stress conditions. The objective of this paper is to develop this approach by applying the Green…
▽ More
This paper is the instructions for the proceeding of the International Symposium on Crop. Sugar beet crop models have rarely taken into account the morphogenetic process generating plant architecture despite the fact that plant architectural plasticity plays a key role during growth, especially under stress conditions. The objective of this paper is to develop this approach by applying the GreenLab model of plant growth to sugar beet and to study the potential advantages for applicative purposes. Experiments were conducted with husbandry practices in 2006. The study of sugar beet development, mostly phytomer appearance, organ expansion and leaf senescence, allowed us to define a morphogenetic model of sugar beet growth based on GreenLab. It simulates organogenesis, biomass production and biomass partitioning. The functional parameters controlling source-sink relationships during plant growth were estimated from organ and compartment dry masses, measured at seven different times, for samples of plants. The fitting results are good, which shows that the introduced framework is adapted to analyse source-sink dynamics and shoot-root allocation throughout the season. However, this approach still needs to be fully validated, particularly among seasons.
△ Less
Submitted 16 February, 2009; v1 submitted 4 November, 2008;
originally announced November 2008.
-
The Influence of Photosynthesis on the Number of Metamers per Growth Unit in GreenLab Model
Authors:
Amelie Mathieu,
Paul-Henry Cournède,
Philippe De Reffye
Abstract:
GreenLab Model is a functional-structural plant growth model that combines both organogenesis (at each cycle, new organs are created with respect to genetic rules) and photosynthesis (organs are filled with the biomass produced by the leaves photosynthesis). Our new developments of the model concern the retroaction of photosynthesis on organogenesis. We present here the first step towards the to…
▽ More
GreenLab Model is a functional-structural plant growth model that combines both organogenesis (at each cycle, new organs are created with respect to genetic rules) and photosynthesis (organs are filled with the biomass produced by the leaves photosynthesis). Our new developments of the model concern the retroaction of photosynthesis on organogenesis. We present here the first step towards the total representation of this retroaction, where the influence of available biomass on the number of metamers in new growth units us modelled. The theory is introduced and applied to a Corner model tree. Different interesting behaviours are pointed out.
△ Less
Submitted 17 January, 2007;
originally announced January 2007.