Skip to main content

Showing 1–7 of 7 results for author: Notin, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.04064  [pdf, other

    q-bio.QM cs.LG stat.ME

    DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design

    Authors: Clare Lyle, Arash Mehrjou, Pascal Notin, Andrew Jesson, Stefan Bauer, Yarin Gal, Patrick Schwab

    Abstract: The discovery of therapeutics to treat genetically-driven pathologies relies on identifying genes involved in the underlying disease mechanisms. Existing approaches search over the billions of potential interventions to maximize the expected influence on the target phenotype. However, to reduce the risk of failure in future stages of trials, practical experiment design aims to find a set of interv… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Journal ref: International Conference on Machine Learning, 2023

  2. arXiv:2308.15395  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    The CausalBench challenge: A machine learning contest for gene network inference from single-cell perturbation data

    Authors: Mathieu Chevalley, Jacob Sackett-Sanders, Yusuf Roohani, Pascal Notin, Artemy Bakulin, Dariusz Brzezinski, Kaiwen Deng, Yuanfang Guan, Justin Hong, Michael Ibrahim, Wojciech Kotlowski, Marcin Kowiel, Panagiotis Misiakos, Achille Nazaret, Markus Püschel, Chris Wendler, Arash Mehrjou, Patrick Schwab

    Abstract: In drug discovery, map** interactions between genes within cellular systems is a crucial early step. This helps formulate hypotheses regarding molecular mechanisms that could potentially be targeted by future medicines. The CausalBench Challenge was an initiative to invite the machine learning community to advance the state of the art in constructing gene-gene interaction networks. These network… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  3. arXiv:2205.13760  [pdf, other

    cs.LG

    Tranception: protein fitness prediction with autoregressive transformers and inference-time retrieval

    Authors: Pascal Notin, Mafalda Dias, Jonathan Frazer, Javier Marchena-Hurtado, Aidan Gomez, Debora S. Marks, Yarin Gal

    Abstract: The ability to accurately model the fitness landscape of protein sequences is critical to a wide range of applications, from quantifying the effects of human variants on disease likelihood, to predicting immune-escape mutations in viruses and designing novel biotherapeutic proteins. Deep generative models of protein sequences trained on multiple sequence alignments have been the most successful ap… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: ICML 2022

  4. arXiv:2205.05789  [pdf, other

    q-bio.QM cs.LG

    RITA: a Study on Scaling Up Generative Protein Sequence Models

    Authors: Daniel Hesslow, Niccoló Zanichelli, Pascal Notin, Iacopo Poli, Debora Marks

    Abstract: In this work we introduce RITA: a suite of autoregressive generative models for protein sequences, with up to 1.2 billion parameters, trained on over 280 million protein sequences belonging to the UniRef-100 database. Such generative models hold the promise of greatly accelerating protein design. We conduct the first systematic study of how capabilities evolve with model size for autoregressive tr… ▽ More

    Submitted 14 July, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

  5. arXiv:2110.11875  [pdf, other

    cs.LG stat.ML

    GeneDisco: A Benchmark for Experimental Design in Drug Discovery

    Authors: Arash Mehrjou, Ashkan Soleymani, Andrew Jesson, Pascal Notin, Yarin Gal, Stefan Bauer, Patrick Schwab

    Abstract: In vitro cellular experimentation with genetic interventions, using for example CRISPR technologies, is an essential step in early-stage drug discovery and target validation that serves to assess initial hypotheses about causal associations between biological mechanisms and disease pathologies. With billions of potential hypotheses to test, the experimental design space for in vitro genetic experi… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  6. arXiv:2107.00096  [pdf, other

    cs.LG stat.ML

    Improving black-box optimization in VAE latent space using decoder uncertainty

    Authors: Pascal Notin, José Miguel Hernández-Lobato, Yarin Gal

    Abstract: Optimization in the latent space of variational autoencoders is a promising approach to generate high-dimensional discrete objects that maximize an expensive black-box property (e.g., drug-likeness in molecular generation, function approximation with arithmetic expressions). However, existing methods lack robustness as they may decide to explore areas of the latent space for which no data was avai… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

  7. arXiv:2007.10909  [pdf, other

    cs.LG stat.ML

    Improving compute efficacy frontiers with SliceOut

    Authors: Pascal Notin, Aidan N. Gomez, Joanna Yoo, Yarin Gal

    Abstract: Pushing forward the compute efficacy frontier in deep learning is critical for tasks that require frequent model re-training or workloads that entail training a large number of models. We introduce SliceOut -- a dropout-inspired scheme designed to take advantage of GPU memory layout to train deep learning models faster without impacting final test accuracy. By drop** contiguous sets of units at… ▽ More

    Submitted 31 March, 2021; v1 submitted 21 July, 2020; originally announced July 2020.