Skip to main content

Showing 1–33 of 33 results for author: Mehrjou, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18314  [pdf, other

    cs.LG

    Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm

    Authors: Mathieu Chevalley, Patrick Schwab, Arash Mehrjou

    Abstract: Targeted and uniform interventions to a system are crucial for unveiling causal relationships. While several methods have been developed to leverage interventional data for causal structure learning, their practical application in real-world scenarios often remains challenging. Recent benchmark studies have highlighted these difficulties, even when large numbers of single-variable intervention sam… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2312.04064  [pdf, other

    q-bio.QM cs.LG stat.ME

    DiscoBAX: Discovery of Optimal Intervention Sets in Genomic Experiment Design

    Authors: Clare Lyle, Arash Mehrjou, Pascal Notin, Andrew Jesson, Stefan Bauer, Yarin Gal, Patrick Schwab

    Abstract: The discovery of therapeutics to treat genetically-driven pathologies relies on identifying genes involved in the underlying disease mechanisms. Existing approaches search over the billions of potential interventions to maximize the expected influence on the target phenotype. However, to reduce the risk of failure in future stages of trials, practical experiment design aims to find a set of interv… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Journal ref: International Conference on Machine Learning, 2023

  3. arXiv:2308.15395  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    The CausalBench challenge: A machine learning contest for gene network inference from single-cell perturbation data

    Authors: Mathieu Chevalley, Jacob Sackett-Sanders, Yusuf Roohani, Pascal Notin, Artemy Bakulin, Dariusz Brzezinski, Kaiwen Deng, Yuanfang Guan, Justin Hong, Michael Ibrahim, Wojciech Kotlowski, Marcin Kowiel, Panagiotis Misiakos, Achille Nazaret, Markus Püschel, Chris Wendler, Arash Mehrjou, Patrick Schwab

    Abstract: In drug discovery, map** interactions between genes within cellular systems is a crucial early step. This helps formulate hypotheses regarding molecular mechanisms that could potentially be targeted by future medicines. The CausalBench Challenge was an initiative to invite the machine learning community to advance the state of the art in constructing gene-gene interaction networks. These network… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  4. arXiv:2306.09391  [pdf, other

    q-bio.QM cs.CV cs.LG q-bio.GN

    Multi-omics Prediction from High-content Cellular Imaging with Deep Learning

    Authors: Rahil Mehrizi, Arash Mehrjou, Maryana Alegro, Yi Zhao, Benedetta Carbone, Carl Fishwick, Johanna Vappiani, **g Bi, Siobhan Sanford, Hakan Keles, Marcus Bantscheff, Cuong Nguyen, Patrick Schwab

    Abstract: High-content cellular imaging, transcriptomics, and proteomics data provide rich and complementary views on the molecular layers of biology that influence cellular states and function. However, the biological determinants through which changes in multi-omics measurements influence cellular morphology have not yet been systematically explored, and the degree to which cell imaging could potentially… ▽ More

    Submitted 21 May, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  5. arXiv:2211.03846  [pdf, other

    cs.LG cs.MA stat.ME

    Federated Causal Discovery From Interventions

    Authors: Amin Abyaneh, Nino Scherrer, Patrick Schwab, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou

    Abstract: Causal discovery serves a pivotal role in mitigating model uncertainty through recovering the underlying causal mechanisms among variables. In many practical domains, such as healthcare, access to the data gathered by individual entities is limited, primarily for privacy and regulatory constraints. However, the majority of existing causal discovery methods require the data to be available in a cen… ▽ More

    Submitted 11 February, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  6. arXiv:2210.17283  [pdf, other

    cs.LG

    CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data

    Authors: Mathieu Chevalley, Yusuf Roohani, Arash Mehrjou, Jure Leskovec, Patrick Schwab

    Abstract: Causal inference is a vital aspect of multiple scientific disciplines and is routinely applied to high-impact applications such as medicine. However, evaluating the performance of causal inference methods in real-world environments is challenging due to the need for observations under both interventional and control conditions. Traditional evaluations conducted on synthetic datasets do not reflect… ▽ More

    Submitted 3 July, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  7. arXiv:2210.13774  [pdf, other

    cs.LG

    From Points to Functions: Infinite-dimensional Representations in Diffusion Models

    Authors: Sarthak Mittal, Guillaume Lajoie, Stefan Bauer, Arash Mehrjou

    Abstract: Diffusion-based generative models learn to iteratively transfer unstructured noise to a complex target distribution as opposed to Generative Adversarial Networks (GANs) or the decoder of Variational Autoencoders (VAEs) which produce samples from the target distribution in a single step. Thus, in diffusion models every sample is naturally connected to a random trajectory which is a solution to a le… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  8. arXiv:2206.07696  [pdf, other

    cs.CV cs.LG stat.ML

    Diffusion Models for Video Prediction and Infilling

    Authors: Tobias Höppe, Arash Mehrjou, Stefan Bauer, Didrik Nielsen, Andrea Dittadi

    Abstract: Predicting and anticipating future outcomes or reasoning about missing information in a sequence are critical skills for agents to be able to make intelligent decisions. This requires strong, temporally coherent generative capabilities. Diffusion models have shown remarkable success in several generative tasks, but have not been extensively explored in the video domain. We present Random-Mask Vide… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Published in TMLR (11/2022)

  9. arXiv:2204.09328  [pdf, other

    cs.LG stat.ML

    Federated Learning in Multi-Center Critical Care Research: A Systematic Case Study using the eICU Database

    Authors: Arash Mehrjou, Ashkan Soleymani, Annika Buchholz, Jürgen Hetzel, Patrick Schwab, Stefan Bauer

    Abstract: Federated learning (FL) has been proposed as a method to train a model on different units without exchanging data. This offers great opportunities in the healthcare sector, where large datasets are available but cannot be shared to ensure patient privacy. We systematically investigate the effectiveness of FL on the publicly available eICU dataset for predicting the survival of each ICU stay. We em… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

  10. arXiv:2201.05830  [pdf, other

    cs.RO math.DS stat.ML

    Physical Derivatives: Computing policy gradients by physical forward-propagation

    Authors: Arash Mehrjou, Ashkan Soleymani, Stefan Bauer, Bernhard Schölkopf

    Abstract: Model-free and model-based reinforcement learning are two ends of a spectrum. Learning a good policy without a dynamic model can be prohibitively expensive. Learning the dynamic model of a system can reduce the cost of learning the policy, but it can also introduce bias if it is not accurate. We propose a middle ground where instead of the transition model, the sensitivity of the trajectories with… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

  11. arXiv:2110.15489  [pdf, other

    cs.LG cs.AI

    GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL

    Authors: Sumedh A Sontakke, Stephen Iota, Zizhao Hu, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf

    Abstract: Out-of-distribution (OOD) detection is a well-studied topic in supervised learning. Extending the successes in supervised learning methods to the reinforcement learning (RL) setting, however, is difficult due to the data generating process - RL agents actively query their environment for data, and the data are a function of the policy followed by the agent. An agent could thus neglect a shift in t… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  12. arXiv:2110.11875  [pdf, other

    cs.LG stat.ML

    GeneDisco: A Benchmark for Experimental Design in Drug Discovery

    Authors: Arash Mehrjou, Ashkan Soleymani, Andrew Jesson, Pascal Notin, Yarin Gal, Stefan Bauer, Patrick Schwab

    Abstract: In vitro cellular experimentation with genetic interventions, using for example CRISPR technologies, is an essential step in early-stage drug discovery and target validation that serves to assess initial hypotheses about causal associations between biological mechanisms and disease pathologies. With billions of potential hypotheses to test, the experimental design space for in vitro genetic experi… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  13. arXiv:2107.03770  [pdf, other

    stat.ML cs.LG math.DS math.OC math.PR

    Federated Learning as a Mean-Field Game

    Authors: Arash Mehrjou

    Abstract: We establish a connection between federated learning, a concept from machine learning, and mean-field games, a concept from game theory and control theory. In this analogy, the local federated learners are considered as the players and the aggregation of the gradients in a central server is the mean-field effect. We present federated learning as a differential game and discuss the properties of th… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  14. arXiv:2105.14257  [pdf, other

    cs.LG cs.CV

    Diffusion-Based Representation Learning

    Authors: Korbinian Abstreiter, Sarthak Mittal, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou

    Abstract: Diffusion-based methods represented as stochastic differential equations on a continuous-time domain have recently proven successful as a non-adversarial generative model. Training such models relies on denoising score matching, which can be seen as multi-scale denoising autoencoders. Here, we augment the denoising score matching framework to enable representation learning without any supervised s… ▽ More

    Submitted 1 August, 2022; v1 submitted 29 May, 2021; originally announced May 2021.

  15. arXiv:2103.15561  [pdf, other

    q-bio.PE cs.AI cs.LG cs.MA eess.SY

    Pyfectious: An individual-level simulator to discover optimal containment polices for epidemic diseases

    Authors: Arash Mehrjou, Ashkan Soleymani, Amin Abyaneh, Samir Bhatt, Bernhard Schölkopf, Stefan Bauer

    Abstract: Simulating the spread of infectious diseases in human communities is critical for predicting the trajectory of an epidemic and verifying various policies to control the devastating impacts of the outbreak. Many existing simulators are based on compartment models that divide people into a few subsets and simulate the dynamics among those subsets using hypothesized differential equations. However, t… ▽ More

    Submitted 20 April, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  16. arXiv:2010.03110  [pdf, other

    cs.LG cs.AI cs.RO

    Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning

    Authors: Sumedh A. Sontakke, Arash Mehrjou, Laurent Itti, Bernhard Schölkopf

    Abstract: Animals exhibit an innate ability to learn regularities of the world through interaction. By performing experiments in their environment, they are able to discern the causal factors of variation and infer how they affect the world's dynamics. Inspired by this, we attempt to equip reinforcement learning agents with the ability to perform experiments that facilitate a categorization of the rolled-ou… ▽ More

    Submitted 6 August, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: International Conference on Machine Learning, PMLR 139, 2021

  17. arXiv:2008.13412  [pdf, other

    stat.AP cs.LG q-bio.QM

    Real-time Prediction of COVID-19 related Mortality using Electronic Health Records

    Authors: Patrick Schwab, Arash Mehrjou, Sonali Parbhoo, Leo Anthony Celi, Jürgen Hetzel, Markus Hofer, Bernhard Schölkopf, Stefan Bauer

    Abstract: Coronavirus Disease 2019 (COVID-19) is an emerging respiratory disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) with rapid human-to-human transmission and a high case fatality rate particularly in older patients. Due to the exponential growth of infections, many healthcare systems across the world are under pressure to care for increasing amounts of at-risk patien… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

  18. arXiv:2008.10053  [pdf, other

    stat.ML cs.LG eess.SY

    Learning Dynamical Systems using Local Stability Priors

    Authors: Arash Mehrjou, Andrea Iannelli, Bernhard Schölkopf

    Abstract: A coupled computational approach to simultaneously learn a vector field and the region of attraction of an equilibrium point from generated trajectories of the system is proposed. The nonlinear identification leverages the local stability information as a prior on the system, effectively endowing the estimate with this important structural property. In addition, the knowledge of the region of attr… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

  19. arXiv:2006.11113  [pdf, other

    cs.OH

    Artificial Buildings: Safety, Complexity and a Quantifiable Measure of Beauty

    Authors: Arash Mehrjou

    Abstract: A place to live is one of the most crucial necessities for all living organisms since the advent of life on planet Earth. The nature of homes has changed considerably over time. At the very early stages, human begins lived in natural places such as caves. Later on, they started to use their intelligence to build places with special purposes. Nowadays, modern technologies such as robotics and artif… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 8 pages

  20. arXiv:2006.03947  [pdf, other

    eess.SY cs.LG stat.ML

    Neural Lyapunov Redesign

    Authors: Arash Mehrjou, Mohammad Ghavamzadeh, Bernhard Schölkopf

    Abstract: Learning controllers merely based on a performance metric has been proven effective in many physical and non-physical tasks in both control theory and reinforcement learning. However, in practice, the controller must guarantee some notion of safety to ensure that it does not harm either the agent or the environment. Stability is a crucial notion of safety, whose violation can certainly cause unsaf… ▽ More

    Submitted 22 November, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: 27 pages

  21. arXiv:1910.14428   

    stat.ML cs.LG math.DS

    Kernel-Guided Training of Implicit Generative Models with Stability Guarantees

    Authors: Arash Mehrjou, Wittawat Jitkrittum, Krikamol Muandet, Bernhard Schölkopf

    Abstract: Modern implicit generative models such as generative adversarial networks (GANs) are generally known to suffer from issues such as instability, uninterpretability, and difficulty in assessing their performance. If we see these implicit models as dynamical systems, some of these issues are caused by being unable to control their behavior in a meaningful way during the course of training. In this wo… ▽ More

    Submitted 3 November, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: There was a misunderstanding in how an article should be updated on arXiv. We have withdrawn this article from this link. The same article can be found at arXiv:1901.09206

  22. arXiv:1910.12358  [pdf, ps, other

    stat.ML cs.LG econ.EM

    Dual Instrumental Variable Regression

    Authors: Krikamol Muandet, Arash Mehrjou, Si Kai Lee, Anant Raj

    Abstract: We present a novel algorithm for non-linear instrumental variable (IV) regression, DualIV, which simplifies traditional two-stage methods via a dual formulation. Inspired by problems in stochastic programming, we show that two-stage procedures for non-linear IV regression can be reformulated as a convex-concave saddle-point problem. Our formulation enables us to circumvent the first-stage regressi… ▽ More

    Submitted 24 October, 2020; v1 submitted 27 October, 2019; originally announced October 2019.

    Comments: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  23. arXiv:1905.06642  [pdf, other

    stat.ML cs.LG

    The Incomplete Rosetta Stone Problem: Identifiability Results for Multi-View Nonlinear ICA

    Authors: Luigi Gresele, Paul K. Rubenstein, Arash Mehrjou, Francesco Locatello, Bernhard Schölkopf

    Abstract: We consider the problem of recovering a common latent source with independent components from multiple views. This applies to settings in which a variable is measured with multiple experimental modalities, and where the goal is to synthesize the disparate measurements into a single unified representation. We consider the case that the observed views are a nonlinear mixing of component-wise corrupt… ▽ More

    Submitted 1 August, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Journal ref: Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence, 2019

  24. arXiv:1901.09206  [pdf, other

    cs.LG stat.ML

    Kernel-Guided Training of Implicit Generative Models with Stability Guarantees

    Authors: Arash Mehrjou, Wittawat Jitkrittum, Krikamol Muandet, Bernhard Schölkopf

    Abstract: Modern implicit generative models such as generative adversarial networks (GANs) are generally known to suffer from issues such as instability, uninterpretability, and difficulty in assessing their performance. If we see these implicit models as dynamical systems, some of these issues are caused by being unable to control their behavior in a meaningful way during the course of training. In this wo… ▽ More

    Submitted 6 November, 2019; v1 submitted 26 January, 2019; originally announced January 2019.

    Comments: This article supersedes arXiv:1901.09206 version 1. The paper is restructured, its writing is improved, and new experiments are added. The main result on stability is unchanged

  25. arXiv:1812.03253  [pdf, other

    cs.LG stat.ML

    Counterfactuals uncover the modular structure of deep generative models

    Authors: Michel Besserve, Arash Mehrjou, Rémy Sun, Bernhard Schölkopf

    Abstract: Deep generative models can emulate the perceptual properties of complex image datasets, providing a latent representation of the data. However, manipulating such representation to perform meaningful and controllable transformations in the data space remains challenging without some form of supervision. While previous work has focused on exploiting statistical independence to disentangle latent fac… ▽ More

    Submitted 12 December, 2019; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: 26 pages, 17 figures

  26. arXiv:1811.05933  [pdf, other

    eess.SP cs.LG stat.ML

    Deep Nonlinear Non-Gaussian Filtering for Dynamical Systems

    Authors: Arash Mehrjou, Bernhard Schölkopf

    Abstract: Filtering is a general name for inferring the states of a dynamical system given observations. The most common filtering approach is Gaussian Filtering (GF) where the distribution of the inferred states is a Gaussian whose mean is an affine function of the observations. There are two restrictions in this model: Gaussianity and Affinity. We propose a model to relax both these assumptions based on r… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  27. arXiv:1805.10615  [pdf, other

    stat.ML cs.LG math.DS

    A Local Information Criterion for Dynamical Systems

    Authors: Arash Mehrjou, Friedrich Solowjow, Sebastian Trimpe, Bernhard Schölkopf

    Abstract: Encoding a sequence of observations is an essential task with many applications. The encoding can become highly efficient when the observations are generated by a dynamical system. A dynamical system imposes regularities on the observations that can be leveraged to achieve a more efficient code. We propose a method to encode a given or learned dynamical system. Apart from its application for encod… ▽ More

    Submitted 27 May, 2018; originally announced May 2018.

  28. arXiv:1805.08916  [pdf, other

    stat.ML cs.LG

    Distribution Aware Active Learning

    Authors: Arash Mehrjou, Mehran Khodabandeh, Greg Mori

    Abstract: Discriminative learning machines often need a large set of labeled samples for training. Active learning (AL) settings assume that the learner has the freedom to ask an oracle to label its desired samples. Traditional AL algorithms heuristically choose query samples about which the current learner is uncertain. This strategy does not make good use of the structure of the dataset at hand and is pro… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  29. arXiv:1805.08306  [pdf, other

    stat.ML cs.LG

    Deep Energy Estimator Networks

    Authors: Saeed Saremi, Arash Mehrjou, Bernhard Schölkopf, Aapo Hyvärinen

    Abstract: Density estimation is a fundamental problem in statistical learning. This problem is especially challenging for complex high-dimensional data due to the curse of dimensionality. A promising solution to this problem is given here in an inference-free hierarchical framework that is built on score matching. We revisit the Bayesian interpretation of the score function and the Parzen score matching, an… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

  30. arXiv:1803.05045  [pdf, other

    stat.ML cs.LG

    Analysis of Nonautonomous Adversarial Systems

    Authors: Arash Mehrjou

    Abstract: Generative adversarial networks are used to generate images but still their convergence properties are not well understood. There have been a few studies who intended to investigate the stability properties of GANs as a dynamical system. This short writing can be seen in that direction. Among the proposed methods for stabilizing training of GANs, ß-GAN was the first who proposed a complete anneali… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

    Comments: 5 pages

  31. arXiv:1802.04374  [pdf, other

    stat.ML cs.CR cs.LG

    Tempered Adversarial Networks

    Authors: Mehdi S. M. Sajjadi, Giambattista Parascandolo, Arash Mehrjou, Bernhard Schölkopf

    Abstract: Generative adversarial networks (GANs) have been shown to produce realistic samples from high-dimensional distributions, but training them is considered hard. A possible explanation for training instabilities is the inherent imbalance between the networks: While the discriminator is trained directly on both real and fake samples, the generator only has control over the fake samples it produces sin… ▽ More

    Submitted 11 July, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: accepted to ICML 2018

  32. arXiv:1711.02799  [pdf, other

    cs.LG cs.CL cs.NE

    Fidelity-Weighted Learning

    Authors: Mostafa Dehghani, Arash Mehrjou, Stephan Gouws, Jaap Kamps, Bernhard Schölkopf

    Abstract: Training deep neural networks requires many training samples, but in practice training labels are expensive to obtain and may be of varying quality, as some may be from trusted expert labelers while others might be from heuristics or other sources of weak supervision such as crowd-sourcing. This creates a fundamental quality versus-quantity trade-off in the learning process. Do we learn from the s… ▽ More

    Submitted 23 May, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: Published as a conference paper at ICLR 2018

  33. arXiv:1705.07505  [pdf, other

    stat.ML cs.LG

    Annealed Generative Adversarial Networks

    Authors: Arash Mehrjou, Bernhard Schölkopf, Saeed Saremi

    Abstract: We introduce a novel framework for adversarial training where the target distribution is annealed between the uniform distribution and the data distribution. We posited a conjecture that learning under continuous annealing in the nonparametric regime is stable irrespective of the divergence measures in the objective function and proposed an algorithm, dubbed ß-GAN, in corollary. In this framework,… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

    Comments: 9 pages, 6 figures