Skip to main content

Showing 1–8 of 8 results for author: Yosef, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07008  [pdf, other

    cs.LG cs.AI cs.CL stat.ME

    AutoEval Done Right: Using Synthetic Data for Model Evaluation

    Authors: Pierre Boyeau, Anastasios N. Angelopoulos, Nir Yosef, Jitendra Malik, Michael I. Jordan

    Abstract: The evaluation of machine learning models using human-labeled validation data can be expensive and time-consuming. AI-labeled synthetic data can be used to decrease the number of human annotations required for this purpose in a process called autoevaluation. We suggest efficient and statistically principled algorithms for this purpose that improve sample efficiency while remaining unbiased. These… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: New experiments, fix fig 1

  2. arXiv:2002.07217  [pdf, other

    stat.ML cs.AI cs.LG

    Decision-Making with Auto-Encoding Variational Bayes

    Authors: Romain Lopez, Pierre Boyeau, Nir Yosef, Michael I. Jordan, Jeffrey Regier

    Abstract: To make decisions based on a model fit with auto-encoding variational Bayes (AEVB), practitioners often let the variational distribution serve as a surrogate for the posterior distribution. This approach yields biased estimates of the expected risk, and therefore leads to poor decisions for two reasons. First, the model fit with AEVB may not equal the underlying data distribution. Second, the vari… ▽ More

    Submitted 21 October, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Journal ref: Advances in Neural Information Processing Systems 2020

  3. arXiv:1905.02269  [pdf, other

    cs.LG q-bio.GN stat.ML

    A joint model of unpaired data from scRNA-seq and spatial transcriptomics for imputing missing gene expression measurements

    Authors: Romain Lopez, Achille Nazaret, Maxime Langevin, Jules Samaran, Jeffrey Regier, Michael I. Jordan, Nir Yosef

    Abstract: Spatial studies of transcriptome provide biologists with gene expression maps of heterogeneous and complex tissues. However, most experimental protocols for spatial transcriptomics suffer from the need to select beforehand a small fraction of genes to be quantified over the entire transcriptome. Standard single-cell RNA sequencing (scRNA-seq) is more prevalent, easier to implement and can in princ… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: submitted to the 2019 ICML Workshop on Computational Biology

  4. arXiv:1809.05957  [pdf, ps, other

    cs.LG stat.ML

    A Deep Generative Model for Semi-Supervised Classification with Noisy Labels

    Authors: Maxime Langevin, Edouard Mehlman, Jeffrey Regier, Romain Lopez, Michael I. Jordan, Nir Yosef

    Abstract: Class labels are often imperfectly observed, due to mistakes and to genuine ambiguity among classes. We propose a new semi-supervised deep generative model that explicitly models noisy labels, called the Mislabeled VAE (M-VAE). The M-VAE can perform better than existing deep generative models which do not account for label noise. Additionally, the derivation of M-VAE gives new theoretical insights… ▽ More

    Submitted 16 September, 2018; originally announced September 2018.

    Comments: accepted to BayLearn 2018

    MSC Class: 68T37

  5. arXiv:1805.08672  [pdf, other

    cs.LG q-bio.GN stat.ML

    Information Constraints on Auto-Encoding Variational Bayes

    Authors: Romain Lopez, Jeffrey Regier, Michael I. Jordan, Nir Yosef

    Abstract: Parameterizing the approximate posterior of a generative model with neural networks has become a common theme in recent machine learning research. While providing appealing flexibility, this approach makes it difficult to impose or assess structural constraints such as conditional independence. We propose a framework for learning representations that relies on Auto-Encoding Variational Bayes and w… ▽ More

    Submitted 28 November, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Journal ref: Advances in Neural Information Processing Systems 31 (2018)

  6. arXiv:1710.05086   

    cs.LG q-bio.GN stat.ML

    A deep generative model for single-cell RNA sequencing with application to detecting differentially expressed genes

    Authors: Romain Lopez, Jeffrey Regier, Michael Cole, Michael Jordan, Nir Yosef

    Abstract: We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the model, each cell has a low-dimensional latent representation. Additional latent variables account for technical effects that may erroneously set some observations of gene expression levels to zero. Conditional distributions are specified by neural networks, giving t… ▽ More

    Submitted 16 October, 2017; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: Updated a previous submission instead. See arXiv:1709.02082

  7. arXiv:1709.02082  [pdf, other

    cs.LG q-bio.GN stat.ML

    A deep generative model for gene expression profiles from single-cell RNA sequencing

    Authors: Romain Lopez, Jeffrey Regier, Michael Cole, Michael Jordan, Nir Yosef

    Abstract: We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the model, each cell has a low-dimensional latent representation. Additional latent variables account for technical effects that may erroneously set some observations of gene expression levels to zero. Conditional distributions are specified by neural networks, giving t… ▽ More

    Submitted 16 January, 2018; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: BayLearn2017, NIPS workshop MLCB 2017

  8. arXiv:1609.04918  [pdf, other

    cs.CC cs.DS

    Steiner Network Problems on Temporal Graphs

    Authors: Alex Khodaverdian, Benjamin Weitz, Jimmy Wu, Nir Yosef

    Abstract: We introduce a temporal Steiner network problem in which a graph, as well as changes to its edges and/or vertices over a set of discrete times, are given as input; the goal is to find a minimal subgraph satisfying a set of $k$ time-sensitive connectivity demands. We show that this problem, $k$-Temporal Steiner Network ($k$-TSN), is NP-hard to approximate to a factor of $k - ε$, for every fixed… ▽ More

    Submitted 31 August, 2017; v1 submitted 16 September, 2016; originally announced September 2016.

    ACM Class: F.2.2