Skip to main content

Showing 1–13 of 13 results for author: Entezari, R

.
  1. arXiv:2403.03206  [pdf, other

    cs.CV

    Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

    Authors: Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Kyle Lacey, Alex Goodwin, Yannik Marek, Robin Rombach

    Abstract: Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos. Rectified flow is a recent generative model formulation that connects data and noise in a straight line. Despite its better theoretical properties and conceptual simplicity, it is n… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  2. arXiv:2304.14108  [pdf, other

    cs.CV cs.CL cs.LG

    DataComp: In search of the next generation of multimodal datasets

    Authors: Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song , et al. (9 additional authors not shown)

    Abstract: Multimodal datasets are a critical component in recent breakthroughs such as Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms. To address this shortcoming in the ML ecosystem, we introduce DataComp, a testbed for dataset experiments centered around a new candidate pool of 12.8 billion image-text pairs from Commo… ▽ More

    Submitted 20 October, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  3. arXiv:2302.13602  [pdf, other

    cs.CV cs.LG

    The Role of Pre-training Data in Transfer Learning

    Authors: Rahim Entezari, Mitchell Wortsman, Olga Saukh, M. Moein Shariatnia, Hanie Sedghi, Ludwig Schmidt

    Abstract: The transfer learning paradigm of model pre-training and subsequent fine-tuning produces high-accuracy models. While most studies recommend scaling the pre-training size to benefit most from transfer learning, a question remains: what data and method should be used for pre-training? We investigate the impact of pre-training data distribution on the few-shot and full fine-tuning performance using 3… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  4. arXiv:2211.08403  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    REPAIR: REnormalizing Permuted Activations for Interpolation Repair

    Authors: Keller Jordan, Hanie Sedghi, Olga Saukh, Rahim Entezari, Behnam Neyshabur

    Abstract: In this paper we look into the conjecture of Entezari et al. (2021) which states that if the permutation invariance of neural networks is taken into account, then there is likely no loss barrier to the linear interpolation between SGD solutions. First, we observe that neuron alignment methods alone are insufficient to establish low-barrier linear connectivity between SGD solutions due to a phenome… ▽ More

    Submitted 25 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  5. arXiv:2207.00200  [pdf, other

    cs.LG cs.CV

    Studying the impact of magnitude pruning on contrastive learning methods

    Authors: Francesco Corti, Rahim Entezari, Sara Hooker, Davide Bacciu, Olga Saukh

    Abstract: We study the impact of different pruning techniques on the representation learned by deep neural networks trained with contrastive loss functions. Our work finds that at high sparsity levels, contrastive learning results in a higher number of misclassified examples relative to models trained with traditional cross-entropy loss. To understand this pronounced difference, we use metrics such as the n… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  6. arXiv:2206.10915  [pdf, other

    cs.CV

    Understanding the effect of sparsity on neural networks robustness

    Authors: Lukas Timpl, Rahim Entezari, Hanie Sedghi, Behnam Neyshabur, Olga Saukh

    Abstract: This paper examines the impact of static sparsity on the robustness of a trained network to weight perturbations, data corruption, and adversarial examples. We show that, up to a certain sparsity achieved by increasing network width and depth while kee** the network capacity fixed, sparsified networks consistently match and often outperform their initially dense versions. Robustness and accuracy… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  7. arXiv:2206.07422  [pdf, other

    eess.IV cs.CV

    Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological Images

    Authors: Amirreza Mahbod, Rahim Entezari, Isabella Ellinger, Olga Saukh

    Abstract: Recently, pruning deep neural networks (DNNs) has received a lot of attention for improving accuracy and generalization power, reducing network size, and increasing inference speed on specialized hardwares. Although pruning was mainly tested on computer vision tasks, its application in the context of medical image analysis has hardly been explored. This work investigates the impact of well-known p… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  8. arXiv:2110.06296  [pdf, other

    cs.LG

    The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks

    Authors: Rahim Entezari, Hanie Sedghi, Olga Saukh, Behnam Neyshabur

    Abstract: In this paper, we conjecture that if the permutation invariance of neural networks is taken into account, SGD solutions will likely have no barrier in the linear interpolation between them. Although it is a bold conjecture, we show how extensive empirical attempts fall short of refuting it. We further provide a preliminary theoretical result to support our conjecture. Our conjecture has implicatio… ▽ More

    Submitted 5 July, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  9. arXiv:1909.10364  [pdf, other

    cs.LG cs.CV

    Class-dependent Compression of Deep Neural Networks

    Authors: Rahim Entezari, Olga Saukh

    Abstract: Today's deep neural networks require substantial computation resources for their training, storage, and inference, which limits their effective use on resource-constrained devices. Many recent research activities explore different options for compressing and optimizing deep models. On the one hand, in many real-world applications, we face the data imbalance challenge, i.e. when the number of label… ▽ More

    Submitted 19 April, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

  10. arXiv:1807.01334  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    Breast Cancer Diagnosis via Classification Algorithms

    Authors: Reihaneh Entezari

    Abstract: In this paper, we analyze the Wisconsin Diagnostic Breast Cancer Data using Machine Learning classification techniques, such as the SVM, Bayesian Logistic Regression (Variational Approximation), and K-Nearest-Neighbors. We describe each model, and compare their performance through different measures. We conclude that SVM has the best performance among all other classifiers, while it competes close… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

  11. arXiv:1807.01239  [pdf, other

    stat.AP stat.CO stat.ME

    Bayesian Spatial Analysis of Hardwood Tree Counts in Forests via MCMC

    Authors: Reihaneh Entezari, Patrick E. Brown, Jeffrey S. Rosenthal

    Abstract: In this paper, we perform Bayesian Inference to analyze spatial tree count data from the Timiskaming and Abitibi River forests in Ontario, Canada. We consider a Bayesian Generalized Linear Geostatistical Model and implement a Markov Chain Monte Carlo algorithm to sample from its posterior distribution. How spatial predictions for new sites in the forests change as the amount of training data is re… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

  12. arXiv:1805.09521  [pdf, other

    cs.CV

    AVID: Adversarial Visual Irregularity Detection

    Authors: Mohammad Sabokrou, Masoud Pourreza, Mohsen Fayyaz, Rahim Entezari, Mahmood Fathy, Jürgen Gall, Ehsan Adeli

    Abstract: Real-time detection of irregularities in visual data is very invaluable and useful in many prospective applications including surveillance, patient monitoring systems, etc. With the surge of deep learning methods in the recent years, researchers have tried a wide spectrum of methods for different applications. However, for the case of irregularity or anomaly detection in videos, training an end-to… ▽ More

    Submitted 17 July, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

  13. arXiv:1605.02113  [pdf, other

    stat.ML stat.CO

    Likelihood Inflating Sampling Algorithm

    Authors: Reihaneh Entezari, Radu V. Craiu, Jeffrey S. Rosenthal

    Abstract: Markov Chain Monte Carlo (MCMC) sampling from a posterior distribution corresponding to a massive data set can be computationally prohibitive since producing one sample requires a number of operations that is linear in the data size. In this paper, we introduce a new communication-free parallel method, the Likelihood Inflating Sampling Algorithm (LISA), that significantly reduces computational cos… ▽ More

    Submitted 30 June, 2017; v1 submitted 6 May, 2016; originally announced May 2016.

    Comments: 32 pages, 3 figures, submitted