Skip to main content

Showing 1–11 of 11 results for author: Fabian, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17902  [pdf, other

    eess.IV cs.CV cs.LG

    Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models

    Authors: Mohammad Shahab Sepehri, Zalan Fabian, Mahdi Soltanolkotabi

    Abstract: The landscape of computational building blocks of efficient image restoration architectures is dominated by a combination of convolutional processing and various attention mechanisms. However, convolutional filters, while efficient, are inherently local and therefore struggle with modeling long-range dependencies in images. In contrast, attention excels at capturing global interactions between arb… ▽ More

    Submitted 29 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 7 figures, under review

    ACM Class: I.4.4; I.4.5

  2. arXiv:2311.07784  [pdf, other

    cs.LG cs.CV

    A Data-Free Approach to Mitigate Catastrophic Forgetting in Federated Class Incremental Learning for Vision Tasks

    Authors: Sara Babakniya, Zalan Fabian, Chaoyang He, Mahdi Soltanolkotabi, Salman Avestimehr

    Abstract: Deep learning models often suffer from forgetting previously learned information when trained on new data. This problem is exacerbated in federated learning (FL), where the data is distributed and can change independently for each user. Many solutions are proposed to resolve this catastrophic forgetting in a centralized setting. However, they do not apply directly to FL because of its unique compl… ▽ More

    Submitted 21 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted in NeurIPS 2023. arXiv admin note: text overlap with arXiv:2307.00497

  3. arXiv:2311.01064  [pdf, other

    cs.CV cs.LG

    Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images

    Authors: Zalan Fabian, Zhongqi Miao, Chunyuan Li, Yuanhan Zhang, Ziwei Liu, Andrés Hernández, Andrés Montes-Rojas, Rafael Escucha, Laura Siabatto, Andrés Link, Pablo Arbeláez, Rahul Dodhia, Juan Lavista Ferres

    Abstract: Due to deteriorating environmental conditions and increasing human activity, conservation efforts directed towards wildlife is crucial. Motion-activated camera traps constitute an efficient tool for tracking and monitoring wildlife populations across the globe. Supervised learning techniques have been successfully deployed to analyze such imagery, however training such techniques requires annotati… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 18 pages, 9 figures

  4. arXiv:2309.06642  [pdf, other

    eess.IV cs.LG

    Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models

    Authors: Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi

    Abstract: Inverse problems arise in a multitude of applications, where the goal is to recover a clean signal from noisy and possibly (non)linear observations. The difficulty of a reconstruction problem depends on multiple factors, such as the structure of the ground truth signal, the severity of the degradation and the complex interactions between the above. This results in natural sample-by-sample variatio… ▽ More

    Submitted 4 February, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 30 pages, 21 figures, preliminary version

    ACM Class: I.2.6; I.4.5

  5. arXiv:2307.13744  [pdf, other

    cs.LG math.OC

    mL-BFGS: A Momentum-based L-BFGS for Distributed Large-Scale Neural Network Optimization

    Authors: Yue Niu, Zalan Fabian, Sunwoo Lee, Mahdi Soltanolkotabi, Salman Avestimehr

    Abstract: Quasi-Newton methods still face significant challenges in training large-scale neural networks due to additional compute costs in the Hessian related computations and instability issues in stochastic training. A well-known method, L-BFGS that efficiently approximates the Hessian using history parameter and gradient changes, suffers convergence instability in stochastic training. So far, attempts t… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to TMLR 2023 (21 pages, 8 figures)

  6. arXiv:2307.00497  [pdf, other

    cs.LG cs.AI

    Don't Memorize; Mimic The Past: Federated Class Incremental Learning Without Episodic Memory

    Authors: Sara Babakniya, Zalan Fabian, Chaoyang He, Mahdi Soltanolkotabi, Salman Avestimehr

    Abstract: Deep learning models are prone to forgetting information learned in the past when trained on new data. This problem becomes even more pronounced in the context of federated learning (FL), where data is decentralized and subject to independent changes for each user. Continual Learning (CL) studies this so-called \textit{catastrophic forgetting} phenomenon primarily in centralized settings, where th… ▽ More

    Submitted 17 July, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

  7. arXiv:2303.14353  [pdf, other

    eess.IV cs.CV cs.LG

    DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency

    Authors: Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi

    Abstract: Diffusion models have established new state of the art in a multitude of computer vision tasks, including image restoration. Diffusion-based inverse problem solvers generate reconstructions of exceptional visual quality from heavily corrupted measurements. However, in what is widely known as the perception-distortion trade-off, the price of perceptually appealing reconstructions is often paid in d… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 28 pages, 13 figures, preliminary version

    ACM Class: I.2.6; I.4.4; I.4.5

  8. arXiv:2203.08213  [pdf, other

    eess.IV cs.CV cs.LG

    HUMUS-Net: Hybrid unrolled multi-scale network architecture for accelerated MRI reconstruction

    Authors: Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi

    Abstract: In accelerated MRI reconstruction, the anatomy of a patient is recovered from a set of under-sampled and noisy measurements. Deep learning approaches have been proven to be successful in solving this ill-posed inverse problem and are capable of producing very high quality reconstructions. However, current architectures heavily rely on convolutions, that are content-independent and have difficultie… ▽ More

    Submitted 16 March, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: 18 pages, 11 figures, NeurIPS 2022

    ACM Class: I.2; I.4; J.3

  9. arXiv:2106.14947  [pdf, other

    eess.IV cs.CV cs.LG

    Data augmentation for deep learning based accelerated MRI reconstruction with limited data

    Authors: Zalan Fabian, Reinhard Heckel, Mahdi Soltanolkotabi

    Abstract: Deep neural networks have emerged as very successful tools for image restoration and reconstruction tasks. These networks are often trained end-to-end to directly reconstruct an image from a noisy or corrupted measurement of that image. To achieve state-of-the-art performance, training on large and diverse sets of images is considered critical. However, it is often difficult and/or expensive to co… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 27 pages, 19 figures, to be published in ICML2021

    ACM Class: I.2; I.4; J.3

  10. arXiv:2006.10581  [pdf, other

    cs.LG cs.IT stat.ML

    Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks

    Authors: Seyed Mohammadreza Mousavi Kalan, Zalan Fabian, A. Salman Avestimehr, Mahdi Soltanolkotabi

    Abstract: Transfer learning has emerged as a powerful technique for improving the performance of machine learning models on new domains where labeled training data may be scarce. In this approach a model trained for a source task, where plenty of labeled training data is available, is used as a starting point for training a model on a related target task with only few labeled training data. Despite recent e… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  11. arXiv:1906.05392  [pdf, other

    cs.LG math.OC stat.ML

    Generalization Guarantees for Neural Networks via Harnessing the Low-rank Structure of the Jacobian

    Authors: Samet Oymak, Zalan Fabian, Mingchen Li, Mahdi Soltanolkotabi

    Abstract: Modern neural network architectures often generalize well despite containing many more parameters than the size of the training dataset. This paper explores the generalization capabilities of neural networks trained via gradient descent. We develop a data-dependent optimization and generalization theory which leverages the low-rank structure of the Jacobian matrix associated with the network. Our… ▽ More

    Submitted 3 July, 2019; v1 submitted 12 June, 2019; originally announced June 2019.