Skip to main content

Showing 1–32 of 32 results for author: Sulam, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16052  [pdf, other

    cs.LG eess.SP stat.ML

    Pivotal Auto-Encoder via Self-Normalizing ReLU

    Authors: Nelson Goldenstein, Jeremias Sulam, Yaniv Romano

    Abstract: Sparse auto-encoders are useful for extracting low-dimensional representations from high-dimensional data. However, their performance degrades sharply when the input noise at test time differs from the noise employed during training. This limitation hinders the applicability of auto-encoders in real-world scenarios where the level of noise in the input is unpredictable. In this paper, we formalize… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2405.19146  [pdf, other

    stat.ML cs.LG

    I Bet You Did Not Mean That: Testing Semantic Importance via Betting

    Authors: Jacopo Teneggi, Jeremias Sulam

    Abstract: Recent works have extended notions of feature importance to \emph{semantic concepts} that are inherently interpretable to the users interacting with a black-box predictive model. Yet, precise statistical guarantees, such as false positive rate control, are needed to communicate findings transparently and to avoid unintended consequences in real-world scenarios. In this paper, we formalize the glob… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.14176  [pdf, other

    cs.LG cs.AI

    Certified Robustness against Sparse Adversarial Perturbations via Data Localization

    Authors: Ambar Pal, René Vidal, Jeremias Sulam

    Abstract: Recent work in adversarial robustness suggests that natural data distributions are localized, i.e., they place high probability in small volume regions of the input space, and that this property can be utilized for designing classifiers with improved robustness guarantees for $\ell_2$-bounded perturbations. Yet, it is still unclear if this observation holds true for more general metrics. In this w… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2310.14344  [pdf, other

    cs.CV cs.LG

    What's in a Prior? Learned Proximal Networks for Inverse Problems

    Authors: Zhenghan Fang, Sam Buchanan, Jeremias Sulam

    Abstract: Proximal operators are ubiquitous in inverse problems, commonly appearing as part of algorithmic strategies to regularize problems that are otherwise ill-posed. Modern deep learning models have been brought to bear for these tasks too, as in the framework of plug-and-play or deep unrolling, where they loosely resemble proximal operators. Yet, something essential is lost in employing these purely d… ▽ More

    Submitted 27 March, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

  5. arXiv:2309.16096  [pdf, other

    cs.LG cs.AI

    Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness

    Authors: Ambar Pal, Jeremias Sulam, René Vidal

    Abstract: The susceptibility of modern machine learning classifiers to adversarial examples has motivated theoretical results suggesting that these might be unavoidable. However, these results can be too general to be applicable to natural data distributions. Indeed, humans are quite robust for tasks involving vision. This apparent conflict motivates a deeper dive into the question: Are adversarial examples… ▽ More

    Submitted 25 May, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2023

  6. arXiv:2307.00426  [pdf, other

    cs.LG cs.AI

    Sparsity-aware generalization theory for deep neural networks

    Authors: Ramchandran Muthukumar, Jeremias Sulam

    Abstract: Deep artificial neural networks achieve surprising generalization abilities that remain poorly understood. In this paper, we present a new approach to analyzing generalization for deep feed-forward ReLU networks that takes advantage of the degree of sparsity that is achieved in the hidden layer activations. By develo** a framework that accounts for this reduced effective model size for each inpu… ▽ More

    Submitted 4 July, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

  7. arXiv:2305.04746  [pdf, other

    cs.LG cs.AI

    Understanding Noise-Augmented Training for Randomized Smoothing

    Authors: Ambar Pal, Jeremias Sulam

    Abstract: Randomized smoothing is a technique for providing provable robustness guarantees against adversarial attacks while making minimal assumptions about a classifier. This method relies on taking a majority vote of any base classifier over multiple noise-perturbed inputs to obtain a smoothed classifier, and it remains the tool of choice to certify deep and complex neural network models. Nonetheless, no… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research, 2023

  8. arXiv:2302.03791  [pdf, other

    stat.ML cs.CV cs.LG

    How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control

    Authors: Jacopo Teneggi, Matthew Tivnan, J. Webster Stayman, Jeremias Sulam

    Abstract: Score-based generative modeling, informally referred to as diffusion models, continue to grow in popularity across several important domains and tasks. While they provide high-quality and diverse samples from empirical distributions, important questions remain on the reliability and trustworthiness of these sampling procedures for their responsible use in critical scenarios. Conformal prediction i… ▽ More

    Submitted 27 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Journal ref: International Conference on Machine Learning (2023)

  9. arXiv:2211.15924  [pdf, other

    cs.CV

    Weakly Supervised Learning Significantly Reduces the Number of Labels Required for Intracranial Hemorrhage Detection on Head CT

    Authors: Jacopo Teneggi, Paul H. Yi, Jeremias Sulam

    Abstract: Modern machine learning pipelines, in particular those based on deep learning (DL) models, require large amounts of labeled data. For classification problems, the most common learning paradigm consists of presenting labeled examples during training, thus providing strong supervision on what constitutes positive and negative samples. This constitutes a major obstacle for the development of DL model… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  10. arXiv:2209.04504  [pdf, other

    eess.IV cs.CV cs.LG

    DeepSTI: Towards Tensor Reconstruction using Fewer Orientations in Susceptibility Tensor Imaging

    Authors: Zhenghan Fang, Kuo-Wei Lai, Peter van Zijl, Xu Li, Jeremias Sulam

    Abstract: Susceptibility tensor imaging (STI) is an emerging magnetic resonance imaging technique that characterizes the anisotropic tissue magnetic susceptibility with a second-order tensor model. STI has the potential to provide information for both the reconstruction of white matter fiber pathways and detection of myelin changes in the brain at mm resolution or less, which would be of great value for und… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

  11. arXiv:2207.12497  [pdf, other

    cs.LG cs.CY

    Estimating and Controlling for Equalized Odds via Sensitive Attribute Predictors

    Authors: Beepul Bharti, Paul Yi, Jeremias Sulam

    Abstract: As the use of machine learning models in real world high-stakes decision settings continues to grow, it is highly important that we are able to audit and control for any potential fairness violations these models may exhibit towards certain groups. To do so, one naturally requires access to sensitive attributes, such as demographics, gender, or other potentially sensitive features that determine g… ▽ More

    Submitted 8 June, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

  12. arXiv:2207.07038  [pdf, other

    cs.LG

    SHAP-XRT: The Shapley Value Meets Conditional Independence Testing

    Authors: Jacopo Teneggi, Beepul Bharti, Yaniv Romano, Jeremias Sulam

    Abstract: The complex nature of artificial neural networks raises concerns on their reliability, trustworthiness, and fairness in real-world scenarios. The Shapley value -- a solution concept from game theory -- is one of the most popular explanation methods for machine learning models. More traditionally, from a statistical perspective, feature importance is defined in terms of conditional independence. So… ▽ More

    Submitted 27 December, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Journal ref: Transactions on Machine Learning Research (2023)

  13. arXiv:2202.13216  [pdf, other

    cs.LG

    Adversarial robustness of sparse local Lipschitz predictors

    Authors: Ramchandran Muthukumar, Jeremias Sulam

    Abstract: This work studies the adversarial robustness of parametric functions composed of a linear predictor and a non-linear representation map. % that satisfies certain stability condition. Our analysis relies on \emph{sparse local Lipschitzness} (SLL), an extension of local Lipschitz continuity that better captures the stability and reduced effective dimensionality of predictors upon local perturbations… ▽ More

    Submitted 3 March, 2023; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: Updated experiments

  14. arXiv:2112.07782  [pdf, other

    q-bio.BM cs.LG

    Deciphering antibody affinity maturation with language models and weakly supervised learning

    Authors: Jeffrey A. Ruffolo, Jeffrey J. Gray, Jeremias Sulam

    Abstract: In response to pathogens, the adaptive immune system generates specific antibodies that bind and neutralize foreign antigens. Understanding the composition of an individual's immune repertoire can provide insights into this process and reveal potential therapeutic antibodies. In this work, we explore the application of antibody-specific language models to aid understanding of immune repertoires. W… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: Presented at Machine Learning for Structural Biology Workshop, NeurIPS 2021

  15. arXiv:2109.10778  [pdf, other

    cs.CV cs.LG

    Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images

    Authors: Zhenzhen Wang, Carla Saoud, Sintawat Wangsiricharoen, Aaron W. James, Aleksander S. Popel, Jeremias Sulam

    Abstract: Annotating cancerous regions in whole-slide images (WSIs) of pathology samples plays a critical role in clinical diagnosis, biomedical research, and machine learning algorithms development. However, generating exhaustive and accurate annotations is labor-intensive, challenging, and costly. Drawing only coarse and approximate annotations is a much easier task, less costly, and it alleviates patholo… ▽ More

    Submitted 7 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

  16. arXiv:2105.02375  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    A Geometric Analysis of Neural Collapse with Unconstrained Features

    Authors: Zhihui Zhu, Tianyu Ding, **xin Zhou, Xiao Li, Chong You, Jeremias Sulam, Qing Qu

    Abstract: We provide the first global optimization landscape analysis of $Neural\;Collapse$ -- an intriguing empirical phenomenon that arises in the last-layer classifiers and features of neural networks during the terminal phase of training. As recently reported by Papyan et al., this phenomenon implies that ($i$) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 42 pages, 8 figures, 1 table; the first two authors contributed to this work equally

  17. Fast Hierarchical Games for Image Explanations

    Authors: Jacopo Teneggi, Alexandre Luster, Jeremias Sulam

    Abstract: As modern complex neural networks keep breaking records and solving harder problems, their predictions also become less and less intelligible. The current lack of interpretability often undermines the deployment of accurate machine learning tools in sensitive settings. In this work, we present a model-agnostic explanation method for image classification based on a hierarchical extension of Shapley… ▽ More

    Submitted 9 June, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: 20 pages, 8 figures

  18. arXiv:2010.12088  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness of Supervised Sparse Coding

    Authors: Jeremias Sulam, Ramchandran Muthukumar, Raman Arora

    Abstract: Several recent results provide theoretical insights into the phenomena of adversarial examples. Existing results, however, are often limited due to a gap between the simplicity of the models studied and the complexity of those deployed in practice. In this work, we strike a better balance by considering a model that involves learning a representation while at the same time giving a precise general… ▽ More

    Submitted 4 January, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Journal ref: Advances in Neural Information Processing Systems, 2020

  19. arXiv:2008.05024  [pdf, other

    eess.IV cs.CV cs.LG

    Learned Proximal Networks for Quantitative Susceptibility Map**

    Authors: Kuo-Wei Lai, Manisha Aggarwal, Peter van Zijl, Xu Li, Jeremias Sulam

    Abstract: Quantitative Susceptibility Map** (QSM) estimates tissue magnetic susceptibility distributions from Magnetic Resonance (MR) phase measurements by solving an ill-posed dipole inversion problem. Conventional single orientation QSM methods usually employ regularization strategies to stabilize such inversion, but may suffer from streaking artifacts or over-smoothing. Multiple orientation QSM such as… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 11 pages

  20. arXiv:2007.08383  [pdf, other

    q-bio.BM cs.LG

    Deep Learning in Protein Structural Modeling and Design

    Authors: Wenhao Gao, Sai Pooja Mahajan, Jeremias Sulam, Jeffrey J. Gray

    Abstract: Deep learning is catalyzing a scientific revolution fueled by big data, accessible toolkits, and powerful computational resources, impacting many fields including protein structural modeling. Protein structural modeling, such as predicting structure from amino acid sequence and evolutionary information, designing proteins toward desirable functionality, or predicting properties or behavior of a pr… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

  21. arXiv:2006.06179  [pdf, other

    cs.LG stat.ML

    Recovery and Generalization in Over-Realized Dictionary Learning

    Authors: Jeremias Sulam, Chong You, Zhihui Zhu

    Abstract: In over two decades of research, the field of dictionary learning has gathered a large collection of successful applications, and theoretical guarantees for model recovery are known only whenever optimization is carried out in the same model class as that of the underlying dictionary. This work characterizes the surprising phenomenon that dictionary recovery can be facilitated by searching over th… ▽ More

    Submitted 1 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

  22. arXiv:1811.00312  [pdf, other

    cs.CV

    A Local Block Coordinate Descent Algorithm for the Convolutional Sparse Coding Model

    Authors: Ev Zisselman, Jeremias Sulam, Michael Elad

    Abstract: The Convolutional Sparse Coding (CSC) model has recently gained considerable traction in the signal and image processing communities. By providing a global, yet tractable, model that operates on the whole image, the CSC was shown to overcome several limitations of the patch-based sparse model while achieving superior performance in various applications. Contemporary methods for pursuit and learnin… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: 13 pages, 10 figures

    MSC Class: 08

  23. MMSE Approximation For Sparse Coding Algorithms Using Stochastic Resonance

    Authors: Dror Simon, Jeremias Sulam, Yaniv Romano, Yue M. Lu, Michael Elad

    Abstract: Sparse coding refers to the pursuit of the sparsest representation of a signal in a typically overcomplete dictionary. From a Bayesian perspective, sparse coding provides a Maximum a Posteriori (MAP) estimate of the unknown vector under a sparse prior. In this work, we suggest enhancing the performance of sparse coding algorithms by a deliberate and controlled contamination of the input with rando… ▽ More

    Submitted 11 April, 2019; v1 submitted 26 June, 2018; originally announced June 2018.

  24. arXiv:1806.00701  [pdf, other

    cs.LG stat.ML

    On Multi-Layer Basis Pursuit, Efficient Algorithms and Convolutional Neural Networks

    Authors: Jeremias Sulam, Aviad Aberdam, Amir Beck, Michael Elad

    Abstract: Parsimonious representations are ubiquitous in modeling and processing information. Motivated by the recent Multi-Layer Convolutional Sparse Coding (ML-CSC) model, we herein generalize the traditional Basis Pursuit problem to a multi-layer setting, introducing similar sparse enforcing penalties at different representation layers in a symbiotic relation between synthesis and analysis sparse priors.… ▽ More

    Submitted 21 November, 2018; v1 submitted 2 June, 2018; originally announced June 2018.

  25. arXiv:1805.11596  [pdf, other

    stat.ML cs.IT cs.LG

    Adversarial Noise Attacks of Deep Learning Architectures -- Stability Analysis via Sparse Modeled Signals

    Authors: Yaniv Romano, Aviad Aberdam, Jeremias Sulam, Michael Elad

    Abstract: Despite their impressive performance, deep convolutional neural networks (CNNs) have been shown to be sensitive to small adversarial perturbations. These nuisances, which one can barely notice, are powerful enough to fool sophisticated and well performing classifiers, leading to ridiculous misclassification results. In this paper we analyze the stability of state-of-the-art deep-learning classific… ▽ More

    Submitted 5 August, 2019; v1 submitted 29 May, 2018; originally announced May 2018.

  26. arXiv:1804.09788  [pdf, other

    eess.IV cs.LG eess.SP

    Multi-Layer Sparse Coding: The Holistic Way

    Authors: Aviad Aberdam, Jeremias Sulam, Michael Elad

    Abstract: The recently proposed multi-layer sparse model has raised insightful connections between sparse representations and convolutional neural networks (CNN). In its original conception, this model was restricted to a cascade of convolutional synthesis representations. In this paper, we start by addressing a more general model, revealing interesting ties to fully connected networks. We then show that th… ▽ More

    Submitted 25 July, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

  27. arXiv:1708.08705  [pdf, other

    cs.CV cs.LG stat.ML

    Multi-Layer Convolutional Sparse Modeling: Pursuit and Dictionary Learning

    Authors: Jeremias Sulam, Vardan Papyan, Yaniv Romano, Michael Elad

    Abstract: The recently proposed Multi-Layer Convolutional Sparse Coding (ML-CSC) model, consisting of a cascade of convolutional sparse layers, provides a new interpretation of Convolutional Neural Networks (CNNs). Under this framework, the computation of the forward pass in a CNN is equivalent to a pursuit algorithm aiming to estimate the nested sparse representation vectors -- or feature maps -- from a gi… ▽ More

    Submitted 30 June, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

    Journal ref: IEEE Transactions on Signal Processing, vol. 66, no. 15, pp. 4090-4104, Aug.1, 1 2018

  28. Working Locally Thinking Globally: Theoretical Guarantees for Convolutional Sparse Coding

    Authors: Vardan Papyan, Jeremias Sulam, Michael Elad

    Abstract: The celebrated sparse representation model has led to remarkable results in various signal processing tasks in the last decade. However, despite its initial purpose of serving as a global prior for entire signals, it has been commonly used for modeling low dimensional patches due to the computational constraints it entails when deployed with learned dictionaries. A way around this problem has been… ▽ More

    Submitted 12 July, 2017; originally announced July 2017.

    Comments: This is the journal version of arXiv:1607.02005 and arXiv:1607.02009, accepted to IEEE Transactions on Signal Processing

  29. arXiv:1705.03239  [pdf, other

    cs.CV

    Convolutional Dictionary Learning via Local Processing

    Authors: Vardan Papyan, Yaniv Romano, Jeremias Sulam, Michael Elad

    Abstract: Convolutional Sparse Coding (CSC) is an increasingly popular model in the signal and image processing communities, tackling some of the limitations of traditional patch-based sparse representations. Although several works have addressed the dictionary learning problem under this model, these relied on an ADMM formulation in the Fourier domain, losing the sense of locality and the relation to the t… ▽ More

    Submitted 9 May, 2017; originally announced May 2017.

  30. arXiv:1607.02009  [pdf, other

    cs.IT

    Working Locally Thinking Globally - Part II: Stability and Algorithms for Convolutional Sparse Coding

    Authors: Vardan Papyan, Jeremias Sulam, Michael Elad

    Abstract: The convolutional sparse model has recently gained increasing attention in the signal and image processing communities, and several methods have been proposed for solving the pursuit problem emerging from it -- in particular its convex relaxation, Basis Pursuit. In the first of this two-part work, we have provided a theoretical back-bone for this model, providing guarantees for the uniqueness of t… ▽ More

    Submitted 22 February, 2017; v1 submitted 7 July, 2016; originally announced July 2016.

  31. arXiv:1607.02005  [pdf, other

    cs.IT

    Working Locally Thinking Globally - Part I: Theoretical Guarantees for Convolutional Sparse Coding

    Authors: Vardan Papyan, Jeremias Sulam, Michael Elad

    Abstract: The celebrated sparse representation model has led to remarkable results in various signal processing tasks in the last decade. However, despite its initial purpose of serving as a global prior for entire signals, it has been commonly used for modeling low dimensional patches due to the computational constraints it entails when deployed with learned dictionaries. A way around this problem has been… ▽ More

    Submitted 22 February, 2017; v1 submitted 7 July, 2016; originally announced July 2016.

  32. Trainlets: Dictionary Learning in High Dimensions

    Authors: Jeremias Sulam, Boaz Ophir, Michael Zibulevsky, Michael Elad

    Abstract: Sparse representations has shown to be a very powerful model for real world signals, and has enabled the development of applications with notable performance. Combined with the ability to learn a dictionary from signal examples, sparsity-inspired algorithms are often achieving state-of-the-art results in a wide variety of tasks. Yet, these methods have traditionally been restricted to small dimens… ▽ More

    Submitted 12 May, 2016; v1 submitted 31 January, 2016; originally announced February 2016.