Skip to main content

Showing 1–50 of 60 results for author: Kolouri, S

.
  1. arXiv:2406.19301  [pdf, other

    cs.LG

    MCNC: Manifold Constrained Network Compression

    Authors: Chayne Thrash, Ali Abbasi, Parsa Nooralinejad, Soroush Abbasi Koohpayegani, Reed Andreas, Hamed Pirsiavash, Soheil Kolouri

    Abstract: The outstanding performance of large foundational models across diverse tasks-from computer vision to speech and natural language processing-has significantly increased their demand. However, storing and transmitting these models pose significant challenges due to their massive size (e.g., 350GB for GPT-3). Recent literature has focused on compressing the original weights or reducing the number of… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.19047  [pdf, other

    cs.LG cs.AI

    Statistical Context Detection for Deep Lifelong Reinforcement Learning

    Authors: Jeffery Dick, Saptarshi Nath, Christos Peridis, Eseoghene Benjamin, Soheil Kolouri, Andrea Soltoggio

    Abstract: Context detection involves labeling segments of an online stream of data as belonging to different tasks. Task labels are used in lifelong learning algorithms to perform consolidation or other procedures that prevent catastrophic forgetting. Inferring task labels from online experiences remains a challenging problem. Most approaches assume finite and low-dimension observation spaces or a prelimina… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages excluding references and bibliography. Submitted to CoLLAs 2024

  3. arXiv:2405.16770  [pdf, other

    cs.LG

    Physics informed cell representations for variational formulation of multiscale problems

    Authors: Yuxiang Gao, Soheil Kolouri, Ravindra Duddu

    Abstract: With the rapid advancement of graphical processing units, Physics-Informed Neural Networks (PINNs) are emerging as a promising tool for solving partial differential equations (PDEs). However, PINNs are not well suited for solving PDEs with multiscale features, particularly suffering from slow convergence and poor accuracy. To address this limitation of PINNs, this article proposes physics-informed… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2403.19786  [pdf, other

    cs.CV

    Zero-shot Prompt-based Video Encoder for Surgical Gesture Recognition

    Authors: Mingxing Rao, Yinhong Qin, Soheil Kolouri, Jie Ying Wu, Daniel Moyer

    Abstract: Purpose: Surgical video is an important data stream for gesture recognition. Thus, robust visual encoders for those data-streams is similarly important. Methods: Leveraging the Bridge-Prompt framework, we fine-tune a pre-trained vision-text model (CLIP) for gesture recognition in surgical videos. This can utilize extensive outside video data such as text, but also make use of label meta-data and w… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 17 pages,4 figures, 7 tables, IPCAI 2024

  5. arXiv:2403.07142  [pdf, other

    cs.CV cs.CL cs.LG

    One Category One Prompt: Dataset Distillation using Diffusion Models

    Authors: Ali Abbasi, Ashkan Shahbazi, Hamed Pirsiavash, Soheil Kolouri

    Abstract: The extensive amounts of data required for training deep neural networks pose significant challenges on storage and transmission fronts. Dataset distillation has emerged as a promising technique to condense the information of massive datasets into a much smaller yet representative set of synthetic samples. However, traditional dataset distillation approaches often struggle to scale effectively wit… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  6. arXiv:2402.03664  [pdf, other

    cs.LG stat.ML

    Partial Gromov-Wasserstein Metric

    Authors: Yikun Bai, Rocio Diaz Martin, Abihith Kothapalli, Hengrong Du, Xinran Liu, Soheil Kolouri

    Abstract: The Gromov-Wasserstein (GW) distance has gained increasing interest in the machine learning community in recent years, as it allows for the comparison of measures in different metric spaces. To overcome the limitations imposed by the equal mass requirements of the classical GW problem, researchers have begun exploring its application in unbalanced settings. However, Unbalanced GW (UGW) can only be… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2402.02345  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Stereographic Spherical Sliced Wasserstein Distances

    Authors: Huy Tran, Yikun Bai, Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Rocio Diaz Martin, Soheil Kolouri

    Abstract: Comparing spherical probability distributions is of great interest in various fields, including geology, medical domains, computer vision, and deep representation learning. The utility of optimal transport-based distances, such as the Wasserstein distance, for comparing probability measures has spurred active research in develo** computationally efficient variations of these distances for spheri… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024 (Spotlight). Project page: https://abi-kothapalli.github.io/s3w/

  8. arXiv:2311.12999  [pdf, other

    cs.LG cs.AI

    CovarNav: Machine Unlearning via Model Inversion and Covariance Navigation

    Authors: Ali Abbasi, Chayne Thrash, Elaheh Akbari, Daniel Zhang, Soheil Kolouri

    Abstract: The rapid progress of AI, combined with its unprecedented public adoption and the propensity of large neural networks to memorize training data, has given rise to significant data privacy concerns. To address these concerns, machine unlearning has emerged as an essential technique to selectively remove the influence of specific training data points on trained models. In this paper, we approach the… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  9. arXiv:2311.11995  [pdf, other

    cs.LG cs.AI cs.CR

    BrainWash: A Poisoning Attack to Forget in Continual Learning

    Authors: Ali Abbasi, Parsa Nooralinejad, Hamed Pirsiavash, Soheil Kolouri

    Abstract: Continual learning has gained substantial attention within the deep learning community, offering promising solutions to the challenging problem of sequential learning. Yet, a largely unexplored facet of this paradigm is its susceptibility to adversarial attacks, especially with the aim of inducing forgetting. In this paper, we introduce "BrainWash," a novel data poisoning method tailored to impose… ▽ More

    Submitted 23 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

  10. arXiv:2310.06002  [pdf, other

    cs.LG

    LCOT: Linear circular optimal transport

    Authors: Rocio Diaz Martin, Ivan Medri, Yikun Bai, Xinran Liu, Kangbai Yan, Gustavo K. Rohde, Soheil Kolouri

    Abstract: The optimal transport problem for measures supported on non-Euclidean spaces has recently gained ample interest in diverse applications involving representation learning. In this paper, we focus on circular probability measures, i.e., probability measures supported on the unit circle, and introduce a new computationally efficient metric for these measures, denoted as Linear Circular Optimal Transp… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  11. arXiv:2310.02556  [pdf, other

    cs.CL cs.CV

    NOLA: Compressing LoRA using Linear Combination of Random Basis

    Authors: Soroush Abbasi Koohpayegani, KL Navaneet, Parsa Nooralinejad, Soheil Kolouri, Hamed Pirsiavash

    Abstract: Fine-tuning Large Language Models (LLMs) and storing them for each downstream task or domain is impractical because of the massive model size (e.g., 350GB in GPT-3). Current literature, such as LoRA, showcases the potential of low-rank modifications to the original weights of an LLM, enabling efficient adaptation and storage for task-specific models. These methods can reduce the number of paramete… ▽ More

    Submitted 29 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. Our code is available here: https://github.com/UCDvision/NOLA

  12. arXiv:2309.15787  [pdf, other

    cs.CV cs.LG

    Partial Transport for Point-Cloud Registration

    Authors: Yikun Bai, Huy Tran, Steven B. Damelin, Soheil Kolouri

    Abstract: Point cloud registration plays a crucial role in various fields, including robotics, computer graphics, and medical imaging. This process involves determining spatial relationships between different sets of points, typically within a 3D space. In real-world scenarios, complexities arise from non-rigid movements and partial visibility, such as occlusions or sensor noise, making non-rigid registrati… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  13. arXiv:2307.13571  [pdf, other

    cs.LG

    PT$\mathrm{L}^{p}$: Partial Transport $\mathrm{L}^{p}$ Distances

    Authors: Xinran Liu, Yikun Bai, Huy Tran, Zhanqi Zhu, Matthew Thorpe, Soheil Kolouri

    Abstract: Optimal transport and its related problems, including optimal partial transport, have proven to be valuable tools in machine learning for computing meaningful distances between probability or positive measures. This success has led to a growing interest in defining transport-based distances that allow for comparing signed measures and, more generally, multi-channeled signals. Transport… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  14. arXiv:2306.05553  [pdf, other

    cs.CV cs.LG

    Equivariant vs. Invariant Layers: A Comparison of Backbone and Pooling for Point Cloud Classification

    Authors: Ashkan Shahbazi, Abihith Kothapalli, Xinran Liu, Robert Sheng, Soheil Kolouri

    Abstract: Learning from set-structured data, such as point clouds, has gained significant attention from the community. Geometric deep learning provides a blueprint for designing effective set neural networks by incorporating permutation symmetry. Of our interest are permutation invariant networks, which are composed of a permutation equivariant backbone, permutation invariant global pooling, and regression… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  15. arXiv:2305.15640  [pdf, other

    cs.LG cs.CV

    Characterizing Out-of-Distribution Error via Optimal Transport

    Authors: Yuzhe Lu, Yilong Qin, Runtian Zhai, Andrew Shen, Ketong Chen, Zhenlin Wang, Soheil Kolouri, Simon Stepputtis, Joseph Campbell, Katia Sycara

    Abstract: Out-of-distribution (OOD) data poses serious challenges in deployed machine learning models, so methods of predicting a model's performance on OOD data without labels are important for machine learning safety. While a number of methods have been proposed by prior work, they often underestimate the actual error, sometimes by a large margin, which greatly impacts their applicability to real tasks. I… ▽ More

    Submitted 27 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  16. arXiv:2305.10997  [pdf, other

    cs.LG cs.AI cs.DC cs.MA

    Sharing Lifelong Reinforcement Learning Knowledge via Modulating Masks

    Authors: Saptarshi Nath, Christos Peridis, Eseoghene Ben-Iwhiwhu, Xinran Liu, Shirin Dora, Cong Liu, Soheil Kolouri, Andrea Soltoggio

    Abstract: Lifelong learning agents aim to learn multiple tasks sequentially over a lifetime. This involves the ability to exploit previous knowledge when learning new tasks and to avoid forgetting. Modulating masks, a specific type of parameter isolation approach, have recently shown promise in both supervised and reinforcement learning. While lifelong learning algorithms have been investigated mainly withi… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 25 pages, 14 figures, 9 tables, to be published in the Second Conference on Lifelong Learning Agents (CoLLAs 2023), code can be found at https://github.com/DMIU-ShELL/deeprl-shell

  17. arXiv:2302.10887  [pdf, other

    cs.LG cs.AI

    The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning

    Authors: Andrea Soltoggio, Eseoghene Ben-Iwhiwhu, Christos Peridis, Pawel Ladosz, Jeffery Dick, Praveen K. Pilly, Soheil Kolouri

    Abstract: This paper introduces a set of formally defined and transparent problems for reinforcement learning algorithms with the following characteristics: (1) variable degrees of observability (non-Markov observations), (2) distal and sparse rewards, (3) variable and hierarchical reward structure, (4) multiple-task generation, (5) variable problem complexity. The environment provides 1D or 2D categorical… ▽ More

    Submitted 21 January, 2023; originally announced February 2023.

  18. arXiv:2302.05018  [pdf, other

    cs.LG cs.CV

    Predicting Out-of-Distribution Error with Confidence Optimal Transport

    Authors: Yuzhe Lu, Zhenlin Wang, Runtian Zhai, Soheil Kolouri, Joseph Campbell, Katia Sycara

    Abstract: Out-of-distribution (OOD) data poses serious challenges in deployed machine learning models as even subtle changes could incur significant performance drops. Being able to estimate a model's performance on test data is important in practice as it indicates when to trust to model's decisions. We present a simple yet effective method to predict a model's performance on an unknown distribution withou… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  19. arXiv:2302.03232  [pdf, other

    cs.LG math.OC

    Linear Optimal Partial Transport Embedding

    Authors: Yikun Bai, Ivan Medri, Rocio Diaz Martin, Rana Muhammad Shahroz Khan, Soheil Kolouri

    Abstract: Optimal transport (OT) has gained popularity due to its various applications in fields such as machine learning, statistics, and signal processing. However, the balanced mass requirement limits its performance in practical problems. To address these limitations, variants of the OT problem, including unbalanced OT, Optimal partial transport (OPT), and Hellinger Kantorovich (HK), have been proposed.… ▽ More

    Submitted 23 April, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

  20. A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

    Authors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee , et al. (22 additional authors not shown)

    Abstract: Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through th… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in Neural Networks

  21. arXiv:2212.11110  [pdf, other

    cs.LG cs.AI stat.ML

    Lifelong Reinforcement Learning with Modulating Masks

    Authors: Eseoghene Ben-Iwhiwhu, Saptarshi Nath, Praveen K. Pilly, Soheil Kolouri, Andrea Soltoggio

    Abstract: Lifelong learning aims to create AI systems that continuously and incrementally learn during a lifetime, similar to biological learning. Attempts so far have met problems, including catastrophic forgetting, interference among tasks, and the inability to exploit previous knowledge. While considerable research has focused on learning multiple supervised classification tasks that involve changes in t… ▽ More

    Submitted 1 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: Code available at https://github.com/dlpbc/mask-lrl

    Journal ref: Transactions on Machine Learning Research (2023)

  22. arXiv:2212.08049  [pdf, other

    cs.LG math.OC stat.ML

    Sliced Optimal Partial Transport

    Authors: Yikun Bai, Berhnard Schmitzer, Mathew Thorpe, Soheil Kolouri

    Abstract: Optimal transport (OT) has become exceedingly popular in machine learning, data science, and computer vision. The core assumption in the OT problem is the equal total amount of mass in source and target measures, which limits its application. Optimal Partial Transport (OPT) is a recently proposed solution to this limitation. Similar to the OT problem, the computation of OPT relies on solving a lin… ▽ More

    Submitted 7 August, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: modify the link of Github page

  23. arXiv:2210.14797  [pdf, other

    cs.LG cs.CV

    Is Multi-Task Learning an Upper Bound for Continual Learning?

    Authors: Zihao Wu, Huy Tran, Hamed Pirsiavash, Soheil Kolouri

    Abstract: Continual and multi-task learning are common machine learning approaches to learning from multiple tasks. The existing works in the literature often assume multi-task learning as a sensible performance upper bound for various continual learning algorithms. While this assumption is empirically verified for different continual learning benchmarks, it is not rigorously justified. Moreover, it is imag… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  24. arXiv:2208.11726  [pdf, other

    cs.LG

    Wasserstein Task Embedding for Measuring Task Similarities

    Authors: Xinran Liu, Yikun Bai, Yuzhe Lu, Andrea Soltoggio, Soheil Kolouri

    Abstract: Measuring similarities between different tasks is critical in a broad spectrum of machine learning problems, including transfer, multi-task, continual, and meta-learning. Most current approaches to measuring task similarities are architecture-dependent: 1) relying on pre-trained models, or 2) training networks on tasks and using forward transfer as a proxy for task similarity. In this paper, we le… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  25. Learning to Solve Optimization Problems with Hard Linear Constraints

    Authors: Meiyi Li, Soheil Kolouri, Javad Mohammadi

    Abstract: Constrained optimization problems appear in a wide variety of challenging real-world problems, where constraints often capture the physics of the underlying system. Classic methods for solving these problems rely on iterative algorithms that explore the feasible domain in the search for the best solution. These iterative methods are often the computational bottleneck in decision-making and adverse… ▽ More

    Submitted 31 August, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

  26. arXiv:2206.08464  [pdf, other

    cs.LG

    PRANC: Pseudo RAndom Networks for Compacting deep models

    Authors: Parsa Nooralinejad, Ali Abbasi, Soroush Abbasi Koohpayegani, Kossar Pourahmadi Meibodi, Rana Muhammad Shahroz Khan, Soheil Kolouri, Hamed Pirsiavash

    Abstract: We demonstrate that a deep model can be reparametrized as a linear combination of several randomly initialized and frozen deep models in the weight space. During training, we seek local minima that reside within the subspace spanned by these random models (i.e., `basis' networks). Our framework, PRANC, enables significant compaction of a deep model. The model can be reconstructed using a single sc… ▽ More

    Submitted 28 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

  27. arXiv:2203.06514  [pdf, other

    cs.LG cs.AI cs.CV

    Sparsity and Heterogeneous Dropout for Continual Learning in the Null Space of Neural Activations

    Authors: Ali Abbasi, Parsa Nooralinejad, Vladimir Braverman, Hamed Pirsiavash, Soheil Kolouri

    Abstract: Continual/lifelong learning from a non-stationary input data stream is a cornerstone of intelligence. Despite their phenomenal performance in a wide variety of applications, deep neural networks are prone to forgetting their previously learned information upon learning new ones. This phenomenon is called "catastrophic forgetting" and is deeply rooted in the stability-plasticity dilemma. Overcoming… ▽ More

    Submitted 8 July, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

  28. arXiv:2202.04104  [pdf, other

    cs.LG

    Teaching Networks to Solve Optimization Problems

    Authors: Xinran Liu, Yuzhe Lu, Ali Abbasi, Meiyi Li, Javad Mohammadi, Soheil Kolouri

    Abstract: Leveraging machine learning to facilitate the optimization process is an emerging field that holds the promise to bypass the fundamental computational bottleneck caused by classic iterative solvers in critical applications requiring near-real-time optimization. The majority of existing approaches focus on learning data-driven optimizers that lead to fewer iterations in solving an optimization. In… ▽ More

    Submitted 15 July, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  29. arXiv:2112.05872  [pdf, other

    cs.LG cs.CV

    SLOSH: Set LOcality Sensitive Hashing via Sliced-Wasserstein Embeddings

    Authors: Yuzhe Lu, Xinran Liu, Andrea Soltoggio, Soheil Kolouri

    Abstract: Learning from set-structured data is an essential problem with many applications in machine learning and computer vision. This paper focuses on non-parametric and data-independent learning from set-structured data using approximate nearest neighbor (ANN) solutions, particularly locality-sensitive hashing. We consider the problem of set retrieval from an input set query. Such retrieval problem requ… ▽ More

    Submitted 8 February, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

  30. arXiv:2104.08604  [pdf, other

    cs.LG

    Lifelong Learning with Sketched Structural Regularization

    Authors: Haoran Li, Aditya Krishnan, **gfeng Wu, Soheil Kolouri, Praveen K. Pilly, Vladimir Braverman

    Abstract: Preventing catastrophic forgetting while continually learning new tasks is an essential problem in lifelong learning. Structural regularization (SR) refers to a family of algorithms that mitigate catastrophic forgetting by penalizing the network for changing its "critical parameters" from previous tasks while learning a new one. The penalty is often induced via a quadratic regularizer defined by a… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  31. arXiv:2103.03892  [pdf, other

    cs.LG

    Set Representation Learning with Generalized Sliced-Wasserstein Embeddings

    Authors: Navid Naderializadeh, Soheil Kolouri, Joseph F. Comer, Reed W. Andrews, Heiko Hoffmann

    Abstract: An increasing number of machine learning tasks deal with learning representations from set-structured data. Solutions to these problems involve the composition of permutation-equivariant modules (e.g., self-attention, or individual processing via feed-forward neural networks) and permutation-invariant modules (e.g., global average pooling, or pooling by multi-head attention). In this paper, we pro… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  32. arXiv:2006.09430  [pdf, other

    cs.LG stat.ML

    Wasserstein Embedding for Graph Learning

    Authors: Soheil Kolouri, Navid Naderializadeh, Gustavo K. Rohde, Heiko Hoffmann

    Abstract: We present Wasserstein Embedding for Graph Learning (WEGL), a novel and fast framework for embedding entire graphs in a vector space, in which various machine learning models are applicable for graph-level prediction tasks. We leverage new insights on defining similarity between graphs as a function of the similarity between their node embedding distributions. Specifically, we use the Wasserstein… ▽ More

    Submitted 1 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Final version to be presented at the Ninth International Conference on Learning Representations (ICLR 2021)

  33. arXiv:2004.03669  [pdf, other

    cs.CV cs.LG eess.IV

    Radon cumulative distribution transform subspace modeling for image classification

    Authors: Mohammad Shifat-E-Rabbi, Xuwang Yin, Abu Hasnat Mohammad Rubaiyat, Shiying Li, Soheil Kolouri, Akram Aldroubi, Jonathan M. Nichols, Gustavo K. Rohde

    Abstract: We present a new supervised image classification method applicable to a broad class of image deformation models. The method makes use of the previously described Radon Cumulative Distribution Transform (R-CDT) for image data, whose mathematical properties are exploited to express the image data in a form that is more suitable for machine learning. While certain operations such as translation, scal… ▽ More

    Submitted 2 March, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 14 pages, 11 figures

  34. arXiv:2003.05783  [pdf, other

    stat.ML cs.LG

    Statistical and Topological Properties of Sliced Probability Divergences

    Authors: Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli

    Abstract: The idea of slicing divergences has been proven to be successful when comparing two probability measures in various machine learning applications including generative modeling, and consists in computing the expected value of a `base divergence' between one-dimensional random projections of the two measures. However, the topological, statistical, and computational consequences of this technique hav… ▽ More

    Submitted 4 January, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Published at NeurIPS 2020 (Spotlight)

  35. arXiv:2002.12537  [pdf, other

    stat.ML cs.LG

    Generalized Sliced Distances for Probability Distributions

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Shahin Shahrampour

    Abstract: Probability metrics have become an indispensable part of modern statistics and machine learning, and they play a quintessential role in various applications, including statistical hypothesis testing and generative modeling. However, in a practical setting, the convergence behavior of the algorithms built upon these distances have not been well established, except for a few specific cases. In this… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  36. arXiv:1909.09902  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture

    Authors: Pawel Ladosz, Eseoghene Ben-Iwhiwhu, Jeffery Dick, Yang Hu, Nicholas Ketz, Soheil Kolouri, Jeffrey L. Krichmar, Praveen Pilly, Andrea Soltoggio

    Abstract: This paper presents a new neural architecture that combines a modulated Hebbian network (MOHN) with DQN, which we call modulated Hebbian plus Q network architecture (MOHQA). The hypothesis is that such a combination allows MOHQA to solve difficult partially observable Markov decision process (POMDP) problems which impair temporal difference (TD)-based RL algorithms such as DQN, as the TD error can… ▽ More

    Submitted 14 October, 2021; v1 submitted 21 September, 2019; originally announced September 2019.

  37. arXiv:1907.02271  [pdf, other

    cs.LG stat.ML

    Learning a Domain-Invariant Embedding for Unsupervised Domain Adaptation Using Class-Conditioned Distribution Alignment

    Authors: Alex Gabourie, Mohammad Rostami, Philip Pope, Soheil Kolouri, Kyungnam Kim

    Abstract: We address the problem of unsupervised domain adaptation (UDA) by learning a cross-domain agnostic embedding space, where the distance between the probability distributions of the two source and target visual domains is minimized. We use the output space of a shared cross-domain deep encoder to model the embedding space anduse the Sliced-Wasserstein Distance (SWD) to measure and minimize the dista… ▽ More

    Submitted 24 September, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

  38. arXiv:1907.02220  [pdf, other

    stat.ML cs.LG

    Neural Networks, Hypersurfaces, and Radon Transforms

    Authors: Soheil Kolouri, Xuwang Yin, Gustavo K. Rohde

    Abstract: Connections between integration along hypersufaces, Radon transforms, and neural networks are exploited to highlight an integral geometric mathematical interpretation of neural networks. By analyzing the properties of neural networks as operators on probability distributions for observed data, we show that the distribution of outputs for any node in a neural network can be interpreted as a nonline… ▽ More

    Submitted 4 July, 2019; originally announced July 2019.

  39. arXiv:1906.10842  [pdf, other

    cs.CV

    Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs

    Authors: Soheil Kolouri, Aniruddha Saha, Hamed Pirsiavash, Heiko Hoffmann

    Abstract: The unprecedented success of deep neural networks in many applications has made these networks a prime target for adversarial exploitation. In this paper, we introduce a benchmark technique for detecting backdoor attacks (aka Trojan attacks) on deep convolutional neural networks (CNNs). We introduce the concept of Universal Litmus Patterns (ULPs), which enable one to reveal backdoor attacks by fee… ▽ More

    Submitted 14 May, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: CVPR 2020 Oral

  40. arXiv:1906.10509  [pdf, other

    cs.CV cs.LG stat.ML

    Zero-Shot Image Classification Using Coupled Dictionary Embedding

    Authors: Mohammad Rostami, Soheil Kolouri, Zak Murez, Yuri Owekcho, Eric Eaton, Kuyngnam Kim

    Abstract: Zero-shot learning (ZSL) is a framework to classify images belonging to unseen classes based on solely semantic information about these unseen classes. In this paper, we propose a new ZSL algorithm using coupled dictionary learning. The core idea is that the visual features and the semantic attributes of an image can share the same sparse representation in an intermediate space. We use images from… ▽ More

    Submitted 23 October, 2021; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1709.03688

  41. arXiv:1906.03744  [pdf, other

    cs.LG cs.AI stat.ML

    Generative Continual Concept Learning

    Authors: Mohammad Rostami, Soheil Kolouri, James McClelland, Praveen Pilly

    Abstract: After learning a concept, humans are also able to continually generalize their learned concepts to new domains by observing only a few labeled instances without any interference with the past learned knowledge. In contrast, learning concepts efficiently in a continual learning setting remains an open challenge for current Artificial Intelligence algorithms as persistent model retraining is necessa… ▽ More

    Submitted 7 September, 2019; v1 submitted 9 June, 2019; originally announced June 2019.

  42. arXiv:1905.11475  [pdf, other

    cs.LG cs.CR stat.ML

    GAT: Generative Adversarial Training for Adversarial Example Detection and Robust Classification

    Authors: Xuwang Yin, Soheil Kolouri, Gustavo K. Rohde

    Abstract: The vulnerabilities of deep neural networks against adversarial examples have become a significant concern for deploying these models in sensitive domains. Devising a definitive defense against such attacks is proven to be challenging, and the methods relying on detecting adversarial samples are only valid when the attacker is oblivious to the detection mechanism. In this paper we propose a princi… ▽ More

    Submitted 1 October, 2022; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: ICLR 2020, code is available at https://github.com/xuwangyin/GAT-Generative-Adversarial-Training; v4 fixed error in Figure 2

  43. arXiv:1903.08329  [pdf, other

    cs.LG cs.AI stat.ML

    On Sampling Random Features From Empirical Leverage Scores: Implementation and Theoretical Guarantees

    Authors: Shahin Shahrampour, Soheil Kolouri

    Abstract: Random features provide a practical framework for large-scale kernel approximation and supervised learning. It has been shown that data-dependent sampling of random features using leverage scores can significantly reduce the number of features required to achieve optimal learning bounds. Leverage scores introduce an optimized distribution for features based on an infinite-dimensional integral oper… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: 23 pages

  44. arXiv:1903.06070  [pdf, other

    cs.NE cs.LG stat.ML

    Attention-Based Structural-Plasticity

    Authors: Soheil Kolouri, Nicholas Ketz, Xinyun Zou, Jeffrey Krichmar, Praveen Pilly

    Abstract: Catastrophic forgetting/interference is a critical problem for lifelong learning machines, which impedes the agents from maintaining their previously learned knowledge while learning new tasks. Neural networks, in particular, suffer plenty from the catastrophic forgetting phenomenon. Recently there has been several efforts towards overcoming catastrophic forgetting in neural networks. Here, we pro… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  45. arXiv:1903.04566  [pdf, other

    cs.LG stat.ML

    Complementary Learning for Overcoming Catastrophic Forgetting Using Experience Replay

    Authors: Mohammad Rostami, Soheil Kolouri, Praveen K. Pilly

    Abstract: Despite huge success, deep networks are unable to learn effectively in sequential multitask learning settings as they forget the past learned tasks after learning new tasks. Inspired from complementary learning systems theory, we address this challenge by learning a generative model that couples the current task to the past learned tasks through a discriminative embedding space. We learn an abstra… ▽ More

    Submitted 31 May, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

  46. arXiv:1903.02647  [pdf, other

    cs.LG stat.ML

    Continual Learning Using World Models for Pseudo-Rehearsal

    Authors: Nicholas Ketz, Soheil Kolouri, Praveen Pilly

    Abstract: The utility of learning a dynamics/world model of the environment in reinforcement learning has been shown in a many ways. When using neural networks, however, these models suffer catastrophic forgetting when learned in a lifelong or continual fashion. Current solutions to the continual learning problem require experience to be segmented and labeled as discrete tasks, however, in continuous experi… ▽ More

    Submitted 11 June, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    MSC Class: 68T05 91E40

  47. arXiv:1903.00068  [pdf, other

    cs.NE cs.CV cs.LG

    Neuromodulated Goal-Driven Perception in Uncertain Domains

    Authors: Xinyun Zou, Soheil Kolouri, Praveen K. Pilly, Jeffrey L. Krichmar

    Abstract: In uncertain domains, the goals are often unknown and need to be predicted by the organism or system. In this paper, contrastive excitation backprop (c-EB) was used in a goal-driven perception task with pairs of noisy MNIST digits, where the system had to increase attention to one of the two digits corresponding to a goal (i.e., even, odd, low value, or high value) and decrease attention to the di… ▽ More

    Submitted 16 February, 2019; originally announced March 2019.

  48. arXiv:1902.00434  [pdf, other

    cs.LG stat.ML

    Generalized Sliced Wasserstein Distances

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Roland Badeau, Gustavo K. Rohde

    Abstract: The Wasserstein distance and its variations, e.g., the sliced-Wasserstein (SW) distance, have recently drawn attention from the machine learning community. The SW distance, specifically, was shown to have similar properties to the Wasserstein distance, while being much simpler to compute, and is therefore used in various applications including generative modeling and general supervised/unsupervise… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

  49. arXiv:1812.00265  [pdf, other

    cs.LG stat.ML

    Discovering Molecular Functional Groups Using Graph Convolutional Neural Networks

    Authors: Phillip Pope, Soheil Kolouri, Mohammad Rostrami, Charles Martin, Heiko Hoffmann

    Abstract: Functional groups (FGs) are molecular substructures that are served as a foundation for analyzing and predicting chemical properties of molecules. Automatic discovery of FGs will impact various fields of research, including medicinal chemistry and material sciences, by reducing the amount of lab experiments required for discovery or synthesis of new molecules. In this paper, we investigate methods… ▽ More

    Submitted 9 October, 2019; v1 submitted 1 December, 2018; originally announced December 2018.

  50. arXiv:1804.01947  [pdf, other

    cs.LG stat.ML

    Sliced-Wasserstein Autoencoder: An Embarrassingly Simple Generative Model

    Authors: Soheil Kolouri, Phillip E. Pope, Charles E. Martin, Gustavo K. Rohde

    Abstract: In this paper we study generative modeling via autoencoders while using the elegant geometric properties of the optimal transport (OT) problem and the Wasserstein distances. We introduce Sliced-Wasserstein Autoencoders (SWAE), which are generative models that enable one to shape the distribution of the latent space into any samplable probability distribution without the need for training an advers… ▽ More

    Submitted 26 June, 2018; v1 submitted 5 April, 2018; originally announced April 2018.