Skip to main content

Showing 1–22 of 22 results for author: Kolouri, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.03664  [pdf, other

    cs.LG stat.ML

    Partial Gromov-Wasserstein Metric

    Authors: Yikun Bai, Rocio Diaz Martin, Abihith Kothapalli, Hengrong Du, Xinran Liu, Soheil Kolouri

    Abstract: The Gromov-Wasserstein (GW) distance has gained increasing interest in the machine learning community in recent years, as it allows for the comparison of measures in different metric spaces. To overcome the limitations imposed by the equal mass requirements of the classical GW problem, researchers have begun exploring its application in unbalanced settings. However, Unbalanced GW (UGW) can only be… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  2. arXiv:2402.02345  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Stereographic Spherical Sliced Wasserstein Distances

    Authors: Huy Tran, Yikun Bai, Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Rocio Diaz Martin, Soheil Kolouri

    Abstract: Comparing spherical probability distributions is of great interest in various fields, including geology, medical domains, computer vision, and deep representation learning. The utility of optimal transport-based distances, such as the Wasserstein distance, for comparing probability measures has spurred active research in develo** computationally efficient variations of these distances for spheri… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024 (Spotlight). Project page: https://abi-kothapalli.github.io/s3w/

  3. arXiv:2212.11110  [pdf, other

    cs.LG cs.AI stat.ML

    Lifelong Reinforcement Learning with Modulating Masks

    Authors: Eseoghene Ben-Iwhiwhu, Saptarshi Nath, Praveen K. Pilly, Soheil Kolouri, Andrea Soltoggio

    Abstract: Lifelong learning aims to create AI systems that continuously and incrementally learn during a lifetime, similar to biological learning. Attempts so far have met problems, including catastrophic forgetting, interference among tasks, and the inability to exploit previous knowledge. While considerable research has focused on learning multiple supervised classification tasks that involve changes in t… ▽ More

    Submitted 1 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: Code available at https://github.com/dlpbc/mask-lrl

    Journal ref: Transactions on Machine Learning Research (2023)

  4. arXiv:2212.08049  [pdf, other

    cs.LG math.OC stat.ML

    Sliced Optimal Partial Transport

    Authors: Yikun Bai, Berhnard Schmitzer, Mathew Thorpe, Soheil Kolouri

    Abstract: Optimal transport (OT) has become exceedingly popular in machine learning, data science, and computer vision. The core assumption in the OT problem is the equal total amount of mass in source and target measures, which limits its application. Optimal Partial Transport (OPT) is a recently proposed solution to this limitation. Similar to the OT problem, the computation of OPT relies on solving a lin… ▽ More

    Submitted 7 August, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: modify the link of Github page

  5. arXiv:2006.09430  [pdf, other

    cs.LG stat.ML

    Wasserstein Embedding for Graph Learning

    Authors: Soheil Kolouri, Navid Naderializadeh, Gustavo K. Rohde, Heiko Hoffmann

    Abstract: We present Wasserstein Embedding for Graph Learning (WEGL), a novel and fast framework for embedding entire graphs in a vector space, in which various machine learning models are applicable for graph-level prediction tasks. We leverage new insights on defining similarity between graphs as a function of the similarity between their node embedding distributions. Specifically, we use the Wasserstein… ▽ More

    Submitted 1 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Final version to be presented at the Ninth International Conference on Learning Representations (ICLR 2021)

  6. arXiv:2003.05783  [pdf, other

    stat.ML cs.LG

    Statistical and Topological Properties of Sliced Probability Divergences

    Authors: Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli

    Abstract: The idea of slicing divergences has been proven to be successful when comparing two probability measures in various machine learning applications including generative modeling, and consists in computing the expected value of a `base divergence' between one-dimensional random projections of the two measures. However, the topological, statistical, and computational consequences of this technique hav… ▽ More

    Submitted 4 January, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Published at NeurIPS 2020 (Spotlight)

  7. arXiv:2002.12537  [pdf, other

    stat.ML cs.LG

    Generalized Sliced Distances for Probability Distributions

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Shahin Shahrampour

    Abstract: Probability metrics have become an indispensable part of modern statistics and machine learning, and they play a quintessential role in various applications, including statistical hypothesis testing and generative modeling. However, in a practical setting, the convergence behavior of the algorithms built upon these distances have not been well established, except for a few specific cases. In this… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  8. arXiv:1909.09902  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture

    Authors: Pawel Ladosz, Eseoghene Ben-Iwhiwhu, Jeffery Dick, Yang Hu, Nicholas Ketz, Soheil Kolouri, Jeffrey L. Krichmar, Praveen Pilly, Andrea Soltoggio

    Abstract: This paper presents a new neural architecture that combines a modulated Hebbian network (MOHN) with DQN, which we call modulated Hebbian plus Q network architecture (MOHQA). The hypothesis is that such a combination allows MOHQA to solve difficult partially observable Markov decision process (POMDP) problems which impair temporal difference (TD)-based RL algorithms such as DQN, as the TD error can… ▽ More

    Submitted 14 October, 2021; v1 submitted 21 September, 2019; originally announced September 2019.

  9. arXiv:1907.02271  [pdf, other

    cs.LG stat.ML

    Learning a Domain-Invariant Embedding for Unsupervised Domain Adaptation Using Class-Conditioned Distribution Alignment

    Authors: Alex Gabourie, Mohammad Rostami, Philip Pope, Soheil Kolouri, Kyungnam Kim

    Abstract: We address the problem of unsupervised domain adaptation (UDA) by learning a cross-domain agnostic embedding space, where the distance between the probability distributions of the two source and target visual domains is minimized. We use the output space of a shared cross-domain deep encoder to model the embedding space anduse the Sliced-Wasserstein Distance (SWD) to measure and minimize the dista… ▽ More

    Submitted 24 September, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

  10. arXiv:1907.02220  [pdf, other

    stat.ML cs.LG

    Neural Networks, Hypersurfaces, and Radon Transforms

    Authors: Soheil Kolouri, Xuwang Yin, Gustavo K. Rohde

    Abstract: Connections between integration along hypersufaces, Radon transforms, and neural networks are exploited to highlight an integral geometric mathematical interpretation of neural networks. By analyzing the properties of neural networks as operators on probability distributions for observed data, we show that the distribution of outputs for any node in a neural network can be interpreted as a nonline… ▽ More

    Submitted 4 July, 2019; originally announced July 2019.

  11. arXiv:1906.10509  [pdf, other

    cs.CV cs.LG stat.ML

    Zero-Shot Image Classification Using Coupled Dictionary Embedding

    Authors: Mohammad Rostami, Soheil Kolouri, Zak Murez, Yuri Owekcho, Eric Eaton, Kuyngnam Kim

    Abstract: Zero-shot learning (ZSL) is a framework to classify images belonging to unseen classes based on solely semantic information about these unseen classes. In this paper, we propose a new ZSL algorithm using coupled dictionary learning. The core idea is that the visual features and the semantic attributes of an image can share the same sparse representation in an intermediate space. We use images from… ▽ More

    Submitted 23 October, 2021; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1709.03688

  12. arXiv:1906.03744  [pdf, other

    cs.LG cs.AI stat.ML

    Generative Continual Concept Learning

    Authors: Mohammad Rostami, Soheil Kolouri, James McClelland, Praveen Pilly

    Abstract: After learning a concept, humans are also able to continually generalize their learned concepts to new domains by observing only a few labeled instances without any interference with the past learned knowledge. In contrast, learning concepts efficiently in a continual learning setting remains an open challenge for current Artificial Intelligence algorithms as persistent model retraining is necessa… ▽ More

    Submitted 7 September, 2019; v1 submitted 9 June, 2019; originally announced June 2019.

  13. arXiv:1905.11475  [pdf, other

    cs.LG cs.CR stat.ML

    GAT: Generative Adversarial Training for Adversarial Example Detection and Robust Classification

    Authors: Xuwang Yin, Soheil Kolouri, Gustavo K. Rohde

    Abstract: The vulnerabilities of deep neural networks against adversarial examples have become a significant concern for deploying these models in sensitive domains. Devising a definitive defense against such attacks is proven to be challenging, and the methods relying on detecting adversarial samples are only valid when the attacker is oblivious to the detection mechanism. In this paper we propose a princi… ▽ More

    Submitted 1 October, 2022; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: ICLR 2020, code is available at https://github.com/xuwangyin/GAT-Generative-Adversarial-Training; v4 fixed error in Figure 2

  14. arXiv:1903.08329  [pdf, other

    cs.LG cs.AI stat.ML

    On Sampling Random Features From Empirical Leverage Scores: Implementation and Theoretical Guarantees

    Authors: Shahin Shahrampour, Soheil Kolouri

    Abstract: Random features provide a practical framework for large-scale kernel approximation and supervised learning. It has been shown that data-dependent sampling of random features using leverage scores can significantly reduce the number of features required to achieve optimal learning bounds. Leverage scores introduce an optimized distribution for features based on an infinite-dimensional integral oper… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: 23 pages

  15. arXiv:1903.06070  [pdf, other

    cs.NE cs.LG stat.ML

    Attention-Based Structural-Plasticity

    Authors: Soheil Kolouri, Nicholas Ketz, Xinyun Zou, Jeffrey Krichmar, Praveen Pilly

    Abstract: Catastrophic forgetting/interference is a critical problem for lifelong learning machines, which impedes the agents from maintaining their previously learned knowledge while learning new tasks. Neural networks, in particular, suffer plenty from the catastrophic forgetting phenomenon. Recently there has been several efforts towards overcoming catastrophic forgetting in neural networks. Here, we pro… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  16. arXiv:1903.04566  [pdf, other

    cs.LG stat.ML

    Complementary Learning for Overcoming Catastrophic Forgetting Using Experience Replay

    Authors: Mohammad Rostami, Soheil Kolouri, Praveen K. Pilly

    Abstract: Despite huge success, deep networks are unable to learn effectively in sequential multitask learning settings as they forget the past learned tasks after learning new tasks. Inspired from complementary learning systems theory, we address this challenge by learning a generative model that couples the current task to the past learned tasks through a discriminative embedding space. We learn an abstra… ▽ More

    Submitted 31 May, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

  17. arXiv:1903.02647  [pdf, other

    cs.LG stat.ML

    Continual Learning Using World Models for Pseudo-Rehearsal

    Authors: Nicholas Ketz, Soheil Kolouri, Praveen Pilly

    Abstract: The utility of learning a dynamics/world model of the environment in reinforcement learning has been shown in a many ways. When using neural networks, however, these models suffer catastrophic forgetting when learned in a lifelong or continual fashion. Current solutions to the continual learning problem require experience to be segmented and labeled as discrete tasks, however, in continuous experi… ▽ More

    Submitted 11 June, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    MSC Class: 68T05 91E40

  18. arXiv:1902.00434  [pdf, other

    cs.LG stat.ML

    Generalized Sliced Wasserstein Distances

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Roland Badeau, Gustavo K. Rohde

    Abstract: The Wasserstein distance and its variations, e.g., the sliced-Wasserstein (SW) distance, have recently drawn attention from the machine learning community. The SW distance, specifically, was shown to have similar properties to the Wasserstein distance, while being much simpler to compute, and is therefore used in various applications including generative modeling and general supervised/unsupervise… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

  19. arXiv:1812.00265  [pdf, other

    cs.LG stat.ML

    Discovering Molecular Functional Groups Using Graph Convolutional Neural Networks

    Authors: Phillip Pope, Soheil Kolouri, Mohammad Rostrami, Charles Martin, Heiko Hoffmann

    Abstract: Functional groups (FGs) are molecular substructures that are served as a foundation for analyzing and predicting chemical properties of molecules. Automatic discovery of FGs will impact various fields of research, including medicinal chemistry and material sciences, by reducing the amount of lab experiments required for discovery or synthesis of new molecules. In this paper, we investigate methods… ▽ More

    Submitted 9 October, 2019; v1 submitted 1 December, 2018; originally announced December 2018.

  20. arXiv:1804.01947  [pdf, other

    cs.LG stat.ML

    Sliced-Wasserstein Autoencoder: An Embarrassingly Simple Generative Model

    Authors: Soheil Kolouri, Phillip E. Pope, Charles E. Martin, Gustavo K. Rohde

    Abstract: In this paper we study generative modeling via autoencoders while using the elegant geometric properties of the optimal transport (OT) problem and the Wasserstein distances. We introduce Sliced-Wasserstein Autoencoders (SWAE), which are generative models that enable one to shape the distribution of the latent space into any samplable probability distribution without the need for training an advers… ▽ More

    Submitted 26 June, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

  21. arXiv:1711.05376  [pdf, other

    cs.CV cs.LG stat.ML

    Sliced Wasserstein Distance for Learning Gaussian Mixture Models

    Authors: Soheil Kolouri, Gustavo K. Rohde, Heiko Hoffmann

    Abstract: Gaussian mixture models (GMM) are powerful parametric tools with many applications in machine learning and computer vision. Expectation maximization (EM) is the most popular algorithm for estimating the GMM parameters. However, EM guarantees only convergence to a stationary point of the log-likelihood function, which could be arbitrarily worse than the optimal solution. Inspired by the relationshi… ▽ More

    Submitted 15 November, 2017; v1 submitted 14 November, 2017; originally announced November 2017.

  22. arXiv:1511.03198  [pdf, other

    cs.LG stat.ML

    Sliced Wasserstein Kernels for Probability Distributions

    Authors: Soheil Kolouri, Yang Zou, Gustavo K. Rohde

    Abstract: Optimal transport distances, otherwise known as Wasserstein distances, have recently drawn ample attention in computer vision and machine learning as a powerful discrepancy measure for probability distributions. The recent developments on alternative formulations of the optimal transport have allowed for faster solutions to the problem and has revamped its practical applications in machine learnin… ▽ More

    Submitted 10 November, 2015; originally announced November 2015.