Skip to main content

Showing 1–23 of 23 results for author: Ghodsi, A

Searching in archive stat. Search in all archives.
.
  1. Theoretical Connection between Locally Linear Embedding, Factor Analysis, and Probabilistic PCA

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: Locally Linear Embedding (LLE) is a nonlinear spectral dimensionality reduction and manifold learning method. It has two main steps which are linear reconstruction and linear embedding of points in the input space and embedding space, respectively. In this work, we look at the linear reconstruction step from a stochastic perspective where it is assumed that every data point is conditioned on its l… ▽ More

    Submitted 10 August, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at the Canadian AI 2022 (Canadian Conference on Artificial Intelligence). This paper has some shared materials with our other paper arXiv:2104.01525 but its focus and aim are different from that paper. v2: corrected a mathematical typo

    Journal ref: Proceedings of the 35th Canadian Conference on Artificial Intelligence, Canadian Artificial Intelligence Association, 2022

  2. arXiv:2201.09267  [pdf, other

    stat.ML cs.CV cs.LG

    Spectral, Probabilistic, and Deep Metric Learning: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on metric learning. Algorithms are divided into spectral, probabilistic, and deep metric learning. We first start with the definition of distance metric, Mahalanobis distance, and generalized Mahalanobis distance. In spectral methods, we start with methods using scatters of data, including the first spectral metric learning, relevant methods to Fisher discrimina… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

  3. arXiv:2111.13282  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Generative Adversarial Networks and Adversarial Autoencoders: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on Generative Adversarial Network (GAN), adversarial autoencoders, and their variants. We start with explaining adversarial learning and the vanilla GAN. Then, we explain the conditional GAN and DCGAN. The mode collapse problem is introduced and various methods, including minibatch GAN, unrolled GAN, BourGAN, mixture GAN, D2GAN, and Wasserstein GAN, are introduc… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

  4. arXiv:2110.09620  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    Sufficient Dimension Reduction for High-Dimensional Regression and Low-Dimensional Embedding: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on various methods for Sufficient Dimension Reduction (SDR). We cover these methods with both statistical high-dimensional regression perspective and machine learning approach for dimensionality reduction. We start with introducing inverse regression methods including Sliced Inverse Regression (SIR), Sliced Average Variance Estimation (SAVE), contour regression,… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

  5. arXiv:2108.04172  [pdf, other

    stat.ML cs.DS cs.LG math.PR

    Johnson-Lindenstrauss Lemma, Linear and Nonlinear Random Projections, Random Fourier Features, and Random Kitchen Sinks: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on the Johnson-Lindenstrauss (JL) lemma and linear and nonlinear random projections. We start with linear random projection and then justify its correctness by JL lemma and its proof. Then, sparse random projections with $\ell_1$ norm and interpolation norm are introduced. Two main applications of random projection, which are low-rank matrix approximation and ap… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

  6. arXiv:2107.12521  [pdf, other

    cs.LG cs.NE physics.data-an stat.ML

    Restricted Boltzmann Machine and Deep Belief Network: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on Boltzmann Machine (BM), Restricted Boltzmann Machine (RBM), and Deep Belief Network (DBN). We start with the required background on probabilistic graphical models, Markov random field, Gibbs sampling, statistical physics, Ising model, and the Hopfield network. Then, we introduce the structures of BM and RBM. The conditional distributions of visible and hidden… ▽ More

    Submitted 5 August, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning. v2: applied readers' feedback

  7. arXiv:2106.15379  [pdf, other

    stat.ML cs.CV cs.LG

    Unified Framework for Spectral Dimensionality Reduction, Maximum Variance Unfolding, and Kernel Learning By Semidefinite Programming: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on unification of spectral dimensionality reduction methods, kernel learning by Semidefinite Programming (SDP), Maximum Variance Unfolding (MVU) or Semidefinite Embedding (SDE), and its variants. We first explain how the spectral dimensionality reduction methods can be unified as kernel Principal Component Analysis (PCA) with different kernels. This unification… ▽ More

    Submitted 3 August, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning. v2: corrected some typos

  8. arXiv:2106.08443  [pdf, other

    stat.ML cs.LG math.FA

    Reproducing Kernel Hilbert Space, Mercer's Theorem, Eigenfunctions, Nyström Method, and Use of Kernels in Machine Learning: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on kernels, kernel methods, and related fields. We start with reviewing the history of kernels in functional analysis and machine learning. Then, Mercer kernel, Hilbert and Banach spaces, Reproducing Kernel Hilbert Space (RKHS), Mercer's theorem and its proof, frequently used kernels, kernel construction from distance metric, important classes of kernels (includ… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

  9. arXiv:2106.02154  [pdf, other

    stat.ML cs.CV cs.LG

    Laplacian-Based Dimensionality Reduction Including Spectral Clustering, Laplacian Eigenmap, Locality Preserving Projection, Graph Embedding, and Diffusion Map: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper for nonlinear dimensionality and feature extraction methods which are based on the Laplacian of graph of data. We first introduce adjacency matrix, definition of Laplacian matrix, and the interpretation of Laplacian. Then, we cover the cuts of graph and spectral clustering which applies clustering in a subspace of data. Different optimization variants of Laplaci… ▽ More

    Submitted 5 August, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning. v2: applied readers' feedback

  10. arXiv:2104.01525  [pdf, other

    stat.ML cs.CV cs.LG

    Generative Locally Linear Embedding

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: Locally Linear Embedding (LLE) is a nonlinear spectral dimensionality reduction and manifold learning method. It has two main steps which are linear reconstruction and linear embedding of points in the input space and embedding space, respectively. In this work, we propose two novel generative versions of LLE, named Generative LLE (GLLE), whose linear reconstruction steps are stochastic rather tha… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

  11. arXiv:2101.00734  [pdf, other

    stat.ML cs.CV cs.LG

    Factor Analysis, Probabilistic Principal Component Analysis, Variational Inference, and Variational Autoencoder: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper on factor analysis, probabilistic Principal Component Analysis (PCA), variational inference, and Variational Autoencoder (VAE). These methods, which are tightly related, are dimensionality reduction and generative models. They assume that every data point is generated from or caused by a low-dimensional latent factor. By learning the parameters of distribution o… ▽ More

    Submitted 23 May, 2022; v1 submitted 3 January, 2021; originally announced January 2021.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning. v2: corrected some mathematical typos

  12. arXiv:2011.10925  [pdf, other

    stat.ML cs.CV cs.LG

    Locally Linear Embedding and its Variants: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: This is a tutorial and survey paper for Locally Linear Embedding (LLE) and its variants. The idea of LLE is fitting the local structure of manifold in the embedding space. In this paper, we first cover LLE, kernel LLE, inverse LLE, and feature fusion with LLE. Then, we cover out-of-sample embedding using linear reconstruction, eigenfunctions, and kernel map**. Incremental LLE is explained for em… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

  13. arXiv:2009.10301  [pdf, ps, other

    stat.ML cs.CV cs.LG

    Stochastic Neighbor Embedding with Gaussian and Student-t Distributions: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: Stochastic Neighbor Embedding (SNE) is a manifold learning and dimensionality reduction method with a probabilistic approach. In SNE, every point is consider to be the neighbor of all other points with some probability and this probability is tried to be preserved in the embedding space. SNE considers Gaussian distribution for the probability in both the input and embedding spaces. However, t-SNE… ▽ More

    Submitted 3 August, 2022; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: To appear as a part of an upcoming academic book on dimensionality reduction and manifold learning. v2: applied readers' feedback

  14. arXiv:2009.08136  [pdf, other

    stat.ML cs.CV cs.LG

    Multidimensional Scaling, Sammon Map**, and Isomap: Tutorial and Survey

    Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

    Abstract: Multidimensional Scaling (MDS) is one of the first fundamental manifold learning methods. It can be categorized into several methods, i.e., classical MDS, kernel classical MDS, metric MDS, and non-metric MDS. Sammon map** and Isomap can be considered as special cases of metric MDS and kernel classical MDS, respectively. In this tutorial and survey paper, we review the theory of MDS, Sammon mappi… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: To appear as a part of an upcoming academic book on dimensionality reduction and manifold learning

  15. arXiv:1904.08514  [pdf, other

    cs.LG q-bio.BM stat.ML

    DeepNovoV2: Better de novo peptide sequencing with deep learning

    Authors: Rui Qiao, Ngoc Hieu Tran, Lei Xin, Baozhen Shan, Ming Li, Ali Ghodsi

    Abstract: Personalized cancer vaccines are envisioned as the next generation rational cancer immunotherapy. The key step in develo** personalized therapeutic cancer vaccines is to identify tumor-specific neoantigens that are on the surface of tumor cells. A promising method for this is through de novo peptide sequencing from mass spectrometry data. In this paper we introduce DeepNovoV2, the state-of-the-a… ▽ More

    Submitted 22 May, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

  16. arXiv:1812.07641  [pdf, other

    cs.LG stat.ML

    Deep Variational Sufficient Dimensionality Reduction

    Authors: Ershad Banijamali, Amir-Hossein Karimi, Ali Ghodsi

    Abstract: We consider the problem of sufficient dimensionality reduction (SDR), where the high-dimensional observation is transformed to a low-dimensional sub-space in which the information of the observations regarding the label variable is preserved. We propose DVSDR, a deep variational approach for sufficient dimensionality reduction. The deep structure in our model has a bottleneck that represent the lo… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

  17. arXiv:1811.03166  [pdf, other

    cs.LG stat.ML

    SRP: Efficient class-aware embedding learning for large-scale data via supervised random projections

    Authors: Amir-Hossein Karimi, Alexander Wong, Ali Ghodsi

    Abstract: Supervised dimensionality reduction strategies have been of great interest. However, current supervised dimensionality reduction approaches are difficult to scale for situations characterized by large datasets given the high computational complexities associated with such methods. While stochastic approximation strategies have been explored for unsupervised dimensionality reduction to tackle this… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

  18. arXiv:1711.09163  [pdf, other

    cs.LG stat.ML

    JADE: Joint Autoencoders for Dis-Entanglement

    Authors: Ershad Banijamali, Amir-Hossein Karimi, Alexander Wong, Ali Ghodsi

    Abstract: The problem of feature disentanglement has been explored in the literature, for the purpose of image and video processing and text analysis. State-of-the-art methods for disentangling feature representations rely on the presence of many labeled samples. In this work, we present a novel method for disentangling factors of variation in data-scarce regimes. Specifically, we explore the application of… ▽ More

    Submitted 24 November, 2017; originally announced November 2017.

    Comments: 5 pages

  19. arXiv:1707.00081  [pdf, ps, other

    cs.NE cs.AI cs.CV stat.ML

    Synthesizing Deep Neural Network Architectures using Biological Synaptic Strength Distributions

    Authors: A. H. Karimi, M. J. Shafiee, A. Ghodsi, A. Wong

    Abstract: In this work, we perform an exploratory study on synthesizing deep neural networks using biological synaptic strength distributions, and the potential influence of different distributions on modelling performance particularly for the scenario associated with small data sets. Surprisingly, a CNN with convolutional layer synaptic strengths drawn from biologically-inspired distributions such as log-n… ▽ More

    Submitted 30 June, 2017; originally announced July 2017.

  20. arXiv:1704.02345  [pdf, other

    cs.LG stat.ML

    Fast Spectral Clustering Using Autoencoders and Landmarks

    Authors: Ershad Banijamali, Ali Ghodsi

    Abstract: In this paper, we introduce an algorithm for performing spectral clustering efficiently. Spectral clustering is a powerful clustering algorithm that suffers from high computational complexity, due to eigen decomposition. In this work, we first build the adjacency matrix of the corresponding graph of the dataset. To build this matrix, we only consider a limited number of points, called landmarks, a… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

    Comments: 8 Pages- Accepted in 14th International Conference on Image Analysis and Recognition

  21. arXiv:1702.03307  [pdf, other

    cs.LG stat.ML

    Generative Mixture of Networks

    Authors: Ershad Banijamali, Ali Ghodsi, Pascal Poupart

    Abstract: A generative model based on training deep architectures is proposed. The model consists of K networks that are trained together to learn the underlying distribution of a given data set. The process starts with dividing the input data into K clusters and feeding each of them into a separate network. After few iterations of training networks separately, we use an EM-like algorithm to train the netwo… ▽ More

    Submitted 10 February, 2017; originally announced February 2017.

    Comments: 9 pages

  22. arXiv:1312.6820  [pdf, ps, other

    cs.DS cs.LG stat.ML

    A Fast Greedy Algorithm for Generalized Column Subset Selection

    Authors: Ahmed K. Farahat, Ali Ghodsi, Mohamed S. Kamel

    Abstract: This paper defines a generalized column subset selection problem which is concerned with the selection of a few columns from a source matrix A that best approximate the span of a target matrix B. The paper then proposes a fast greedy algorithm for solving this problem and draws connections to different problems that can be efficiently solved using the proposed algorithm.

    Submitted 24 December, 2013; originally announced December 2013.

    Comments: NIPS'13 Workshop on Greedy Algorithms, Frank-Wolfe and Friends

  23. arXiv:1210.4903  [pdf

    stat.ME cs.CE

    Detecting Change-Points in Time Series by Maximum Mean Discrepancy of Ordinal Pattern Distributions

    Authors: Mathieu Sinn, Ali Ghodsi, Karsten Keller

    Abstract: As a new method for detecting change-points in high-resolution time series, we apply Maximum Mean Discrepancy to the distributions of ordinal patterns in different parts of a time series. The main advantage of this approach is its computational simplicity and robustness with respect to (non-linear) monotonic transformations, which makes it particularly well-suited for the analysis of long biophysi… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-786-794