Skip to main content

Showing 1–18 of 18 results for author: Soltani, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2103.12827  [pdf, other

    cs.LG eess.IV stat.ML

    Fisher Task Distance and Its Application in Neural Architecture Search

    Authors: Cat P. Le, Mohammadreza Soltani, Juncheng Dong, Vahid Tarokh

    Abstract: We formulate an asymmetric (or non-commutative) distance between tasks based on Fisher Information Matrices, called Fisher task distance. This distance represents the complexity of transferring the knowledge from one task to another. We provide a proof of consistency for our distance through theorems and experiments on various classification tasks from MNIST, CIFAR-10, CIFAR-100, ImageNet, and Tas… ▽ More

    Submitted 30 April, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Published in IEEE Access, Volume 10, 2022

  2. arXiv:2007.06682  [pdf, other

    cs.LG cs.CV stat.ML

    GeoStat Representations of Time Series for Fast Classification

    Authors: Robert J. Ravier, Mohammadreza Soltani, Miguel Simões, Denis Garagic, Vahid Tarokh

    Abstract: Recent advances in time series classification have largely focused on methods that either employ deep learning or utilize other machine learning models for feature extraction. Though successful, their power often comes at the requirement of computational complexity. In this paper, we introduce GeoStat representations for time series. GeoStat representations are based off of a generalization of rec… ▽ More

    Submitted 11 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 28 pages, 8 tables, 5 figures

  3. arXiv:2007.06140  [pdf, other

    cs.LG stat.ML

    Projected Latent Markov Chain Monte Carlo: Conditional Sampling of Normalizing Flows

    Authors: Chris Cannella, Mohammadreza Soltani, Vahid Tarokh

    Abstract: We introduce Projected Latent Markov Chain Monte Carlo (PL-MCMC), a technique for sampling from the high-dimensional conditional distributions learned by a normalizing flow. We prove that a Metropolis-Hastings implementation of PL-MCMC asymptotically samples from the exact conditional distributions associated with a normalizing flow. As a conditional sampling method, PL-MCMC enables Monte Carlo Ex… ▽ More

    Submitted 26 February, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 27 pages, 22 figures, 4 tables

  4. arXiv:2007.04087  [pdf, other

    cs.LG stat.ML

    Hyperparameter Optimization in Neural Networks via Structured Sparse Recovery

    Authors: Minsu Cho, Mohammadreza Soltani, Chinmay Hegde

    Abstract: In this paper, we study two important problems in the automated design of neural networks -- Hyper-parameter Optimization (HPO), and Neural Architecture Search (NAS) -- through the lens of sparse recovery methods. In the first part of this paper, we establish a novel connection between HPO and structured sparse recovery. In particular, we show that a special encoding of the hyperparameter space en… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: text overlap with arXiv:1906.02869

  5. arXiv:2004.07296  [pdf, other

    cs.LG stat.ML

    Clustering Time Series Data through Autoencoder-based Deep Learning Models

    Authors: Neda Tavakoli, Sima Siami-Namini, Mahdi Adl Khanghah, Fahimeh Mirza Soltani, Akbar Siami Namin

    Abstract: Machine learning and in particular deep learning algorithms are the emerging approaches to data analysis. These techniques have transformed traditional data mining-based analysis radically into a learning-based model in which existing data sets along with their cluster labels (i.e., train set) are learned to build a supervised learning model and predict the cluster labels of unseen data (i.e., tes… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

  6. arXiv:1910.09122  [pdf, other

    cs.LG cs.CV stat.ML

    Perception-Distortion Trade-off with Restricted Boltzmann Machines

    Authors: Chris Cannella, Jie Ding, Mohammadreza Soltani, Vahid Tarokh

    Abstract: In this work, we introduce a new procedure for applying Restricted Boltzmann Machines (RBMs) to missing data inference tasks, based on linearization of the effective energy function governing the distribution of observations. We compare the performance of our proposed procedure with those obtained using existing reconstruction procedures trained on incomplete data. We place these performance compa… ▽ More

    Submitted 20 October, 2019; originally announced October 2019.

    Comments: 5 pages, 1 figure

  7. arXiv:1906.02869  [pdf, other

    cs.LG stat.ML

    One-Shot Neural Architecture Search via Compressive Sensing

    Authors: Minsu Cho, Mohammadreza Soltani, Chinmay Hegde

    Abstract: Neural Architecture Search remains a very challenging meta-learning problem. Several recent techniques based on parameter-sharing idea have focused on reducing the NAS running time by leveraging proxy models, leading to architectures with competitive performance compared to those with hand-crafted designs. In this paper, we propose an iterative technique for NAS, inspired by algorithms for learnin… ▽ More

    Submitted 7 February, 2022; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: 2nd Workshop on Neural Architecture Search at ICLR 2021

  8. arXiv:1903.07045  [pdf, other

    cs.LG stat.ML

    Deep Feature Selection using a Teacher-Student Network

    Authors: Ali Mirzaei, Vahid Pourahmadi, Mehran Soltani, Hamid Sheikhzadeh

    Abstract: High-dimensional data in many machine learning applications leads to computational and analytical complexities. Feature selection provides an effective way for solving these problems by removing irrelevant and redundant features, thus reducing model complexity and improving accuracy and generalization capability of the model. In this paper, we present a novel teacher-student feature selection (TSF… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

    Comments: 28 pages

  9. arXiv:1902.04664  [pdf, other

    stat.ML cs.LG

    Learning Generative Models of Structured Signals from Their Superposition Using GANs with Application to Denoising and Demixing

    Authors: Mohammadreza Soltani, Swayambhoo Jain, Abhinav Sambasivan

    Abstract: Recently, Generative Adversarial Networks (GANs) have emerged as a popular alternative for modeling complex high dimensional distributions. Most of the existing works implicitly assume that the clean samples from the target distribution are easily available. However, in many applications, this assumption is violated. In this paper, we consider the observation setting when the samples from target d… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  10. arXiv:1810.05893  [pdf, other

    cs.IT cs.LG eess.SP stat.ML

    Deep Learning-Based Channel Estimation

    Authors: Mehran Soltani, Vahid Pourahmadi, Ali Mirzaei, Hamid Sheikhzadeh

    Abstract: In this paper, we present a deep learning (DL) algorithm for channel estimation in communication systems. We consider the time-frequency response of a fast fading communication channel as a two-dimensional image. The aim is to find the unknown values of the channel response using some known values at the pilot locations. To this end, a general pipeline using deep image processing techniques, image… ▽ More

    Submitted 19 February, 2019; v1 submitted 13 October, 2018; originally announced October 2018.

    Comments: 4 pages , 5 figures , Accepted for publication in the IEEE Communications Letters

  11. arXiv:1712.03281  [pdf, other

    stat.ML

    Fast Low-Rank Matrix Estimation without the Condition Number

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: In this paper, we study the general problem of optimizing a convex function $F(L)$ over the set of $p \times p$ matrices, subject to rank constraints on $L$. However, existing first-order methods for solving such problems either are too slow to converge, or require multiple invocations of singular value decompositions. On the other hand, factorization-based non-convex algorithms, while being much… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.

  12. arXiv:1710.00109  [pdf, other

    stat.ML

    Reconstruction from Periodic Nonlinearities, With Applications to HDR Imaging

    Authors: Viraj Shah, Mohammadreza Soltani, Chinmay Hegde

    Abstract: We consider the problem of reconstructing signals and images from periodic nonlinearities. For such problems, we design a measurement scheme that supports efficient reconstruction; moreover, our method can be adapted to extend to compressive sensing-based signal and image acquisition systems. Our techniques can be potentially useful for reducing the measurement complexity of high dynamic range (HD… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

  13. arXiv:1708.02999  [pdf, other

    stat.ML

    Demixing Structured Superposition Signals from Periodic and Aperiodic Nonlinear Observations

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: We consider the demixing problem of two (or more) structured high-dimensional vectors from a limited number of nonlinear observations where this nonlinearity is due to either a periodic or an aperiodic function. We study certain families of structured superposition models, and propose a method which provably recovers the components given (nearly) $m = \mathcal{O}(s)$ samples where $s$ denotes the… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1701.06597

  14. arXiv:1706.08936  [pdf, other

    stat.ML

    Fast Algorithms for Learning Latent Variables in Graphical Models

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: We study the problem of learning latent variables in Gaussian graphical models. Existing methods for this problem assume that the precision matrix of the observed variables is the superposition of a sparse and a low-rank component. In this paper, we focus on the estimation of the low-rank component, which encodes the effect of marginalization over the latent variables. We introduce fast, proper le… ▽ More

    Submitted 11 July, 2017; v1 submitted 27 June, 2017; originally announced June 2017.

  15. arXiv:1705.07469  [pdf, other

    stat.ML

    Improved Algorithms for Matrix Recovery from Rank-One Projections

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: We consider the problem of estimation of a low-rank matrix from a limited number of noisy rank-one projections. In particular, we propose two fast, non-convex \emph{proper} algorithms for matrix recovery and support them with rigorous theoretical analysis. We show that the proposed algorithms enjoy linear convergence and that their sample complexity is independent of the condition number of the un… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

  16. arXiv:1701.06607  [pdf, other

    stat.ML

    Stable Recovery Of Sparse Vectors From Random Sinusoidal Feature Maps

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: Random sinusoidal features are a popular approach for speeding up kernel-based inference in large datasets. Prior to the inference stage, the approach suggests performing dimensionality reduction by first multiplying each data vector by a random Gaussian matrix, and then computing an element-wise sinusoid. Theoretical analysis shows that collecting a sufficient number of such features can be relia… ▽ More

    Submitted 11 July, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

  17. arXiv:1701.06597  [pdf, other

    stat.ML

    Iterative Thresholding for Demixing Structured Superpositions in High Dimensions

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: We consider the demixing problem of two (or more) high-dimensional vectors from nonlinear observations when the number of such observations is far less than the ambient dimension of the underlying vectors. Specifically, we demonstrate an algorithm that stably estimate the underlying components under general \emph{structured sparsity} assumptions on these components. Specifically, we show that for… ▽ More

    Submitted 23 January, 2017; originally announced January 2017.

  18. Fast Algorithms for Demixing Sparse Signals from Nonlinear Observations

    Authors: Mohammadreza Soltani, Chinmay Hegde

    Abstract: We study the problem of demixing a pair of sparse signals from noisy, nonlinear observations of their superposition. Mathematically, we consider a nonlinear signal observation model, $y_i = g(a_i^Tx) + e_i, \ i=1,\ldots,m$, where $x = Φw+Ψz$ denotes the superposition signal, $Φ$ and $Ψ$ are orthonormal bases in $\mathbb{R}^n$, and $w, z\in\mathbb{R}^n$ are sparse coefficient vectors of the constit… ▽ More

    Submitted 21 July, 2017; v1 submitted 3 August, 2016; originally announced August 2016.