Skip to main content

Showing 1–25 of 25 results for author: Amini, A A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10686  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Neural Thompson Sampling

    Authors: Shuang Wu, Arash A. Amini

    Abstract: We consider an online decision-making problem with a reward function defined over graph-structured data. We formally formulate the problem as an instance of graph action bandit. We then propose \texttt{GNN-TS}, a Graph Neural Network (GNN) powered Thompson Sampling (TS) algorithm which employs a GNN approximator for estimating the mean reward function and the graph neural tangent features for unce… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2406.06014  [pdf, other

    math.ST cs.SI stat.ME stat.ML

    Network two-sample test for block models

    Authors: Chung Kyong Nguen, Oscar Hernan Madrid Padilla, Arash A. Amini

    Abstract: We consider the two-sample testing problem for networks, where the goal is to determine whether two sets of networks originated from the same stochastic model. Assuming no vertex correspondence and allowing for different numbers of nodes, we address a fundamental network testing problem that goes beyond simple adjacency matrix comparisons. We adopt the stochastic block model (SBM) for network dist… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2310.05250  [pdf, other

    cs.LG stat.ML

    Simplifying GNN Performance with Low Rank Kernel Models

    Authors: Luciano Vinas, Arash A. Amini

    Abstract: We revisit recent spectral GNN approaches to semi-supervised node classification (SSNC). We posit that many of the current GNN architectures may be over-engineered. Instead, simpler, traditional methods from nonparametric estimation, applied in the spectral domain, could replace many deep-learning inspired GNN designs. These conventional techniques appear to be well suited for a variety of graph t… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  4. arXiv:2307.09210  [pdf, other

    stat.ME cs.SI stat.ML

    Nested stochastic block model for simultaneously clustering networks and nodes

    Authors: Nathaniel Josephs, Arash A. Amini, Marina Paez, Lizhen Lin

    Abstract: We introduce the nested stochastic block model (NSBM) to cluster a collection of networks while simultaneously detecting communities within each network. NSBM has several appealing features including the ability to work on unlabeled networks with potentially different node sets, the flexibility to model heterogeneous communities, and the means to automatically select the number of classes for the… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  5. arXiv:2209.13619  [pdf, other

    physics.med-ph cs.CV stat.AP

    LapGM: A Multisequence MR Bias Correction and Normalization Model

    Authors: Luciano Vinas, Arash A. Amini, Jade Fischer, Atchar Sudhyadhom

    Abstract: A spatially regularized Gaussian mixture model, LapGM, is proposed for the bias field correction and magnetic resonance normalization problem. The proposed spatial regularizer gives practitioners fine-tuned control between balancing bias field removal and preserving image contrast preservation for multi-sequence, magnetic resonance images. The fitted Gaussian parameters of LapGM serve as control v… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  6. arXiv:2206.14255  [pdf, other

    cs.LG math.ST stat.ML

    Target alignment in truncated kernel ridge regression

    Authors: Arash A. Amini, Richard Baumgartner, Dai Feng

    Abstract: Kernel ridge regression (KRR) has recently attracted renewed interest due to its potential for explaining the transient effects, such as double descent, that emerge during neural network training. In this work, we study how the alignment between the target function and the kernel affects the performance of the KRR. We focus on the truncated KRR (TKRR) which utilizes an additional parameter that co… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  7. arXiv:2206.05829  [pdf, other

    math.ST cs.DM stat.ML

    A non-graphical representation of conditional independence via the neighbourhood lattice

    Authors: Arash A. Amini, Bryon Aragam, Qing Zhou

    Abstract: We introduce and study the neighbourhood lattice decomposition of a distribution, which is a compact, non-graphical representation of conditional independence that is valid in the absence of a faithful graphical representation. The idea is to view the set of neighbourhoods of a variable as a subset lattice, and partition this lattice into convex sublattices, each of which directly encodes a collec… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

    Comments: 30 pages, 3 figures

  8. arXiv:2201.09194  [pdf, other

    stat.ME cs.LG stat.CO

    Distributed Learning of Generalized Linear Causal Networks

    Authors: Qiaoling Ye, Arash A. Amini, Qing Zhou

    Abstract: We consider the task of learning causal structures from data stored on multiple machines, and propose a novel structure learning method called distributed annealing on regularized likelihood score (DARLS) to solve this problem. We model causal structures by a directed acyclic graph that is parameterized with generalized linear models, so that our method is applicable to various types of data. To o… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: 27 pages, 3 tables, 3 figures

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 (Early Access)

  9. arXiv:2012.15047  [pdf, other

    math.ST cs.SI stat.ML

    Adjusted chi-square test for degree-corrected block models

    Authors: Linfan Zhang, Arash A. Amini

    Abstract: We propose a goodness-of-fit test for degree-corrected stochastic block models (DCSBM). The test is based on an adjusted chi-square statistic for measuring equality of means among groups of $n$ multinomial distributions with $d_1,\dots,d_n$ observations. In the context of network models, the number of multinomials, $n$, grows much faster than the number of observations, $d_i$, corresponding to the… ▽ More

    Submitted 22 September, 2022; v1 submitted 30 December, 2020; originally announced December 2020.

  10. arXiv:1909.03347  [pdf, other

    math.ST cs.LG stat.ML

    Concentration of kernel matrices with application to kernel spectral clustering

    Authors: Arash A. Amini, Zahra S. Razaee

    Abstract: We study the concentration of random kernel matrices around their mean. We derive nonasymptotic exponential concentration inequalities for Lipschitz kernels assuming that the data points are independent draws from a class of multivariate distributions on $\mathbb R^d$, including the strongly log-concave distributions under affine transformations. A feature of our result is that the data points nee… ▽ More

    Submitted 27 January, 2020; v1 submitted 7 September, 2019; originally announced September 2019.

  11. arXiv:1909.01978  [pdf, other

    math.ST cs.LG stat.ML

    On perfectness in Gaussian graphical models

    Authors: Arash A. Amini, Bryon Aragam, Qing Zhou

    Abstract: Knowing when a graphical model is perfect to a distribution is essential in order to relate separation in the graph to conditional independence in the distribution, and this is particularly important when performing inference from data. When the model is perfect, there is a one-to-one correspondence between conditional independence statements in the distribution and separation statements in the gr… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: This note is based on a result that first appeared in arXiv:1711.00991v1. The original article has now been split into two parts

  12. arXiv:1906.06276  [pdf, ps, other

    stat.ML cs.LG math.ST

    Spectrally-truncated kernel ridge regression and its free lunch

    Authors: Arash A. Amini

    Abstract: Kernel ridge regression (KRR) is a well-known and popular nonparametric regression approach with many desirable properties, including minimax rate-optimality in estimating functions that belong to common reproducing kernel Hilbert spaces (RKHS). The approach, however, is computationally intensive for large data sets, due to the need to operate on a dense $n \times n$ kernel matrix, where $n$ is th… ▽ More

    Submitted 12 October, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  13. arXiv:1906.03052  [pdf, other

    cs.SI physics.soc-ph stat.AP

    Approximate Identification of the Optimal Epidemic Source in Complex Networks

    Authors: S. Jalil Kazemitabar, Arash A. Amini

    Abstract: We consider the problem of identifying the source of an epidemic, spreading through a network, from a complete observation of the infected nodes in a snapshot of the network. Previous work on the problem has often employed geometric, spectral or heuristic approaches to identify the source, with the trees being the most studied network topology. We take a fully statistical approach and derive novel… ▽ More

    Submitted 12 June, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  14. Optimizing regularized Cholesky score for order-based learning of Bayesian networks

    Authors: Qiaoling Ye, Arash A. Amini, Qing Zhou

    Abstract: Bayesian networks are a class of popular graphical models that encode causal and conditional independence relations among variables by directed acyclic graphs (DAGs). We propose a novel structure learning method, annealing on regularized Cholesky score (ARCS), to search over topological sorts, or permutations of nodes, for a high-scoring Bayesian network. Our scoring function is derived from regul… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Comments: 15 pages, 7 figures, 5 tables

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)

  15. arXiv:1904.05330  [pdf, other

    cs.SI cs.LG stat.ME stat.ML

    Hierarchical Stochastic Block Model for Community Detection in Multiplex Networks

    Authors: Arash A. Amini, Marina S. Paez, Lizhen Lin

    Abstract: Multiplex networks have become increasingly more prevalent in many fields, and have emerged as a powerful tool for modeling the complexity of real networks. There is a critical need for develo** inference models for multiplex networks that can take into account potential dependencies across different layers, particularly when the aim is community detection. We add to a limited literature by prop… ▽ More

    Submitted 12 February, 2023; v1 submitted 29 March, 2019; originally announced April 2019.

    Comments: 27 pages, 11 figures

  16. arXiv:1903.09631  [pdf, other

    math.ST cs.LG eess.SP stat.ML

    High-Dimensional Bernoulli Autoregressive Process with Long-Range Dependence

    Authors: Parthe Pandit, Mojtaba Sahraee-Ardakan, Arash A. Amini, Sundeep Rangan, Alyson K. Fletcher

    Abstract: We consider the problem of estimating the parameters of a multivariate Bernoulli process with auto-regressive feedback in the high-dimensional setting where the number of samples available is much less than the number of parameters. This problem arises in learning interconnections of networks of dynamical systems with spiking or binary-valued data. We allow the process to depend on its past up to… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: To appear at AISTATS 2019 titled "Sparse Multivariate Bernoulli Processes in High Dimensions"

    Journal ref: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Naha, Okinawa, Japan. PMLR: Volume 89

  17. arXiv:1903.08829  [pdf, other

    stat.ML cs.LG stat.CO

    Exact slice sampler for Hierarchical Dirichlet Processes

    Authors: Arash A. Amini, Marina Paez, Lizhen Lin, Zahra S. Razaee

    Abstract: We propose an exact slice sampler for Hierarchical Dirichlet process (HDP) and its associated mixture models (Teh et al., 2006). Although there are existing MCMC algorithms for sampling from the HDP, a slice sampler has been missing from the literature. Slice sampling is well-known for its desirable properties including its fast mixing and its natural potential for parallelization. On the other ha… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

  18. arXiv:1803.06031  [pdf, other

    math.ST cs.SI stat.ML

    Optimal Bipartite Network Clustering

    Authors: Zhixin Zhou, Arash A. Amini

    Abstract: We study bipartite community detection in networks, or more generally the network biclustering problem. We present a fast two-stage procedure based on spectral initialization followed by the application of a pseudo-likelihood classifier twice. Under mild regularity conditions, we establish the weak consistency of the procedure (i.e., the convergence of the misclassification rate to zero) under a g… ▽ More

    Submitted 22 December, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

  19. arXiv:1803.04547  [pdf, other

    math.ST cs.SI stat.ML

    Analysis of spectral clustering algorithms for community detection: the general bipartite setting

    Authors: Zhixin Zhou, Arash A. Amini

    Abstract: We consider spectral clustering algorithms for community detection under a general bipartite stochastic block model (SBM). A modern spectral clustering algorithm consists of three steps: (1) regularization of an appropriate adjacency or Laplacian matrix (2) a form of spectral truncation and (3) a k-means type algorithm in the reduced spectral domain. We focus on the adjacency-based spectral cluste… ▽ More

    Submitted 22 December, 2018; v1 submitted 12 March, 2018; originally announced March 2018.

  20. arXiv:1711.00991  [pdf, other

    math.ST cs.LG stat.ML

    The neighborhood lattice for encoding partial correlations in a Hilbert space

    Authors: Arash A. Amini, Bryon Aragam, Qing Zhou

    Abstract: Neighborhood regression has been a successful approach in graphical and structural equation modeling, with applications to learning undirected and directed graphical models. We extend these ideas by defining and studying an algebraic structure called the neighborhood lattice based on a generalized notion of neighborhood regression. We show that this algebraic structure has the potential to provide… ▽ More

    Submitted 6 February, 2019; v1 submitted 2 November, 2017; originally announced November 2017.

  21. arXiv:1703.04943  [pdf, other

    cs.SI cs.LG stat.ML

    Matched bipartite block model with covariates

    Authors: Zahra S. Razaee, Arash A. Amini, **gyi Jessica Li

    Abstract: Community detection or clustering is a fundamental task in the analysis of network data. Many real networks have a bipartite structure which makes community detection challenging. In this paper, we consider a model which allows for matched communities in the bipartite setting, in addition to node covariates with information about the matching. We derive a simple fast algorithm for fitting the mode… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

  22. arXiv:1511.08963  [pdf, ps, other

    math.ST cs.LG stat.ML

    Learning Directed Acyclic Graphs with Penalized Neighbourhood Regression

    Authors: Bryon Aragam, Arash A. Amini, Qing Zhou

    Abstract: We study a family of regularized score-based estimators for learning the structure of a directed acyclic graph (DAG) for a multivariate normal distribution from high-dimensional data with $p\gg n$. Our main results establish support recovery guarantees and deviation bounds for a family of penalized least-squares estimators under concave regularization without assuming prior knowledge of a variable… ▽ More

    Submitted 1 October, 2017; v1 submitted 28 November, 2015; originally announced November 2015.

    Comments: 54 pages, 1 figure

  23. arXiv:1406.5647  [pdf, ps, other

    cs.LG cs.SI stat.ML

    On semidefinite relaxations for the block model

    Authors: Arash A. Amini, Elizaveta Levina

    Abstract: The stochastic block model (SBM) is a popular tool for community detection in networks, but fitting it by maximum likelihood (MLE) involves a computationally infeasible optimization problem. We propose a new semidefinite programming (SDP) solution to the problem of fitting the SBM, derived as a relaxation of the MLE. We put ours and previously proposed SDPs in a unified framework, as relaxations o… ▽ More

    Submitted 16 March, 2016; v1 submitted 21 June, 2014; originally announced June 2014.

  24. arXiv:1207.2340  [pdf, ps, other

    cs.SI cs.LG math.ST physics.soc-ph stat.ML

    Pseudo-likelihood methods for community detection in large sparse networks

    Authors: Arash A. Amini, Aiyou Chen, Peter J. Bickel, Elizaveta Levina

    Abstract: Many algorithms have been proposed for fitting network models with communities, but most of them do not scale well to large networks, and often fail on sparse networks. Here we propose a new fast pseudo-likelihood method for fitting the stochastic block model for networks, as well as a variant that allows for an arbitrary degree distribution by conditioning on degrees. We show that the algorithms… ▽ More

    Submitted 5 November, 2013; v1 submitted 10 July, 2012; originally announced July 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1138 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1138

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 4, 2097-2122

  25. arXiv:0803.4026  [pdf, ps, other

    math.ST cs.IT

    High-dimensional analysis of semidefinite relaxations for sparse principal components

    Authors: Arash A. Amini, Martin J. Wainwright

    Abstract: Principal component analysis (PCA) is a classical method for dimensionality reduction based on extracting the dominant eigenvectors of the sample covariance matrix. However, PCA is well known to behave poorly in the ``large $p$, small $n$'' setting, in which the problem dimension $p$ is comparable to or larger than the sample size $n$. This paper studies PCA in this high-dimensional regime, but… ▽ More

    Submitted 26 August, 2009; v1 submitted 27 March, 2008; originally announced March 2008.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOS664 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS664 MSC Class: 62H25 (Primary) 62F12 (Secondary)

    Journal ref: Annals of Statistics 2009, Vol. 37, No. 5B, 2877-2921