Skip to main content

Showing 1–8 of 8 results for author: Kolda, T G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.06927  [pdf, other

    cs.LG math.OC stat.ML

    Convergence of Alternating Gradient Descent for Matrix Factorization

    Authors: Rachel Ward, Tamara G. Kolda

    Abstract: We consider alternating gradient descent (AGD) with fixed step size applied to the asymmetric matrix factorization objective. We show that, for a rank-$r$ matrix $\mathbf{A} \in \mathbb{R}^{m \times n}$, $T = C (\frac{σ_1(\mathbf{A})}{σ_r(\mathbf{A})})^2 \log(1/ε)$ iterations of alternating gradient descent suffice to reach an $ε$-optimal factorization… ▽ More

    Submitted 7 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

  2. arXiv:2202.06930  [pdf, other

    stat.ML cs.LG math.NA

    Tensor Moments of Gaussian Mixture Models: Theory and Applications

    Authors: João M. Pereira, Joe Kileel, Tamara G. Kolda

    Abstract: Gaussian mixture models (GMMs) are fundamental tools in statistical and data sciences. We study the moments of multivariate Gaussians and GMMs. The $d$-th moment of an $n$-dimensional random variable is a symmetric $d$-way tensor of size $n^d$, so working with moments naively is assumed to be prohibitively expensive for $d>2$ and larger values of $n$. In this work, we develop theory and numerical… ▽ More

    Submitted 21 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  3. arXiv:1906.01687  [pdf, other

    math.NA cs.LG stat.ML

    Stochastic Gradients for Large-Scale Tensor Decomposition

    Authors: Tamara G. Kolda, David Hong

    Abstract: Tensor decomposition is a well-known tool for multiway data analysis. This work proposes using stochastic gradients for efficient generalized canonical polyadic (GCP) tensor decomposition of large-scale tensors. GCP tensor decomposition is a recently proposed version of tensor decomposition that allows for a variety of loss functions such as Bernoulli loss for binary data or Huber loss for robust… ▽ More

    Submitted 7 July, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Journal ref: SIAM Journal on Mathematics of Data Science, Vol. 2, No. 4, pp. 1066-1095, 2020

  4. arXiv:1808.07510  [pdf, other

    stat.ML cs.LG

    XPCA: Extending PCA for a Combination of Discrete and Continuous Variables

    Authors: Clifford Anderson-Bergman, Tamara G. Kolda, Kina Kincher-Winoto

    Abstract: Principal component analysis (PCA) is arguably the most popular tool in multivariate exploratory data analysis. In this paper, we consider the question of how to handle heterogeneous variables that include continuous, binary, and ordinal. In the probabilistic interpretation of low-rank PCA, the data has a normal multivariate distribution and, therefore, normal marginal distributions for each colum… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

  5. arXiv:1105.3422  [pdf, other

    math.NA physics.data-an stat.ML

    All-at-once Optimization for Coupled Matrix and Tensor Factorizations

    Authors: Evrim Acar, Tamara G. Kolda, Daniel M. Dunlavy

    Abstract: Joint analysis of data from multiple sources has the potential to improve our understanding of the underlying structures in complex data sets. For instance, in restaurant recommendation systems, recommendations can be based on rating histories of customers. In addition to rating histories, customers' social networks (e.g., Facebook friendships) and restaurant categories information (e.g., Thai or… ▽ More

    Submitted 17 May, 2011; originally announced May 2011.

  6. arXiv:1103.2068  [pdf, other

    cs.LG cs.DC stat.ML

    COMET: A Recipe for Learning and Using Large Ensembles on Massive Data

    Authors: Justin D. Basilico, M. Arthur Munson, Tamara G. Kolda, Kevin R. Dixon, W. Philip Kegelmeyer

    Abstract: COMET is a single-pass MapReduce algorithm for learning on large-scale data. It builds multiple random forest ensembles on distributed blocks of data and merges them into a mega-ensemble. This approach is appropriate when learning from massive-scale data that is too large to fit on a single machine. To get the best accuracy, IVoting should be used instead of bagging to generate the training subset… ▽ More

    Submitted 8 September, 2011; v1 submitted 10 March, 2011; originally announced March 2011.

    ACM Class: I.5; I.2.6; H.2.8

    Journal ref: ICDM 2011: Proceedings of the 2011 IEEE International Conference on Data Mining, pp. 41-50, 2011

  7. arXiv:1010.3043  [pdf, other

    math.NA stat.CO stat.ME

    Making Tensor Factorizations Robust to Non-Gaussian Noise

    Authors: Eric C. Chi, Tamara G. Kolda

    Abstract: Tensors are multi-way arrays, and the Candecomp/Parafac (CP) tensor factorization has found application in many different domains. The CP model is typically fit using a least squares objective function, which is a maximum likelihood estimate under the assumption of i.i.d. Gaussian noise. We demonstrate that this loss function can actually be highly sensitive to non-Gaussian noise. Therefore, we pr… ▽ More

    Submitted 14 October, 2010; originally announced October 2010.

    Comments: Contributed presentation at the NIPS Workshop on Tensors, Kernels, and Machine Learning, Whistler, BC, Canada, December 10, 2010

  8. arXiv:1005.4006  [pdf, other

    math.NA physics.data-an stat.ML

    Temporal Link Prediction using Matrix and Tensor Factorizations

    Authors: Daniel M. Dunlavy, Tamara G. Kolda, Evrim Acar

    Abstract: The data in many disciplines such as social networks, web analysis, etc. is link-based, and the link structure can be exploited for many different data mining tasks. In this paper, we consider the problem of temporal link prediction: Given link data for times 1 through T, can we predict the links at time T+1? If our data has underlying periodic structure, can we predict out even further in time, i… ▽ More

    Submitted 19 June, 2010; v1 submitted 21 May, 2010; originally announced May 2010.

    Journal ref: ACM Transactions on Knowledge Discovery from Data 5(2):10 (27 pages), February 2011