Skip to main content

Showing 1–50 of 110 results for author: Needell, D

Searching in archive math. Search in all archives.
.
  1. Stochastic Iterative Methods for Online Rank Aggregation from Pairwise Comparisons

    Authors: Benjamin Jarman, Lara Kassab, Deanna Needell, Alexander Sietsema

    Abstract: In this paper, we consider large-scale ranking problems where one is given a set of (possibly non-redundant) pairwise comparisons and the underlying ranking explained by those comparisons is desired. We show that stochastic gradient descent approaches can be leveraged to offer convergence to a solution that reveals the underlying ranking while requiring low-memory operations. We introduce several… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Journal ref: Bit Numer Math 64, 26 (2024)

  2. arXiv:2406.12021  [pdf, other

    math.OC math.NA

    Block Matrix and Tensor Randomized Kaczmarz Methods for Linear Feasibility Problems

    Authors: Minxin Zhang, Jamie Haddock, Deanna Needell

    Abstract: The randomized Kaczmarz methods are a popular and effective family of iterative methods for solving large-scale linear systems of equations, which have also been applied to linear feasibility problems. In this work, we propose a new block variant of the randomized Kaczmarz method, B-MRK, for solving linear feasibility problems defined by matrices. We show that B-MRK converges linearly in expectati… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.05818  [pdf, ps, other

    cs.DS cs.LG math.NA math.OC

    Fine-grained Analysis and Faster Algorithms for Iteratively Solving Linear Systems

    Authors: Michał Dereziński, Daniel LeJeune, Deanna Needell, Elizaveta Rebrova

    Abstract: While effective in practice, iterative methods for solving large systems of linear equations can be significantly affected by problem-dependent condition number quantities. This makes characterizing their time complexity challenging, particularly when we wish to make comparisons between deterministic and stochastic methods, that may or may not rely on preconditioning and/or fast matrix multiplicat… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 32 pages

  4. arXiv:2405.03073  [pdf, other

    math.OC stat.ML

    Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization Algorithms

    Authors: Yuchen Li, Laura Balzano, Deanna Needell, Hanbaek Lyu

    Abstract: We analyze inexact Riemannian gradient descent (RGD) where Riemannian gradients and retractions are inexactly (and cheaply) computed. Our focus is on understanding when inexact RGD converges and what is the complexity in the general nonconvex and constrained setting. We answer these questions in a general framework of tangential Block Majorization-Minimization (tBMM). We establish that tBMM conver… ▽ More

    Submitted 9 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 23 pages, 5 figures. ICML 2024. Appendix revised

  5. arXiv:2403.14688  [pdf, other

    cs.LG math.NA

    Kernel Alignment for Unsupervised Feature Selection via Matrix Factorization

    Authors: Ziyuan Lin, Deanna Needell

    Abstract: By removing irrelevant and redundant features, feature selection aims to find a good representation of the original features. With the prevalence of unlabeled data, unsupervised feature selection has been proven effective in alleviating the so-called curse of dimensionality. Most existing matrix factorization-based unsupervised feature selection methods are built upon subspace learning, but they h… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    MSC Class: 65F10; 65F22; 90C26

  6. arXiv:2403.01204  [pdf, ps, other

    cs.LG math.NA stat.ML

    Stochastic gradient descent for streaming linear and rectified linear systems with Massart noise

    Authors: Halyun Jeong, Deanna Needell, Elizaveta Rebrova

    Abstract: We propose SGD-exp, a stochastic gradient descent approach for linear and ReLU regressions under Massart noise (adversarial semi-random corruption model) for the fully streaming setting. We show novel nearly linear convergence guarantees of SGD-exp to the true parameter with up to $50\%$ Massart corruption rate, and with any corruption rate in the case of symmetric oblivious corruptions. This is t… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: Submitted to a journal

    MSC Class: 65F10; 60-XX

  7. arXiv:2312.10330  [pdf, other

    math.OC stat.ML

    Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization

    Authors: Yuchen Li, Laura Balzano, Deanna Needell, Hanbaek Lyu

    Abstract: Block majorization-minimization (BMM) is a simple iterative algorithm for nonconvex optimization that sequentially minimizes a majorizing surrogate of the objective function in each block coordinate while the other block coordinates are held fixed. We consider a family of BMM algorithms for minimizing smooth nonconvex objectives, where each parameter block is constrained within a subset of a Riema… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 54 pages, 8 figures

  8. arXiv:2311.10789  [pdf, other

    cs.LG math.NA

    Stratified-NMF for Heterogeneous Data

    Authors: James Chapman, Yotam Yaniv, Deanna Needell

    Abstract: Non-negative matrix factorization (NMF) is an important technique for obtaining low dimensional representations of datasets. However, classical NMF does not take into account data that is collected at different times or in different locations, which may exhibit heterogeneity. We resolve this problem by solving a modified NMF objective, Stratified-NMF, that simultaneously learns strata-dependent st… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 5 pages. Will appear in IEEE Asilomar Conference on Signals, Systems, and Computers 2023

    ACM Class: G.1.6; I.5.3; I.5.4

  9. arXiv:2308.13709  [pdf, other

    cs.IT math.NA

    Fast and Low-Memory Compressive Sensing Algorithms for Low Tucker-Rank Tensor Approximation from Streamed Measurements

    Authors: Cullen Haselby, Mark A. Iwen, Deanna Needell, Elizaveta Rebrova, William Swartworth

    Abstract: In this paper we consider the problem of recovering a low-rank Tucker approximation to a massive tensor based solely on structured random compressive measurements. Crucially, the proposed random measurement ensembles are both designed to be compactly represented (i.e., low-memory), and can also be efficiently computed in one-pass over the tensor. Thus, the proposed compressive sensing approach may… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 59 pages, 8 figures

    MSC Class: 65F55

  10. arXiv:2307.04056  [pdf, other

    stat.ML cs.LG eess.SP math.NA

    Manifold Filter-Combine Networks

    Authors: Joyce Chew, Edward De Brouwer, Smita Krishnaswamy, Deanna Needell, Michael Perlmutter

    Abstract: We introduce a class of manifold neural networks (MNNs) that we call Manifold Filter-Combine Networks (MFCNs), that aims to further our understanding of MNNs, analogous to how the aggregate-combine framework helps with the understanding of graph neural networks (GNNs). This class includes a wide variety of subclasses that can be thought of as the manifold analog of various popular GNNs. We then co… ▽ More

    Submitted 5 September, 2023; v1 submitted 8 July, 2023; originally announced July 2023.

  11. arXiv:2306.04730  [pdf, other

    eess.SP cs.LG math.NA math.OC stat.ML

    Stochastic Natural Thresholding Algorithms

    Authors: Rachel Grotheer, Shuang Li, Anna Ma, Deanna Needell, **g Qin

    Abstract: Sparse signal recovery is one of the most fundamental problems in various applications, including medical imaging and remote sensing. Many greedy algorithms based on the family of hard thresholding operators have been developed to solve the sparse signal recovery problem. More recently, Natural Thresholding (NT) has been proposed with improved computational efficiency. This paper proposes and disc… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  12. arXiv:2306.00507  [pdf, other

    math.NA math.DG math.OC

    Curvature corrected tangent space-based approximation of manifold-valued data

    Authors: Willem Diepeveen, Joyce Chew, Deanna Needell

    Abstract: When generalizing schemes for real-valued data approximation or decomposition to data living in Riemannian manifolds, tangent space-based schemes are very attractive for the simple reason that these spaces are linear. An open challenge is to do this in such a way that the generalized scheme is applicable to general Riemannian manifolds, is global-geometry aware and is computationally feasible. Exi… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    MSC Class: 53Z50; 15A69; 90C26; 90C30; 53-04; 53-08; 49Q99

  13. arXiv:2305.04080  [pdf, other

    math.NA cs.LG

    Robust Tensor CUR Decompositions: Rapid Low-Tucker-Rank Tensor Recovery with Sparse Corruption

    Authors: HanQin Cai, Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: We study the tensor robust principal component analysis (TRPCA) problem, a tensorial extension of matrix robust principal component analysis (RPCA), that aims to split the given tensor into an underlying low-rank component and a sparse outlier component. This work proposes a fast algorithm, called Robust Tensor CUR Decompositions (RTCUR), for large-scale non-convex TRPCA problems under the Tucker… ▽ More

    Submitted 10 October, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    MSC Class: 68p20; 68W20; 68W25; 68Q25; 65F30

    Journal ref: SIAM Journal on Imaging Sciences 17 (1), 225-247, 2024

  14. arXiv:2304.10123  [pdf, other

    stat.ML math.NA

    Linear Convergence of Reshuffling Kaczmarz Methods With Sparse Constraints

    Authors: Halyun Jeong, Deanna Needell

    Abstract: The Kaczmarz method (KZ) and its variants, which are types of stochastic gradient descent (SGD) methods, have been extensively studied due to their simplicity and efficiency in solving linear equation systems. The iterative thresholding (IHT) method has gained popularity in various research fields, including compressed sensing or sparse linear regression, machine learning with additional structure… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Submitted to a journal

    MSC Class: 65F10; 65F22; 90C26

  15. arXiv:2304.04860  [pdf, other

    math.OC

    Iterative Singular Tube Hard Thresholding Algorithms for Tensor Recovery

    Authors: Rachel Grotheer, Shuang Li, Anna Ma, Deanna Needell, **g Qin

    Abstract: Due to the explosive growth of large-scale data sets, tensors have been a vital tool to analyze and process high-dimensional data. Different from the matrix case, tensor decomposition has been defined in various formats, which can be further used to define the best low-rank approximation of a tensor to significantly reduce the dimensionality for signal compression and recovery. In this paper, we c… ▽ More

    Submitted 26 December, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  16. arXiv:2302.14615  [pdf, other

    math.OC cs.CR cs.LG math.NA

    Randomized Kaczmarz in Adversarial Distributed Setting

    Authors: Longxiu Huang, Xia Li, Deanna Needell

    Abstract: Develo** large-scale distributed methods that are robust to the presence of adversarial or corrupted workers is an important part of making such methods practical for real-world problems. In this paper, we propose an iterative approach that is adversary-tolerant for convex optimization problems. By leveraging simple statistics, our method ensures convergence and is capable of adapting to adversa… ▽ More

    Submitted 13 March, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

    MSC Class: 65F20; 65F10; 65K10

  17. arXiv:2302.10755  [pdf, other

    cs.LG cs.IT math.NA

    Federated Gradient Matching Pursuit

    Authors: Halyun Jeong, Deanna Needell, **g Qin

    Abstract: Traditional machine learning techniques require centralizing all training data on one server or data hub. Due to the development of communication technologies and a huge amount of decentralized data on many clients, collaborative machine learning has become the main interest while providing privacy-preserving frameworks. In particular, federated learning (FL) provides such a solution to learn a sh… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: Submitted to a journal

    MSC Class: 65-xx; 68W15; 68W20

  18. arXiv:2212.12606  [pdf, other

    cs.LG eess.SP math.NA stat.ML

    A Convergence Rate for Manifold Neural Networks

    Authors: Joyce Chew, Deanna Needell, Michael Perlmutter

    Abstract: High-dimensional data arises in numerous applications, and the rapidly develo** field of geometric deep learning seeks to develop neural network architectures to analyze such data in non-Euclidean domains, such as graphs and manifolds. Recent work by Z. Wang, L. Ruiz, and A. Ribeiro has introduced a method for constructing manifold neural networks using the spectral decomposition of the Laplace… ▽ More

    Submitted 20 July, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

  19. arXiv:2212.03962  [pdf, other

    math.NA

    Multi-Randomized Kaczmarz for Latent Class Regression

    Authors: Erin George, Yotam Yaniv, Deanna Needell

    Abstract: Linear regression is effective at identifying interpretable trends in a data set, but averages out potentially different effects on subgroups within data. We propose an iterative algorithm based on the randomized Kaczmarz (RK) method to automatically identify subgroups in data and perform linear regression on these groups simultaneously. We prove almost sure convergence for this method, as well as… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  20. arXiv:2211.06391  [pdf, other

    math.NA

    Online Signal Recovery via Heavy Ball Kaczmarz

    Authors: Benjamin Jarman, Yotam Yaniv, Deanna Needell

    Abstract: Recovering a signal $x^\ast \in \mathbb{R}^n$ from a sequence of linear measurements is an important problem in areas such as computerized tomography and compressed sensing. In this work, we consider an online setting in which measurements are sampled one-by-one from some source distribution. We propose solving this problem with a variant of the Kaczmarz method with an additional heavy ball moment… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 6 pages

  21. Matrix Completion with Cross-Concentrated Sampling: Bridging Uniform Sampling and CUR Sampling

    Authors: HanQin Cai, Longxiu Huang, Pengyu Li, Deanna Needell

    Abstract: While uniform sampling has been widely studied in the matrix completion literature, CUR sampling approximates a low-rank matrix via row and column samples. Unfortunately, both sampling models lack flexibility for various circumstances in real-world applications. In this work, we propose a novel and easy-to-implement sampling strategy, coined Cross-Concentrated Sampling (CCS). By bridging uniform s… ▽ More

    Submitted 21 March, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

  22. arXiv:2208.08561  [pdf, other

    stat.ML cs.LG math.SP

    Geometric Scattering on Measure Spaces

    Authors: Joyce Chew, Matthew Hirn, Smita Krishnaswamy, Deanna Needell, Michael Perlmutter, Holly Steach, Siddharth Viswanath, Hau-Tieng Wu

    Abstract: The scattering transform is a multilayered, wavelet-based transform initially introduced as a model of convolutional neural networks (CNNs) that has played a foundational role in our understanding of these networks' stability and invariance properties. Subsequently, there has been widespread interest in extending the success of CNNs to data sets with non-Euclidean structure, such as graphs and man… ▽ More

    Submitted 13 October, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    MSC Class: 68T07

  23. arXiv:2207.08171  [pdf, other

    cs.LG math.OC

    SP2: A Second Order Stochastic Polyak Method

    Authors: Shuang Li, William J. Swartworth, Martin Takáč, Deanna Needell, Robert M. Gower

    Abstract: Recently the "SP" (Stochastic Polyak step size) method has emerged as a competitive adaptive method for setting the step sizes of SGD. SP can be interpreted as a method specialized to interpolated models, since it solves the interpolation equations. SP solves these equation by using local linearizations of the model. We take a step further and develop a method for solving the interpolation equatio… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  24. On Block Accelerations of Quantile Randomized Kaczmarz for Corrupted Systems of Linear Equations

    Authors: Lu Cheng, Benjamin Jarman, Deanna Needell, Elizaveta Rebrova

    Abstract: With the growth of large data as well as large-scale learning tasks, the need for efficient and robust linear system solvers is greater than ever. The randomized Kaczmarz method (RK) and similar stochastic iterative methods have received considerable recent attention due to their efficient implementation and memory footprint. These methods can tolerate streaming data, accessing only part of the da… ▽ More

    Submitted 21 December, 2022; v1 submitted 25 June, 2022; originally announced June 2022.

  25. arXiv:2206.10078  [pdf, other

    cs.LG eess.SP math.NA stat.ML

    The Manifold Scattering Transform for High-Dimensional Point Cloud Data

    Authors: Joyce Chew, Holly R. Steach, Siddharth Viswanath, Hau-Tieng Wu, Matthew Hirn, Deanna Needell, Smita Krishnaswamy, Michael Perlmutter

    Abstract: The manifold scattering transform is a deep feature extractor for data defined on a Riemannian manifold. It is one of the first examples of extending convolutional neural network-like operators to general manifolds. The initial work on this model focused primarily on its theoretical stability and invariance properties but did not provide methods for its numerical implementation except in the case… ▽ More

    Submitted 21 January, 2024; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in the TAG in DS Workshop at ICML. For subsequent theoretical guarantees, please see Section 6 of arXiv:2208.08561

    MSC Class: 68T07 ACM Class: I.2.6

  26. arXiv:2204.03782  [pdf, ps, other

    cs.DS math.NA

    Testing Positive Semidefiniteness Using Linear Measurements

    Authors: Deanna Needell, William Swartworth, David P. Woodruff

    Abstract: We study the problem of testing whether a symmetric $d \times d$ input matrix $A$ is symmetric positive semidefinite (PSD), or is $ε$-far from the PSD cone, meaning that $λ_{\min}(A) \leq - ε\|A\|_p$, where $\|A\|_p$ is the Schatten-$p$ norm of $A$. In applications one often needs to quickly tell if an input matrix is PSD, and a small distance from the PSD cone may be tolerable. We consider two we… ▽ More

    Submitted 25 October, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    ACM Class: F.2.1

  27. arXiv:2203.03551  [pdf, other

    cs.IR cs.LG math.NA

    Semi-supervised Nonnegative Matrix Factorization for Document Classification

    Authors: Jamie Haddock, Lara Kassab, Sixian Li, Alona Kryshchenko, Rachel Grotheer, Elena Sizikova, Chuntian Wang, Thomas Merkh, RWMA Madushani, Miju Ahn, Deanna Needell, Kathryn Leonard

    Abstract: We propose new semi-supervised nonnegative matrix factorization (SSNMF) models for document classification and provide motivation for these models as maximum likelihood estimators. The proposed SSNMF models simultaneously provide both a topic model and a model for classification, thereby offering highly interpretable classification results. We derive training methods using multiplicative updates f… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2010.07956

  28. arXiv:2203.00095  [pdf, ps, other

    cs.LG math.NA

    Distributed randomized Kaczmarz for the adversarial workers

    Authors: Xia Li, Longxiu Huang, Deanna Needell

    Abstract: Develo** large-scale distributed methods that are robust to the presence of adversarial or corrupted workers is an important part of making such methods practical for real-world problems. Here, we propose an iterative approach that is adversary-tolerant for least-squares problems. The algorithm utilizes simple statistics to guarantee convergence and is capable of learning the adversarial distrib… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  29. arXiv:2110.04703  [pdf, other

    math.NA

    Selectable Set Randomized Kaczmarz

    Authors: Yotam Yaniv, Jacob D. Moorman, William Swartworth, Thomas Tu, Daji Landis, Deanna Needell

    Abstract: The Randomized Kaczmarz method (RK) is a stochastic iterative method for solving linear systems that has recently grown in popularity due to its speed and low memory requirement. Selectable Set Randomized Kaczmarz (SSRK) is an variant of RK that leverages existing information about the Kaczmarz iterate to identify an adaptive "selectable set" and thus yields an improved convergence guarantee. In t… ▽ More

    Submitted 2 February, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

  30. arXiv:2109.14079  [pdf, other

    cs.IT math.NA stat.CO

    Robust recovery of bandlimited graph signals via randomized dynamical sampling

    Authors: Longxiu Huang, Deanna Needell, Sui Tang

    Abstract: Heat diffusion processes have found wide applications in modelling dynamical systems over graphs. In this paper, we consider the recovery of a $k$-bandlimited graph signal that is an initial signal of a heat diffusion process from its space-time samples. We propose three random space-time sampling regimes, termed dynamical sampling techniques, that consist in selecting a small subset of space-time… ▽ More

    Submitted 3 October, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: corrected mistakes in plotting. arXiv admin note: text overlap with arXiv:1511.05118 by other authors

    MSC Class: 94A20; 94A12

  31. arXiv:2109.10454  [pdf, other

    math.NA

    Modewise Operators, the Tensor Restricted Isometry Property, and Low-Rank Tensor Recovery

    Authors: Mark A. Iwen, Deanna Needell, Michael Perlmutter, Elizaveta Rebrova

    Abstract: Recovery of sparse vectors and low-rank matrices from a small number of linear measurements is well-known to be possible under various model assumptions on the measurements. The key requirement on the measurement matrices is typically the restricted isometry property, that is, approximate orthonormality when acting on the subspace to be recovered. Among the most widely used random matrix measureme… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    MSC Class: 15B52; 15A69; 15A83; 97N40

  32. arXiv:2108.10448  [pdf, other

    cs.LG cs.CV eess.IV math.OC

    Fast Robust Tensor Principal Component Analysis via Fiber CUR Decomposition

    Authors: HanQin Cai, Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: We study the problem of tensor robust principal component analysis (TRPCA), which aims to separate an underlying low-multilinear-rank tensor and a sparse outlier tensor from their sum. In this work, we propose a fast non-convex algorithm, coined Robust Tensor CUR (RTCUR), for large-scale TRPCA problems. RTCUR considers a framework of alternating projections and utilizes the recently developed tens… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted to Workshop on Robust Subspace Learning and Applications in Computer Vision, International Conference on Computer Vision (ICCV) 2021

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 189-197, 2021

  33. arXiv:2108.02304  [pdf, other

    math.NA

    QuantileRK: Solving Large-Scale Linear Systems with Corrupted, Noisy Data

    Authors: Benjamin Jarman, Deanna Needell

    Abstract: Measurement data in linear systems arising from real-world applications often suffers from both large, sparse corruptions, and widespread small-scale noise. This can render many popular solvers ineffective, as the least squares solution is far from the desired solution, and the underlying consistent system becomes harder to identify and solve. QuantileRK is a member of the Kaczmarz family of itera… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    MSC Class: 65F10; 68W20

  34. arXiv:2107.09188  [pdf, other

    physics.soc-ph cs.CG math.AT q-bio.PE

    Analysis of Spatial and Spatiotemporal Anomalies Using Persistent Homology: Case Studies with COVID-19 Data

    Authors: Abigail Hickok, Deanna Needell, Mason A. Porter

    Abstract: We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one… ▽ More

    Submitted 24 February, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: revised version

    MSC Class: 55N31; 68T09; 92D30

  35. arXiv:2103.11037  [pdf, other

    math.NA cs.IT cs.LG eess.IV

    Mode-wise Tensor Decompositions: Multi-dimensional Generalizations of CUR Decompositions

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Deanna Needell

    Abstract: Low rank tensor approximation is a fundamental tool in modern machine learning and data science. In this paper, we study the characterization, perturbation analysis, and an efficient sampling strategy for two primary tensor CUR approximations, namely Chidori and Fiber CUR. We characterize exact tensor CUR decompositions for low multilinear rank tensors. We also present theoretical error bounds of… ▽ More

    Submitted 25 June, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Journal ref: The Journal of Machine Learning Research 22.185 (2021): 1-36

  36. arXiv:2010.07956  [pdf, other

    cs.LG math.OC

    Semi-supervised NMF Models for Topic Modeling in Learning Tasks

    Authors: Jamie Haddock, Lara Kassab, Sixian Li, Alona Kryshchenko, Rachel Grotheer, Elena Sizikova, Chuntian Wang, Thomas Merkh, R. W. M. A. Madushani, Miju Ahn, Deanna Needell, Kathryn Leonard

    Abstract: We propose several new models for semi-supervised nonnegative matrix factorization (SSNMF) and provide motivation for SSNMF models as maximum likelihood estimators given specific distributions of uncertainty. We present multiplicative updates training methods for each new model, and demonstrate the application of these models to classification, although they are flexible to other supervised learni… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 4 figures, 12 tables

  37. arXiv:2010.01600  [pdf, other

    cs.IR cs.SI math.NA

    Sparseness-constrained Nonnegative Tensor Factorization for Detecting Topics at Different Time Scales

    Authors: Lara Kassab, Alona Kryshchenko, Hanbaek Lyu, Denali Molitor, Deanna Needell, Elizaveta Rebrova, Jiahong Yuan

    Abstract: Temporal data (such as news articles or Twitter feeds) often consists of a mixture of long-lasting trends and popular but short-lasting topics of interest. A truly successful topic modeling strategy should be able to detect both types of topics and clearly locate them in time. In this paper, we first show that nonnegative CANDECOMP/PARAFAC decomposition (NCPD) is able to discover topics of variabl… ▽ More

    Submitted 31 August, 2023; v1 submitted 4 October, 2020; originally announced October 2020.

  38. arXiv:2009.08089  [pdf, other

    math.NA

    Quantile-based Iterative Methods for Corrupted Systems of Linear Equations

    Authors: Jamie Haddock, Deanna Needell, Elizaveta Rebrova, William Swartworth

    Abstract: Often in applications ranging from medical imaging and sensor networks to error correction and data science (and beyond), one needs to solve large-scale linear systems in which a fraction of the measurements have been corrupted. We consider solving such large-scale systems of linear equations $\mathbf{A}\mathbf{x}=\mathbf{b}$ that are inconsistent due to corruptions in the measurement vector… ▽ More

    Submitted 7 July, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    MSC Class: 65F10; 68W20; 60B20

  39. arXiv:2009.07612  [pdf, other

    stat.ML cs.LG math.OC

    Online nonnegative CP-dictionary learning for Markovian data

    Authors: Hanbaek Lyu, Christopher Strohmeier, Deanna Needell

    Abstract: Online Tensor Factorization (OTF) is a fundamental tool in learning low-dimensional interpretable features from streaming multi-modal data. While various algorithmic and theoretical aspects of OTF have been investigated recently, a general convergence guarantee to stationary points of the objective function without any incoherence or sparsity assumptions is still lacking even for the i.i.d. case.… ▽ More

    Submitted 2 April, 2022; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: 41 pages, 5 figures

  40. arXiv:2007.15776  [pdf, other

    stat.ML cs.IT cs.LG math.PR

    Random Vector Functional Link Networks for Function Approximation on Manifolds

    Authors: Deanna Needell, Aaron A. Nelson, Rayan Saab, Palina Salanevich, Olov Schavemaker

    Abstract: The learning speed of feed-forward neural networks is notoriously slow and has presented a bottleneck in deep learning applications for several decades. For instance, gradient-based learning algorithms, which are used extensively to train neural networks, tend to work slowly when all of the network parameters must be iteratively tuned. To counter this, both researchers and practitioners have tried… ▽ More

    Submitted 28 March, 2024; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: 37 pages, 1 figure

    MSC Class: 62M45

  41. arXiv:2004.09112  [pdf, other

    cs.LG math.OC stat.ML

    COVID-19 Time-series Prediction by Joint Dictionary Learning and Online NMF

    Authors: Hanbaek Lyu, Christopher Strohmeier, Georg Menz, Deanna Needell

    Abstract: Predicting the spread and containment of COVID-19 is a challenge of utmost importance that the broader scientific community is currently facing. One of the main sources of difficulty is that a very limited amount of daily COVID-19 case data is available, and with few exceptions, the majority of countries are currently in the "exponential spread stage," and thus there is scarce information availabl… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 8 pages, 4 figures

  42. arXiv:2003.09062  [pdf, other

    math.NA

    Tensor Completion through Total Variationwith Initialization from Weighted HOSVD

    Authors: Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: In our paper, we have studied the tensor completion problem when the sampling pattern is deterministic. We first propose a simple but efficient weighted HOSVD algorithm for recovery from noisy observations. Then we use the weighted HOSVD result as an initialization for the total variation. We have proved the accuracy of the weighted HOSVD algorithm from theoretical and numerical perspectives. In t… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: 8 pages, 6 figures, ITA 2020

  43. arXiv:2003.08537  [pdf, other

    math.NA cs.IT

    HOSVD-Based Algorithm for Weighted Tensor Completion

    Authors: Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: Matrix completion, the problem of completing missing entries in a data matrix with low dimensional structure (such as rank), has seen many fruitful approaches and analyses. Tensor completion is the tensor analog, that attempts to impute missing tensor entries from similar low-rank type assumptions. In this paper, we study the tensor completion problem when the sampling pattern is deterministic and… ▽ More

    Submitted 6 July, 2021; v1 submitted 18 March, 2020; originally announced March 2020.

    MSC Class: 15A69; 15A83; 65F30; 68P99; 68W20; 65F99

    Journal ref: journal of imaging, 2021

  44. arXiv:2002.04126  [pdf, other

    math.NA

    Randomized Kaczmarz with Averaging

    Authors: Jacob D. Moorman, Thomas K. Tu, Denali Molitor, Deanna Needell

    Abstract: The randomized Kaczmarz (RK) method is an iterative method for approximating the least-squares solution of large linear systems of equations. The standard RK method uses sequential updates, making parallel computation difficult. Here, we study a parallel version of RK where a weighted average of independent updates is used. We analyze the convergence of RK with averaging and demonstrate its perfor… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 19 pages, 9 figures

    MSC Class: 15A06; 15B52; 65F10; 65F20; 65Y20; 68Q25; 68W10; 68W20; 68W40

  45. arXiv:2002.02041  [pdf, other

    math.NA

    An Adaptation for Iterative Structured Matrix Completion

    Authors: Henry Adams, Lara Kassab, Deanna Needell

    Abstract: The task of predicting missing entries of a matrix, from a subset of known entries, is known as \textit{matrix completion}. In today's data-driven world, data completion is essential whether it is the main goal or a pre-processing step. Structured matrix completion includes any setting in which data is not missing uniformly at random. In recent work, a modification to the standard nuclear norm min… ▽ More

    Submitted 14 May, 2021; v1 submitted 5 February, 2020; originally announced February 2020.

    MSC Class: 15A83; 65F55 (Primary); 65F50 (Secondary)

  46. arXiv:1912.08294  [pdf, other

    math.NA stat.ML

    Lower Memory Oblivious (Tensor) Subspace Embeddings with Fewer Random Bits: Modewise Methods for Least Squares

    Authors: M. A. Iwen, D. Needell, E. Rebrova, A. Zare

    Abstract: In this paper new general modewise Johnson-Lindenstrauss (JL) subspace embeddings are proposed that are both considerably faster to generate and easier to store than traditional JL embeddings when working with extremely large vectors and/or tensors. Corresponding embedding results are then proven for two different types of low-dimensional (tensor) subspaces. The first of these new subspace embed… ▽ More

    Submitted 16 December, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

  47. arXiv:1912.00771  [pdf, other

    math.NA math.PR

    Sketching for Motzkin's Iterative Method for Linear Systems

    Authors: Elizaveta Rebrova, Deanna Needell

    Abstract: Projection-based iterative methods for solving large over-determined linear systems are well-known for their simplicity and computational efficiency. It is also known that the correct choice of a sketching procedure (i.e., preprocessing steps that reduce the dimension of each iteration) can improve the performance of iterative methods in multiple ways, such as, to speed up the convergence of the m… ▽ More

    Submitted 28 November, 2019; originally announced December 2019.

  48. arXiv:1912.00315  [pdf, other

    cs.CL cs.LG math.OC stat.ML

    Topic-aware chatbot using Recurrent Neural Networks and Nonnegative Matrix Factorization

    Authors: Yuchen Guo, Nicholas Hanoian, Zhexiao Lin, Nicholas Liskij, Hanbaek Lyu, Deanna Needell, Jiahao Qu, Henry Sojico, Yuliang Wang, Zhe Xiong, Zhenhong Zou

    Abstract: We propose a novel model for a topic-aware chatbot by combining the traditional Recurrent Neural Network (RNN) encoder-decoder model with a topic attention layer based on Nonnegative Matrix Factorization (NMF). After learning topic vectors from an auxiliary text corpus via NMF, the decoder is trained so that it is more likely to sample response words from the most correlated topic vectors. One of… ▽ More

    Submitted 4 December, 2019; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: 14 pages, 1 figure, 2 tables

  49. arXiv:1911.01931  [pdf, other

    cs.LG cs.DS math.OC math.PR stat.ML

    Online matrix factorization for Markovian data and applications to Network Dictionary Learning

    Authors: Hanbaek Lyu, Deanna Needell, Laura Balzano

    Abstract: Online Matrix Factorization (OMF) is a fundamental tool for dictionary learning problems, giving an approximate representation of complex data sets in terms of a reduced number of extracted features. Convergence guarantees for most of the OMF algorithms in the literature assume independence between data matrices, and the case of dependent data streams remains largely unexplored. In this paper, we… ▽ More

    Submitted 7 November, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: 39 pages, 13 figures

    Journal ref: Journal of Machine Learning Research 21 (2020)

  50. arXiv:1910.13986  [pdf, other

    cs.IT math.ST

    Weighted matrix completion from non-random, non-uniform sampling patterns

    Authors: Simon Foucart, Deanna Needell, Reese Pathak, Yaniv Plan, Mary Wootters

    Abstract: We study the matrix completion problem when the observation pattern is deterministic and possibly non-uniform. We propose a simple and efficient debiased projection scheme for recovery from noisy observations and analyze the error under a suitable weighted metric. We introduce a simple function of the weight matrix and the sampling pattern that governs the accuracy of the recovered matrix. We deri… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: 41 pages, 4 figures