Skip to main content

Showing 51–100 of 156 results for author: Needell, D

.
  1. arXiv:2107.09188  [pdf, other

    physics.soc-ph cs.CG math.AT q-bio.PE

    Analysis of Spatial and Spatiotemporal Anomalies Using Persistent Homology: Case Studies with COVID-19 Data

    Authors: Abigail Hickok, Deanna Needell, Mason A. Porter

    Abstract: We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one… ▽ More

    Submitted 24 February, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: revised version

    MSC Class: 55N31; 68T09; 92D30

  2. arXiv:2105.10598  [pdf, other

    cs.CV cs.AI cs.LG

    Embracing New Techniques in Deep Learning for Estimating Image Memorability

    Authors: Coen D. Needell, Wilma A. Bainbridge

    Abstract: Various work has suggested that the memorability of an image is consistent across people, and thus can be treated as an intrinsic property of an image. Using computer vision models, we can make specific predictions about what people will remember or forget. While older work has used now-outdated deep learning architectures to predict image memorability, innovations in the field have given us new t… ▽ More

    Submitted 8 January, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: 27 pages, 15 figures, Presented at the Proceedings of the Vision Sciences Society 2021

    ACM Class: J.4; I.2.10

  3. arXiv:2105.09065  [pdf, other

    stat.AP

    Statistical Learning for Best Practices in Tattoo Removal

    Authors: Richard Yim, Jamie Haddock, Deanna Needell

    Abstract: The causes behind complications in laser-assisted tattoo removal are currently not well understood, and in the literature relating to tattoo removal the emphasis on removal treatment is on removal technologies and tools, not best parameters involved in the treatment process. Additionally, the very challenge of determining best practices is difficult given the complexity of interactions between fac… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 15 pages, 2 figures, 9 tables

  4. arXiv:2104.14028  [pdf, other

    cs.LG cs.CY

    Analysis of Legal Documents via Non-negative Matrix Factorization Methods

    Authors: Ryan Budahazy, Lu Cheng, Yihuan Huang, Andrew Johnson, Pengyu Li, Joshua Vendrow, Zhoutong Wu, Denali Molitor, Elizaveta Rebrova, Deanna Needell

    Abstract: The California Innocence Project (CIP), a clinical law school program aiming to free wrongfully convicted prisoners, evaluates thousands of mails containing new requests for assistance and corresponding case files. Processing and interpreting this large amount of information presents a significant challenge for CIP officials, which can be successfully aided by topic modeling techniques.In this pap… ▽ More

    Submitted 6 November, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: 16 pages, 4 figures

  5. arXiv:2103.11037  [pdf, other

    math.NA cs.IT cs.LG eess.IV

    Mode-wise Tensor Decompositions: Multi-dimensional Generalizations of CUR Decompositions

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Deanna Needell

    Abstract: Low rank tensor approximation is a fundamental tool in modern machine learning and data science. In this paper, we study the characterization, perturbation analysis, and an efficient sampling strategy for two primary tensor CUR approximations, namely Chidori and Fiber CUR. We characterize exact tensor CUR decompositions for low multilinear rank tensors. We also present theoretical error bounds of… ▽ More

    Submitted 25 June, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Journal ref: The Journal of Machine Learning Research 22.185 (2021): 1-36

  6. arXiv:2101.05231  [pdf, other

    cs.CV cs.LG eess.IV

    Robust CUR Decomposition: Theory and Imaging Applications

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Deanna Needell

    Abstract: This paper considers the use of Robust PCA in a CUR decomposition framework and applications thereof. Our main algorithms produce a robust version of column-row factorizations of matrices $\mathbf{D}=\mathbf{L}+\mathbf{S}$ where $\mathbf{L}$ is low-rank and $\mathbf{S}$ contains sparse outliers. These methods yield interpretable factorizations at low computational cost, and provide new CUR decompo… ▽ More

    Submitted 5 August, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

    MSC Class: 15A23; 65F30; 68P20; 68W20; 68W25; 68Q25

    Journal ref: SIAM Journal on Imaging Sciences 14.4 (2021): 1472-1503

  7. arXiv:2011.05384  [pdf, other

    cs.LG

    Applications of Online Nonnegative Matrix Factorization to Image and Time-Series Data

    Authors: Hanbaek Lyu, Georg Menz, Deanna Needell, Christopher Strohmeier

    Abstract: Online nonnegative matrix factorization (ONMF) is a matrix factorization technique in the online setting where data are acquired in a streaming fashion and the matrix factors are updated each time. This enables factor analysis to be performed concurrently with the arrival of new data samples. In this article, we demonstrate how one can use online nonnegative matrix factorization algorithms to lear… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: 9 pages, 8 figures

    Journal ref: 2020 Information Theory and Applications Workshop (ITA)

  8. arXiv:2010.11365  [pdf, other

    cs.LG

    On a Guided Nonnegative Matrix Factorization

    Authors: Joshua Vendrow, Jamie Haddock, Elizaveta Rebrova, Deanna Needell

    Abstract: Fully unsupervised topic models have found fantastic success in document clustering and classification. However, these models often suffer from the tendency to learn less-than-meaningful or even redundant topics when the data is biased towards a set of features. For this reason, we propose an approach based upon the nonnegative matrix factorization (NMF) model, deemed \textit{Guided NMF}, that inc… ▽ More

    Submitted 5 February, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 6 pages, 6 tables

  9. arXiv:2010.07956  [pdf, other

    cs.LG math.OC

    Semi-supervised NMF Models for Topic Modeling in Learning Tasks

    Authors: Jamie Haddock, Lara Kassab, Sixian Li, Alona Kryshchenko, Rachel Grotheer, Elena Sizikova, Chuntian Wang, Thomas Merkh, R. W. M. A. Madushani, Miju Ahn, Deanna Needell, Kathryn Leonard

    Abstract: We propose several new models for semi-supervised nonnegative matrix factorization (SSNMF) and provide motivation for SSNMF models as maximum likelihood estimators given specific distributions of uncertainty. We present multiplicative updates training methods for each new model, and demonstrate the application of these models to classification, although they are flexible to other supervised learni… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 4 figures, 12 tables

  10. arXiv:2010.01600  [pdf, other

    cs.IR cs.SI math.NA

    Sparseness-constrained Nonnegative Tensor Factorization for Detecting Topics at Different Time Scales

    Authors: Lara Kassab, Alona Kryshchenko, Hanbaek Lyu, Denali Molitor, Deanna Needell, Elizaveta Rebrova, Jiahong Yuan

    Abstract: Temporal data (such as news articles or Twitter feeds) often consists of a mixture of long-lasting trends and popular but short-lasting topics of interest. A truly successful topic modeling strategy should be able to detect both types of topics and clearly locate them in time. In this paper, we first show that nonnegative CANDECOMP/PARAFAC decomposition (NCPD) is able to discover topics of variabl… ▽ More

    Submitted 31 August, 2023; v1 submitted 4 October, 2020; originally announced October 2020.

  11. arXiv:2009.09087  [pdf, other

    cs.CY cs.LG stat.ML

    Feature Selection on Lyme Disease Patient Survey Data

    Authors: Joshua Vendrow, Jamie Haddock, Deanna Needell, Lorraine Johnson

    Abstract: Lyme disease is a rapidly growing illness that remains poorly understood within the medical community. Critical questions about when and why patients respond to treatment or stay ill, what kinds of treatments are effective, and even how to properly diagnose the disease remain largely unanswered. We investigate these questions by applying machine learning techniques to a large scale Lyme disease pa… ▽ More

    Submitted 24 August, 2020; originally announced September 2020.

    Comments: 9 pages, 8 figures, 6 tables

  12. arXiv:2009.09074  [pdf, other

    cs.DL cs.IR cs.LG stat.ML

    COVID-19 Literature Topic-Based Search via Hierarchical NMF

    Authors: Rachel Grotheer, Yihuan Huang, Pengyu Li, Elizaveta Rebrova, Deanna Needell, Longxiu Huang, Alona Kryshchenko, Xia Li, Kyung Ha, Oleksandr Kryshchenko

    Abstract: A dataset of COVID-19-related scientific literature is compiled, combining the articles from several online libraries and selecting those with open access and full text available. Then, hierarchical nonnegative matrix factorization is used to organize literature related to the novel coronavirus into a tree structure that allows researchers to search for relevant literature based on detected topics… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  13. arXiv:2009.08089  [pdf, other

    math.NA

    Quantile-based Iterative Methods for Corrupted Systems of Linear Equations

    Authors: Jamie Haddock, Deanna Needell, Elizaveta Rebrova, William Swartworth

    Abstract: Often in applications ranging from medical imaging and sensor networks to error correction and data science (and beyond), one needs to solve large-scale linear systems in which a fraction of the measurements have been corrupted. We consider solving such large-scale systems of linear equations $\mathbf{A}\mathbf{x}=\mathbf{b}$ that are inconsistent due to corruptions in the measurement vector… ▽ More

    Submitted 7 July, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    MSC Class: 65F10; 68W20; 60B20

  14. arXiv:2009.07612  [pdf, other

    stat.ML cs.LG math.OC

    Online nonnegative CP-dictionary learning for Markovian data

    Authors: Hanbaek Lyu, Christopher Strohmeier, Deanna Needell

    Abstract: Online Tensor Factorization (OTF) is a fundamental tool in learning low-dimensional interpretable features from streaming multi-modal data. While various algorithmic and theoretical aspects of OTF have been investigated recently, a general convergence guarantee to stationary points of the objective function without any incoherence or sparsity assumptions is still lacking even for the i.i.d. case.… ▽ More

    Submitted 2 April, 2022; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: 41 pages, 5 figures

  15. arXiv:2009.01279  [pdf, other

    stat.ML cs.LG eess.SP

    Clustering of Nonnegative Data and an Application to Matrix Completion

    Authors: C. Strohmeier, D. Needell

    Abstract: In this paper, we propose a simple algorithm to cluster nonnegative data lying in disjoint subspaces. We analyze its performance in relation to a certain measure of correlation between said subspaces. We use our clustering algorithm to develop a matrix completion algorithm which can outperform standard matrix completion algorithms on data matrices satisfying certain natural conditions.

    Submitted 2 September, 2020; originally announced September 2020.

  16. arXiv:2007.15776  [pdf, other

    stat.ML cs.IT cs.LG math.PR

    Random Vector Functional Link Networks for Function Approximation on Manifolds

    Authors: Deanna Needell, Aaron A. Nelson, Rayan Saab, Palina Salanevich, Olov Schavemaker

    Abstract: The learning speed of feed-forward neural networks is notoriously slow and has presented a bottleneck in deep learning applications for several decades. For instance, gradient-based learning algorithms, which are used extensively to train neural networks, tend to work slowly when all of the network parameters must be iteratively tuned. To counter this, both researchers and practitioners have tried… ▽ More

    Submitted 28 March, 2024; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: 37 pages, 1 figure

    MSC Class: 62M45

  17. arXiv:2004.09112  [pdf, other

    cs.LG math.OC stat.ML

    COVID-19 Time-series Prediction by Joint Dictionary Learning and Online NMF

    Authors: Hanbaek Lyu, Christopher Strohmeier, Georg Menz, Deanna Needell

    Abstract: Predicting the spread and containment of COVID-19 is a challenge of utmost importance that the broader scientific community is currently facing. One of the main sources of difficulty is that a very limited amount of daily COVID-19 case data is available, and with few exceptions, the majority of countries are currently in the "exponential spread stage," and thus there is scarce information availabl… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 8 pages, 4 figures

  18. arXiv:2003.09062  [pdf, other

    math.NA

    Tensor Completion through Total Variationwith Initialization from Weighted HOSVD

    Authors: Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: In our paper, we have studied the tensor completion problem when the sampling pattern is deterministic. We first propose a simple but efficient weighted HOSVD algorithm for recovery from noisy observations. Then we use the weighted HOSVD result as an initialization for the total variation. We have proved the accuracy of the weighted HOSVD algorithm from theoretical and numerical perspectives. In t… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: 8 pages, 6 figures, ITA 2020

  19. arXiv:2003.08537  [pdf, other

    math.NA cs.IT

    HOSVD-Based Algorithm for Weighted Tensor Completion

    Authors: Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: Matrix completion, the problem of completing missing entries in a data matrix with low dimensional structure (such as rank), has seen many fruitful approaches and analyses. Tensor completion is the tensor analog, that attempts to impute missing tensor entries from similar low-rank type assumptions. In this paper, we study the tensor completion problem when the sampling pattern is deterministic and… ▽ More

    Submitted 6 July, 2021; v1 submitted 18 March, 2020; originally announced March 2020.

    MSC Class: 15A69; 15A83; 65F30; 68P99; 68W20; 65F99

    Journal ref: journal of imaging, 2021

  20. arXiv:2002.04126  [pdf, other

    math.NA

    Randomized Kaczmarz with Averaging

    Authors: Jacob D. Moorman, Thomas K. Tu, Denali Molitor, Deanna Needell

    Abstract: The randomized Kaczmarz (RK) method is an iterative method for approximating the least-squares solution of large linear systems of equations. The standard RK method uses sequential updates, making parallel computation difficult. Here, we study a parallel version of RK where a weighted average of independent updates is used. We analyze the convergence of RK with averaging and demonstrate its perfor… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 19 pages, 9 figures

    MSC Class: 15A06; 15B52; 65F10; 65F20; 65Y20; 68Q25; 68W10; 68W20; 68W40

  21. arXiv:2002.02041  [pdf, other

    math.NA

    An Adaptation for Iterative Structured Matrix Completion

    Authors: Henry Adams, Lara Kassab, Deanna Needell

    Abstract: The task of predicting missing entries of a matrix, from a subset of known entries, is known as \textit{matrix completion}. In today's data-driven world, data completion is essential whether it is the main goal or a pre-processing step. Structured matrix completion includes any setting in which data is not missing uniformly at random. In recent work, a modification to the standard nuclear norm min… ▽ More

    Submitted 14 May, 2021; v1 submitted 5 February, 2020; originally announced February 2020.

    MSC Class: 15A83; 65F55 (Primary); 65F50 (Secondary)

  22. arXiv:2001.00631  [pdf, other

    cs.LG stat.ML

    On Large-Scale Dynamic Topic Modeling with Nonnegative CP Tensor Decomposition

    Authors: Miju Ahn, Nicole Eikmeier, Jamie Haddock, Lara Kassab, Alona Kryshchenko, Kathryn Leonard, Deanna Needell, R. W. M. A. Madushani, Elena Sizikova, Chuntian Wang

    Abstract: There is currently an unprecedented demand for large-scale temporal data analysis due to the explosive growth of data. Dynamic topic modeling has been widely used in social and data sciences with the goal of learning latent topics that emerge, evolve, and fade over time. Previous work on dynamic topic modeling primarily employ the method of nonnegative matrix factorization (NMF), where slices of t… ▽ More

    Submitted 14 October, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: 23 pages, 29 figures, submitted to Women in Data Science and Mathematics (WiSDM) Workshop Proceedings, "Advances in Data Science", AWM-Springer series

  23. arXiv:1912.08294  [pdf, other

    math.NA stat.ML

    Lower Memory Oblivious (Tensor) Subspace Embeddings with Fewer Random Bits: Modewise Methods for Least Squares

    Authors: M. A. Iwen, D. Needell, E. Rebrova, A. Zare

    Abstract: In this paper new general modewise Johnson-Lindenstrauss (JL) subspace embeddings are proposed that are both considerably faster to generate and easier to store than traditional JL embeddings when working with extremely large vectors and/or tensors. Corresponding embedding results are then proven for two different types of low-dimensional (tensor) subspaces. The first of these new subspace embed… ▽ More

    Submitted 16 December, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

  24. arXiv:1912.00771  [pdf, other

    math.NA math.PR

    Sketching for Motzkin's Iterative Method for Linear Systems

    Authors: Elizaveta Rebrova, Deanna Needell

    Abstract: Projection-based iterative methods for solving large over-determined linear systems are well-known for their simplicity and computational efficiency. It is also known that the correct choice of a sketching procedure (i.e., preprocessing steps that reduce the dimension of each iteration) can improve the performance of iterative methods in multiple ways, such as, to speed up the convergence of the m… ▽ More

    Submitted 28 November, 2019; originally announced December 2019.

  25. arXiv:1912.00315  [pdf, other

    cs.CL cs.LG math.OC stat.ML

    Topic-aware chatbot using Recurrent Neural Networks and Nonnegative Matrix Factorization

    Authors: Yuchen Guo, Nicholas Hanoian, Zhexiao Lin, Nicholas Liskij, Hanbaek Lyu, Deanna Needell, Jiahao Qu, Henry Sojico, Yuliang Wang, Zhe Xiong, Zhenhong Zou

    Abstract: We propose a novel model for a topic-aware chatbot by combining the traditional Recurrent Neural Network (RNN) encoder-decoder model with a topic attention layer based on Nonnegative Matrix Factorization (NMF). After learning topic vectors from an auxiliary text corpus via NMF, the decoder is trained so that it is more likely to sample response words from the most correlated topic vectors. One of… ▽ More

    Submitted 4 December, 2019; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: 14 pages, 1 figure, 2 tables

  26. arXiv:1911.01931  [pdf, other

    cs.LG cs.DS math.OC math.PR stat.ML

    Online matrix factorization for Markovian data and applications to Network Dictionary Learning

    Authors: Hanbaek Lyu, Deanna Needell, Laura Balzano

    Abstract: Online Matrix Factorization (OMF) is a fundamental tool for dictionary learning problems, giving an approximate representation of complex data sets in terms of a reduced number of extracted features. Convergence guarantees for most of the OMF algorithms in the literature assume independence between data matrices, and the case of dependent data streams remains largely unexplored. In this paper, we… ▽ More

    Submitted 7 November, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: 39 pages, 13 figures

    Journal ref: Journal of Machine Learning Research 21 (2020)

  27. arXiv:1910.13986  [pdf, other

    cs.IT math.ST

    Weighted matrix completion from non-random, non-uniform sampling patterns

    Authors: Simon Foucart, Deanna Needell, Reese Pathak, Yaniv Plan, Mary Wootters

    Abstract: We study the matrix completion problem when the observation pattern is deterministic and possibly non-uniform. We propose a simple and efficient debiased projection scheme for recovery from noisy observations and analyze the error under a suitable weighted metric. We introduce a simple function of the weight matrix and the sampling pattern that governs the accuracy of the recovered matrix. We deri… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: 41 pages, 4 figures

  28. arXiv:1909.10132  [pdf, other

    math.NA

    Stochastic Iterative Hard Thresholding for Low-Tucker-Rank Tensor Recovery

    Authors: Rachel Grotheer, Shuang Li, Anna Ma, Deanna Needell, **g Qin

    Abstract: Low-rank tensor recovery problems have been widely studied in many applications of signal processing and machine learning. Tucker decomposition is known as one of the most popular decompositions in the tensor framework. In recent years, researchers have developed many state-of-the-art algorithms to address the problem of low-Tucker-rank tensor recovery. Motivated by the favorable properties of the… ▽ More

    Submitted 16 July, 2020; v1 submitted 22 September, 2019; originally announced September 2019.

  29. arXiv:1909.03604  [pdf, other

    math.NA

    Adaptive Sketch-and-Project Methods for Solving Linear Systems

    Authors: Robert Gower, Denali Molitor, Jacob Moorman, Deanna Needell

    Abstract: We present new adaptive sampling rules for the sketch-and-project method for solving linear systems. To deduce our new sampling rules, we first show how the progress of one step of the sketch-and-project method depends directly on a sketched residual. Based on this insight, we derive a 1) max-distance sampling rule, by sampling the sketch with the largest sketched residual 2) a proportional sampli… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

    MSC Class: 15A06; 15B52; 65F10; 68W20; 65N75; 65Y20; 68Q25; 68W40; 90C20

  30. arXiv:1908.08479  [pdf, other

    math.NA stat.ML

    Iterative Hard Thresholding for Low CP-rank Tensor Models

    Authors: Rachel Grotheer, Shuang Li, Anna Ma, Deanna Needell, **g Qin

    Abstract: Recovery of low-rank matrices from a small number of linear measurements is now well-known to be possible under various model assumptions on the measurements. Such results demonstrate robustness and are backed with provable theoretical guarantees. However, extensions to tensor recovery have only recently began to be studied and developed, despite an abundance of practical tensor applications. Rece… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  31. arXiv:1907.11746  [pdf, other

    stat.ML cs.LG

    Bias of Homotopic Gradient Descent for the Hinge Loss

    Authors: Denali Molitor, Deanna Needell, Rachel Ward

    Abstract: Gradient descent is a simple and widely used optimization method for machine learning. For homogeneous linear classifiers applied to separable data, gradient descent has been shown to converge to the maximal margin (or equivalently, the minimal norm) solution for various smooth loss functions. The previous theory does not, however, apply to non-smooth functions such as the hinge loss which is wide… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

  32. arXiv:1907.03028  [pdf, other

    math.ST

    On Inferences from Completed Data

    Authors: Jamie Haddock, Denali Molitor, Deanna Needell, Sneha Sambandam, Joy Song, Simon Sun

    Abstract: Matrix completion has become an extremely important technique as data scientists are routinely faced with large, incomplete datasets on which they wish to perform statistical inferences. We investigate how error introduced via matrix completion affects statistical inference. Furthermore, we prove recovery error bounds which depend upon the matrix recovery error for several common statistical infer… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

  33. arXiv:1905.13404  [pdf, other

    cs.LG math.OC stat.ML

    Data-driven Algorithm Selection and Parameter Tuning: Two Case studies in Optimization and Signal Processing

    Authors: Jesus A. De Loera, Jamie Haddock, Anna Ma, Deanna Needell

    Abstract: Machine learning algorithms typically rely on optimization subroutines and are well-known to provide very effective outcomes for many types of problems. Here, we flip the reliance and ask the reverse question: can machine learning algorithms lead to more effective outcomes for optimization problems? Our goal is to train machine learning methods to automatically improve the performance of optimizat… ▽ More

    Submitted 26 July, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  34. arXiv:1905.08894  [pdf, other

    math.PR math.NA

    On block Gaussian sketching for the Kaczmarz method

    Authors: Deanna Needell, Elizaveta Rebrova

    Abstract: The Kaczmarz algorithm is one of the most popular methods for solving large-scale over-determined linear systems due to its simplicity and computational efficiency. This method can be viewed as a special instance of a more general class of sketch and project methods. Recently, a block Gaussian version was proposed that uses a block Gaussian sketch, enjoying the regularization properties of Gaussia… ▽ More

    Submitted 21 January, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

    MSC Class: 65F10; 68W20; 60B20

  35. arXiv:1904.08540  [pdf, other

    cs.LG stat.ML

    Matrix Completion With Selective Sampling

    Authors: Christian Parkinson, Kevin Huynh, Deanna Needell

    Abstract: Matrix completion is a classical problem in data science wherein one attempts to reconstruct a low-rank matrix while only observing some subset of the entries. Previous authors have phrased this problem as a nuclear norm minimization problem. Almost all previous work assumes no explicit structure of the matrix and uses uniform sampling to decide the observed entries. We suggest methods for selecti… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: 4 pages, 4 figures

  36. arXiv:1902.02862  [pdf, other

    math.CO math.MG math.NA math.NT

    Lattices from tight frames and vertex transitive graphs

    Authors: Lenny Fukshansky, Deanna Needell, Josiah Park, Yuxin Xin

    Abstract: We show that real tight frames that generate lattices must be rational, and use this observation to describe a construction of lattices from vertex transitive graphs. In the case of irreducible group frames, we show that the corresponding lattice is always strongly eutactic. This is the case for the more restrictive class of distance transitive graphs. We show that such lattices exist in arbitrari… ▽ More

    Submitted 18 August, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: some corrections of typos made; updated bibliography

    MSC Class: 11H31; 52C17; 42C15; 05C50; 05C76

  37. arXiv:1809.03041  [pdf, other

    stat.ML cs.LG

    An iterative method for classification of binary data

    Authors: Denali Molitor, Deanna Needell

    Abstract: In today's data driven world, storing, processing, and gleaning insights from large-scale data are major challenges. Data compression is often required in order to store large amounts of high-dimensional data, and thus, efficient inference methods for analyzing compressed data are necessary. Building on a recently designed simple framework for classification using binary data, we demonstrate that… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

    MSC Class: 68T05; 68P30; 68U10

  38. arXiv:1808.04421  [pdf, other

    math.GT math.QA

    Tribracket Modules

    Authors: Deanna Needell, Sam Nelson, Yingqi Shi

    Abstract: Niebrzydowski tribrackets are ternary operations on sets satisfying conditions obtained from the oriented Reidemeister moves such that the set of tribracket colorings of an oriented knot or link diagram is an invariant of oriented knots and links. We introduce tribracket modules analogous to quandle/biquandle/rack modules and use these structures to enhance the tribracket counting invariant. We pr… ▽ More

    Submitted 26 August, 2019; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: 11 pages, v2 contains typo corrections and other small improvements

    MSC Class: 57M27; 57M25

  39. arXiv:1807.08825  [pdf, other

    cs.LG stat.ML

    Hierarchical Classification using Binary Data

    Authors: Denali Molitor, Deanna Needell

    Abstract: In classification problems, especially those that categorize data into a large number of classes, the classes often naturally follow a hierarchical structure. That is, some classes are likely to share similar structures and features. Those characteristics can be captured by considering a hierarchical relationship among the class labels. Here, we extend a recent simple classification approach on bi… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

    Comments: AAAI Magazine special Issue on Deep Models, Machine Learning and Artificial Intelligence Applications in National and International Security, June, 2018

  40. An Approximate Message Passing Framework for Side Information

    Authors: Anna Ma, You, Zhou, Cynthia Rush, Dror Baron, Deanna Needell

    Abstract: Approximate message passing (AMP) methods have gained recent traction in sparse signal recovery. Additional information about the signal, or \emph{side information} (SI), is commonly available and can aid in efficient signal recovery. This work presents an AMP-based framework that exploits SI and can be readily implemented in various settings for which the SI results in separable distributions. To… ▽ More

    Submitted 2 May, 2019; v1 submitted 12 July, 2018; originally announced July 2018.

  41. arXiv:1805.12529  [pdf, other

    cs.LG stat.ML

    Analysis of Fast Structured Dictionary Learning

    Authors: Saiprasad Ravishankar, Anna Ma, Deanna Needell

    Abstract: Sparsity-based models and techniques have been exploited in many signal processing and imaging applications. Data-driven methods based on dictionary and sparsifying transform learning enable learning rich image features from data, and can outperform analytical models. In particular, alternating optimization algorithms have been popular for learning such models. In this work, we focus on alternatin… ▽ More

    Submitted 23 September, 2019; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: This article has been accepted for publication in Information and Inference Published by Oxford University Press

  42. Randomized Projection Methods for Linear Systems with Arbitrarily Large Sparse Corruptions

    Authors: Jamie Haddock, Deanna Needell

    Abstract: In applications like medical imaging, error correction, and sensor networks, one needs to solve large-scale linear systems that may be corrupted by a small number of arbitrarily large corruptions. We consider solving such large-scale systems of linear equations $A\mathbf{x}=\mathbf{b}$ that are inconsistent due to corruptions in the measurement vector $\mathbf{b}$. With this as our motivating exam… ▽ More

    Submitted 22 December, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

    MSC Class: 65F10; 65F20; 65F22

  43. arXiv:1802.03126  [pdf, ps, other

    math.NA

    On Motzkin's Method for Inconsistent Linear Systems

    Authors: Jamie Haddock, Deanna Needell

    Abstract: Iterative linear solvers have gained recent popularity due to their computational efficiency and low memory footprint for large-scale linear systems. The relaxation method, or Motzkin's method, can be viewed as an iterative method that projects the current estimation onto the solution hyperplane corresponding to the most violated constraint. Although this leads to an optimal selection strategy for… ▽ More

    Submitted 26 October, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

    MSC Class: 15A06; 65F10; 65F20; 65F22

  44. arXiv:1802.00518  [pdf, other

    cs.LG

    Analysis of Fast Alternating Minimization for Structured Dictionary Learning

    Authors: Saiprasad Ravishankar, Anna Ma, Deanna Needell

    Abstract: Methods exploiting sparsity have been popular in imaging and signal processing applications including compression, denoising, and imaging inverse problems. Data-driven approaches such as dictionary learning and transform learning enable one to discover complex image features from datasets and provide promising performance over analytical models. Alternating minimization algorithms have been partic… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

  45. arXiv:1801.10264  [pdf, other

    cs.IT cs.DS eess.SP math.NA math.OC

    Compressed Anomaly Detection with Multiple Mixed Observations

    Authors: Natalie Durgin, Rachel Grotheer, Chenxi Huang, Shuang Li, Anna Ma, Deanna Needell, **g Qin

    Abstract: We consider a collection of independent random variables that are identically distributed, except for a small subset which follows a different, anomalous distribution. We study the problem of detecting which random variables in the collection are governed by the anomalous distribution. Recent work proposes to solve this problem by conducting hypothesis tests based on mixed observations (e.g. linea… ▽ More

    Submitted 19 June, 2018; v1 submitted 30 January, 2018; originally announced January 2018.

    Comments: 27 pages, 9 figures. Incorporates reviewer feedback, additional experiments, and additional figures

  46. arXiv:1801.09657  [pdf, other

    math.NA cs.LG stat.ME

    Matrix Completion for Structured Observations

    Authors: Denali Molitor, Deanna Needell

    Abstract: The need to predict or fill-in missing data, often referred to as matrix completion, is a common challenge in today's data-driven world. Previous strategies typically assume that no structural difference between observed and missing entries exists. Unfortunately, this assumption is woefully unrealistic in many applications. For example, in the classic Netflix challenge, in which one hopes to predi… ▽ More

    Submitted 29 January, 2018; originally announced January 2018.

  47. arXiv:1801.01526  [pdf, other

    math.NA math.NT

    An algebraic perspective on integer sparse recovery

    Authors: Lenny Fukshansky, Deanna Needell, Benny Sudakov

    Abstract: Compressed sensing is a relatively new mathematical paradigm that shows a small number of linear measurements are enough to efficiently reconstruct a large dimensional signal under the assumption the signal is sparse. Applications for this technology are ubiquitous, ranging from wireless communications to medical imaging, and there is now a solid foundation of mathematical theory and algorithms to… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    MSC Class: 41A46; 68Q25; 68W20

  48. arXiv:1711.02743  [pdf, other

    eess.SP cs.DS math.NA

    Sparse Randomized Kaczmarz for Support Recovery of Jointly Sparse Corrupted Multiple Measurement Vectors

    Authors: Natalie Durgin, Rachel Grotheer, Chenxi Huang, Shuang Li, Anna Ma, Deanna Needell, **g Qin

    Abstract: While single measurement vector (SMV) models have been widely studied in signal processing, there is a surging interest in addressing the multiple measurement vectors (MMV) problem. In the MMV setting, more than one measurement vector is available and the multiple signals to be recovered share some commonalities such as a common support. Applications in which MMV is a naturally occurring phenomeno… ▽ More

    Submitted 14 June, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: 13 pages, 6 figures

  49. arXiv:1711.01521  [pdf, other

    math.OC cs.DS math.NA

    Stochastic Greedy Algorithms For Multiple Measurement Vectors

    Authors: **g Qin, Shuang Li, Deanna Needell, Anna Ma, Rachel Grotheer, Chenxi Huang, Natalie Durgin

    Abstract: Sparse representation of a single measurement vector (SMV) has been explored in a variety of compressive sensing applications. Recently, SMV models have been extended to solve multiple measurement vectors (MMV) problems, where the underlying signal is assumed to have joint sparse structures. To circumvent the NP-hardness of the $\ell_0$ minimization problem, many deterministic MMV algorithms solve… ▽ More

    Submitted 22 August, 2020; v1 submitted 4 November, 2017; originally announced November 2017.

    MSC Class: 68W20; 94A12; 47N10

  50. arXiv:1710.00034  [pdf, other

    physics.app-ph physics.optics

    Micro-optical Tandem Luminescent Solar Concentrators

    Authors: David R. Needell, Ognjen Ilic, Colton R. Bukowsky, Zach Nett, Lu Xu, Junwen He, Haley Bauser, Benjamin G. Lee, John F. Geisz, Ralph G. Nuzzo, A. Paul Alivisatos, Harry A. Atwater

    Abstract: Traditional concentrating photovoltaic (CPV) systems utilize multijunction cells to minimize thermalization losses, but cannot efficiently capture diffuse sunlight, which contributes to a high levelized cost of energy (LCOE) and limits their use to geographical regions with high direct sunlight insolation. Luminescent solar concentrators (LSCs) harness light generated by luminophores embedded in a… ▽ More

    Submitted 5 September, 2017; originally announced October 2017.