-
Tensor Network Space-Time Spectral Collocation Method for Solving the Nonlinear Convection Diffusion Equation
Authors:
Dibyendu Adak,
M. Engin Danis,
Duc P. Truong,
Kim Ø. Rasmussen,
Boian S. Alexandrov
Abstract:
Spectral methods provide highly accurate numerical solutions for partial differential equations, exhibiting exponential convergence with the number of spectral nodes. Traditionally, in addressing time-dependent nonlinear problems, attention has been on low-order finite difference schemes for time discretization and spectral element schemes for spatial variables. However, our recent developments ha…
▽ More
Spectral methods provide highly accurate numerical solutions for partial differential equations, exhibiting exponential convergence with the number of spectral nodes. Traditionally, in addressing time-dependent nonlinear problems, attention has been on low-order finite difference schemes for time discretization and spectral element schemes for spatial variables. However, our recent developments have resulted in the application of spectral methods to both space and time variables, preserving spectral convergence in both domains. Leveraging Tensor Train techniques, our approach tackles the curse of dimensionality inherent in space-time methods. Here, we extend this methodology to the nonlinear time-dependent convection-diffusion equation. Our discretization scheme exhibits a low-rank structure, facilitating translation to tensor-train (TT) format. Nevertheless, controlling the TT-rank across Newton's iterations, needed to deal with the nonlinearity, poses a challenge, leading us to devise the "Step Truncation TT-Newton" method. We demonstrate the exponential convergence of our methods through various benchmark examples. Importantly, our scheme offers significantly reduced memory requirement compared to the full-grid scheme.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Tensor-Train WENO Scheme for Compressible Flows
Authors:
Mustafa Engin Danis,
Duc Truong,
Ismael Boureima,
Oleg Korobkin,
Kim Rasmussen,
Boian Alexandrov
Abstract:
In this study, we introduce a tensor-train (TT) finite difference WENO method for solving compressible Euler equations. In a step-by-step manner, the tensorization of the governing equations is demonstrated. We also introduce \emph{LF-cross} and \emph{WENO-cross} methods to compute numerical fluxes and the WENO reconstruction using the cross interpolation technique. A tensor-train approach is deve…
▽ More
In this study, we introduce a tensor-train (TT) finite difference WENO method for solving compressible Euler equations. In a step-by-step manner, the tensorization of the governing equations is demonstrated. We also introduce \emph{LF-cross} and \emph{WENO-cross} methods to compute numerical fluxes and the WENO reconstruction using the cross interpolation technique. A tensor-train approach is developed for boundary condition types commonly encountered in Computational Fluid Dynamics (CFD). The performance of the proposed WENO-TT solver is investigated in a rich set of numerical experiments. We demonstrate that the WENO-TT method achieves the theoretical $\text{5}^{\text{th}}$-order accuracy of the classical WENO scheme in smooth problems while successfully capturing complicated shock structures. In an effort to avoid the growth of TT ranks, we propose a dynamic method to estimate the TT approximation error that governs the ranks and overall truncation error of the WENO-TT scheme. Finally, we show that the traditional WENO scheme can be accelerated up to 1000 times in the TT format, and the memory requirements can be significantly decreased for low-rank problems, demonstrating the potential of tensor-train approach for future CFD application. This paper is the first study that develops a finite difference WENO scheme using the tensor-train approach for compressible flows. It is also the first comprehensive work that provides a detailed perspective into the relationship between rank, truncation error, and the TT approximation error for compressible WENO solvers.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Cyber-Security Knowledge Graph Generation by Hierarchical Nonnegative Matrix Factorization
Authors:
Ryan Barron,
Maksim E. Eren,
Manish Bhattarai,
Selma Wanna,
Nicholas Solovyev,
Kim Rasmussen,
Boian S. Alexandrov,
Charles Nicholas,
Cynthia Matuszek
Abstract:
Much of human knowledge in cybersecurity is encapsulated within the ever-growing volume of scientific papers. As this textual data continues to expand, the importance of document organization methods becomes increasingly crucial for extracting actionable insights hidden within large text datasets. Knowledge Graphs (KGs) serve as a means to store factual information in a structured manner, providin…
▽ More
Much of human knowledge in cybersecurity is encapsulated within the ever-growing volume of scientific papers. As this textual data continues to expand, the importance of document organization methods becomes increasingly crucial for extracting actionable insights hidden within large text datasets. Knowledge Graphs (KGs) serve as a means to store factual information in a structured manner, providing explicit, interpretable knowledge that includes domain-specific information from the cybersecurity scientific literature. One of the challenges in constructing a KG from scientific literature is the extraction of ontology from unstructured text. In this paper, we address this topic and introduce a method for building a multi-modal KG by extracting structured ontology from scientific papers. We demonstrate this concept in the cybersecurity domain. One modality of the KG represents observable information from the papers, such as the categories in which they were published or the authors. The second modality uncovers latent (hidden) patterns of text extracted through hierarchical and semantic non-negative matrix factorization (NMF), such as named entities, topics or clusters, and keywords. We illustrate this concept by consolidating more than two million scientific papers uploaded to arXiv into the cyber-domain, using hierarchical and semantic NMF, and by building a cyber-domain-specific KG.
△ Less
Submitted 26 March, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
Catch'em all: Classification of Rare, Prominent, and Novel Malware Families
Authors:
Maksim E. Eren,
Ryan Barron,
Manish Bhattarai,
Selma Wanna,
Nicholas Solovyev,
Kim Rasmussen,
Boian S. Alexandrov,
Charles Nicholas
Abstract:
National security is threatened by malware, which remains one of the most dangerous and costly cyber threats. As of last year, researchers reported 1.3 billion known malware specimens, motivating the use of data-driven machine learning (ML) methods for analysis. However, shortcomings in existing ML approaches hinder their mass adoption. These challenges include detection of novel malware and the a…
▽ More
National security is threatened by malware, which remains one of the most dangerous and costly cyber threats. As of last year, researchers reported 1.3 billion known malware specimens, motivating the use of data-driven machine learning (ML) methods for analysis. However, shortcomings in existing ML approaches hinder their mass adoption. These challenges include detection of novel malware and the ability to perform malware classification in the face of class imbalance: a situation where malware families are not equally represented in the data. Our work addresses these shortcomings with MalwareDNA: an advanced dimensionality reduction and feature extraction framework. We demonstrate stable task performance under class imbalance for the following tasks: malware family classification and novel malware detection with a trade-off in increased abstention or reject-option rate.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Tensor Network Space-Time Spectral Collocation Method for Time Dependent Convection-Diffusion-Reaction Equations
Authors:
Dibyendu Adak,
Duc P. Truong,
Gianmarco Manzini,
Kim Ø. Rasmussen,
Boian S. Alexandrov
Abstract:
Emerging tensor network techniques for solutions of Partial Differential Equations (PDEs), known for their ability to break the curse of dimensionality, deliver new mathematical methods for ultrafast numerical solutions of high-dimensional problems. Here, we introduce a Tensor Train (TT) Chebyshev spectral collocation method, in both space and time, for solution of the time dependent convection-di…
▽ More
Emerging tensor network techniques for solutions of Partial Differential Equations (PDEs), known for their ability to break the curse of dimensionality, deliver new mathematical methods for ultrafast numerical solutions of high-dimensional problems. Here, we introduce a Tensor Train (TT) Chebyshev spectral collocation method, in both space and time, for solution of the time dependent convection-diffusion-reaction (CDR) equation with inhomogeneous boundary conditions, in Cartesian geometry. Previous methods for numerical solution of time dependent PDEs often use finite difference for time, and a spectral scheme for the spatial dimensions, which leads to slow linear convergence. Spectral collocation space-time methods show exponential convergence, however, for realistic problems they need to solve large four-dimensional systems. We overcome this difficulty by using a TT approach as its complexity only grows linearly with the number of dimensions. We show that our TT space-time Chebyshev spectral collocation method converges exponentially, when the solution of the CDR is smooth, and demonstrate that it leads to very high compression of linear operators from terabytes to kilobytes in TT-format, and tens of thousands times speedup when compared to full grid space-time spectral method. These advantages allow us to obtain the solutions at much higher resolutions.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Electrical Grid Anomaly Detection via Tensor Decomposition
Authors:
Alexander Most,
Maksim Eren,
Nigel Lawrence,
Boian Alexandrov
Abstract:
Supervisory Control and Data Acquisition (SCADA) systems often serve as the nervous system for substations within power grids. These systems facilitate real-time monitoring, data acquisition, control of equipment, and ensure smooth and efficient operation of the substation and its connected devices. Previous work has shown that dimensionality reduction-based approaches, such as Principal Component…
▽ More
Supervisory Control and Data Acquisition (SCADA) systems often serve as the nervous system for substations within power grids. These systems facilitate real-time monitoring, data acquisition, control of equipment, and ensure smooth and efficient operation of the substation and its connected devices. Previous work has shown that dimensionality reduction-based approaches, such as Principal Component Analysis (PCA), can be used for accurate identification of anomalies in SCADA systems. While not specifically applied to SCADA, non-negative matrix factorization (NMF) has shown strong results at detecting anomalies in wireless sensor networks. These unsupervised approaches model the normal or expected behavior and detect the unseen types of attacks or anomalies by identifying the events that deviate from the expected behavior. These approaches; however, do not model the complex and multi-dimensional interactions that are naturally present in SCADA systems. Differently, non-negative tensor decomposition is a powerful unsupervised machine learning (ML) method that can model the complex and multi-faceted activity details of SCADA events. In this work, we novelly apply the tensor decomposition method Canonical Polyadic Alternating Poisson Regression (CP-APR) with a probabilistic framework, which has previously shown state-of-the-art anomaly detection results on cyber network data, to identify anomalies in SCADA systems. We showcase that the use of statistical behavior analysis of SCADA communication with tensor decomposition improves the specificity and accuracy of identifying anomalies in electrical grid systems. In our experiments, we model real-world SCADA system data collected from the electrical grid operated by Los Alamos National Laboratory (LANL) which provides transmission and distribution service through a partnership with Los Alamos County, and detect synthetically generated anomalies.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Interactive Distillation of Large Single-Topic Corpora of Scientific Papers
Authors:
Nicholas Solovyev,
Ryan Barron,
Manish Bhattarai,
Maksim E. Eren,
Kim O. Rasmussen,
Boian S. Alexandrov
Abstract:
Highly specific datasets of scientific literature are important for both research and education. However, it is difficult to build such datasets at scale. A common approach is to build these datasets reductively by applying topic modeling on an established corpus and selecting specific topics. A more robust but time-consuming approach is to build the dataset constructively in which a subject matte…
▽ More
Highly specific datasets of scientific literature are important for both research and education. However, it is difficult to build such datasets at scale. A common approach is to build these datasets reductively by applying topic modeling on an established corpus and selecting specific topics. A more robust but time-consuming approach is to build the dataset constructively in which a subject matter expert (SME) handpicks documents. This method does not scale and is prone to error as the dataset grows. Here we showcase a new tool, based on machine learning, for constructively generating targeted datasets of scientific literature. Given a small initial "core" corpus of papers, we build a citation network of documents. At each step of the citation network, we generate text embeddings and visualize the embeddings through dimensionality reduction. Papers are kept in the dataset if they are "similar" to the core or are otherwise pruned through human-in-the-loop selection. Additional insight into the papers is gained through sub-topic modeling using SeNMFk. We demonstrate our new tool for literature review by applying it to two different fields in machine learning.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Semi-supervised Classification of Malware Families Under Extreme Class Imbalance via Hierarchical Non-Negative Matrix Factorization with Automatic Model Selection
Authors:
Maksim E. Eren,
Manish Bhattarai,
Robert J. Joyce,
Edward Raff,
Charles Nicholas,
Boian S. Alexandrov
Abstract:
Identification of the family to which a malware specimen belongs is essential in understanding the behavior of the malware and develo** mitigation strategies. Solutions proposed by prior work, however, are often not practicable due to the lack of realistic evaluation factors. These factors include learning under class imbalance, the ability to identify new malware, and the cost of production-qua…
▽ More
Identification of the family to which a malware specimen belongs is essential in understanding the behavior of the malware and develo** mitigation strategies. Solutions proposed by prior work, however, are often not practicable due to the lack of realistic evaluation factors. These factors include learning under class imbalance, the ability to identify new malware, and the cost of production-quality labeled data. In practice, deployed models face prominent, rare, and new malware families. At the same time, obtaining a large quantity of up-to-date labeled malware for training a model can be expensive. In this paper, we address these problems and propose a novel hierarchical semi-supervised algorithm, which we call the HNMFk Classifier, that can be used in the early stages of the malware family labeling process. Our method is based on non-negative matrix factorization with automatic model selection, that is, with an estimation of the number of clusters. With HNMFk Classifier, we exploit the hierarchical structure of the malware data together with a semi-supervised setup, which enables us to classify malware families under conditions of extreme class imbalance. Our solution can perform abstaining predictions, or rejection option, which yields promising results in the identification of novel malware families and helps with maintaining the performance of the model when a low quantity of labeled data is used. We perform bulk classification of nearly 2,900 both rare and prominent malware families, through static analysis, using nearly 388,000 samples from the EMBER-2018 corpus. In our experiments, we surpass both supervised and semi-supervised baseline models with an F1 score of 0.80.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Tensor Networks for Solving Realistic Time-independent Boltzmann Neutron Transport Equation
Authors:
Duc P. Truong,
Mario I. Ortega,
Ismael Boureima,
Gianmarco Manzini,
Kim Ø. Rasmussen,
Boian S. Alexandrov
Abstract:
Tensor network techniques, known for their low-rank approximation ability that breaks the curse of dimensionality, are emerging as a foundation of new mathematical methods for ultra-fast numerical solutions of high-dimensional Partial Differential Equations (PDEs). Here, we present a mixed Tensor Train (TT)/Quantized Tensor Train (QTT) approach for the numerical solution of time-independent Boltzm…
▽ More
Tensor network techniques, known for their low-rank approximation ability that breaks the curse of dimensionality, are emerging as a foundation of new mathematical methods for ultra-fast numerical solutions of high-dimensional Partial Differential Equations (PDEs). Here, we present a mixed Tensor Train (TT)/Quantized Tensor Train (QTT) approach for the numerical solution of time-independent Boltzmann Neutron Transport equations (BNTEs) in Cartesian geometry. Discretizing a realistic three-dimensional (3D) BNTE by (i) diamond differencing, (ii) multigroup-in-energy, and (iii) discrete ordinate collocation leads to huge generalized eigenvalue problems that generally require a matrix-free approach and large computer clusters. Starting from this discretization, we construct a TT representation of the PDE fields and discrete operators, followed by a QTT representation of the TT cores and solving the tensorized generalized eigenvalue problem in a fixed-point scheme with tensor network optimization techniques. We validate our approach by applying it to two realistic examples of 3D neutron transport problems, currently solved by the PARallel TIme-dependent SN (PARTISN) solver. We demonstrate that our TT/QTT method, executed on a standard desktop computer, leads to a yottabyte compression of the memory storage, and more than 7500 times speedup with a discrepancy of less than 1e-5 when compared to the PARTISN solution.
△ Less
Submitted 13 September, 2023; v1 submitted 6 September, 2023;
originally announced September 2023.
-
MalwareDNA: Simultaneous Classification of Malware, Malware Families, and Novel Malware
Authors:
Maksim E. Eren,
Manish Bhattarai,
Kim Rasmussen,
Boian S. Alexandrov,
Charles Nicholas
Abstract:
Malware is one of the most dangerous and costly cyber threats to national security and a crucial factor in modern cyber-space. However, the adoption of machine learning (ML) based solutions against malware threats has been relatively slow. Shortcomings in the existing ML approaches are likely contributing to this problem. The majority of current ML approaches ignore real-world challenges such as t…
▽ More
Malware is one of the most dangerous and costly cyber threats to national security and a crucial factor in modern cyber-space. However, the adoption of machine learning (ML) based solutions against malware threats has been relatively slow. Shortcomings in the existing ML approaches are likely contributing to this problem. The majority of current ML approaches ignore real-world challenges such as the detection of novel malware. In addition, proposed ML approaches are often designed either for malware/benign-ware classification or malware family classification. Here we introduce and showcase preliminary capabilities of a new method that can perform precise identification of novel malware families, while also unifying the capability for malware/benign-ware classification and malware family classification into a single framework.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Robust Adversarial Defense by Tensor Factorization
Authors:
Manish Bhattarai,
Mehmet Cagri Kaymak,
Ryan Barron,
Ben Nebgen,
Kim Rasmussen,
Boian Alexandrov
Abstract:
As machine learning techniques become increasingly prevalent in data analysis, the threat of adversarial attacks has surged, necessitating robust defense mechanisms. Among these defenses, methods exploiting low-rank approximations for input data preprocessing and neural network (NN) parameter factorization have shown potential. Our work advances this field further by integrating the tensorization…
▽ More
As machine learning techniques become increasingly prevalent in data analysis, the threat of adversarial attacks has surged, necessitating robust defense mechanisms. Among these defenses, methods exploiting low-rank approximations for input data preprocessing and neural network (NN) parameter factorization have shown potential. Our work advances this field further by integrating the tensorization of input data with low-rank decomposition and tensorization of NN parameters to enhance adversarial defense. The proposed approach demonstrates significant defense capabilities, maintaining robust accuracy even when subjected to the strongest known auto-attacks. Evaluations against leading-edge robust performance benchmarks reveal that our results not only hold their ground against the best defensive methods available but also exceed all current defense strategies that rely on tensor factorizations. This study underscores the potential of integrating tensorization and low-rank decomposition as a robust defense against adversarial attacks in machine learning.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
Generating Hidden Markov Models from Process Models Through Nonnegative Tensor Factorization
Authors:
Erik Skau,
Andrew Hollis,
Stephan Eidenbenz,
Kim Rasmussen,
Boian Alexandrov
Abstract:
Monitoring of industrial processes is a critical capability in industry and in government to ensure reliability of production cycles, quick emergency response, and national security. Process monitoring allows users to gauge the progress of an organization in an industrial process or predict the degradation or aging of machine parts in processes taking place at a remote location. Similar to many da…
▽ More
Monitoring of industrial processes is a critical capability in industry and in government to ensure reliability of production cycles, quick emergency response, and national security. Process monitoring allows users to gauge the progress of an organization in an industrial process or predict the degradation or aging of machine parts in processes taking place at a remote location. Similar to many data science applications, we usually only have access to limited raw data, such as satellite imagery, short video clips, event logs, and signatures captured by a small set of sensors. To combat data scarcity, we leverage the knowledge of Subject Matter Experts (SMEs) who are familiar with the actions of interest. SMEs provide expert knowledge of the essential activities required for task completion and the resources necessary to carry out each of these activities. Various process mining techniques have been developed for this type of analysis; typically such approaches combine theoretical process models built based on domain expert insights with ad-hoc integration of available pieces of raw data. Here, we introduce a novel mathematically sound method that integrates theoretical process models (as proposed by SMEs) with interrelated minimal Hidden Markov Models (HMM), built via nonnegative tensor factorization. Our method consolidates: (a) theoretical process models, (b) HMMs, (c) coupled nonnegative matrix-tensor factorizations, and (d) custom model selection. To demonstrate our methodology and its abilities, we apply it on simple synthetic and real world process models.
△ Less
Submitted 26 April, 2024; v1 submitted 3 October, 2022;
originally announced October 2022.
-
Patterns and Stability of Coupled Multi-Stable Nonlinear Oscillators
Authors:
Golan Bel,
Boian S. Alexandrov,
Alan R. Bishop,
Kim Ø. Rasmussen
Abstract:
Nonlinear isolated and coupled oscillators are extensively studied as prototypical nonlinear dynamics models. Much attention has been devoted to oscillator synchronization or the lack thereof. Here, we study the synchronization and stability of coupled driven-damped Helmholtz-Duffing oscillators in bi-stability regimes. We find that despite the fact that the system parameters and the driving force…
▽ More
Nonlinear isolated and coupled oscillators are extensively studied as prototypical nonlinear dynamics models. Much attention has been devoted to oscillator synchronization or the lack thereof. Here, we study the synchronization and stability of coupled driven-damped Helmholtz-Duffing oscillators in bi-stability regimes. We find that despite the fact that the system parameters and the driving force are identical, the stability of the two states to spatially non-uniform perturbations is very different. Moreover, the final stable states, resulting from these spatial perturbations, are not solely dictated by the wavelength of the perturbing mode and take different spatial configurations in terms of the coupled oscillator phases.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
SeNMFk-SPLIT: Large Corpora Topic Modeling by Semantic Non-negative Matrix Factorization with Automatic Model Selection
Authors:
Maksim E. Eren,
Nick Solovyev,
Manish Bhattarai,
Kim Rasmussen,
Charles Nicholas,
Boian S. Alexandrov
Abstract:
As the amount of text data continues to grow, topic modeling is serving an important role in understanding the content hidden by the overwhelming quantity of documents. One popular topic modeling approach is non-negative matrix factorization (NMF), an unsupervised machine learning (ML) method. Recently, Semantic NMF with automatic model selection (SeNMFk) has been proposed as a modification to NMF…
▽ More
As the amount of text data continues to grow, topic modeling is serving an important role in understanding the content hidden by the overwhelming quantity of documents. One popular topic modeling approach is non-negative matrix factorization (NMF), an unsupervised machine learning (ML) method. Recently, Semantic NMF with automatic model selection (SeNMFk) has been proposed as a modification to NMF. In addition to heuristically estimating the number of topics, SeNMFk also incorporates the semantic structure of the text. This is performed by jointly factorizing the term frequency-inverse document frequency (TF-IDF) matrix with the co-occurrence/word-context matrix, the values of which represent the number of times two words co-occur in a predetermined window of the text. In this paper, we introduce a novel distributed method, SeNMFk-SPLIT, for semantic topic extraction suitable for large corpora. Contrary to SeNMFk, our method enables the joint factorization of large documents by decomposing the word-context and term-document matrices separately. We demonstrate the capability of SeNMFk-SPLIT by applying it to the entire artificial intelligence (AI) and ML scientific literature uploaded on arXiv.
△ Less
Submitted 21 August, 2022;
originally announced August 2022.
-
Distributed Out-of-Memory SVD on CPU/GPU Architectures
Authors:
Ismael Boureima,
Manish Bhattarai,
Maksim E. Eren,
Nick Solovyev,
Hristo Djidjev,
Boian S. Alexandrov
Abstract:
We propose an efficient, distributed, out-of-memory implementation of the truncated singular value decomposition (t-SVD) for heterogeneous (CPU+GPU) high performance computing (HPC) systems. Various implementations of SVD have been proposed, but most only estimate the singular values as an estimation of the singular vectors which can significantly increase the time and memory complexity of the alg…
▽ More
We propose an efficient, distributed, out-of-memory implementation of the truncated singular value decomposition (t-SVD) for heterogeneous (CPU+GPU) high performance computing (HPC) systems. Various implementations of SVD have been proposed, but most only estimate the singular values as an estimation of the singular vectors which can significantly increase the time and memory complexity of the algorithm. In this work, we propose an implementation of SVD based on the power method, which is a truncated singular values and singular vectors estimation method. Memory utilization bottlenecks seen in the power method are typically associated with the computation of the Gram matrix $\mat{A}^T\mat{A}$, which can be significant when $\mat{A}$ is large and dense, or when $\mat{A}$ is super-large and sparse. The proposed implementation is optimized for out-of-memory problems where the memory required to factorize a given matrix is greater than the available GPU memory. We reduce the memory complexity of $\mat{A}^T\mat{A}$ by using a batching strategy where the intermediate factors are computed block by block. We also suppress I/O latency associated with both host-to-device (H2D) and device-to-host (D2H) batch copies by overlap** each batch copy with compute using CUDA streams. Furthermore, we use optimized \textit{NCCL} based communicators to reduce the latency associated with collective communications (both intra-node and inter-node). In addition, sparse and dense matrix multiplications are significantly accelerated with GPU cores (or tensors cores when available), resulting in an implementation with good scaling. We demonstrate the scalability of our distributed out of core SVD algorithm to successfully decompose dense matrix of size 1TB and sparse matrix of size 128PB with 1e-6 sparsity.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
FedSPLIT: One-Shot Federated Recommendation System Based on Non-negative Joint Matrix Factorization and Knowledge Distillation
Authors:
Maksim E. Eren,
Luke E. Richards,
Manish Bhattarai,
Roberto Yus,
Charles Nicholas,
Boian S. Alexandrov
Abstract:
Non-negative matrix factorization (NMF) with missing-value completion is a well-known effective Collaborative Filtering (CF) method used to provide personalized user recommendations. However, traditional CF relies on the privacy-invasive collection of users' explicit and implicit feedback to build a central recommender model. One-shot federated learning has recently emerged as a method to mitigate…
▽ More
Non-negative matrix factorization (NMF) with missing-value completion is a well-known effective Collaborative Filtering (CF) method used to provide personalized user recommendations. However, traditional CF relies on the privacy-invasive collection of users' explicit and implicit feedback to build a central recommender model. One-shot federated learning has recently emerged as a method to mitigate the privacy problem while addressing the traditional communication bottleneck of federated learning. In this paper, we present the first unsupervised one-shot federated CF implementation, named FedSPLIT, based on NMF joint factorization. In our solution, the clients first apply local CF in-parallel to build distinct client-specific recommenders. Then, the privacy-preserving local item patterns and biases from each client are shared with the processor to perform joint factorization in order to extract the global item patterns. Extracted patterns are then aggregated to each client to build the local models via knowledge distillation. In our experiments, we demonstrate the feasibility of our approach with standard recommendation datasets. FedSPLIT can obtain similar results than the state of the art (and even outperform it in certain situations) with a substantial decrease in the number of communications.
△ Less
Submitted 4 May, 2022;
originally announced May 2022.
-
Distributed Out-of-Memory NMF on CPU/GPU Architectures
Authors:
Ismael Boureima,
Manish Bhattarai,
Maksim Eren,
Erik Skau,
Philip Romero,
Stephan Eidenbenz,
Boian Alexandrov
Abstract:
We propose an efficient distributed out-of-memory implementation of the Non-negative Matrix Factorization (NMF) algorithm for heterogeneous high-performance-computing (HPC) systems. The proposed implementation is based on prior work on NMFk, which can perform automatic model selection and extract latent variables and patterns from data. In this work, we extend NMFk by adding support for dense and…
▽ More
We propose an efficient distributed out-of-memory implementation of the Non-negative Matrix Factorization (NMF) algorithm for heterogeneous high-performance-computing (HPC) systems. The proposed implementation is based on prior work on NMFk, which can perform automatic model selection and extract latent variables and patterns from data. In this work, we extend NMFk by adding support for dense and sparse matrix operation on multi-node, multi-GPU systems. The resulting algorithm is optimized for out-of-memory (OOM) problems where the memory required to factorize a given matrix is greater than the available GPU memory. Memory complexity is reduced by batching/tiling strategies, and sparse and dense matrix operations are significantly accelerated with GPU cores (or tensor cores when available). Input/Output (I/O) latency associated with batch copies between host and device is hidden using CUDA streams to overlap data transfers and compute asynchronously, and latency associated with collective communications (both intra-node and inter-node) is reduced using optimized NVIDIA Collective Communication Library NCCL based communicators. Benchmark results show significant improvement, from 32X to 76x speedup, with the new implementation using GPUs over the CPU-based NMFk. Good weak scaling was demonstrated on up to 4096 multi-GPU cluster nodes with approximately 25,000 GPUs when decomposing a dense 340 Terabyte-size matrix and an 11 Exabyte-size sparse matrix of density 10e-6.
△ Less
Submitted 12 September, 2023; v1 submitted 18 February, 2022;
originally announced February 2022.
-
Distributed non-negative RESCAL with Automatic Model Selection for Exascale Data
Authors:
Manish Bhattarai,
Namita Kharat,
Erik Skau,
Benjamin Nebgen,
Hristo Djidjev,
Sanjay Rajopadhye,
James P. Smith,
Boian Alexandrov
Abstract:
With the boom in the development of computer hardware and software, social media, IoT platforms, and communications, there has been an exponential growth in the volume of data produced around the world. Among these data, relational datasets are growing in popularity as they provide unique insights regarding the evolution of communities and their interactions. Relational datasets are naturally non-…
▽ More
With the boom in the development of computer hardware and software, social media, IoT platforms, and communications, there has been an exponential growth in the volume of data produced around the world. Among these data, relational datasets are growing in popularity as they provide unique insights regarding the evolution of communities and their interactions. Relational datasets are naturally non-negative, sparse, and extra-large. Relational data usually contain triples, (subject, relation, object), and are represented as graphs/multigraphs, called knowledge graphs, which need to be embedded into a low-dimensional dense vector space. Among various embedding models, RESCAL allows learning of relational data to extract the posterior distributions over the latent variables and to make predictions of missing relations. However, RESCAL is computationally demanding and requires a fast and distributed implementation to analyze extra-large real-world datasets. Here we introduce a distributed non-negative RESCAL algorithm for heterogeneous CPU/GPU architectures with automatic selection of the number of latent communities (model selection), called pyDRESCALk. We demonstrate the correctness of pyDRESCALk with real-world and large synthetic tensors, and the efficacy showing near-linear scaling that concurs with the theoretical complexities. Finally, pyDRESCALk determines the number of latent communities in an 11-terabyte dense and 9-exabyte sparse synthetic tensor.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Topic Analysis of Superconductivity Literature by Semantic Non-negative Matrix Factorization
Authors:
Valentin Stanev,
Erik Skau,
Ichiro Takeuchi,
Boian S. Alexandrov
Abstract:
We utilize a recently developed topic modeling method called SeNMFk, extending the standard Non-negative Matrix Factorization (NMF) methods by incorporating the semantic structure of the text, and adding a robust system for determining the number of topics. With SeNMFk, we were able to extract coherent topics validated by human experts. From these topics, a few are relatively general and cover bro…
▽ More
We utilize a recently developed topic modeling method called SeNMFk, extending the standard Non-negative Matrix Factorization (NMF) methods by incorporating the semantic structure of the text, and adding a robust system for determining the number of topics. With SeNMFk, we were able to extract coherent topics validated by human experts. From these topics, a few are relatively general and cover broad concepts, while the majority can be precisely mapped to specific scientific effects or measurement techniques. The topics also differ by ubiquity, with only three topics prevalent in almost 40 percent of the abstract, while each specific topic tends to dominate a small subset of the abstracts. These results demonstrate the ability of SeNMFk to produce a layered and nuanced analysis of large scientific corpora.
△ Less
Submitted 1 December, 2021;
originally announced January 2022.
-
Quantum Annealing Algorithms for Boolean Tensor Networks
Authors:
Elijah Pelofske,
Georg Hahn,
Daniel O'Malley,
Hristo N. Djidjev,
Boian S. Alexandrov
Abstract:
Quantum annealers manufactured by D-Wave Systems, Inc., are computational devices capable of finding high-quality solutions of NP-hard problems. In this contribution, we explore the potential and effectiveness of such quantum annealers for computing Boolean tensor networks. Tensors offer a natural way to model high-dimensional data commonplace in many scientific fields, and representing a binary t…
▽ More
Quantum annealers manufactured by D-Wave Systems, Inc., are computational devices capable of finding high-quality solutions of NP-hard problems. In this contribution, we explore the potential and effectiveness of such quantum annealers for computing Boolean tensor networks. Tensors offer a natural way to model high-dimensional data commonplace in many scientific fields, and representing a binary tensor as a Boolean tensor network is the task of expressing a tensor containing categorical (i.e., {0, 1}) values as a product of low dimensional binary tensors. A Boolean tensor network is computed by Boolean tensor decomposition, and it is usually not exact. The aim of such decomposition is to minimize the given distance measure between the high-dimensional input tensor and the product of lower-dimensional (usually three-dimensional) tensors and matrices representing the tensor network. In this paper, we introduce and analyze three general algorithms for Boolean tensor networks: Tucker, Tensor Train, and Hierarchical Tucker networks. The computation of a Boolean tensor network is reduced to a sequence of Boolean matrix factorizations, which we show can be expressed as a quadratic unconstrained binary optimization problem suitable for solving on a quantum annealer. By using a novel method we introduce called \textit{parallel quantum annealing}, we demonstrate that tensor with up to millions of elements can be decomposed efficiently using a DWave 2000Q quantum annealer.
△ Less
Submitted 27 March, 2022; v1 submitted 28 July, 2021;
originally announced July 2021.
-
COVID-19 Multidimensional Kaggle Literature Organization
Authors:
Maksim E. Eren,
Nick Solovyev,
Chris Hamer,
Renee McDonald,
Boian S. Alexandrov,
Charles Nicholas
Abstract:
The unprecedented outbreak of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), or COVID-19, continues to be a significant worldwide problem. As a result, a surge of new COVID-19 related research has followed suit. The growing number of publications requires document organization methods to identify relevant information. In this paper, we expand upon our previous work with clustering t…
▽ More
The unprecedented outbreak of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), or COVID-19, continues to be a significant worldwide problem. As a result, a surge of new COVID-19 related research has followed suit. The growing number of publications requires document organization methods to identify relevant information. In this paper, we expand upon our previous work with clustering the CORD-19 dataset by applying multi-dimensional analysis methods. Tensor factorization is a powerful unsupervised learning method capable of discovering hidden patterns in a document corpus. We show that a higher-order representation of the corpus allows for the simultaneous grou** of similar articles, relevant journals, authors with similar research interests, and topic keywords. These grou**s are identified within and among the latent components extracted via tensor decomposition. We further demonstrate the application of this method with a publicly available interactive visualization of the dataset.
△ Less
Submitted 19 July, 2021; v1 submitted 17 July, 2021;
originally announced July 2021.
-
Boolean Matrix Factorization via Nonnegative Auxiliary Optimization
Authors:
Duc P. Truong,
Erik Skau,
Derek Desantis,
Boian Alexandrov
Abstract:
A novel approach to Boolean matrix factorization (BMF) is presented. Instead of solving the BMF problem directly, this approach solves a nonnegative optimization problem with the constraint over an auxiliary matrix whose Boolean structure is identical to the initial Boolean data. Then the solution of the nonnegative auxiliary optimization problem is thresholded to provide a solution for the BMF pr…
▽ More
A novel approach to Boolean matrix factorization (BMF) is presented. Instead of solving the BMF problem directly, this approach solves a nonnegative optimization problem with the constraint over an auxiliary matrix whose Boolean structure is identical to the initial Boolean data. Then the solution of the nonnegative auxiliary optimization problem is thresholded to provide a solution for the BMF problem. We provide the proofs for the equivalencies of the two solution spaces under the existence of an exact solution. Moreover, the nonincreasing property of the algorithm is also proven. Experiments on synthetic and real datasets are conducted to show the effectiveness and complexity of the algorithm compared to other current methods.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Boolean Hierarchical Tucker Networks on Quantum Annealers
Authors:
Elijah Pelofske,
Georg Hahn,
Daniel O'Malley,
Hristo N. Djidjev,
Boian S. Alexandrov
Abstract:
Quantum annealing is an emerging technology with the potential to solve some of the computational challenges that remain unresolved as we approach an era beyond Moore's Law. In this work, we investigate the capabilities of the quantum annealers of D-Wave Systems, Inc., for computing a certain type of Boolean tensor decomposition called Boolean Hierarchical Tucker Network (BHTN). Boolean tensor dec…
▽ More
Quantum annealing is an emerging technology with the potential to solve some of the computational challenges that remain unresolved as we approach an era beyond Moore's Law. In this work, we investigate the capabilities of the quantum annealers of D-Wave Systems, Inc., for computing a certain type of Boolean tensor decomposition called Boolean Hierarchical Tucker Network (BHTN). Boolean tensor decomposition problems ask for finding a decomposition of a high-dimensional tensor with categorical, [true, false], values, as a product of smaller Boolean core tensors. As the BHTN decompositions are usually not exact, we aim to approximate an input high-dimensional tensor by a product of lower-dimensional tensors such that the difference between both is minimized in some norm. We show that BHTN can be calculated as a sequence of optimization problems suitable for the D-Wave 2000Q quantum annealer. Although current technology is still fairly restricted in the problems they can address, we show that a complex problem such as BHTN can be solved efficiently and accurately.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
Factorization of Binary Matrices: Rank Relations, Uniqueness and Model Selection of Boolean Decomposition
Authors:
Derek DeSantis,
Erik Skau,
Duc P. Truong,
Boian Alexandrov
Abstract:
The application of binary matrices are numerous. Representing a matrix as a mixture of a small collection of latent vectors via low-rank decomposition is often seen as an advantageous method to interpret and analyze data. In this work, we examine the factorizations of binary matrices using standard arithmetic (real and nonnegative) and logical operations (Boolean and $\mathbb{Z}_2$). We examine th…
▽ More
The application of binary matrices are numerous. Representing a matrix as a mixture of a small collection of latent vectors via low-rank decomposition is often seen as an advantageous method to interpret and analyze data. In this work, we examine the factorizations of binary matrices using standard arithmetic (real and nonnegative) and logical operations (Boolean and $\mathbb{Z}_2$). We examine the relationships between the different ranks, and discuss when factorization is unique. In particular, we characterize when a Boolean factorization $X = W \land H$ has a unique $W$, a unique $H$ (for a fixed $W$), and when both $W$ and $H$ are unique, given a rank constraint. We introduce a method for robust Boolean model selection, called BMF$k$, and show on numerical examples that BMF$k$ not only accurately determines the correct number of Boolean latent features but reconstruct the pre-determined factors accurately.
△ Less
Submitted 1 November, 2021; v1 submitted 18 December, 2020;
originally announced December 2020.
-
Identification of Anomalous Diffusion Sources by Unsupervised Learning
Authors:
Raviteja Vangara,
Kim Ø. Rasmussen,
Dimiter N. Petsev,
Golan Bel,
Boian S. Alexandrov
Abstract:
Fractional Brownian motion (fBm) is a ubiquitous diffusion process in which the memory effects of the stochastic transport result in the mean squared particle displacement following a power law, $\langle {Δr}^2 \rangle \sim t^α$, where the diffusion exponent $α$ characterizes whether the transport is subdiffusive, ($α<1$), diffusive ($α= 1$), or superdiffusive, ($α>1$). Due to the abundance of fBm…
▽ More
Fractional Brownian motion (fBm) is a ubiquitous diffusion process in which the memory effects of the stochastic transport result in the mean squared particle displacement following a power law, $\langle {Δr}^2 \rangle \sim t^α$, where the diffusion exponent $α$ characterizes whether the transport is subdiffusive, ($α<1$), diffusive ($α= 1$), or superdiffusive, ($α>1$). Due to the abundance of fBm processes in nature, significant efforts have been devoted to the identification and characterization of fBm sources in various phenomena. In practice, the identification of the fBm sources often relies on solving a complex and ill-posed inverse problem based on limited observed data. In the general case, the detected signals are formed by an unknown number of release sources, located at different locations and with different strengths, that act simultaneously. This means that the observed data is composed of mixtures of releases from an unknown number of sources, which makes the traditional inverse modeling approaches unreliable. Here, we report an unsupervised learning method, based on Nonnegative Matrix Factorization, that enables the identification of the unknown number of release sources as well the anomalous diffusion characteristics based on limited observed data and the general form of the corresponding fBm Green's function. We show that our method performs accurately for different types of sources and configurations with a predetermined number of sources with specific characteristics and introduced noise.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Distributed Non-Negative Tensor Train Decomposition
Authors:
Manish Bhattarai,
Gopinath Chennupati,
Erik Skau,
Raviteja Vangara,
Hirsto Djidjev,
Boian Alexandrov
Abstract:
The era of exascale computing opens new venues for innovations and discoveries in many scientific, engineering, and commercial fields. However, with the exaflops also come the extra-large high-dimensional data generated by high-performance computing. High-dimensional data is presented as multidimensional arrays, aka tensors. The presence of latent (not directly observable) structures in the tensor…
▽ More
The era of exascale computing opens new venues for innovations and discoveries in many scientific, engineering, and commercial fields. However, with the exaflops also come the extra-large high-dimensional data generated by high-performance computing. High-dimensional data is presented as multidimensional arrays, aka tensors. The presence of latent (not directly observable) structures in the tensor allows a unique representation and compression of the data by classical tensor factorization techniques. However, the classical tensor methods are not always stable or they can be exponential in their memory requirements, which makes them not suitable for high-dimensional tensors. Tensor train (TT) is a state-of-the-art tensor network introduced for factorization of high-dimensional tensors. TT transforms the initial high-dimensional tensor in a network of three-dimensional tensors that requires only a linear storage. Many real-world data, such as, density, temperature, population, probability, etc., are non-negative and for an easy interpretation, the algorithms preserving non-negativity are preferred. Here, we introduce a distributed non-negative tensor-train and demonstrate its scalability and the compression on synthetic and real-world big datasets.
△ Less
Submitted 4 August, 2020;
originally announced August 2020.
-
A Neural Network for Determination of Latent Dimensionality in Nonnegative Matrix Factorization
Authors:
Benjamin T. Nebgen,
Raviteja Vangara,
Miguel A. Hombrados-Herrera,
Svetlana Kuksova,
Boian S. Alexandrov
Abstract:
Non-negative Matrix Factorization (NMF) has proven to be a powerful unsupervised learning method for uncovering hidden features in complex and noisy data sets with applications in data mining, text recognition, dimension reduction, face recognition, anomaly detection, blind source separation, and many other fields. An important input for NMF is the latent dimensionality of the data, that is, the n…
▽ More
Non-negative Matrix Factorization (NMF) has proven to be a powerful unsupervised learning method for uncovering hidden features in complex and noisy data sets with applications in data mining, text recognition, dimension reduction, face recognition, anomaly detection, blind source separation, and many other fields. An important input for NMF is the latent dimensionality of the data, that is, the number of hidden features, K, present in the explored data set. Unfortunately, this quantity is rarely known a priori. We utilize a supervised machine learning approach in combination with a recent method for model determination, called NMFk, to determine the number of hidden features automatically. NMFk performs a set of NMF simulations on an ensemble of matrices, obtained by bootstrap** the initial data set, and determines which K produces stable groups of latent features that reconstruct the initial data set well. We then train a Multi-Layer Perceptron (MLP) classifier network to determine the correct number of latent features utilizing the statistics and characteristics of the NMF solutions, obtained from NMFk. In order to train the MLP classifier, a training set of 58,660 matrices with predetermined latent features were factorized with NMFk. The MLP classifier in conjunction with NMFk maintains a greater than 95% success rate when applied to a held out test set. Additionally, when applied to two well-known benchmark data sets, the swimmer and MIT face data, NMFk/MLP correctly recovered the established number of hidden features. Finally, we compared the accuracy of our method to the ARD, AIC and Stability-based methods.
△ Less
Submitted 22 June, 2020;
originally announced June 2020.
-
Determination of Latent Dimensionality in International Trade Flow
Authors:
Duc P. Truong,
Erik Skau,
Vladimir I. Valtchinov,
Boian S. Alexandrov
Abstract:
Currently, high-dimensional data is ubiquitous in data science, which necessitates the development of techniques to decompose and interpret such multidimensional (aka tensor) datasets. Finding a low dimensional representation of the data, that is, its inherent structure, is one of the approaches that can serve to understand the dynamics of low dimensional latent features hidden in the data. Nonneg…
▽ More
Currently, high-dimensional data is ubiquitous in data science, which necessitates the development of techniques to decompose and interpret such multidimensional (aka tensor) datasets. Finding a low dimensional representation of the data, that is, its inherent structure, is one of the approaches that can serve to understand the dynamics of low dimensional latent features hidden in the data. Nonnegative RESCAL is one such technique, particularly well suited to analyze self-relational data, such as dynamic networks found in international trade flows. Nonnegative RESCAL computes a low dimensional tensor representation by finding the latent space containing multiple modalities. Estimating the dimensionality of this latent space is crucial for extracting meaningful latent features. Here, to determine the dimensionality of the latent space with nonnegative RESCAL, we propose a latent dimension determination method which is based on clustering of the solutions of multiple realizations of nonnegative RESCAL decompositions. We demonstrate the performance of our model selection method on synthetic data and then we apply our method to decompose a network of international trade flows data from International Monetary Fund and validate the resulting features against empirical facts from economic literature.
△ Less
Submitted 28 February, 2020;
originally announced March 2020.
-
Coarse-Grain Cluster Analysis of Tensors with Application to Climate Biome Identification
Authors:
Derek DeSantis,
Phillip J. Wolfram,
Katrina Bennett,
Boian Alexandrov
Abstract:
A tensor provides a concise way to codify the interdependence of complex data. Treating a tensor as a d-way array, each entry records the interaction between the different indices. Clustering provides a way to parse the complexity of the data into more readily understandable information. Clustering methods are heavily dependent on the algorithm of choice, as well as the chosen hyperparameters of t…
▽ More
A tensor provides a concise way to codify the interdependence of complex data. Treating a tensor as a d-way array, each entry records the interaction between the different indices. Clustering provides a way to parse the complexity of the data into more readily understandable information. Clustering methods are heavily dependent on the algorithm of choice, as well as the chosen hyperparameters of the algorithm. However, their sensitivity to data scales is largely unknown.
In this work, we apply the discrete wavelet transform to analyze the effects of coarse-graining on clustering tensor data. We are particularly interested in understanding how scale effects clustering of the Earth's climate system. The discrete wavelet transform allows classification of the Earth's climate across a multitude of spatial-temporal scales. The discrete wavelet transform is used to produce an ensemble of classification estimates, as opposed to a single classification. Information theoretic approaches are used to identify important scale lenghts in clustering The L15 Climate Datset. We also discover a sub-collection of the ensemble that spans the majority of the variance observed, allowing for efficient consensus clustering techniques that can be used to identify climate biomes.
△ Less
Submitted 22 May, 2020; v1 submitted 21 January, 2020;
originally announced January 2020.
-
Nonnegative Canonical Tensor Decomposition with Linear Constraints: nnCANDELINC
Authors:
Boian Alexandrov,
Derek DeSantis,
Gianmarco Manzini,
Erik Skau
Abstract:
There is an emerging interest in tensor factorization applications in big-data analytics and machine learning. To speed up the factorization of extra-large datasets, organized in multidimensional arrays (aka tensors), easy to compute compression-based tensor representations, such as Tucker and Tensor Train formats, are used to approximate the initial large-tensor. Further, tensor factorization is…
▽ More
There is an emerging interest in tensor factorization applications in big-data analytics and machine learning. To speed up the factorization of extra-large datasets, organized in multidimensional arrays (aka tensors), easy to compute compression-based tensor representations, such as Tucker and Tensor Train formats, are used to approximate the initial large-tensor. Further, tensor factorization is used to extract latent features that can facilitate discoveries of new mechanisms and signatures hidden in the data, where the explainability of the latent features is of principal importance. Nonnegative tensor factorization extracts latent features that are naturally sparse and parts of the data, which makes them easily interpretable. However, to take into account available domain knowledge and subject matter expertise, additional constraints often need to be imposed, which lead us to Canonical decomposition with linear constraints (CANDELINC), a Canonical Polyadic Decomposition with rank deficient factors. In CANDELINC, Tucker compression is used as a pre-processing step, which leads to a larger residual error but to more explainable latent features. Here, we propose a nonnegative CANDELINC (nnCANDELINC) accomplished via a specific nonnegative Tucker decomposition; we refer to as minimal or canonical nonnegative Tucker. We derive several results required to understand the specificity of nnCANDELINC, focusing on the difficulties of preserving the nonnegative rank to its Tucker core and comparing the real-valued to the nonnegative case. Finally, we demonstrate nnCANDELINC performance on synthetic and real-world examples.
△ Less
Submitted 21 March, 2022; v1 submitted 16 September, 2019;
originally announced September 2019.
-
Unsupervised Machine Learning Based on Non-Negative Tensor Factorization for Analyzing Reactive-Mixing
Authors:
V. V. Vesselinov,
M. K. Mudunuru,
S. Karra,
D. O. Malley,
B. S. Alexandrov
Abstract:
Analysis of reactive-diffusion simulations requires a large number of independent model runs. For each high-fidelity simulation, inputs are varied and the predicted mixing behavior is represented by changes in species concentration. It is then required to discern how the model inputs impact the mixing process. This task is challenging and typically involves interpretation of large model outputs. H…
▽ More
Analysis of reactive-diffusion simulations requires a large number of independent model runs. For each high-fidelity simulation, inputs are varied and the predicted mixing behavior is represented by changes in species concentration. It is then required to discern how the model inputs impact the mixing process. This task is challenging and typically involves interpretation of large model outputs. However, the task can be automated and substantially simplified by applying Machine Learning (ML) methods. In this paper, we present an application of an unsupervised ML method (called NTFk) using Non-negative Tensor Factorization (NTF) coupled with a custom clustering procedure based on k-means to reveal hidden features in product concentration. An attractive aspect of the proposed ML method is that it ensures the extracted features are non-negative, which are important to obtain a meaningful deconstruction of the mixing processes. The ML method is applied to a large set of high-resolution FEM simulations representing reaction-diffusion processes in perturbed vortex-based velocity fields. The applied FEM ensures that species concentration are always non-negative. The simulated reaction is a fast irreversible bimolecular reaction. The reactive-diffusion model input parameters that control mixing include properties of velocity field, anisotropic dispersion, and molecular diffusion. We demonstrate the applicability of the ML method to produce a meaningful deconstruction of model outputs to discriminate between different physical processes impacting the reactants, their mixing, and the spatial distribution of the product. The presented ML analysis allowed us to identify additive features that characterize mixing behavior.
△ Less
Submitted 21 February, 2019; v1 submitted 15 May, 2018;
originally announced May 2018.
-
Unsupervised Phase Map** of X-ray Diffraction Data by Nonnegative Matrix Factorization Integrated with Custom Clustering
Authors:
Valentin Stanev,
Velimir V. Vesselinov,
A. Gilad Kusne,
Graham Antoszewski,
Ichiro Takeuchi,
Boian S. Alexandrov
Abstract:
Analyzing large X-ray diffraction (XRD) datasets is a key step in high-throughput map** of the compositional phase diagrams of combinatorial materials libraries. Optimizing and automating this task can help accelerate the process of discovery of materials with novel and desirable properties. Here, we report a new method for pattern analysis and phase extraction of XRD datasets. The method expand…
▽ More
Analyzing large X-ray diffraction (XRD) datasets is a key step in high-throughput map** of the compositional phase diagrams of combinatorial materials libraries. Optimizing and automating this task can help accelerate the process of discovery of materials with novel and desirable properties. Here, we report a new method for pattern analysis and phase extraction of XRD datasets. The method expands the Nonnegative Matrix Factorization method, which has been used previously to analyze such datasets, by combining it with custom clustering and cross-correlation algorithms. This new method is capable of robust determination of the number of basis patterns present in the data which, in turn, enables straightforward identification of any possible peak-shifted patterns. Peak-shifting arises due to continuous change in the lattice constants as a function of composition, and is ubiquitous in XRD datasets from composition spread libraries. Successful identification of the peak-shifted patterns allows proper quantification and classification of the basis XRD patterns, which is necessary in order to decipher the contribution of each unique single-phase structure to the multi-phase regions. The process can be utilized to determine accurately the compositional phase diagram of a system under study. The presented method is applied to one synthetic and one experimental dataset, and demonstrates robust accuracy and identification abilities.
△ Less
Submitted 20 February, 2018;
originally announced February 2018.
-
Nonnegative/binary matrix factorization with a D-Wave quantum annealer
Authors:
Daniel O'Malley,
Velimir V. Vesselinov,
Boian S. Alexandrov,
Ludmil B. Alexandrov
Abstract:
D-Wave quantum annealers represent a novel computational architecture and have attracted significant interest, but have been used for few real-world computations. Machine learning has been identified as an area where quantum annealing may be useful. Here, we show that the D-Wave 2X can be effectively used as part of an unsupervised machine learning method. This method can be used to analyze large…
▽ More
D-Wave quantum annealers represent a novel computational architecture and have attracted significant interest, but have been used for few real-world computations. Machine learning has been identified as an area where quantum annealing may be useful. Here, we show that the D-Wave 2X can be effectively used as part of an unsupervised machine learning method. This method can be used to analyze large datasets. The D-Wave only limits the number of features that can be extracted from the dataset. We apply this method to learn the features from a set of facial images.
△ Less
Submitted 5 April, 2017;
originally announced April 2017.
-
Nonnegative Matrix Factorization for identification of unknown number of sources emitting delayed signals
Authors:
Filip L. Iliev,
Valentin G. Stanev,
Velimir V. Vesselinov,
Boian S. Alexandrov
Abstract:
Factor analysis is broadly used as a powerful unsupervised machine learning tool for reconstruction of hidden features in recorded mixtures of signals. In the case of a linear approximation, the mixtures can be decomposed by a variety of model-free Blind Source Separation (BSS) algorithms. Most of the available BSS algorithms consider an instantaneous mixing of signals, while the case when the mix…
▽ More
Factor analysis is broadly used as a powerful unsupervised machine learning tool for reconstruction of hidden features in recorded mixtures of signals. In the case of a linear approximation, the mixtures can be decomposed by a variety of model-free Blind Source Separation (BSS) algorithms. Most of the available BSS algorithms consider an instantaneous mixing of signals, while the case when the mixtures are linear combinations of signals with delays is less explored. Especially difficult is the case when the number of sources of the signals with delays is unknown and has to be determined from the data as well. To address this problem, in this paper, we present a new method based on Nonnegative Matrix Factorization (NMF) that is capable of identifying: (a) the unknown number of the sources, (b) the delays and speed of propagation of the signals, and (c) the locations of the sources. Our method can be used to decompose records of mixtures of signals with delays emitted by an unknown number of sources in a nondispersive medium, based only on recorded data. This is the case, for example, when electromagnetic signals from multiple antennas are received asynchronously; or mixtures of acoustic or seismic signals recorded by sensors located at different positions; or when a shift in frequency is induced by the Doppler effect. By applying our method to synthetic datasets, we demonstrate its ability to identify the unknown number of sources as well as the waveforms, the delays, and the strengths of the signals. Using Bayesian analysis, we also evaluate estimation uncertainties and identify the region of likelihood where the positions of the sources can be found.
△ Less
Submitted 23 March, 2018; v1 submitted 12 December, 2016;
originally announced December 2016.
-
Identification of release sources in advection-diffusion system by machine learning combined with Green function inverse method
Authors:
Valentin G. Stanev,
Filip L. Iliev,
Scott Hansen,
Velimir V. Vesselinov,
Boian S. Alexandrov
Abstract:
The identification of sources of advection-diffusion transport is based usually on solving complex ill-posed inverse models against the available state- variable data records. However, if there are several sources with different locations and strengths, the data records represent mixtures rather than the separate influences of the original sources. Importantly, the number of these original release…
▽ More
The identification of sources of advection-diffusion transport is based usually on solving complex ill-posed inverse models against the available state- variable data records. However, if there are several sources with different locations and strengths, the data records represent mixtures rather than the separate influences of the original sources. Importantly, the number of these original release sources is typically unknown, which hinders reliability of the classical inverse-model analyses. To address this challenge, we present here a novel hybrid method for identification of the unknown number of release sources. Our hybrid method, called HNMF, couples unsupervised learning based on Nonnegative Matrix Factorization (NMF) and inverse-analysis Green functions method. HNMF synergistically performs decomposition of the recorded mixtures, finds the number of the unknown sources and uses the Green function of advection-diffusion equation to identify their characteristics. In the paper, we introduce the method and demonstrate that it is capable of identifying the advection velocity and dispersivity of the medium as well as the unknown number, locations, and properties of various sets of synthetic release sources with different space and time dependencies, based only on the recorded data. HNMF can be applied directly to any problem controlled by a partial-differential parabolic equation where mixtures of an unknown number of sources are measured at multiple locations.
△ Less
Submitted 23 March, 2018; v1 submitted 12 December, 2016;
originally announced December 2016.
-
New measurements of neutron electric dipole moment with double chamber EDM spectrometer
Authors:
A. P. Serebrov,
E. A. Kolomenskiy,
A. N. Pirozhkov,
I. A. Krasnoshekova,
A. V. Vasiliev,
A. O. Polyushkin,
M. S. Lasakov,
A. N. Murashkin,
V. A. Solovey,
A. K. Fomin,
I. V. Shoka,
O. M. Zherebtsov,
P. Geltenbort,
S. N. Ivanov,
O. Zimmer,
E. B. Alexandrov,
S. P. Dmitriev,
N. A. Dovator
Abstract:
The article presents results on neutron electric dipole moment measurements (EDM), made by ILL reactor using PNPI experimental installation. Double chamber magnetic resonance spectrometer with prolonged holding of ultra cold neutrons has been employed. The obtained results at 90% confidence level determine the upper limit for EDM neutron quantity equal to $|d_n| < 5.5 \cdot 10^{-26}$ e$ \cdot$cm.
The article presents results on neutron electric dipole moment measurements (EDM), made by ILL reactor using PNPI experimental installation. Double chamber magnetic resonance spectrometer with prolonged holding of ultra cold neutrons has been employed. The obtained results at 90% confidence level determine the upper limit for EDM neutron quantity equal to $|d_n| < 5.5 \cdot 10^{-26}$ e$ \cdot$cm.
△ Less
Submitted 27 August, 2014;
originally announced August 2014.
-
New measurements of neutron electric dipole moment
Authors:
A. P. Serebrov,
E. A. Kolomenskiy,
A. N. Pirozhkov,
I. A. Krasnoshekova,
A. V. Vasiliev,
A. O. Polyushkin,
M. S. Lasakov,
A. K. Fomin,
I. V. Shoka,
V. A. Solovey,
O. M. Zherebtsov,
P. Geltenbort,
O. Zimmer,
S. N. Ivanov,
E. B. Alexandrov,
S. P. Dmitriev,
N. A. Dovator
Abstract:
We report a new measurement of the neutron electric dipole moment with the PNPI EDM spectrometer using the ultracold neutron source PF2 at the research reactor of the ILL. Its first results can be interpreted as a limit on the neutron EDM of $|d_{\rm n}| < 5.5 \times 10^{-26} \rm{e} \cdot \rm{cm}$ (90% confidence level).
We report a new measurement of the neutron electric dipole moment with the PNPI EDM spectrometer using the ultracold neutron source PF2 at the research reactor of the ILL. Its first results can be interpreted as a limit on the neutron EDM of $|d_{\rm n}| < 5.5 \times 10^{-26} \rm{e} \cdot \rm{cm}$ (90% confidence level).
△ Less
Submitted 16 December, 2013; v1 submitted 21 October, 2013;
originally announced October 2013.
-
Deformations of nearly parallel G_2-structures
Authors:
Bogdan Alexandrov,
Uwe Semmelmann
Abstract:
We study the infinitesimal deformations of a proper nearly parallel G_2-structure and prove that they are characterized by a certain first order differential equation. In particular we show that the space of infinitesimal deformations modulo the group of diffeomorphisms is isomorphic to a subspace of co-closed $Λ^3_{27}$-eigenforms of the Laplace operator for the eigenvalue 8 scal/21. We give a si…
▽ More
We study the infinitesimal deformations of a proper nearly parallel G_2-structure and prove that they are characterized by a certain first order differential equation. In particular we show that the space of infinitesimal deformations modulo the group of diffeomorphisms is isomorphic to a subspace of co-closed $Λ^3_{27}$-eigenforms of the Laplace operator for the eigenvalue 8 scal/21. We give a similar description for the space of infinitesimal Einstein deformations of a fixed nearly parallel G_2-structure. Moreover we show that there are no deformations on the squashed S^7 and on SO(5)/SO(3), but that there are infinitesimal deformations on the Aloff-Wallach manifold N(1,1) = SU(3)/U(1).
△ Less
Submitted 11 January, 2011;
originally announced January 2011.
-
Feigenbaum Cascade of Discrete Breathers in a Model of DNA
Authors:
P. Maniadis,
B. S. Alexandrov,
A. R. Bishop,
K. Ø. Rasmussen
Abstract:
We demonstrate that period-doubled discrete breathers appear from the anti-continuum limit of the driven Peyrard-Bishop-Dauxois model of DNA. These novel breathers result from a stability overlap between sub-harmonic solutions of the driven Morse oscillator. Sub-harmonic breathers exist whenever a stability overlap is present within the Feigenbaum cascade to chaos and therefore an entire cascade o…
▽ More
We demonstrate that period-doubled discrete breathers appear from the anti-continuum limit of the driven Peyrard-Bishop-Dauxois model of DNA. These novel breathers result from a stability overlap between sub-harmonic solutions of the driven Morse oscillator. Sub-harmonic breathers exist whenever a stability overlap is present within the Feigenbaum cascade to chaos and therefore an entire cascade of such breathers exists. This phenomenon is present in any driven lattice where the on-site potential admits sub-harmonic solutions. In DNA these breathers may have ramifications for cellular gene expression.
△ Less
Submitted 11 January, 2011; v1 submitted 12 December, 2010;
originally announced December 2010.
-
Robust accidental nodes and zeroes and critical quasiparticle scaling in iron-based multiband superconductors
Authors:
Valentin Stanev,
Boian S. Alexandrov,
Predrag Nikolic,
Zlatko Tesanovic
Abstract:
We study multigap superconductivity, with strong angular variations of one of the gaps, as appropriate for certain iron-based high-temperature superconductors. We solve the gap equations of this model and find that the nodes or zeroes in the gap function present at Tc - although purely accidental -- typically survive down to T=0. Based on this result, we investigate the line of quantum transitions…
▽ More
We study multigap superconductivity, with strong angular variations of one of the gaps, as appropriate for certain iron-based high-temperature superconductors. We solve the gap equations of this model and find that the nodes or zeroes in the gap function present at Tc - although purely accidental -- typically survive down to T=0. Based on this result, we investigate the line of quantum transitions at which gap zeroes first appear. The peculiar "zero-point" critical scaling emanating from this line dominates quasiparticle thermodynamics and transport properties over much of the phase diagram, and supplants more familiar forms of scaling associated with accidental nodes.
△ Less
Submitted 19 May, 2011; v1 submitted 2 June, 2010;
originally announced June 2010.
-
DNA Breathing Dynamics in the Presence of a Terahertz Field
Authors:
B. S. Alexandrov,
V. Gelev,
A. R. Bishop,
A. Usheva,
K. O. Rasmussen
Abstract:
We consider the influence of a terahertz field on the breathing dynamics of double-stranded DNA. We model the spontaneous formation of spatially localized openings of a damped and driven DNA chain, and find that linear instabilities lead to dynamic dimerization, while true local strand separations require a threshold amplitude mechanism. Based on our results we argue that a specific terahertz ra…
▽ More
We consider the influence of a terahertz field on the breathing dynamics of double-stranded DNA. We model the spontaneous formation of spatially localized openings of a damped and driven DNA chain, and find that linear instabilities lead to dynamic dimerization, while true local strand separations require a threshold amplitude mechanism. Based on our results we argue that a specific terahertz radiation exposure may significantly affect the natural dynamics of DNA, and thereby influence intricate molecular processes involved in gene expression and DNA replication.
△ Less
Submitted 28 October, 2009;
originally announced October 2009.
-
Controlling atomic vapor density in paraffin-coated cells using light-induced atomic desorption
Authors:
T. Karaulanov,
M. T. Graf,
D. English,
S. M. Rochester,
Y. Rosen,
K. Tsigutkin,
D. Budker,
E. B. Alexandrov,
M. V. Balabas,
D. F. Jackson Kimball,
F. A. Narducci,
S. Pustelny,
V. V. Yashchuk
Abstract:
Atomic-vapor density change due to light induced atomic desorption (LIAD) is studied in paraffin-coated rubidium, cesium, sodium and potassium cells. In the present experiment, low-intensity probe light is used to obtain an absorption spectrum and measure the vapor density, while light from an argon-ion laser, array of light emitting diodes, or discharge lamp is used for desorption. Potassium is…
▽ More
Atomic-vapor density change due to light induced atomic desorption (LIAD) is studied in paraffin-coated rubidium, cesium, sodium and potassium cells. In the present experiment, low-intensity probe light is used to obtain an absorption spectrum and measure the vapor density, while light from an argon-ion laser, array of light emitting diodes, or discharge lamp is used for desorption. Potassium is found to exhibit significantly weaker LIAD from paraffin compared to Rb and Cs, and we were unable to observe LIAD with sodium. A simple LIAD model is applied to describe the observed vapor-density dynamics, and the role of the cell's stem is explored through the use of cells with lockable stems. Stabilization of Cs vapor density above its equilibrium value over 25 minutes is demonstrated. The results of this work could be used to assess the use of LIAD for vapor-density control in magnetometers, clocks, and gyroscopes utilizing coated cells.
△ Less
Submitted 23 October, 2008; v1 submitted 3 June, 2008;
originally announced June 2008.
-
Branching Transport Model of Alkali-Halide Scintillators
Authors:
B. S. Alexandrov,
K. D. Ianakiev,
P. B. Littlewood
Abstract:
We measure the time dependence of the scintillator light-emission pulses in NaI(Tl) crystals at different temperatures, after activation by gamma rays. We confirm that there are two main nonexponential components to the time decay and find that their amplitude ratio shows Arrhenius temperature dependence. We explain these nonexponential components as arising from two competing mechanisms of carr…
▽ More
We measure the time dependence of the scintillator light-emission pulses in NaI(Tl) crystals at different temperatures, after activation by gamma rays. We confirm that there are two main nonexponential components to the time decay and find that their amplitude ratio shows Arrhenius temperature dependence. We explain these nonexponential components as arising from two competing mechanisms of carrier transport to the Tl activation levels. The total light output of the NaI(Tl) detectors shows a linear temperature dependence explained by our model.
△ Less
Submitted 25 May, 2007;
originally announced May 2007.
-
Classical Propagation of Light in Spatio-Temporal Periodic Media
Authors:
B. S. Alexandrov,
K. O. Rasmussen,
A. T. Findikoglu,
A. R. Bishop,
I. Z. Kostadinov
Abstract:
We analyze the propagation of electromagnetic waves in media where the dielectric constants undergo rapid temporal periodic modulation. Both spatially homogeneous and periodic media are studied. Fast periodic temporal modulation of the dielectric constant of a homogeneous medium leads to existence of photonic band-gap like phenomena. In the presence of both spatial and tem- poral periodicity the…
▽ More
We analyze the propagation of electromagnetic waves in media where the dielectric constants undergo rapid temporal periodic modulation. Both spatially homogeneous and periodic media are studied. Fast periodic temporal modulation of the dielectric constant of a homogeneous medium leads to existence of photonic band-gap like phenomena. In the presence of both spatial and tem- poral periodicity the electromagnetic spectrum is described in a four-dimensional cube, defining an effective Brillouin zone. In the case of incommensurability between space and time periodicities, completely dispersed point spectra exist.
△ Less
Submitted 18 October, 2006;
originally announced October 2006.
-
Temperature behavior of NaI (Tl) scintillation detectors
Authors:
K. D. Ianakiev,
B. S. Alexandrov,
P. B. Littlewood,
M. C. Browne
Abstract:
It is a familiar fact that the total measured light yield of NaI (Tl) detectors is a nonlinear function of temperature. Here we present new experimental data for the temperature behavior of doped NaI(Tl) scintillators that instead shows a linear dependence of light output over a wide temperature range- including that for outdoor applications. The shape of the light pulse shows in general two dec…
▽ More
It is a familiar fact that the total measured light yield of NaI (Tl) detectors is a nonlinear function of temperature. Here we present new experimental data for the temperature behavior of doped NaI(Tl) scintillators that instead shows a linear dependence of light output over a wide temperature range- including that for outdoor applications. The shape of the light pulse shows in general two decay processes: a single dominant process above room temperature and two decay time constants below. We show that redistribution of the intensities is temperature-dependent; the second (slow) decay component is negligible at room temperatures, but, by -20C, it contributes up to 40 percents of the total light and has a duration of several microseconds. We discuss the profound effect this new understanding of the light output has on the pulse height analysis instrumentation. We introduce a theoretical model to explain the experimental results. In addition, we describe a unique technique for correcting both amplitude and shape temperature changes inside the NaI(Tl) detector package.
△ Less
Submitted 30 May, 2006;
originally announced May 2006.
-
Bubble Statistics and Dynamics in Double-Stranded DNA
Authors:
B. S. Alexandrov,
L. T. Wille,
K. O. Rasmussen,
A. R. Bishop,
K. B. Blagoev
Abstract:
The dynamical properties of double-stranded DNA are studied in the framework of the Peyrard-Bishop-Dauxois model using Langevin dynamics. Our simulations are analyzed in terms of two probability functions describing coherently localized separations ("bubbles") of the double strand. We find that the resulting bubble distributions are more sharply peaked at the active sites than found in thermodyn…
▽ More
The dynamical properties of double-stranded DNA are studied in the framework of the Peyrard-Bishop-Dauxois model using Langevin dynamics. Our simulations are analyzed in terms of two probability functions describing coherently localized separations ("bubbles") of the double strand. We find that the resulting bubble distributions are more sharply peaked at the active sites than found in thermodynamically obtained distributions. Our analysis ascribes this to the fact that the bubble life-times significantly afects the distribution function. We find that certain base-pair sequences promote long-lived bubbles and we argue that this is due to a length scale competition between the nonlinearity and disorder present in the system.
△ Less
Submitted 24 January, 2006;
originally announced January 2006.
-
Relaxation of atomic polarization in paraffin-coated cesium vapor cells
Authors:
M. T. Graf,
D. F. Kimball,
S. M. Rochester,
K. Kerner,
C. Wong,
D. Budker,
E. B. Alexandrov,
M. V. Balabas
Abstract:
The relaxation of atomic polarization in buffer-gas-free, paraffin-coated cesium vapor cells is studied using a variation on Franzen's technique of ``relaxation in the dark'' [Franzen, Phys. Rev. {\bf 115}, 850 (1959)]. In the present experiment, narrow-band, circularly polarized pump light, resonant with the Cs D2 transition, orients atoms along a longitudinal magnetic field, and time-dependent…
▽ More
The relaxation of atomic polarization in buffer-gas-free, paraffin-coated cesium vapor cells is studied using a variation on Franzen's technique of ``relaxation in the dark'' [Franzen, Phys. Rev. {\bf 115}, 850 (1959)]. In the present experiment, narrow-band, circularly polarized pump light, resonant with the Cs D2 transition, orients atoms along a longitudinal magnetic field, and time-dependent optical rotation of linearly polarized probe light is measured to determine the relaxation rates of the atomic orientation of a particular hyperfine level. The change in relaxation rates during light-induced atomic desorption (LIAD) is studied. No significant change in the spin relaxation rate during LIAD is found beyond that expected from the faster rate of spin-exchange collisions due to the increase in Cs density.
△ Less
Submitted 25 March, 2005;
originally announced March 2005.
-
The first eigenvalue of the Dirac operator on locally reducible Riemannian manifolds
Authors:
Bogdan Alexandrov
Abstract:
We prove a lower estimate for the first eigenvalue of the Dirac operator on a compact locally reducible Riemannian spin manifold with positive scalar curvature. We determine also the universal covers of the manifolds on which the smallest possible eigenvalue is attained.
We prove a lower estimate for the first eigenvalue of the Dirac operator on a compact locally reducible Riemannian spin manifold with positive scalar curvature. We determine also the universal covers of the manifolds on which the smallest possible eigenvalue is attained.
△ Less
Submitted 25 February, 2005;
originally announced February 2005.
-
Erratum: Light-induced desorption of alkali-metal atoms from paraffin coating [Phys. Rev. A 66, 042903 (2002)]
Authors:
E. B. Alexandrov,
M. V. Balabas,
D. Budker,
D. English,
D. F. Kimball,
C. -H. Li,
V. V. Yashchuk
Abstract:
Erratum for paper "Light-induced desorption of alkali-metal atoms from paraffin coating" [Phys. Rev. A 66, 042903 (2002).
Erratum for paper "Light-induced desorption of alkali-metal atoms from paraffin coating" [Phys. Rev. A 66, 042903 (2002).
△ Less
Submitted 29 June, 2004;
originally announced June 2004.
-
Dynamic effects in nonlinear magneto-optics of atoms and molecules
Authors:
E. B. Alexandrov,
M. Auzinsh,
D. Budker,
D. F. Kimball,
S. M. Rochester,
V. V. Yashchuk
Abstract:
A brief review is given of topics relating to dynamical processes arising in nonlinear interactions between light and resonant systems (atoms or molecules) in the presence of a magnetic field.
A brief review is given of topics relating to dynamical processes arising in nonlinear interactions between light and resonant systems (atoms or molecules) in the presence of a magnetic field.
△ Less
Submitted 10 May, 2004;
originally announced May 2004.