-
Linear to multi-linear algebra and systems using tensors
Authors:
Divyanshu Pandey,
Adithya Venugopal,
Harry Leib
Abstract:
In past few decades, tensor algebra also known as multi-linear algebra has been developed and customized as a tool to be used for various engineering applications. In particular, with the help of a special form of tensor contracted product, known as the Einstein Product and its properties, many of the known concepts from Linear Algebra could be extended to a multi-linear setting. This enables to d…
▽ More
In past few decades, tensor algebra also known as multi-linear algebra has been developed and customized as a tool to be used for various engineering applications. In particular, with the help of a special form of tensor contracted product, known as the Einstein Product and its properties, many of the known concepts from Linear Algebra could be extended to a multi-linear setting. This enables to define the notions of multi-linear system theory where the input, output signals and the system are multi-domain in nature. This paper provides an overview of tensor algebra tools which can be seen as an extension of linear algebra, at the same time highlighting the difference and advantages that the multi-linear setting brings forth. In particular, the notion of tensor inversion, tensor singular value and tensor Eigenvalue decomposition using the Einstein product is explained. In addition, this paper also introduces the notion of contracted convolution in both discrete and continuous multi-linear system tensors. Tensor Networks representation of various tensor operations is also presented. Also, application of tensor tools in develo** transceiver schemes for multi-domain communication systems, with an example of MIMO CDMA systems, is presented. Thus this paper acts as an entry point tutorial for graduate students whose research involves multi-domain or multi-modal signals and systems.
△ Less
Submitted 29 December, 2023; v1 submitted 20 April, 2023;
originally announced April 2023.
-
EdnaML: A Declarative API and Framework for Reproducible Deep Learning
Authors:
Abhijit Suprem,
Sanjyot Vaidya,
Avinash Venugopal,
Joao Eduardo Ferreira,
Calton Pu
Abstract:
Machine Learning has become the bedrock of recent advances in text, image, video, and audio processing and generation. Most production systems deal with several models during deployment and training, each with a variety of tuned hyperparameters. Furthermore, data collection and processing aspects of ML pipelines are receiving increasing interest due to their importance in creating sustainable high…
▽ More
Machine Learning has become the bedrock of recent advances in text, image, video, and audio processing and generation. Most production systems deal with several models during deployment and training, each with a variety of tuned hyperparameters. Furthermore, data collection and processing aspects of ML pipelines are receiving increasing interest due to their importance in creating sustainable high-quality classifiers. We present EdnaML, a framework with a declarative API for reproducible deep learning. EdnaML provides low-level building blocks that can be composed manually, as well as a high-level pipeline orchestration API to automate data collection, data processing, classifier training, classifier deployment, and model monitoring. Our layered API allows users to manage ML pipelines at high-level component abstractions, while providing flexibility to modify any part of it through the building blocks. We present several examples of ML pipelines with EdnaML, including a large-scale fake news labeling and classification system with six sub-pipelines managed by EdnaML.
△ Less
Submitted 12 November, 2022;
originally announced November 2022.
-
N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses
Authors:
Karthik Ganesan,
Pakhi Bamdev,
Jaivarsan B,
Amresh Venugopal,
Abhinav Tushar
Abstract:
Spoken Language Understanding (SLU) systems parse speech into semantic structures like dialog acts and slots. This involves the use of an Automatic Speech Recognizer (ASR) to transcribe speech into multiple text alternatives (hypotheses). Transcription errors, common in ASRs, impact downstream SLU performance negatively. Approaches to mitigate such errors involve using richer information from the…
▽ More
Spoken Language Understanding (SLU) systems parse speech into semantic structures like dialog acts and slots. This involves the use of an Automatic Speech Recognizer (ASR) to transcribe speech into multiple text alternatives (hypotheses). Transcription errors, common in ASRs, impact downstream SLU performance negatively. Approaches to mitigate such errors involve using richer information from the ASR, either in form of N-best hypotheses or word-lattices. We hypothesize that transformer models learn better with a simpler utterance representation using the concatenation of the N-best ASR alternatives, where each alternative is separated by a special delimiter [SEP]. In our work, we test our hypothesis by using concatenated N-best ASR alternatives as the input to transformer encoder models, namely BERT and XLM-RoBERTa, and achieve performance equivalent to the prior state-of-the-art model on DSTC2 dataset. We also show that our approach significantly outperforms the prior state-of-the-art when subjected to the low data regime. Additionally, this methodology is accessible to users of third-party ASR APIs which do not provide word-lattice information.
△ Less
Submitted 11 June, 2021;
originally announced June 2021.