-
Multivariate Functional Linear Discriminant Analysis for the Classification of Short Time Series with Missing Data
Authors:
Rahul Bordoloi,
Clémence Réda,
Orell Trautmann,
Saptarshi Bej,
Olaf Wolkenhauer
Abstract:
Functional linear discriminant analysis (FLDA) is a powerful tool that extends LDA-mediated multiclass classification and dimension reduction to univariate time-series functions. However, in the age of large multivariate and incomplete data, statistical dependencies between features must be estimated in a computationally tractable way, while also dealing with missing data. There is a need for a co…
▽ More
Functional linear discriminant analysis (FLDA) is a powerful tool that extends LDA-mediated multiclass classification and dimension reduction to univariate time-series functions. However, in the age of large multivariate and incomplete data, statistical dependencies between features must be estimated in a computationally tractable way, while also dealing with missing data. There is a need for a computationally tractable approach that considers the statistical dependencies between features and can handle missing values. We here develop a multivariate version of FLDA (MUDRA) to tackle this issue and describe an efficient expectation/conditional-maximization (ECM) algorithm to infer its parameters. We assess its predictive power on the "Articulary Word Recognition" data set and show its improvement over the state-of-the-art, especially in the case of missing data. MUDRA allows interpretable classification of data sets with large proportions of missing data, which will be particularly useful for medical or psychological data sets.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
High-order aberrations of vortex constellations
Authors:
Rafael Barros,
Subhajit Bej,
Markus Hiekkamäki,
Marco Ornigotti,
Robert Fickler
Abstract:
When reflected from an interface, a laser beam generally drifts and tilts away from the path predicted by ray optics, an intriguing consequence of its finite transverse extent. Such beam shifts manifest more dramatically for structured light fields, and in particular for optical vortices. Upon reflection, a field containing a high-order optical vortex is expected to experience not only geometrical…
▽ More
When reflected from an interface, a laser beam generally drifts and tilts away from the path predicted by ray optics, an intriguing consequence of its finite transverse extent. Such beam shifts manifest more dramatically for structured light fields, and in particular for optical vortices. Upon reflection, a field containing a high-order optical vortex is expected to experience not only geometrical shifts, but an additional splitting of its high-order vortex into a constellation of unit-charge vortices, a phenomenon known as topological aberration. In this article, we report on the first direct observation of the topological aberration effect, measured through the transformation of a vortex constellation upon reflection. We develop a general theoretical framework to study topological aberrations in terms of the elementary symmetric polynomials of the coordinates of a vortex constellation, a mathematical abstraction which we prove to be the physical quantity of interest. Using this approach, we are able to verify experimentally the aberration of constellations of up to three vortices reflected from a thin metallic film. Our work not only deepens the understanding of the reflection of naturally occurring structured light fields such as vortex constellations but also sets forth a potential method for studying the interaction of twisted light fields with matter.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
High-Q guided-mode resonance of a crossed grating with near-flat dispersion
Authors:
Reuben Amedalor,
Petri Karvinen,
Henri Pesonen,
Jari Turunen,
Tapio Niemi,
Subhajit Bej
Abstract:
Guided-mode resonances in diffraction gratings are manifested as peaks (dips) in reflection (transmission) spectra. Smaller resonance line widths (higher Q-factors) ensure stronger light-matter interactions and are beneficial for field-dependent physical processes. However, strong angular and spectral dispersion are inherent to such high-Q resonances. We demonstrate that a class of high-Q resonant…
▽ More
Guided-mode resonances in diffraction gratings are manifested as peaks (dips) in reflection (transmission) spectra. Smaller resonance line widths (higher Q-factors) ensure stronger light-matter interactions and are beneficial for field-dependent physical processes. However, strong angular and spectral dispersion are inherent to such high-Q resonances. We demonstrate that a class of high-Q resonant modes (Q-factor >1000) exhibiting extraordinarily weak dispersion can be excited in crossed gratings simultaneously with the modes with well-known nearly linear dispersion. Furthermore, we show that the polarization of the incoming light can be adjusted to engineer the dispersion of these modes, and strong to near-flat dispersion or vice-versa can be achieved by switching between two mutually orthogonal linear polarization states. We introduce a semi-analytical model to explain the underlying physics behind these observations and perform full-wave numerical simulations and experiments to support our theoretical conjecture. The results presented here will benefit all applications that rely on resonances in free-space-coupled geometries.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets
Authors:
Kristian Schultz,
Saptarshi Bej,
Waldemar Hahn,
Markus Wolfien,
Prashant Srivastava,
Olaf Wolkenhauer
Abstract:
Data is commonly stored in tabular format. Several fields of research are prone to small imbalanced tabular data. Supervised Machine Learning on such data is often difficult due to class imbalance. Synthetic data generation, i.e., oversampling, is a common remedy used to improve classifier performance. State-of-the-art linear interpolation approaches, such as LoRAS and ProWRAS can be used to gener…
▽ More
Data is commonly stored in tabular format. Several fields of research are prone to small imbalanced tabular data. Supervised Machine Learning on such data is often difficult due to class imbalance. Synthetic data generation, i.e., oversampling, is a common remedy used to improve classifier performance. State-of-the-art linear interpolation approaches, such as LoRAS and ProWRAS can be used to generate synthetic samples from the convex space of the minority class to improve classifier performance in such cases. Deep generative networks are common deep learning approaches for synthetic sample generation, widely used for synthetic image generation. However, their scope on synthetic tabular data generation in the context of imbalanced classification is not adequately explored. In this article, we show that existing deep generative models perform poorly compared to linear interpolation based approaches for imbalanced classification problems on smaller tabular datasets. To overcome this, we propose a deep generative model, ConvGeN that combines the idea of convex space learning with deep generative models. ConvGeN learns the coefficients for the convex combinations of the minority class samples, such that the synthetic data is distinct enough from the majority class. Our benchmarking experiments demonstrate that our proposed model ConvGeN improves imbalanced classification on such small datasets, as compared to existing deep generative models, while being at-par with the existing linear interpolation approaches. Moreover, we discuss how our model can be used for synthetic tabular data generation in general, even outside the scope of data imbalance and thus, improves the overall applicability of convex space learning.
△ Less
Submitted 13 July, 2022; v1 submitted 20 June, 2022;
originally announced June 2022.
-
A multi-schematic classifier-independent oversampling approach for imbalanced datasets
Authors:
Saptarshi Bej,
Kristian Schultz,
Prashant Srivastava,
Markus Wolfien,
Olaf Wolkenhauer
Abstract:
Over 85 oversampling algorithms, mostly extensions of the SMOTE algorithm, have been built over the past two decades, to solve the problem of imbalanced datasets. However, it has been evident from previous studies that different oversampling algorithms have different degrees of efficiency with different classifiers. With numerous algorithms available, it is difficult to decide on an oversampling a…
▽ More
Over 85 oversampling algorithms, mostly extensions of the SMOTE algorithm, have been built over the past two decades, to solve the problem of imbalanced datasets. However, it has been evident from previous studies that different oversampling algorithms have different degrees of efficiency with different classifiers. With numerous algorithms available, it is difficult to decide on an oversampling algorithm for a chosen classifier. Here, we overcome this problem with a multi-schematic and classifier-independent oversampling approach: ProWRAS(Proximity Weighted Random Affine Shadowsampling). ProWRAS integrates the Localized Random Affine Shadowsampling (LoRAS)algorithm and the Proximity Weighted Synthetic oversampling (ProWSyn) algorithm. By controlling the variance of the synthetic samples, as well as a proximity-weighted clustering system of the minority classdata, the ProWRAS algorithm improves performance, compared to algorithms that generate synthetic samples through modelling high dimensional convex spaces of the minority class. ProWRAS has four oversampling schemes, each of which has its unique way to model the variance of the generated data. Most importantly, the performance of ProWRAS with proper choice of oversampling schemes, is independent of the classifier used. We have benchmarked our newly developed ProWRAS algorithm against five sate-of-the-art oversampling models and four different classifiers on 20 publicly available datasets. ProWRAS outperforms other oversampling algorithms in a statistically significant way, in terms of both F1-score and Kappa-score. Moreover, we have introduced a novel measure for classifier independence I-score, and showed quantitatively that ProWRAS performs better, independent of the classifier used. In practice, ProWRAS customizes synthetic sample generation according to a classifier of choice and thereby reduces benchmarking efforts.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Hamiltonian cycles in annular decomposable Barnette graphs
Authors:
Saptarshi Bej
Abstract:
Barnette's conjecture is an unsolved problem in graph theory. The problem states that every 3-regular (cubic), 3-connected, planar, bipartite (Barnette) graph is Hamiltonian. Partial results have been derived with restrictions on number of vertices, several properties of face-partitions and dual graphs of Barnette graphs while some studies focus just on structural characterizations of Barnette gra…
▽ More
Barnette's conjecture is an unsolved problem in graph theory. The problem states that every 3-regular (cubic), 3-connected, planar, bipartite (Barnette) graph is Hamiltonian. Partial results have been derived with restrictions on number of vertices, several properties of face-partitions and dual graphs of Barnette graphs while some studies focus just on structural characterizations of Barnette graphs. Noting that Spider web graphs are a subclass of Annular Decomposable Barnette (ADB graphs) graphs and are Hamiltonian, we study ADB graphs and their annular-connected subclass (ADB-AC graphs). We show that ADB-AC graphs can be generated from the smallest Barnette graph using recursive edge operations. We derive several conditions assuring the existence of Hamiltonian cycles in ADB-AC graphs without imposing restrictions on number of vertices, face size or any other constraints on the face partitions. We show that there can be two types of annuli in ADB-AC graphs, ring annuli and block annuli. Our main result is, ADB-AC graphs having non singular sequences of ring annuli are Hamiltonian.
△ Less
Submitted 15 August, 2020;
originally announced August 2020.
-
LoRAS: An oversampling approach for imbalanced datasets
Authors:
Saptarshi Bej,
Narek Davtyan,
Markus Wolfien,
Mariam Nassar,
Olaf Wolkenhauer
Abstract:
The Synthetic Minority Oversampling TEchnique (SMOTE) is widely-used for the analysis of imbalanced datasets. It is known that SMOTE frequently over-generalizes the minority class, leading to misclassifications for the majority class, and effecting the overall balance of the model.
In this article, we present an approach that overcomes this limitation of SMOTE, employing Localized Random Affine…
▽ More
The Synthetic Minority Oversampling TEchnique (SMOTE) is widely-used for the analysis of imbalanced datasets. It is known that SMOTE frequently over-generalizes the minority class, leading to misclassifications for the majority class, and effecting the overall balance of the model.
In this article, we present an approach that overcomes this limitation of SMOTE, employing Localized Random Affine Shadowsampling (LoRAS) to oversample from an approximated data manifold of the minority class.
We benchmarked our algorithm with 14 publicly available imbalanced datasets using three different Machine Learning (ML) algorithms and compared the performance of LoRAS, SMOTE and several SMOTE extensions that share the concept of using convex combinations of minority class data points for oversampling with LoRAS. We observed that LoRAS, on average generates better ML models in terms of F1-Score and Balanced accuracy. Another key observation is that while most of the extensions of SMOTE we have tested, improve the F1-Score with respect to SMOTE on an average, they compromise on the Balanced accuracy of a classification model. LoRAS on the contrary, improves both F1 Score and the Balanced accuracy thus produces better classification models.
Moreover, to explain the success of the algorithm, we have constructed a mathematical framework to prove that LoRAS oversampling technique provides a better estimate for the mean of the underlying local data distribution of the minority class data space.
△ Less
Submitted 15 August, 2020; v1 submitted 22 August, 2019;
originally announced August 2019.
-
Coloring Sums of Extensions of Certain Graphs
Authors:
Johan Kok,
Saptarshi Bej
Abstract:
Recall that the minimum number of colors that allow a proper coloring of graph $G$ is called the chromatic number of $G$ and denoted by $χ(G).$ In this paper the concepts of $χ$'-chromatic sum and $χ^+$-chromatic sum are introduced. The extended graph $G^x$ of a graph $G$ was recently introduced for certain regular graphs. We further the concepts of $χ$'-chromatic sum and $χ^+$-chromatic sum to ex…
▽ More
Recall that the minimum number of colors that allow a proper coloring of graph $G$ is called the chromatic number of $G$ and denoted by $χ(G).$ In this paper the concepts of $χ$'-chromatic sum and $χ^+$-chromatic sum are introduced. The extended graph $G^x$ of a graph $G$ was recently introduced for certain regular graphs. We further the concepts of $χ$'-chromatic sum and $χ^+$-chromatic sum to extended paths and cycles. The paper concludes with \emph{patterned structured} graphs.
△ Less
Submitted 1 February, 2016;
originally announced February 2016.
-
On Extension of Regular Graphs
Authors:
Anirban Banerjee,
Saptarshi Bej
Abstract:
In this article, we discuss when one can extend an r-regular graph to an r + 1 regular by adding edges. Different conditions on the num- ber of vertices n and regularity r are developed. We derive an upper bound of r, depending on n, for which, every regular graph G(n, r) can be extended to an r + 1-regular graph with n vertices. Presence of induced complete bipartite subgraph and complete subgrap…
▽ More
In this article, we discuss when one can extend an r-regular graph to an r + 1 regular by adding edges. Different conditions on the num- ber of vertices n and regularity r are developed. We derive an upper bound of r, depending on n, for which, every regular graph G(n, r) can be extended to an r + 1-regular graph with n vertices. Presence of induced complete bipartite subgraph and complete subgraph is dis- cussed, separately, for the extension of regularity.
△ Less
Submitted 17 September, 2015;
originally announced September 2015.
-
On Minimum Order of Odd Regular Graphs Without Perfect Matching
Authors:
Anirban Banerjee,
Saptarshi Bej
Abstract:
In this article we have derived the minimum order of an odd regular graph such that the graph has no matching. We have observed that how it is different from the case of even regular graphs. We have checked the consistency of the derived result with Petersen's theorem.
In this article we have derived the minimum order of an odd regular graph such that the graph has no matching. We have observed that how it is different from the case of even regular graphs. We have checked the consistency of the derived result with Petersen's theorem.
△ Less
Submitted 22 August, 2019; v1 submitted 23 July, 2014;
originally announced July 2014.