-
A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images
Authors:
László Kopácsi,
Áron Fóthi,
András Lőrincz
Abstract:
Recognition of individual components and keypoint detection supported by instance segmentation is crucial to analyze the behavior of agents on the scene. Such systems could be used for surveillance, self-driving cars, and also for medical research, where behavior analysis of laboratory animals is used to confirm the aftereffects of a given medicine. A method capable of solving the aforementioned t…
▽ More
Recognition of individual components and keypoint detection supported by instance segmentation is crucial to analyze the behavior of agents on the scene. Such systems could be used for surveillance, self-driving cars, and also for medical research, where behavior analysis of laboratory animals is used to confirm the aftereffects of a given medicine. A method capable of solving the aforementioned tasks usually requires a large amount of high-quality hand-annotated data, which takes time and money to produce. In this paper, we propose a method that alleviates the need for manual labeling of laboratory rats. To do so, first, we generate initial annotations with a computer vision-based approach, then through extensive augmentation, we train a deep neural network on the generated data. The final system is capable of instance segmentation, keypoint detection, and body part segmentation even when the objects are heavily occluded.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Enhancing Apparent Personality Trait Analysis with Cross-Modal Embeddings
Authors:
Ádám Fodor,
Rachid R. Saboundji,
András Lőrincz
Abstract:
Automatic personality trait assessment is essential for high-quality human-machine interactions. Systems capable of human behavior analysis could be used for self-driving cars, medical research, and surveillance, among many others. We present a multimodal deep neural network with a Siamese extension for apparent personality trait prediction trained on short video recordings and exploiting modality…
▽ More
Automatic personality trait assessment is essential for high-quality human-machine interactions. Systems capable of human behavior analysis could be used for self-driving cars, medical research, and surveillance, among many others. We present a multimodal deep neural network with a Siamese extension for apparent personality trait prediction trained on short video recordings and exploiting modality invariant embeddings. Acoustic, visual, and textual information are utilized to reach high-performance solutions in this task. Due to the highly centralized target distribution of the analyzed dataset, the changes in the third digit are relevant. Our proposed method addresses the challenge of under-represented extreme values, achieves 0.0033 MAE average improvement, and shows a clear advantage over the baseline multimodal DNN without the introduced module.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Singularities of orthogonal and symplectic determinantal varieties
Authors:
András Cristian Lőrincz
Abstract:
Let either $GL(E)\times SO(F)$ or $GL(E)\times Sp(F)$ act naturally on the space of matrices $E\otimes F$. There are only finitely many orbits, and the orbit closures are orthogonal and symplectic generalizations of determinantal varieties, which can be described similarly using rank conditions. In this paper, we study the singularities of these varieties and describe their defining equations. We…
▽ More
Let either $GL(E)\times SO(F)$ or $GL(E)\times Sp(F)$ act naturally on the space of matrices $E\otimes F$. There are only finitely many orbits, and the orbit closures are orthogonal and symplectic generalizations of determinantal varieties, which can be described similarly using rank conditions. In this paper, we study the singularities of these varieties and describe their defining equations. We prove that in the symplectic case, the orbit closures are normal with good filtrations, and in characteristic $0$ have rational singularities. In the orthogonal case we show that most orbit closures will have the same properties, and determine precisely the exceptions to this.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Equivariant D-modules on 2x2xn hypermatrices
Authors:
András C. Lőrincz,
Michael Perlman
Abstract:
We study D-modules and related invariants on the space of 2 x 2 x n hypermatrices for n >= 3, which has finitely many orbits under the action of G = GL_2 x GL_2 x GL_n. We describe the category of coherent G-equivariant D-modules as the category of representations of a quiver with relations. We classify the simple equivariant D-modules, determine their characteristic cycles and find special repres…
▽ More
We study D-modules and related invariants on the space of 2 x 2 x n hypermatrices for n >= 3, which has finitely many orbits under the action of G = GL_2 x GL_2 x GL_n. We describe the category of coherent G-equivariant D-modules as the category of representations of a quiver with relations. We classify the simple equivariant D-modules, determine their characteristic cycles and find special representations that appear in their G-structures. We determine the explicit D-module structure of the local cohomology groups with supports given by orbit closures. As a consequence, we calculate the Lyubeznik numbers and intersection cohomology groups of the orbit closures. All but one of the orbit closures have rational singularities: we use local cohomology to prove that the one exception is neither normal nor Cohen--Macaulay. While our results display special behavior in the cases n=3 and n=4, they are completely uniform for n >= 5.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Perceived personality state estimation in dyadic and small group interaction with deep learning methods
Authors:
Kristian Fenech,
Ádám Fodor,
Sean P. Bergeron,
Rachid R. Saboundji,
Catharine Oertel,
András Lőrincz
Abstract:
Dyadic and small group collaboration is an evolutionary advantageous behaviour and the need for such collaboration is a regular occurrence in day to day life. In this paper we estimate the perceived personality traits of individuals in dyadic and small groups over thin-slices of interaction on four multimodal datasets. We find that our transformer based predictive model performs similarly to human…
▽ More
Dyadic and small group collaboration is an evolutionary advantageous behaviour and the need for such collaboration is a regular occurrence in day to day life. In this paper we estimate the perceived personality traits of individuals in dyadic and small groups over thin-slices of interaction on four multimodal datasets. We find that our transformer based predictive model performs similarly to human annotators tasked with predicting the perceived big-five personality traits of participants. Using this model we analyse the estimated perceived personality traits of individuals performing tasks in small groups and dyads. Permutation analysis shows that in the case of small groups undergoing collaborative tasks, the perceived personality of group members clusters, this is also observed for dyads in a collaborative problem solving task, but not in dyads under non-collaborative task settings. Additionally, we find that the group level average perceived personality traits provide a better predictor of group performance than the group level average self-reported personality traits.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Structural Extensions of Basis Pursuit: Guarantees on Adversarial Robustness
Authors:
Dávid Szeghy,
Mahmoud Aslan,
Áron Fóthi,
Balázs Mészáros,
Zoltán Ádám Milacski,
András Lőrincz
Abstract:
While deep neural networks are sensitive to adversarial noise, sparse coding using the Basis Pursuit (BP) method is robust against such attacks, including its multi-layer extensions. We prove that the stability theorem of BP holds upon the following generalizations: (i) the regularization procedure can be separated into disjoint groups with different weights, (ii) neurons or full layers may form g…
▽ More
While deep neural networks are sensitive to adversarial noise, sparse coding using the Basis Pursuit (BP) method is robust against such attacks, including its multi-layer extensions. We prove that the stability theorem of BP holds upon the following generalizations: (i) the regularization procedure can be separated into disjoint groups with different weights, (ii) neurons or full layers may form groups, and (iii) the regularizer takes various generalized forms of the $\ell_1$ norm. This result provides the proof for the architectural generalizations of Cazenavette et al. (2021), including (iv) an approximation of the complete architecture as a shallow sparse coding network. Due to this approximation, we settled to experimenting with shallow networks and studied their robustness against the Iterative Fast Gradient Sign Method on a synthetic dataset and MNIST. We introduce classification based on the $\ell_2$ norms of the groups and show numerically that it can be accurate and offers considerable speedups. In this family, linear transformer shows the best performance. Based on the theoretical results and the numerical simulations, we highlight numerical matters that may improve performance further.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
Borel-Moore homology of determinantal varieties
Authors:
András C. Lőrincz,
Claudiu Raicu
Abstract:
We compute the rational Borel-Moore homology groups for affine determinantal varieties in the spaces of general, symmetric, and skew-symmetric matrices, solving a problem suggested by the work of Pragacz and Ratajski. The main ingredient is the relation with Hartshorne's algebraic de Rham homology theory, and the calculation of the singular cohomology of matrix orbits, using the methods of Cartan…
▽ More
We compute the rational Borel-Moore homology groups for affine determinantal varieties in the spaces of general, symmetric, and skew-symmetric matrices, solving a problem suggested by the work of Pragacz and Ratajski. The main ingredient is the relation with Hartshorne's algebraic de Rham homology theory, and the calculation of the singular cohomology of matrix orbits, using the methods of Cartan and Borel. We also establish the degeneration of the Čech-de Rham spectral sequence for determinantal varieties, and compute explicitly the dimensions of de Rham cohomology groups of local cohomology with determinantal support, which are analogues of Lyubeznik numbers first introduced by Switala. Additionally, in the case of general matrices we further determine the Hodge numbers of the singular cohomology of matrix orbits and of the Borel-Moore homology of their closures, based on Saito's theory of mixed Hodge modules.
△ Less
Submitted 6 November, 2021; v1 submitted 15 October, 2021;
originally announced October 2021.
-
Local Euler obstructions for determinantal varieties
Authors:
András C. Lőrincz,
Claudiu Raicu
Abstract:
The goal of this note is to explain a derivation of the formulas for the local Euler obstructions of determinantal varieties of general, symmetric and skew-symmetric matrices, by studying the invariant de Rham complex and using character formulas for simple equivariant $D$-modules. These calculations are then combined with standard arguments involving Kashiwara's local index formula and the descri…
▽ More
The goal of this note is to explain a derivation of the formulas for the local Euler obstructions of determinantal varieties of general, symmetric and skew-symmetric matrices, by studying the invariant de Rham complex and using character formulas for simple equivariant $D$-modules. These calculations are then combined with standard arguments involving Kashiwara's local index formula and the description of characteristic cycles of simple equivariant $D$-modules. The formulas are implicit in the work of Boe and Fu, and in the case of general matrices they have also been obtained recently by Gaffney--Grulha--Ruas, for skew-symmetric matrices by Promtapan and Rimányi, and for all cases by Zhang.
△ Less
Submitted 1 September, 2021; v1 submitted 1 May, 2021;
originally announced May 2021.
-
Fast Interactive Video Object Segmentation with Graph Neural Networks
Authors:
Viktor Varga,
András Lőrincz
Abstract:
Pixelwise annotation of image sequences can be very tedious for humans. Interactive video object segmentation aims to utilize automatic methods to speed up the process and reduce the workload of the annotators. Most contemporary approaches rely on deep convolutional networks to collect and process information from human annotations throughout the video. However, such networks contain millions of p…
▽ More
Pixelwise annotation of image sequences can be very tedious for humans. Interactive video object segmentation aims to utilize automatic methods to speed up the process and reduce the workload of the annotators. Most contemporary approaches rely on deep convolutional networks to collect and process information from human annotations throughout the video. However, such networks contain millions of parameters and need huge amounts of labeled training data to avoid overfitting. Beyond that, label propagation is usually executed as a series of frame-by-frame inference steps, which is difficult to be parallelized and is thus time consuming. In this paper we present a graph neural network based approach for tackling the problem of interactive video object segmentation. Our network operates on superpixel-graphs which allow us to reduce the dimensionality of the problem by several magnitudes. We show, that our network possessing only a few thousand parameters is able to achieve state-of-the-art performance, while inference remains fast and can be trained quickly with very little data.
△ Less
Submitted 21 April, 2021; v1 submitted 5 March, 2021;
originally announced March 2021.
-
Minimizing false negative rate in melanoma detection and providing insight into the causes of classification
Authors:
Ellák Somfai,
Benjámin Baffy,
Kristian Fenech,
Changlu Guo,
Rita Hosszú,
Dorina Korózs,
Fabrizio Nunnari,
Marcell Pólik,
Daniel Sonntag,
Attila Ulbert,
András Lőrincz
Abstract:
Our goal is to bridge human and machine intelligence in melanoma detection. We develop a classification system exploiting a combination of visual pre-processing, deep learning, and ensembling for providing explanations to experts and to minimize false negative rate while maintaining high accuracy in melanoma detection. Source images are first automatically segmented using a U-net CNN. The result o…
▽ More
Our goal is to bridge human and machine intelligence in melanoma detection. We develop a classification system exploiting a combination of visual pre-processing, deep learning, and ensembling for providing explanations to experts and to minimize false negative rate while maintaining high accuracy in melanoma detection. Source images are first automatically segmented using a U-net CNN. The result of the segmentation is then used to extract image sub-areas and specific parameters relevant in human evaluation, namely center, border, and asymmetry measures. These data are then processed by tailored neural networks which include structure searching algorithms. Partial results are then ensembled by a committee machine. Our evaluation on the largest skin lesion dataset which is publicly available today, ISIC-2019, shows improvement in all evaluated metrics over a baseline using the original images only. We also showed that indicative scores computed by the feature classifiers can provide useful insight into the various features on which the decision can be based.
△ Less
Submitted 9 March, 2021; v1 submitted 18 February, 2021;
originally announced February 2021.
-
Holonomic functions and prehomogeneous spaces
Authors:
András Cristian Lőrincz
Abstract:
A function that is analytic on a domain of $\mathbb{C}^n$ is holonomic if it is the solution to a holonomic system of linear homogeneous differential equations with polynomial coefficients. We define and study the Bernstein-Sato polynomial of a holonomic function on a smooth algebraic variety. We analyze the structure of certain sheaves of holonomic functions, such as the algebraic functions along…
▽ More
A function that is analytic on a domain of $\mathbb{C}^n$ is holonomic if it is the solution to a holonomic system of linear homogeneous differential equations with polynomial coefficients. We define and study the Bernstein-Sato polynomial of a holonomic function on a smooth algebraic variety. We analyze the structure of certain sheaves of holonomic functions, such as the algebraic functions along a hypersurface, determining their direct sum decompositions into indecomposables, that further respect decompositions of Bernstein-Sato polynomials. When the space is endowed with the action of a linear algebraic group $G$, we study the class of $G$-finite analytic functions, i.e. functions that under the action of the Lie algebra of $G$ generate a finite dimensional rational $G$-module. These are automatically algebraic functions on a variety with a dense orbit. When $G$ is reductive, we give several representation-theoretic techniques toward the determination of Bernstein-Sato polynomials of $G$-finite functions. We classify the $G$-finite functions on all but one of the irreducible reduced prehomogeneous vector spaces, and compute the Bernstein-Sato polynomials for distinguished $G$-finite functions. The results can be used to construct explicitly equivariant $\mathcal{D}$-modules.
△ Less
Submitted 1 February, 2021;
originally announced February 2021.
-
Temporal Smoothing for 3D Human Pose Estimation and Localization for Occluded People
Authors:
Marton Veges,
Andras Lorincz
Abstract:
In multi-person pose estimation actors can be heavily occluded, even become fully invisible behind another person. While temporal methods can still predict a reasonable estimation for a temporarily disappeared pose using past and future frames, they exhibit large errors nevertheless. We present an energy minimization approach to generate smooth, valid trajectories in time, bridging gaps in visibil…
▽ More
In multi-person pose estimation actors can be heavily occluded, even become fully invisible behind another person. While temporal methods can still predict a reasonable estimation for a temporarily disappeared pose using past and future frames, they exhibit large errors nevertheless. We present an energy minimization approach to generate smooth, valid trajectories in time, bridging gaps in visibility. We show that it is better than other interpolation based approaches and achieves state of the art results. In addition, we present the synthetic MuCo-Temp dataset, a temporal extension of the MuCo-3DHP dataset. Our code is made publicly available.
△ Less
Submitted 31 October, 2020;
originally announced November 2020.
-
On the collapsing of homogeneous bundles in arbitrary characteristic
Authors:
András Cristian Lőrincz
Abstract:
We study the geometry of equivariant, proper maps from homogeneous bundles $G\times_P V$ over flag varieties $G/P$ to representations of $G$, called collapsing maps. Kempf showed that, provided the bundle is completely reducible, the image $G\cdot V$ of a collapsing map has rational singularities in characteristic zero. We extend this result to positive characteristic and show that for the analogo…
▽ More
We study the geometry of equivariant, proper maps from homogeneous bundles $G\times_P V$ over flag varieties $G/P$ to representations of $G$, called collapsing maps. Kempf showed that, provided the bundle is completely reducible, the image $G\cdot V$ of a collapsing map has rational singularities in characteristic zero. We extend this result to positive characteristic and show that for the analogous bundles the saturation $G\cdot V$ is strongly $F$-regular if its coordinate ring has a good filtration. We further show that in this case the images of collapsing maps of homogeneous bundles restricted to Schubert varieties are $F$-rational in positive characteristic, and have rational singularities in characteristic zero. We provide results on the singularities and defining equations of saturations $G\cdot X$ for $P$-stable closed subvarieties $X\subset V$. We give criteria for the existence of good filtrations for the coordinate ring of $G\cdot X$. Our results give a uniform, characteristic-free approach for the study of the geometry of a number of important varieties: multicones over Schubert varieties, determinantal varieties in the space of matrices, symmetric matrices, skew-symmetric matrices, and certain matrix Schubert varieties therein, representation varieties of radical square zero algebras (e.g. varieties of complexes), subspace varieties, higher rank varieties, etc.
△ Less
Submitted 5 October, 2021; v1 submitted 19 August, 2020;
originally announced August 2020.
-
Multi-Person Absolute 3D Human Pose Estimation with Weak Depth Supervision
Authors:
Marton Veges,
Andras Lorincz
Abstract:
In 3D human pose estimation one of the biggest problems is the lack of large, diverse datasets. This is especially true for multi-person 3D pose estimation, where, to our knowledge, there are only machine generated annotations available for training. To mitigate this issue, we introduce a network that can be trained with additional RGB-D images in a weakly supervised fashion. Due to the existence…
▽ More
In 3D human pose estimation one of the biggest problems is the lack of large, diverse datasets. This is especially true for multi-person 3D pose estimation, where, to our knowledge, there are only machine generated annotations available for training. To mitigate this issue, we introduce a network that can be trained with additional RGB-D images in a weakly supervised fashion. Due to the existence of cheap sensors, videos with depth maps are widely available, and our method can exploit a large, unannotated dataset. Our algorithm is a monocular, multi-person, absolute pose estimator. We evaluate the algorithm on several benchmarks, showing a consistent improvement in error rates. Also, our model achieves state-of-the-art results on the MuPoTS-3D dataset by a considerable margin.
△ Less
Submitted 8 April, 2020;
originally announced April 2020.
-
Minimal free resolutions of ideals of minors associated to pairs of matrices
Authors:
András Cristian Lőrincz
Abstract:
Consider the affine space consisting of pairs of matrices $(A,B)$ of fixed size, and its closed subvariety given by the rank conditions $\operatorname{rank} A \leq a$, $\operatorname{rank} B \leq b$ and $\operatorname{rank} (A\cdot B) \leq c$, for three non-negative integers $a,b,c$. These varieties are precisely the orbit closures of representations for the equioriented $A_3$ quiver. In this pape…
▽ More
Consider the affine space consisting of pairs of matrices $(A,B)$ of fixed size, and its closed subvariety given by the rank conditions $\operatorname{rank} A \leq a$, $\operatorname{rank} B \leq b$ and $\operatorname{rank} (A\cdot B) \leq c$, for three non-negative integers $a,b,c$. These varieties are precisely the orbit closures of representations for the equioriented $A_3$ quiver. In this paper we construct the (equivariant) minimal free resolutions of the defining ideals of such varieties. We show how this problem is equivalent to determining the cohomology groups of the tensor product of two Schur functors of tautological bundles on a 2-step flag variety. We provide several techniques for the determination of these groups, which is of independent interest.
△ Less
Submitted 3 August, 2020; v1 submitted 23 January, 2020;
originally announced January 2020.
-
Algebraic Analysis of Rotation Data
Authors:
Michael F. Adamer,
András C. Lőrincz,
Anna-Laura Sattelberger,
Bernd Sturmfels
Abstract:
We develop algebraic tools for statistical inference from samples of rotation matrices. This rests on the theory of D-modules in algebraic analysis. Noncommutative Gröbner bases are used to design numerical algorithms for maximum likelihood estimation, building on the holonomic gradient method of Sei, Shibata, Takemura, Ohara, and Takayama. We study the Fisher model for sampling from rotation matr…
▽ More
We develop algebraic tools for statistical inference from samples of rotation matrices. This rests on the theory of D-modules in algebraic analysis. Noncommutative Gröbner bases are used to design numerical algorithms for maximum likelihood estimation, building on the holonomic gradient method of Sei, Shibata, Takemura, Ohara, and Takayama. We study the Fisher model for sampling from rotation matrices, and we apply our algorithms for data from the applied sciences. On the theoretical side, we generalize the underlying equivariant D-modules from SO(3) to arbitrary Lie groups. For compact groups, our D-ideals encode the normalizing constant of the Fisher model.
△ Less
Submitted 1 December, 2019;
originally announced December 2019.
-
Local cohomology on a subexceptional series of representations
Authors:
András C. Lőrincz,
Jerzy Weyman
Abstract:
We consider a series of four subexceptional representations coming from the third line of the Freudenthal-Tits magic square; using Bourbaki notation, these are fundamental representations $(G',X)$ corresponding to $(C_3, ω_3),\, (A_5, ω_3), \, (D_6, ω_5)$ and $(E_7, ω_6)$. In each of these four cases, the group $G=G'\times \mathbb{C}^*$ acts on $X$ with five orbits, and many invariants display a u…
▽ More
We consider a series of four subexceptional representations coming from the third line of the Freudenthal-Tits magic square; using Bourbaki notation, these are fundamental representations $(G',X)$ corresponding to $(C_3, ω_3),\, (A_5, ω_3), \, (D_6, ω_5)$ and $(E_7, ω_6)$. In each of these four cases, the group $G=G'\times \mathbb{C}^*$ acts on $X$ with five orbits, and many invariants display a uniform behavior, e.g. dimension of orbits, their defining ideals and the character of their coordinate rings as $G$-modules. In this paper, we determine some more subtle invariants and analyze their uniformity within the series. We describe the category of $G$-equivariant coherent $\mathcal{D}_X$-modules as the category of representations of a quiver with relations. We construct explicitly the simple $G$-equivariant $\mathcal{D}_X$-modules and compute the characters of their underlying $G$-structures. We determine the local cohomology groups with supports given by orbit closures, determining their precise $\mathcal{D}_X$-module structure. As a consequence, we calculate the intersection cohomology groups and Lyubeznik numbers of the orbit closures. While our results for the cases $(A_5, ω_3), \, (D_6, ω_5)$ and $(E_7, ω_6)$ are still completely uniform, the case $(C_3, ω_3)$ displays a surprisingly different behavior. We give two explanations for this phenomenon: one topological, as the middle orbit of $(C_3, ω_3)$ is not simply-connected; one geometric, as the closure of the orbit is not Gorenstein.
△ Less
Submitted 23 June, 2021; v1 submitted 29 October, 2019;
originally announced October 2019.
-
Absolute Human Pose Estimation with Depth Prediction Network
Authors:
Márton Véges,
András Lőrincz
Abstract:
The common approach to 3D human pose estimation is predicting the body joint coordinates relative to the hip. This works well for a single person but is insufficient in the case of multiple interacting people. Methods predicting absolute coordinates first estimate a root-relative pose then calculate the translation via a secondary optimization task. We propose a neural network that predicts joints…
▽ More
The common approach to 3D human pose estimation is predicting the body joint coordinates relative to the hip. This works well for a single person but is insufficient in the case of multiple interacting people. Methods predicting absolute coordinates first estimate a root-relative pose then calculate the translation via a secondary optimization task. We propose a neural network that predicts joints in a camera centered coordinate system instead of a root-relative one. Unlike previous methods, our network works in a single step without any post-processing. Our network beats previous methods on the MuPoTS-3D dataset and achieves state-of-the-art results.
△ Less
Submitted 11 April, 2019;
originally announced April 2019.
-
Representation varieties of algebras with nodes
Authors:
Ryan Kinser,
András C. Lőrincz
Abstract:
We study the behavior of representation varieties of quivers with relations under the operation of node splitting. We show how splitting a node gives a correspondence between certain closed subvarieties of representation varieties for different algebras, which preserves properties like normality or having rational singularities. Furthermore, we describe how the defining equations of such closed su…
▽ More
We study the behavior of representation varieties of quivers with relations under the operation of node splitting. We show how splitting a node gives a correspondence between certain closed subvarieties of representation varieties for different algebras, which preserves properties like normality or having rational singularities. Furthermore, we describe how the defining equations of such closed subvarieties change under the correspondence. By working in the "relative setting" (splitting one node at a time), we demonstrate that there are many non-hereditary algebras whose irreducible components of representation varieties are all normal with rational singularities. We also obtain explicit generators of the prime defining ideals of these irreducible components. This class contains all radical square zero algebras, but also many others, as illustrated by examples throughout the paper. We also show the above is true when replacing irreducible components by orbit closures, for a more restrictive class of algebras. Lastly, we provide applications to decompositions of moduli spaces of semistable representations of certain algebras.
△ Less
Submitted 15 June, 2021; v1 submitted 25 October, 2018;
originally announced October 2018.
-
Equivariant D-modules on alternating senary 3-tensors
Authors:
András C. Lőrincz,
Michael Perlman
Abstract:
Let X be the third exterior power of a six-dimensional complex vector space, equipped with the natural action of the group GL_6(C) of invertible linear transformations of C^6. We describe explicitly the category of GL_6(C)-equivariant coherent D_X-modules as the category of representations of a quiver with relations, which has finite representation type. We give a construction of the six simple eq…
▽ More
Let X be the third exterior power of a six-dimensional complex vector space, equipped with the natural action of the group GL_6(C) of invertible linear transformations of C^6. We describe explicitly the category of GL_6(C)-equivariant coherent D_X-modules as the category of representations of a quiver with relations, which has finite representation type. We give a construction of the six simple equivariant D_X-modules and give formulas for the characters of their underlying GL_6(C)-structures. We describe the (iterated) local cohomology groups with supports given by orbit closures, determining, in particular, the Lyubeznik numbers associated to the orbit closures.
△ Less
Submitted 25 November, 2019; v1 submitted 24 September, 2018;
originally announced September 2018.
-
3D Human Pose Estimation with Siamese Equivariant Embedding
Authors:
Márton Véges,
Viktor Varga,
András Lőrincz
Abstract:
In monocular 3D human pose estimation a common setup is to first detect 2D positions and then lift the detection into 3D coordinates. Many algorithms suffer from overfitting to camera positions in the training set. We propose a siamese architecture that learns a rotation equivariant hidden representation to reduce the need for data augmentation. Our method is evaluated on multiple databases with d…
▽ More
In monocular 3D human pose estimation a common setup is to first detect 2D positions and then lift the detection into 3D coordinates. Many algorithms suffer from overfitting to camera positions in the training set. We propose a siamese architecture that learns a rotation equivariant hidden representation to reduce the need for data augmentation. Our method is evaluated on multiple databases with different base networks and shows a consistent improvement of error metrics. It achieves state-of-the-art cross-camera error rate among algorithms that use estimated 2D joint coordinates only.
△ Less
Submitted 16 February, 2019; v1 submitted 19 September, 2018;
originally announced September 2018.
-
On categories of equivariant D-modules
Authors:
András C. Lőrincz,
Uli Walther
Abstract:
Let $X$ be a variety with an action by an algebraic group $G$. In this paper we discuss various properties of $G$-equivariant $D$-modules on $X$, such as the decompositions of their global sections as representations of $G$ (when $G$ is reductive), and descriptions of the categories that they form. When $G$ acts on $X$ with finitely many orbits, the category of equivariant $D$-modules is isomorphi…
▽ More
Let $X$ be a variety with an action by an algebraic group $G$. In this paper we discuss various properties of $G$-equivariant $D$-modules on $X$, such as the decompositions of their global sections as representations of $G$ (when $G$ is reductive), and descriptions of the categories that they form. When $G$ acts on $X$ with finitely many orbits, the category of equivariant $D$-modules is isomorphic to the category of finite-dimensional representations of a finite quiver with relations. We describe explicitly these categories for irreducible $G$-modules $X$ that are spherical varieties, and show that in such cases the quivers are almost always representation-finite (i.e. with finitely many indecomposable representations).
△ Less
Submitted 10 April, 2019; v1 submitted 6 June, 2018;
originally announced June 2018.
-
Iterated local cohomology groups and Lyubeznik numbers for determinantal rings
Authors:
András C. Lőrincz,
Claudiu Raicu
Abstract:
We give an explicit recipe for determining iterated local cohomology groups with support in ideals of minors of a generic matrix in characteristic zero, expressing them as direct sums of indecomposable D-modules. For non-square matrices these indecomposables are simple, but this is no longer true for square matrices where the relevant indecomposables arise from the pole order filtration associated…
▽ More
We give an explicit recipe for determining iterated local cohomology groups with support in ideals of minors of a generic matrix in characteristic zero, expressing them as direct sums of indecomposable D-modules. For non-square matrices these indecomposables are simple, but this is no longer true for square matrices where the relevant indecomposables arise from the pole order filtration associated with the determinant hypersurface. Specializing our results to a single iteration, we determine the Lyubeznik numbers for all generic determinantal rings, thus answering a question of Hochster.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.
-
Decompositions of Bernstein-Sato polynomials and slices
Authors:
András Cristian Lőrincz
Abstract:
Let $G$ be a linearly reductive group acting on a vector space $V$, and $f$ a (semi-)invariant polynomial on $V$. In this paper we study systematically decompositions of the Bernstein-Sato polynomial of $f$ in parallel with some representation-theoretic properties of the action of $G$ on $V$. We provide a technique based on a multiplicity one property, that we use to compute the Bernstein-Sato pol…
▽ More
Let $G$ be a linearly reductive group acting on a vector space $V$, and $f$ a (semi-)invariant polynomial on $V$. In this paper we study systematically decompositions of the Bernstein-Sato polynomial of $f$ in parallel with some representation-theoretic properties of the action of $G$ on $V$. We provide a technique based on a multiplicity one property, that we use to compute the Bernstein-Sato polynomials of several classical invariants in an elementary fashion. Furthermore, we derive a "slice method" which shows that the decomposition of $V$ as a representation of $G$ can induce a decomposition of the Bernstein-Sato polynomial of $f$ into a product of two Bernstein-Sato polynomials - that of an ideal and that of a semi-invariant of smaller degree. Using the slice method, we compute Bernstein-Sato polynomials for a large class of semi-invariants of quivers.
△ Less
Submitted 21 February, 2018;
originally announced February 2018.
-
Free resolutions of orbit closures of Dynkin quivers
Authors:
András C. Lőrincz,
Jerzy Weyman
Abstract:
We use the Kempf-Lascoux-Weyman geometric technique in order to determine the minimal free resolutions of some orbit closures of quivers. As a consequence, we obtain that for Dynkin quivers orbit closures of 1-step representations are normal with rational singularities. For Dynkin quivers of type $A$, we describe explicit minimal generators of the defining ideals of orbit closures of 1-step repres…
▽ More
We use the Kempf-Lascoux-Weyman geometric technique in order to determine the minimal free resolutions of some orbit closures of quivers. As a consequence, we obtain that for Dynkin quivers orbit closures of 1-step representations are normal with rational singularities. For Dynkin quivers of type $A$, we describe explicit minimal generators of the defining ideals of orbit closures of 1-step representations. Using this, we provide an algorithm for type $A$ quivers for describing an efficient set of generators of the defining ideal of the orbit closure of any representation.
△ Less
Submitted 30 December, 2017;
originally announced January 2018.
-
Equivariant D-modules on binary cubic forms
Authors:
András C. Lőrincz,
Claudiu Raicu,
Jerzy Weyman
Abstract:
We consider the space X = Sym^3(C^2) of binary cubic forms, equipped with the natural action of the group GL_2 of invertible linear transformations of C^2. We describe explicitly the category of GL_2-equivariant coherent D_X-modules as the category of representations of a quiver with relations. We show moreover that this quiver is of tame representation type and we classify its indecomposable repr…
▽ More
We consider the space X = Sym^3(C^2) of binary cubic forms, equipped with the natural action of the group GL_2 of invertible linear transformations of C^2. We describe explicitly the category of GL_2-equivariant coherent D_X-modules as the category of representations of a quiver with relations. We show moreover that this quiver is of tame representation type and we classify its indecomposable representations. We also give a construction of the simple equivariant D_X-modules (of which there are 14), and give formulas for the characters of their underlying GL_2-representations. We conclude the article with an explicit calculation of (iterated) local cohomology groups with supports given by orbit closures.
△ Less
Submitted 28 December, 2017;
originally announced December 2017.
-
Fine-tuning deep CNN models on specific MS COCO categories
Authors:
Daniel Sonntag,
Michael Barz,
Jan Zacharias,
Sven Stauden,
Vahid Rahmani,
Áron Fóthi,
András Lőrincz
Abstract:
Fine-tuning of a deep convolutional neural network (CNN) is often desired. This paper provides an overview of our publicly available py-faster-rcnn-ft software library that can be used to fine-tune the VGG_CNN_M_1024 model on custom subsets of the Microsoft Common Objects in Context (MS COCO) dataset. For example, we improved the procedure so that the user does not have to look for suitable image…
▽ More
Fine-tuning of a deep convolutional neural network (CNN) is often desired. This paper provides an overview of our publicly available py-faster-rcnn-ft software library that can be used to fine-tune the VGG_CNN_M_1024 model on custom subsets of the Microsoft Common Objects in Context (MS COCO) dataset. For example, we improved the procedure so that the user does not have to look for suitable image files in the dataset by hand which can then be used in the demo program. Our implementation randomly selects images that contain at least one object of the categories on which the model is fine-tuned.
△ Less
Submitted 5 September, 2017;
originally announced September 2017.
-
Cognitive Deep Machine Can Train Itself
Authors:
András Lőrincz,
Máté Csákvári,
Áron Fóthi,
Zoltán Ádám Milacski,
András Sárkány,
Zoltán Tősér
Abstract:
Machine learning is making substantial progress in diverse applications. The success is mostly due to advances in deep learning. However, deep learning can make mistakes and its generalization abilities to new tasks are questionable. We ask when and how one can combine network outputs, when (i) details of the observations are evaluated by learned deep components and (ii) facts and confirmation rul…
▽ More
Machine learning is making substantial progress in diverse applications. The success is mostly due to advances in deep learning. However, deep learning can make mistakes and its generalization abilities to new tasks are questionable. We ask when and how one can combine network outputs, when (i) details of the observations are evaluated by learned deep components and (ii) facts and confirmation rules are available in knowledge based systems. We show that in limited contexts the required number of training samples can be low and self-improvement of pre-trained networks in more general context is possible. We argue that the combination of sparse outlier detection with deep components that can support each other diminish the fragility of deep methods, an important requirement for engineering applications. We argue that supervised learning of labels may be fully eliminated under certain conditions: a component based architecture together with a knowledge based system can train itself and provide high quality answers. We demonstrate these concepts on the State Farm Distracted Driver Detection benchmark. We argue that the view of the Study Panel (2016) may overestimate the requirements on `years of focused research' and `careful, unique construction' for `AI systems'.
△ Less
Submitted 2 December, 2016;
originally announced December 2016.
-
Slip avalanches in metallic glasses and granular matter reveal universal dynamics
Authors:
D. V. Denisov,
K. A. Lorincz,
W. J. Wright,
T. C. Hufnagel,
A. Nawano,
X. J. Gu,
J. T. Uhl,
K. A. Dahmen,
P. Schall
Abstract:
Universality in materials deformation is of intense interest: universal scaling relations if exist would bridge the gap from microscopic deformation to macroscopic response in a single material-independent fashion. While recent agreement of the force statistics of deformed nanopillars, bulk metallic glasses, and granular materials with mean-field predictions supports the idea of universal scaling…
▽ More
Universality in materials deformation is of intense interest: universal scaling relations if exist would bridge the gap from microscopic deformation to macroscopic response in a single material-independent fashion. While recent agreement of the force statistics of deformed nanopillars, bulk metallic glasses, and granular materials with mean-field predictions supports the idea of universal scaling relations, here for the first time we demonstrate that the universality extends beyond the statistics, and applies to the slip dynamics as well. By rigorous comparison of two very different systems, bulk metallic glasses and granular materials in terms of both the statistics and dynamics of force fluctuations, we clearly establish a material-independent universal regime of deformation. We experimentally verify the predicted universal scaling function for the time evolution of individual avalanches, and show that both the slip statistics and dynamics are universal, i.e. independent of the scale and details of the material structure and interactions. These results are important for transferring experimental results across scales and material structures in a single theory of deformation.
△ Less
Submitted 19 May, 2016;
originally announced May 2016.
-
Bernstein-Sato polynomials for maximal minors and sub-maximal Pfaffians
Authors:
András C. Lőrincz,
Claudiu Raicu,
Uli Walther,
Jerzy Weyman
Abstract:
We determine the Bernstein-Sato polynomials for the ideal of maximal minors of a generic m x n matrix, as well as for that of sub-maximal Pfaffians of a generic skew-symmetric matrix of odd size. As a corollary, we obtain that the Strong Monodromy Conjecture holds in these two cases.
We determine the Bernstein-Sato polynomials for the ideal of maximal minors of a generic m x n matrix, as well as for that of sub-maximal Pfaffians of a generic skew-symmetric matrix of odd size. As a corollary, we obtain that the Strong Monodromy Conjecture holds in these two cases.
△ Less
Submitted 25 January, 2016;
originally announced January 2016.
-
Singularities of zero sets of semi-invariants for quivers
Authors:
András Cristian Lőrincz
Abstract:
Let $Q$ be a quiver with dimension vector $α$ prehomogeneous under the action of the product of general linear groups $\operatorname{GL}(α)$ on the representation variety $\operatorname{Rep}(Q,α)$. We study geometric properties of zero sets of semi-invariants of this space. It is known that for large numbers $N$, the nullcone in $\operatorname{Rep}(Q,N\cdot α)$ becomes a complete intersection. Fir…
▽ More
Let $Q$ be a quiver with dimension vector $α$ prehomogeneous under the action of the product of general linear groups $\operatorname{GL}(α)$ on the representation variety $\operatorname{Rep}(Q,α)$. We study geometric properties of zero sets of semi-invariants of this space. It is known that for large numbers $N$, the nullcone in $\operatorname{Rep}(Q,N\cdot α)$ becomes a complete intersection. First, we show that it also becomes reduced. Then, using Bernstein-Sato polynomials, we discuss some criteria for zero sets to have rational singularities. In particular, we show that for Dynkin quivers codimension $1$ orbit closures have rational singularities.
△ Less
Submitted 3 July, 2017; v1 submitted 14 September, 2015;
originally announced September 2015.
-
The b-functions of semi-invariants of quivers
Authors:
András Cristian Lőrincz
Abstract:
In this paper we compute b-functions (or Bernstein-Sato polynomials) of various semi-invariants of quivers. The main tool is an explicit relation for the b-functions between semi-invariants that correspond to each other under reflection functors (or castling transforms). This enables us to compute recursively the b-functions for all Dynkin quivers, and extended Dynkin quivers with prehomogeneous d…
▽ More
In this paper we compute b-functions (or Bernstein-Sato polynomials) of various semi-invariants of quivers. The main tool is an explicit relation for the b-functions between semi-invariants that correspond to each other under reflection functors (or castling transforms). This enables us to compute recursively the b-functions for all Dynkin quivers, and extended Dynkin quivers with prehomogeneous dimension vectors.
△ Less
Submitted 22 February, 2018; v1 submitted 14 October, 2013;
originally announced October 2013.
-
Emotional Expression Classification using Time-Series Kernels
Authors:
Andras Lorincz,
Laszlo Jeni,
Zoltan Szabo,
Jeffrey Cohn,
Takeo Kanade
Abstract:
Estimation of facial expressions, as spatio-temporal processes, can take advantage of kernel methods if one considers facial landmark positions and their motion in 3D space. We applied support vector classification with kernels derived from dynamic time-war** similarity measures. We achieved over 99% accuracy - measured by area under ROC curve - using only the 'motion pattern' of the PCA compres…
▽ More
Estimation of facial expressions, as spatio-temporal processes, can take advantage of kernel methods if one considers facial landmark positions and their motion in 3D space. We applied support vector classification with kernels derived from dynamic time-war** similarity measures. We achieved over 99% accuracy - measured by area under ROC curve - using only the 'motion pattern' of the PCA compressed representation of the marker point vector, the so-called shape parameters. Beyond the classification of full motion patterns, several expressions were recognized with over 90% accuracy in as few as 5-6 frames from their onset, about 200 milliseconds.
△ Less
Submitted 8 June, 2013;
originally announced June 2013.
-
The FuturICT Education Accelerator
Authors:
Jeffrey Johnson,
Simon Buckingham Shum,
Alistair Willis,
Steven Bishop,
Theodore Zamenopoulos,
Stephen Swithenby,
Robert MacKay,
Yasmin Merali,
Andras Lorincz,
Carmen Costea,
Paul Bourgine,
Jorge Loucas Atis Kapenieks,
Paul Kelley,
Sally Caird,
Jane Bromley,
Ruth Deakin Crick,
Chris Goldspink,
Pierre Collet,
Anna Carbone,
Dirk Helbing
Abstract:
Education is a major force for economic and social wellbeing. Despite high aspirations, education at all levels can be expensive and ineffective. Three Grand Challenges are identified: (1) enable people to learn orders of magnitude more effectively, (2) enable people to learn at orders of magnitude less cost, and (3) demonstrate success by exemplary interdisciplinary education in complex systems s…
▽ More
Education is a major force for economic and social wellbeing. Despite high aspirations, education at all levels can be expensive and ineffective. Three Grand Challenges are identified: (1) enable people to learn orders of magnitude more effectively, (2) enable people to learn at orders of magnitude less cost, and (3) demonstrate success by exemplary interdisciplinary education in complex systems science. A ten year `man-on-the-moon' project is proposed in which FuturICT's unique combination of Complexity, Social and Computing Sciences could provide an urgently needed transdisciplinary language for making sense of educational systems. In close dialogue with educational theory and practice, and grounded in the emerging data science and learning analytics paradigms, this will translate into practical tools (both analytical and computational) for researchers, practitioners and leaders; generative principles for resilient educational ecosystems; and innovation for radically scalable, yet personalised, learner engagement and assessment. The proposed {\em Education Accelerator} will serve as a `wind tunnel' for testing these ideas in the context of real educational programmes, with an international virtual campus delivering complex systems education exploiting the new understanding of complex, social, computationally enhanced organisational structure developed within FuturICT.
△ Less
Submitted 1 April, 2013;
originally announced April 2013.
-
Distributed High Dimensional Information Theoretical Image Registration via Random Projections
Authors:
Zoltan Szabo,
Andras Lorincz
Abstract:
Information theoretical measures, such as entropy, mutual information, and various divergences, exhibit robust characteristics in image registration applications. However, the estimation of these quantities is computationally intensive in high dimensions. On the other hand, consistent estimation from pairwise distances of the sample points is possible, which suits random projection (RP) based low…
▽ More
Information theoretical measures, such as entropy, mutual information, and various divergences, exhibit robust characteristics in image registration applications. However, the estimation of these quantities is computationally intensive in high dimensions. On the other hand, consistent estimation from pairwise distances of the sample points is possible, which suits random projection (RP) based low dimensional embeddings. We adapt the RP technique to this task by means of a simple ensemble method. To the best of our knowledge, this is the first distributed, RP based information theoretical image registration approach. The efficiency of the method is demonstrated through numerical examples.
△ Less
Submitted 2 October, 2012;
originally announced October 2012.
-
Automated Word Puzzle Generation via Topic Dictionaries
Authors:
Balazs Pinter,
Gyula Voros,
Zoltan Szabo,
Andras Lorincz
Abstract:
We propose a general method for automated word puzzle generation. Contrary to previous approaches in this novel field, the presented method does not rely on highly structured datasets obtained with serious human annotation effort: it only needs an unstructured and unannotated corpus (i.e., document collection) as input. The method builds upon two additional pillars: (i) a topic model, which induce…
▽ More
We propose a general method for automated word puzzle generation. Contrary to previous approaches in this novel field, the presented method does not rely on highly structured datasets obtained with serious human annotation effort: it only needs an unstructured and unannotated corpus (i.e., document collection) as input. The method builds upon two additional pillars: (i) a topic model, which induces a topic dictionary from the input corpus (examples include e.g., latent semantic analysis, group-structured dictionaries or latent Dirichlet allocation), and (ii) a semantic similarity measure of word pairs. Our method can (i) generate automatically a large number of proper word puzzles of different types, including the odd one out, choose the related word and separate the topics puzzle. (ii) It can easily create domain-specific puzzles by replacing the corpus component. (iii) It is also capable of automatically generating puzzles with parameterizable levels of difficulty suitable for, e.g., beginners or intermediate learners.
△ Less
Submitted 2 June, 2012;
originally announced June 2012.
-
Collaborative Filtering via Group-Structured Dictionary Learning
Authors:
Zoltan Szabo,
Barnabas Poczos,
Andras Lorincz
Abstract:
Structured sparse coding and the related structured dictionary learning problems are novel research areas in machine learning. In this paper we present a new application of structured dictionary learning for collaborative filtering based recommender systems. Our extensive numerical experiments demonstrate that the presented technique outperforms its state-of-the-art competitors and has several adv…
▽ More
Structured sparse coding and the related structured dictionary learning problems are novel research areas in machine learning. In this paper we present a new application of structured dictionary learning for collaborative filtering based recommender systems. Our extensive numerical experiments demonstrate that the presented technique outperforms its state-of-the-art competitors and has several advantages over approaches that do not put structured constraints on the dictionary elements.
△ Less
Submitted 1 January, 2012;
originally announced January 2012.
-
Edge superconductivity in Nb thin film microbridges revealed by integral and spatially resolved electric transport
Authors:
R. Werner,
A. Yu. Aladyshkin,
I. M. Nefedov,
A. V. Putilov,
M. Kemmler,
D. Bothner,
A. Loerincz,
K. Ilin,
M. Siegel,
R. Kleiner,
D. Koelle
Abstract:
The resistance $R$ vs perpendicular external magnetic field $H$ was measured for superconducting Nb thin--film microbridges with and without microholes [antidots (ADs)]. Well below the transition temperature, integral $R(H)$ measurements of the resistive transition to the normal state on the plain bridge show two distinct regions, which can be identified as bulk and edge superconductivity, respect…
▽ More
The resistance $R$ vs perpendicular external magnetic field $H$ was measured for superconducting Nb thin--film microbridges with and without microholes [antidots (ADs)]. Well below the transition temperature, integral $R(H)$ measurements of the resistive transition to the normal state on the plain bridge show two distinct regions, which can be identified as bulk and edge superconductivity, respectively. The latter case appears when bulk superconductivity becomes suppressed at the upper critical field $H_{c2}$ and below the critical field of edge superconductivity $H_{c3}\approx 1.7\, H_{c2}$. The presence of additional edges in the AD bridge leads to a different shape of the $R(H)$ curves. We used low-temperature scanning laser microscopy (LTSLM) to visualize the current distribution in the plain and AD bridge upon swee** $H$. While the plain bridge shows a dominant LTSLM signal at its edges for $H > H_{c2}$ the AD bridge also gives a signal from the inner parts of the bridge due to the additional edge states around the ADs. LTSLM reveals an asymmetry in the current distribution between left and right edges, which confirms theoretical predictions. Furthermore, the experimental results are in good agreement with our numerical simulations (based on the time-dependent Ginzburg--Landau model) yielding the spatial distribution of the order parameter and current density for different bias currents and $H$ values.
△ Less
Submitted 10 December, 2011;
originally announced December 2011.
-
Decision Making Agent Searching for Markov Models in Near-Deterministic World
Authors:
Gabor Matuz,
Andras Lorincz
Abstract:
Reinforcement learning has solid foundations, but becomes inefficient in partially observed (non-Markovian) environments. Thus, a learning agent -born with a representation and a policy- might wish to investigate to what extent the Markov property holds. We propose a learning architecture that utilizes combinatorial policy optimization to overcome non-Markovity and to develop efficient behaviors,…
▽ More
Reinforcement learning has solid foundations, but becomes inefficient in partially observed (non-Markovian) environments. Thus, a learning agent -born with a representation and a policy- might wish to investigate to what extent the Markov property holds. We propose a learning architecture that utilizes combinatorial policy optimization to overcome non-Markovity and to develop efficient behaviors, which are easy to inherit, tests the Markov property of the behavioral states, and corrects against non-Markovity by running a deterministic factored Finite State Model, which can be learned. We illustrate the properties of architecture in the near deterministic Ms. Pac-Man game. We analyze the architecture from the point of view of evolutionary, individual, and social learning.
△ Less
Submitted 1 March, 2011; v1 submitted 27 February, 2011;
originally announced February 2011.
-
Sparse and silent coding in neural circuits
Authors:
András Lőrincz,
Zsolt Palotai,
Gábor Szirtes
Abstract:
Sparse coding algorithms are about finding a linear basis in which signals can be represented by a small number of active (non-zero) coefficients. Such coding has many applications in science and engineering and is believed to play an important role in neural information processing. However, due to the computational complexity of the task, only approximate solutions provide the required efficiency…
▽ More
Sparse coding algorithms are about finding a linear basis in which signals can be represented by a small number of active (non-zero) coefficients. Such coding has many applications in science and engineering and is believed to play an important role in neural information processing. However, due to the computational complexity of the task, only approximate solutions provide the required efficiency (in terms of time). As new results show, under particular conditions there exist efficient solutions by minimizing the magnitude of the coefficients (`$l_1$-norm') instead of minimizing the size of the active subset of features (`$l_0$-norm'). Straightforward neural implementation of these solutions is not likely, as they require \emph{a priori} knowledge of the number of active features. Furthermore, these methods utilize iterative re-evaluation of the reconstruction error, which in turn implies that final sparse forms (featuring `population sparseness') can only be reached through the formation of a series of non-sparse representations, which is in contrast with the overall sparse functioning of the neural systems (`lifetime sparseness'). In this article we present a novel algorithm which integrates our previous `$l_0$-norm' model on spike based probabilistic optimization for sparse coding with ideas coming from novel `$l_1$-norm' solutions.
The resulting algorithm allows neurally plausible implementation and does not require an exactly defined sparseness level thus it is suitable for representing natural stimuli with a varying number of features. We also demonstrate that the combined method significantly extends the domain where optimal solutions can be found by `$l_1$-norm' based algorithms.
△ Less
Submitted 20 October, 2010;
originally announced October 2010.
-
Trapped electron coupled to superconducting devices
Authors:
P. Bushev,
D. Bothner,
J. Nagel,
M. Kemmler,
K. B. Konovalenko,
A. Loerincz,
K. Ilin,
M. Siegel,
D. Koelle,
R. Kleiner,
F. Schmidt-Kaler
Abstract:
We propose to couple a trapped single electron to superconducting structures located at a variable distance from the electron. The electron is captured in a cryogenic Penning trap using electric fields and a static magnetic field in the Tesla range. Measurements on the electron will allow investigating the properties of the superconductor such as vortex structure, dam** and decoherence. We propo…
▽ More
We propose to couple a trapped single electron to superconducting structures located at a variable distance from the electron. The electron is captured in a cryogenic Penning trap using electric fields and a static magnetic field in the Tesla range. Measurements on the electron will allow investigating the properties of the superconductor such as vortex structure, dam** and decoherence. We propose to couple a superconducting microwave resonator to the electron in order to realize a circuit QED-like experiment, as well as to couple superconducting Josephson junctions or superconducting quantum interferometers (SQUIDs) to the electron. The electron may also be coupled to a vortex which is situated in a double well potential, realized by nearby pinning centers in the superconductor, acting as a quantum mechanical two level system that can be controlled by a transport current tilting the double well potential. When the vortex is trapped in the interferometer arms of a SQUID, this would allow its detection both by the SQUID and by the electron.
△ Less
Submitted 17 September, 2010;
originally announced September 2010.
-
Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version
Authors:
Istvan Szita,
Andras Lorincz
Abstract:
In this paper we propose an algorithm for polynomial-time reinforcement learning in factored Markov decision processes (FMDPs). The factored optimistic initial model (FOIM) algorithm, maintains an empirical model of the FMDP in a conventional way, and always follows a greedy policy with respect to its model. The only trick of the algorithm is that the model is initialized optimistically. We prov…
▽ More
In this paper we propose an algorithm for polynomial-time reinforcement learning in factored Markov decision processes (FMDPs). The factored optimistic initial model (FOIM) algorithm, maintains an empirical model of the FMDP in a conventional way, and always follows a greedy policy with respect to its model. The only trick of the algorithm is that the model is initialized optimistically. We prove that with suitable initialization (i) FOIM converges to the fixed point of approximate value iteration (AVI); (ii) the number of steps when the agent makes non-near-optimal decisions (with respect to the solution of AVI) is polynomial in all relevant quantities; (iii) the per-step costs of the algorithm are also polynomial. To our best knowledge, FOIM is the first algorithm with these properties. This extended version contains the rigorous proofs of the main theorem. A version of this paper appeared in ICML'09.
△ Less
Submitted 21 April, 2009;
originally announced April 2009.
-
The many faces of optimism - Extended version
Authors:
István Szita,
András Lőrincz
Abstract:
The exploration-exploitation dilemma has been an intriguing and unsolved problem within the framework of reinforcement learning. "Optimism in the face of uncertainty" and model building play central roles in advanced exploration methods. Here, we integrate several concepts and obtain a fast and simple algorithm. We show that the proposed algorithm finds a near-optimal policy in polynomial time,…
▽ More
The exploration-exploitation dilemma has been an intriguing and unsolved problem within the framework of reinforcement learning. "Optimism in the face of uncertainty" and model building play central roles in advanced exploration methods. Here, we integrate several concepts and obtain a fast and simple algorithm. We show that the proposed algorithm finds a near-optimal policy in polynomial time, and give experimental evidence that it is robust and efficient compared to its ascendants.
△ Less
Submitted 19 October, 2008;
originally announced October 2008.
-
Model of the hippocampal formation explains the coexistence of grid cells and place cells
Authors:
Andras Lorincz,
Melinda Kiszlinger,
Gabor Szirtes
Abstract:
In this paper we explain the strikingly regular activity of the 'grid' cells in rodent dorsal medial entorhinal cortex (dMEC) and the spatially localized activity of the hippocampal place cells in CA3 and CA1 by assuming that the hippocampal region is constructed to support an internal dynamical model of the sensory information. The functioning of the different areas of the hippocampal-entorhina…
▽ More
In this paper we explain the strikingly regular activity of the 'grid' cells in rodent dorsal medial entorhinal cortex (dMEC) and the spatially localized activity of the hippocampal place cells in CA3 and CA1 by assuming that the hippocampal region is constructed to support an internal dynamical model of the sensory information. The functioning of the different areas of the hippocampal-entorhinal loop and their interaction are derived from a set of information theoretical principles. We demonstrate through simple transformations of the stimulus representations that the double form of space representation (i.e. place field and regular grid tiling) can be seen as a computational 'by-product' of the circuit. In contrast to other theoretical or computational models we can also explain how place and grid activity may emerge at the respective areas simultaneously. In accord with recent views, our results point toward a close relation between the formation of episodic memory and spatial navigation.
△ Less
Submitted 20 April, 2008;
originally announced April 2008.
-
Factored Value Iteration Converges
Authors:
Istvan Szita,
Andras Lorincz
Abstract:
In this paper we propose a novel algorithm, factored value iteration (FVI), for the approximate solution of factored Markov decision processes (fMDPs). The traditional approximate value iteration algorithm is modified in two ways. For one, the least-squares projection operator is modified so that it does not increase max-norm, and thus preserves convergence. The other modification is that we uni…
▽ More
In this paper we propose a novel algorithm, factored value iteration (FVI), for the approximate solution of factored Markov decision processes (fMDPs). The traditional approximate value iteration algorithm is modified in two ways. For one, the least-squares projection operator is modified so that it does not increase max-norm, and thus preserves convergence. The other modification is that we uniformly sample polynomially many samples from the (exponentially large) state space. This way, the complexity of our algorithm becomes polynomial in the size of the fMDP description length. We prove that the algorithm is convergent. We also derive an upper bound on the difference between our approximate solution and the optimal one, and also on the error introduced by sampling. We analyze various projection operators with respect to their computation complexity and their convergence when combined with approximate value iteration.
△ Less
Submitted 13 August, 2008; v1 submitted 14 January, 2008;
originally announced January 2008.
-
Online variants of the cross-entropy method
Authors:
Istvan Szita,
Andras Lorincz
Abstract:
The cross-entropy method is a simple but efficient method for global optimization. In this paper we provide two online variants of the basic CEM, together with a proof of convergence.
The cross-entropy method is a simple but efficient method for global optimization. In this paper we provide two online variants of the basic CEM, together with a proof of convergence.
△ Less
Submitted 14 January, 2008;
originally announced January 2008.
-
D-optimal Bayesian Interrogation for Parameter and Noise Identification of Recurrent Neural Networks
Authors:
Barnabas Poczos,
Andras Lorincz
Abstract:
We introduce a novel online Bayesian method for the identification of a family of noisy recurrent neural networks (RNNs). We develop Bayesian active learning technique in order to optimize the interrogating stimuli given past experiences. In particular, we consider the unknown parameters as stochastic variables and use the D-optimality principle, also known as `\emph{infomax method}', to choose…
▽ More
We introduce a novel online Bayesian method for the identification of a family of noisy recurrent neural networks (RNNs). We develop Bayesian active learning technique in order to optimize the interrogating stimuli given past experiences. In particular, we consider the unknown parameters as stochastic variables and use the D-optimality principle, also known as `\emph{infomax method}', to choose optimal stimuli. We apply a greedy technique to maximize the information gain concerning network parameters at each time step. We also derive the D-optimal estimation of the additive noise that perturbs the dynamical system of the RNN. Our analytical results are approximation-free. The analytic derivation gives rise to attractive quadratic update rules.
△ Less
Submitted 12 January, 2008;
originally announced January 2008.
-
Undercomplete Blind Subspace Deconvolution via Linear Prediction
Authors:
Zoltan Szabo,
Barnabas Poczos,
Andras Lorincz
Abstract:
We present a novel solution technique for the blind subspace deconvolution (BSSD) problem, where temporal convolution of multidimensional hidden independent components is observed and the task is to uncover the hidden components using the observation only. We carry out this task for the undercomplete case (uBSSD): we reduce the original uBSSD task via linear prediction to independent subspace an…
▽ More
We present a novel solution technique for the blind subspace deconvolution (BSSD) problem, where temporal convolution of multidimensional hidden independent components is observed and the task is to uncover the hidden components using the observation only. We carry out this task for the undercomplete case (uBSSD): we reduce the original uBSSD task via linear prediction to independent subspace analysis (ISA), which we can solve. As it has been shown recently, applying temporal concatenation can also reduce uBSSD to ISA, but the associated ISA problem can easily become `high dimensional' [1]. The new reduction method circumvents this dimensionality problem. We perform detailed studies on the efficiency of the proposed technique by means of numerical simulations. We have found several advantages: our method can achieve high quality estimations for smaller number of samples and it can cope with deeper temporal convolutions.
△ Less
Submitted 23 June, 2007;
originally announced June 2007.
-
Independent Process Analysis without A Priori Dimensional Information
Authors:
Barnabas Poczos,
Zoltan Szabo,
Melinda Kiszlinger,
Andras Lorincz
Abstract:
Recently, several algorithms have been proposed for independent subspace analysis where hidden variables are i.i.d. processes. We show that these methods can be extended to certain AR, MA, ARMA and ARIMA tasks. Central to our paper is that we introduce a cascade of algorithms, which aims to solve these tasks without previous knowledge about the number and the dimensions of the hidden processes.…
▽ More
Recently, several algorithms have been proposed for independent subspace analysis where hidden variables are i.i.d. processes. We show that these methods can be extended to certain AR, MA, ARMA and ARIMA tasks. Central to our paper is that we introduce a cascade of algorithms, which aims to solve these tasks without previous knowledge about the number and the dimensions of the hidden processes. Our claim is supported by numerical simulations. As a particular application, we search for subspaces of facial components.
△ Less
Submitted 20 March, 2007;
originally announced March 2007.
-
Undercomplete Blind Subspace Deconvolution
Authors:
Zoltan Szabo,
Barnabas Poczos,
Andras Lorincz
Abstract:
We introduce the blind subspace deconvolution (BSSD) problem, which is the extension of both the blind source deconvolution (BSD) and the independent subspace analysis (ISA) tasks. We examine the case of the undercomplete BSSD (uBSSD). Applying temporal concatenation we reduce this problem to ISA. The associated `high dimensional' ISA problem can be handled by a recent technique called joint f-d…
▽ More
We introduce the blind subspace deconvolution (BSSD) problem, which is the extension of both the blind source deconvolution (BSD) and the independent subspace analysis (ISA) tasks. We examine the case of the undercomplete BSSD (uBSSD). Applying temporal concatenation we reduce this problem to ISA. The associated `high dimensional' ISA problem can be handled by a recent technique called joint f-decorrelation (JFD). Similar decorrelation methods have been used previously for kernel independent component analysis (kernel-ICA). More precisely, the kernel canonical correlation (KCCA) technique is a member of this family, and, as is shown in this paper, the kernel generalized variance (KGV) method can also be seen as a decorrelation method in the feature space. These kernel based algorithms will be adapted to the ISA task. In the numerical examples, we (i) examine how efficiently the emerging higher dimensional ISA tasks can be tackled, and (ii) explore the working and advantages of the derived kernel-ISA methods.
△ Less
Submitted 20 May, 2007; v1 submitted 7 January, 2007;
originally announced January 2007.