-
A framework for optimisation based stochastic process discovery
Authors:
Pierre Cry,
András Horváth,
Paolo Ballarini,
Pascal Le Gall
Abstract:
Process mining is concerned with deriving formal models capable of reproducing the behaviour of a given organisational process by analysing observed executions collected in an event log. The elements of an event log are finite sequences (i.e., traces or words) of actions. Many effective algorithms have been introduced which issue a control flow model (commonly in Petri net form) aimed at reproduci…
▽ More
Process mining is concerned with deriving formal models capable of reproducing the behaviour of a given organisational process by analysing observed executions collected in an event log. The elements of an event log are finite sequences (i.e., traces or words) of actions. Many effective algorithms have been introduced which issue a control flow model (commonly in Petri net form) aimed at reproducing, as precisely as possible, the language of the considered event log. However, given that identical executions can be observed several times, traces of an event log are associated with a frequency and, hence, an event log inherently yields also a stochastic language. By exploiting the trace frequencies contained in the event log, the stochastic extension of process mining, therefore, consists in deriving stochastic (Petri nets) models capable of reproducing the likelihood of the observed executions. In this paper, we introduce a novel stochastic process mining approach. Starting from a "standard" Petri net model mined through classical mining algorithms, we employ optimization to identify optimal weights for the transitions of the mined net so that the stochastic language issued by the stochastic interpretation of the mined net closely resembles that of the event log. The optimization is either based on the maximum likelihood principle or on the earth moving distance. Experiments on some popular real system logs show an improved accuracy w.r.t. to alternative approaches.
△ Less
Submitted 2 July, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
Soft cells and the geometry of seashells
Authors:
Gábor Domokos,
Alain Goriely,
Ákos G. Horváth,
Krisztina Regős
Abstract:
A central problem of geometry is the tiling of space with simple structures. The classical solutions, such as triangles, squares, and hexagons in the plane and cubes and other polyhedra in three-dimensional space are built with sharp corners and flat faces. However, many tilings in Nature are characterized by shapes with curved edges, non-flat faces, and few, if any, sharp corners. An important qu…
▽ More
A central problem of geometry is the tiling of space with simple structures. The classical solutions, such as triangles, squares, and hexagons in the plane and cubes and other polyhedra in three-dimensional space are built with sharp corners and flat faces. However, many tilings in Nature are characterized by shapes with curved edges, non-flat faces, and few, if any, sharp corners. An important question is then to relate prototypical sharp tilings to softer natural shapes. Here, we solve this problem by introducing a new class of shapes, the \textit{soft cells}, minimizing the number of sharp corners and filling space as \emph{soft tilings}. We prove that an infinite class of polyhedral tilings can be smoothly deformed into soft tilings and we construct the soft versions of all Dirichlet-Voronoi cells associated with point lattices in two and three dimensions. Remarkably, these ideal soft shapes, born out of geometry, are found abundantly in nature, from cells to shells.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Forecasting Fold Bifurcations through Physics-Informed Convolutional Neural Networks
Authors:
Giuseppe Habib,
Ádám Horváth
Abstract:
This study proposes a physics-informed convolutional neural network (CNN) for identifying dynamical systems' time series near a fold bifurcation. The peculiarity of this work is that the CNN is trained with a relatively small amount of data and on a single, very simple system. In contrast, the CNN is validated on much more complicated systems. A similar task requires significant extrapolation capa…
▽ More
This study proposes a physics-informed convolutional neural network (CNN) for identifying dynamical systems' time series near a fold bifurcation. The peculiarity of this work is that the CNN is trained with a relatively small amount of data and on a single, very simple system. In contrast, the CNN is validated on much more complicated systems. A similar task requires significant extrapolation capabilities, which are obtained by exploiting physics-based information. Physics-based information is provided through a specific pre-processing of the input data, consisting mostly of a transformation into polar coordinates, normalization, transformation into the logarithmic scale, and filtering through a moving mean. The results illustrate that such data pre-processing enables the CNN to grasp the important features related to approaching a fold bifurcation, namely, the trend of the oscillation amplitude, and neglect other characteristics that are not particularly relevant, such as the vibration frequency. The developed CNN was able to correctly classify trajectories near a fold for a mass-on-moving-belt system, a van der Pol-Duffing oscillator with an attached tuned mass damper, and a pitch-and-plunge wing profile. The results obtained pave the way for the development of similar CNNs effective in real-life applications.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Elementary Constructions of conic sections
Authors:
Ákos G. Horváth
Abstract:
In classical geometry, there is no such well-known and much-studied topic as the construction of conic sections (or briefly conics) from its five points. Its importance in many applications of mechanical engineering, civil engineering and architectural engineering, as well as other applied sciences is clear. The beauty of the topic is that it raises difficult questions that can be approached with…
▽ More
In classical geometry, there is no such well-known and much-studied topic as the construction of conic sections (or briefly conics) from its five points. Its importance in many applications of mechanical engineering, civil engineering and architectural engineering, as well as other applied sciences is clear. The beauty of the topic is that it raises difficult questions that can be approached with basic tools. In this article, we provide constructions (and corresponding theories) that can be taught to high school and university students without knowledge of projective geometry. For this, we recall some important facts about conic sections that can be found in the rich literature. We use the concepts of power of a point on a circle, similarity, orthogonal affinity and inversion. We also mention famous constructions related to our questions. We begin our article at this point, where the standard teaching ends the discussion of conic sections. We therefore assume that the reader knows the basic definitions and constructions of conics, the concepts of focus, axis, tangent, leading circle and leading line.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Which model density is best in pair natural orbital local correlation theory?
Authors:
Reka A. Horvath,
Kesha Sorathia,
Isabelle Saint,
David P. Tew
Abstract:
Low-scaling electron correlation theory based on the pair natural orbital approximation, PNO-CCSD(T), has become a powerful computational tool. Motivated by the recent discovery of large errors for organometallic molecules, we assess the role of the model density used to discard unimportant contributions. We find that second-order perturbation theory provides the best compromise between cost and a…
▽ More
Low-scaling electron correlation theory based on the pair natural orbital approximation, PNO-CCSD(T), has become a powerful computational tool. Motivated by the recent discovery of large errors for organometallic molecules, we assess the role of the model density used to discard unimportant contributions. We find that second-order perturbation theory provides the best compromise between cost and accuracy, but coupling between localised occupied orbitals must be accounted for. Errors in the CCSD energy are then well below 1~kcal/mol, even for molecules with moderate multi-reference character, and the primary remaining source of errors lies in the treatment of the (T) energy contribution.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Targeted Adversarial Attacks on Generalizable Neural Radiance Fields
Authors:
Andras Horvath,
Csaba M. Jozsa
Abstract:
Neural Radiance Fields (NeRFs) have recently emerged as a powerful tool for 3D scene representation and rendering. These data-driven models can learn to synthesize high-quality images from sparse 2D observations, enabling realistic and interactive scene reconstructions. However, the growing usage of NeRFs in critical applications such as augmented reality, robotics, and virtual environments could…
▽ More
Neural Radiance Fields (NeRFs) have recently emerged as a powerful tool for 3D scene representation and rendering. These data-driven models can learn to synthesize high-quality images from sparse 2D observations, enabling realistic and interactive scene reconstructions. However, the growing usage of NeRFs in critical applications such as augmented reality, robotics, and virtual environments could be threatened by adversarial attacks.
In this paper we present how generalizable NeRFs can be attacked by both low-intensity adversarial attacks and adversarial patches, where the later could be robust enough to be used in real world applications. We also demonstrate targeted attacks, where a specific, predefined output scene is generated by these attack with success.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Enhancing Cell Tracking with a Time-Symmetric Deep Learning Approach
Authors:
Gergely Szabó,
Paolo Bonaiuti,
Andrea Ciliberto,
András Horváth
Abstract:
The accurate tracking of live cells using video microscopy recordings remains a challenging task for popular state-of-the-art image processing based object tracking methods. In recent years, several existing and new applications have attempted to integrate deep-learning based frameworks for this task, but most of them still heavily rely on consecutive frame based tracking embedded in their archite…
▽ More
The accurate tracking of live cells using video microscopy recordings remains a challenging task for popular state-of-the-art image processing based object tracking methods. In recent years, several existing and new applications have attempted to integrate deep-learning based frameworks for this task, but most of them still heavily rely on consecutive frame based tracking embedded in their architecture or other premises that hinder generalized learning. To address this issue, we aimed to develop a new deep-learning based tracking method that relies solely on the assumption that cells can be tracked based on their spatio-temporal neighborhood, without restricting it to consecutive frames. The proposed method has the additional benefit that the motion patterns of the cells can be learned completely by the predictor without any prior assumptions, and it has the potential to handle a large number of video frames with heavy artifacts. The efficacy of the proposed method is demonstrated through biologically motivated validation strategies and compared against multiple state-of-the-art cell tracking methods.
△ Less
Submitted 18 September, 2023; v1 submitted 4 August, 2023;
originally announced August 2023.
-
Intruder configurations in $^{29}$Ne at the transition into the island of inversion: Detailed structure study of $^{28}$Ne
Authors:
H. Wang,
M. Yasuda,
Y. Kondo,
T. Nakamura,
J. A. Tostevin,
K. Ogata,
T. Otsuka,
A. Poves,
N. Shimizu,
K. Yoshida,
N. L. Achouri,
H. Al Falou,
L. Atar,
T. Aumann,
H. Baba,
K. Boretzky,
C. Caesar,
D. Calvet,
H. Chae,
N. Chiga,
A. Corsi,
H. L. Crawford,
F. Delaunay,
A. Delbart,
Q. Deshayes
, et al. (71 additional authors not shown)
Abstract:
Detailed $γ$-ray spectroscopy of the exotic neon isotope $^{28}$Ne has been performed for the first time using the one-neutron removal reaction from $^{29}$Ne on a liquid hydrogen target at 240~MeV/nucleon. Based on an analysis of parallel momentum distributions, a level scheme with spin-parity assignments has been constructed for $^{28}$Ne and the negative-parity states are identified for the fir…
▽ More
Detailed $γ$-ray spectroscopy of the exotic neon isotope $^{28}$Ne has been performed for the first time using the one-neutron removal reaction from $^{29}$Ne on a liquid hydrogen target at 240~MeV/nucleon. Based on an analysis of parallel momentum distributions, a level scheme with spin-parity assignments has been constructed for $^{28}$Ne and the negative-parity states are identified for the first time. The measured partial cross sections and momentum distributions reveal a significant intruder $p$-wave strength providing evidence of the breakdown of the $N=20$ and $N=28$ shell gaps. Only a weak, possible $f$-wave strength was observed to bound final states. Large-scale shell-model calculations with different effective interactions do not reproduce the large $p$-wave and small $f$-wave strength observed experimentally, indicating an ongoing challenge for a complete theoretical description of the transition into the island of inversion along the Ne isotopic chain.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Event-shape-dependent analysis of charm-anticharm azimuthal correlations in simulations
Authors:
Aniko Horvath,
Eszter Frajna,
Robert Vertesi
Abstract:
In high-energy collisions of small systems, by high-enough final-state multiplicities, a collective behaviour is present that is similar to the flow patterns observed in heavy-ion collisions. Recent studies connect this collectivity to semi-soft vacuum-QCD processes. Here we explore QCD production mechanisms using angular correlations of heavy flavour using simulated proton-proton collisions at…
▽ More
In high-energy collisions of small systems, by high-enough final-state multiplicities, a collective behaviour is present that is similar to the flow patterns observed in heavy-ion collisions. Recent studies connect this collectivity to semi-soft vacuum-QCD processes. Here we explore QCD production mechanisms using angular correlations of heavy flavour using simulated proton-proton collisions at $\sqrt{s} = 13$~TeV with the PYTHIA8 Monte Carlo event generator. We demonstrate that the event shape is strongly connected to the production mechanisms. Flattenicity, a novel event descriptor, can be used to separate events containing the final-state radiation from the rest of the events.
△ Less
Submitted 9 June, 2023;
originally announced June 2023.
-
The bridge between Desargues' and Pappus' theorems
Authors:
Ákos G. Horváth
Abstract:
In this paper, we investigate the configuration theorems of Desargues and Pappus in a synthetic geometric way. We provide a bridge between the two configurations with a third one that can be considered a specification for both. We do not use the theory of collineations or the analytic description of the plane over a ternary ring.
In this paper, we investigate the configuration theorems of Desargues and Pappus in a synthetic geometric way. We provide a bridge between the two configurations with a third one that can be considered a specification for both. We do not use the theory of collineations or the analytic description of the plane over a ternary ring.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
$p$-capacity with Bessel convolution
Authors:
Á. P. Horváth
Abstract:
We define and examine nonlinear potential by Bessel convolution with Bessel kernel. We investigate removable sets with respect to Laplace-Bessel inequality. By studying the maximal and fractional maximal measure, a Wolff type inequality is proved. Finally the relation of B-$p$ capacity and B-Lipschitz map**, and the B-$p$ capacity and weighted Hausdorff measure and the B-$p$ capacity of Cantor s…
▽ More
We define and examine nonlinear potential by Bessel convolution with Bessel kernel. We investigate removable sets with respect to Laplace-Bessel inequality. By studying the maximal and fractional maximal measure, a Wolff type inequality is proved. Finally the relation of B-$p$ capacity and B-Lipschitz map**, and the B-$p$ capacity and weighted Hausdorff measure and the B-$p$ capacity of Cantor sets are examined.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Exploratory Analysis of Federated Learning Methods with Differential Privacy on MIMIC-III
Authors:
Aron N. Horvath,
Matteo Berchier,
Farhad Nooralahzadeh,
Ahmed Allam,
Michael Krauthammer
Abstract:
Background: Federated learning methods offer the possibility of training machine learning models on privacy-sensitive data sets, which cannot be easily shared. Multiple regulations pose strict requirements on the storage and usage of healthcare data, leading to data being in silos (i.e. locked-in at healthcare facilities). The application of federated algorithms on these datasets could accelerate…
▽ More
Background: Federated learning methods offer the possibility of training machine learning models on privacy-sensitive data sets, which cannot be easily shared. Multiple regulations pose strict requirements on the storage and usage of healthcare data, leading to data being in silos (i.e. locked-in at healthcare facilities). The application of federated algorithms on these datasets could accelerate disease diagnostic, drug development, as well as improve patient care.
Methods: We present an extensive evaluation of the impact of different federation and differential privacy techniques when training models on the open-source MIMIC-III dataset. We analyze a set of parameters influencing a federated model performance, namely data distribution (homogeneous and heterogeneous), communication strategies (communication rounds vs. local training epochs), federation strategies (FedAvg vs. FedProx). Furthermore, we assess and compare two differential privacy (DP) techniques during model training: a stochastic gradient descent-based differential privacy algorithm (DP-SGD), and a sparse vector differential privacy technique (DP-SVT).
Results: Our experiments show that extreme data distributions across sites (imbalance either in the number of patients or the positive label ratios between sites) lead to a deterioration of model performance when trained using the FedAvg strategy. This issue is resolved when using FedProx with the use of appropriate hyperparameter tuning. Furthermore, the results show that both differential privacy techniques can reach model performances similar to those of models trained without DP, however at the expense of a large quantifiable privacy leakage.
Conclusions: We evaluate empirically the benefits of two federation strategies and propose optimal strategies for the choice of parameters when using differential privacy techniques.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Translation beyond Delsarte
Authors:
Á. P. Horváth
Abstract:
We introduce general translations as solutions to Cauchy or Dirichlet problems. This point of view allows us to handle the heat-diffusion semigroup as a translation. With the given examples Kolmogorov-Riesz characterization of compact sets in certain $L^p_μ$ spaces are given. Pego-type characterizations are also derived. Finally for some examples the equivalence of the corresponding modulus of smo…
▽ More
We introduce general translations as solutions to Cauchy or Dirichlet problems. This point of view allows us to handle the heat-diffusion semigroup as a translation. With the given examples Kolmogorov-Riesz characterization of compact sets in certain $L^p_μ$ spaces are given. Pego-type characterizations are also derived. Finally for some examples the equivalence of the corresponding modulus of smoothness and K-functional is pointed out.
△ Less
Submitted 14 June, 2023; v1 submitted 1 July, 2022;
originally announced July 2022.
-
Saliency Map Based Data Augmentation
Authors:
Jalal Al-afandi,
Bálint Magyar,
András Horváth
Abstract:
Data augmentation is a commonly applied technique with two seemingly related advantages. With this method one can increase the size of the training set generating new samples and also increase the invariance of the network against the applied transformations. Unfortunately all images contain both relevant and irrelevant features for classification therefore this invariance has to be class specific…
▽ More
Data augmentation is a commonly applied technique with two seemingly related advantages. With this method one can increase the size of the training set generating new samples and also increase the invariance of the network against the applied transformations. Unfortunately all images contain both relevant and irrelevant features for classification therefore this invariance has to be class specific. In this paper we will present a new method which uses saliency maps to restrict the invariance of neural networks to certain regions, providing higher test accuracy in classification tasks.
△ Less
Submitted 29 May, 2022;
originally announced May 2022.
-
On the Feasibility and Generality of Patch-based Adversarial Attacks on Semantic Segmentation Problems
Authors:
Soma Kontar,
Andras Horvath
Abstract:
Deep neural networks were applied with success in a myriad of applications, but in safety critical use cases adversarial attacks still pose a significant threat. These attacks were demonstrated on various classification and detection tasks and are usually considered general in a sense that arbitrary network outputs can be generated by them.
In this paper we will demonstrate through simple case s…
▽ More
Deep neural networks were applied with success in a myriad of applications, but in safety critical use cases adversarial attacks still pose a significant threat. These attacks were demonstrated on various classification and detection tasks and are usually considered general in a sense that arbitrary network outputs can be generated by them.
In this paper we will demonstrate through simple case studies both in simulation and in real-life, that patch based attacks can be utilised to alter the output of segmentation networks. Through a few examples and the investigation of network complexity, we will also demonstrate that the number of possible output maps which can be generated via patch-based attacks of a given size is typically smaller than the area they effect or areas which should be attacked in case of practical applications.
We will prove that based on these results most patch-based attacks cannot be general in practice, namely they can not generate arbitrary output maps or if they could, they are spatially limited and this limit is significantly smaller than the receptive field of the patches.
△ Less
Submitted 21 May, 2022;
originally announced May 2022.
-
Observing Particle Energization above the Nyquist Frequency: An Application of the Field-Particle Correlation Technique
Authors:
Sarah A. Horvath,
Gregory G. Howes,
Andrew J. McCubbin
Abstract:
The field-particle correlation technique utilizes single-point measurements to uncover signatures of various particle energization mechanisms in turbulent space plasmas. The signature of Landau dam** by electrons has been found in both simulations and observations from Earth's magnetosheath using this technique, but instrumental limitations of spacecraft sampling rates present a challenge to dis…
▽ More
The field-particle correlation technique utilizes single-point measurements to uncover signatures of various particle energization mechanisms in turbulent space plasmas. The signature of Landau dam** by electrons has been found in both simulations and observations from Earth's magnetosheath using this technique, but instrumental limitations of spacecraft sampling rates present a challenge to discovering the full extent of the presence of Landau dam** in the solar wind. Theory predicts that field-particle correlations can recover velocity-space energization signatures even from data that is undersampled with respect to the characteristic frequencies at which the wave dam** occurs. To test this hypothesis, we perform a high-resoluation gyrokinetic simulation of space plasma turbulence, confirm that it contains signatures of electron Landau dam**, and then systematically reduce the time resolution of the data to identify the point at which the signatures become impossible to recover. We find results in support of our theoretical prediction and look for a rule of thumb that can be compared with the measurement capabilities of spacecraft missions to inform the process of applying field-particle correlations to low time resolution data.
△ Less
Submitted 31 March, 2022;
originally announced April 2022.
-
Border of the Island of Inversion: Unbound states in $^{29}$Ne
Authors:
M. Holl,
S. Lindberg,
A. Heinz,
Y. Kondo,
T. Nakamura,
J. A. Tostevin,
H. Wang,
T. Nilsson,
N. L. Achouri,
H. Al Falou,
L. Atar,
T. Aumann,
H. Baba,
K. Boretzky,
C. Caesar,
D. Calvet,
H. Chae,
N. Chiga,
A. Corsi,
H. L. Crawford,
F. Delaunay,
A. Delbart,
Q. Deshayes,
P. Díaz Fernández,
Z. Dombrádi
, et al. (67 additional authors not shown)
Abstract:
The nucleus $^{29}$Ne is situated at the border of the island of inversion. Despite significant efforts, no bound low-lying intruder $f_{7/2}$-state, which would place $^{29}$Ne firmly inside the island of inversion, has yet been observed. Here, the first investigation of unbound states of $^{29}$Ne is reported. The states were populated in $^{30}\mathrm{Ne}(p,pn)$ and $^{30}\mathrm{Na}(p,2p)$ rea…
▽ More
The nucleus $^{29}$Ne is situated at the border of the island of inversion. Despite significant efforts, no bound low-lying intruder $f_{7/2}$-state, which would place $^{29}$Ne firmly inside the island of inversion, has yet been observed. Here, the first investigation of unbound states of $^{29}$Ne is reported. The states were populated in $^{30}\mathrm{Ne}(p,pn)$ and $^{30}\mathrm{Na}(p,2p)$ reactions at a beam energy of around $230$ MeV/nucleon, and analyzed in terms of their resonance properties, partial cross sections and momentum distributions. The momentum distributions are compared to calculations using the eikonal, direct reaction model, allowing $\ell$-assignments for the observed states. The lowest-lying resonance at an excitation energy of 1.48(4) MeV shows clear signs of a significant $\ell$=3-component, giving first evidence for $f_{7/2}$ single particle strength in $^{29}$Ne. The excitation energies and strengths of the observed states are compared to shell-model calculations using the sdpf-u-mix interaction
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
Mitigating the Bias of Centered Objects in Common Datasets
Authors:
Gergely Szabo,
Andras Horvath
Abstract:
Convolutional networks are considered shift invariant, but it was demonstrated that their response may vary according to the exact location of the objects. In this paper we will demonstrate that most commonly investigated datasets have a bias, where objects are over-represented at the center of the image during training. This bias and the boundary condition of these networks can have a significant…
▽ More
Convolutional networks are considered shift invariant, but it was demonstrated that their response may vary according to the exact location of the objects. In this paper we will demonstrate that most commonly investigated datasets have a bias, where objects are over-represented at the center of the image during training. This bias and the boundary condition of these networks can have a significant effect on the performance of these architectures and their accuracy drops significantly as an object approaches the boundary. We will also demonstrate how this effect can be mitigated with data augmentation techniques.
△ Less
Submitted 4 August, 2023; v1 submitted 16 December, 2021;
originally announced December 2021.
-
Understanding How Programmers Can Use Annotations on Documentation
Authors:
Amber Horvath,
Michael Xieyang Liu,
River Hendriksen,
Connor Shannon,
Emma Paterson,
Kazi Jawad,
Andrew Macvean,
Brad A. Myers
Abstract:
Modern software development requires developers to find and effectively utilize new APIs and their documentation, but documentation has many well-known issues. Despite this, developers eventually overcome these issues but have no way of sharing what they learned. We investigate sharing this documentation-specific information through \textit{annotations}, which have advantages over developer forums…
▽ More
Modern software development requires developers to find and effectively utilize new APIs and their documentation, but documentation has many well-known issues. Despite this, developers eventually overcome these issues but have no way of sharing what they learned. We investigate sharing this documentation-specific information through \textit{annotations}, which have advantages over developer forums as the information is contextualized, not disruptive, and is short, thus easy to author. Developers can also author annotations to support their own comprehension. In order to support the documentation usage behaviors we found, we built the Adamite annotation tool, which supports features such as multi-anchoring, annotation types, and pinning. In our user study, we found that developers are able to create annotations that are useful to themselves and are able to utilize annotations created by other developers when learning a new API, with readers of the annotations completing 67% more of the task, on average, than the baseline.
△ Less
Submitted 11 January, 2022; v1 submitted 16 November, 2021;
originally announced November 2021.
-
A two-vertex theorem for normal tilings
Authors:
Gábor Domokos,
Ákos G. Horváth,
Krisztina Regős
Abstract:
We regard a smooth, $d=2$-dimensional manifold $\mathcal{M}$ and its normal tiling $M$, the cells of which may have non-smooth or smooth vertices (at the latter, two edges meet at 180 degrees.) We denote the average number (per cell) of non-smooth vertices by $\bar v^{\star}$ and we prove that if $M$ is periodic then $v^{\star} \geq 2$ and we show the same result for the monohedral case by an enti…
▽ More
We regard a smooth, $d=2$-dimensional manifold $\mathcal{M}$ and its normal tiling $M$, the cells of which may have non-smooth or smooth vertices (at the latter, two edges meet at 180 degrees.) We denote the average number (per cell) of non-smooth vertices by $\bar v^{\star}$ and we prove that if $M$ is periodic then $v^{\star} \geq 2$ and we show the same result for the monohedral case by an entirely different argument. Our theory also makes a closely related prediction for non-periodic tilings. In 3 dimensions we show a monohedral construction with $\bar v^{\star}=0$.
△ Less
Submitted 5 January, 2022; v1 submitted 5 October, 2021;
originally announced October 2021.
-
Compactness criteria via Laguerre and Hankel transformations
Authors:
Á. P. Horváth
Abstract:
The aim of this paper is to prove Kolmogorov-Riesz type theorems via Bessel and Laguerre translations, and Pego-type theorems by the corresponding transformations.
The aim of this paper is to prove Kolmogorov-Riesz type theorems via Bessel and Laguerre translations, and Pego-type theorems by the corresponding transformations.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
A note on the low-dimensional Minkowski-reduction
Authors:
Ákos G. Horváth
Abstract:
In this paper, we recall the basic results of the reduction theory of positive definite quadratic forms. Since finding the shortest vectors in a lattice is an NP-hard problem, the low-dimensional results in lattice reduction theory have an important role. Using the result of Ryskov on admissible centerings and the result of Tammela about the determination of a Minkowski-reduced form, we prove that…
▽ More
In this paper, we recall the basic results of the reduction theory of positive definite quadratic forms. Since finding the shortest vectors in a lattice is an NP-hard problem, the low-dimensional results in lattice reduction theory have an important role. Using the result of Ryskov on admissible centerings and the result of Tammela about the determination of a Minkowski-reduced form, we prove that the absolute values of the coordinates of a minimum vector in a six-dimensional Minkowski-reduced basis are less or equal to three. To get this sharpening of P. Tammela's interesting works, we combine some elementary geometric reasonings with the mentioned theoretical results.
△ Less
Submitted 3 November, 2023; v1 submitted 9 February, 2021;
originally announced February 2021.
-
On the convex hull and homothetic convex hull functions of a convex body
Authors:
Ákos G. Horváth,
Zsolt Lángi
Abstract:
The aim of this note is to investigate the properties of the convex hull and the homothetic convex hull functions of a convex body $K$ in Euclidean $n$-space, defined as the volume of the union of $K$ and one of its translates, and the volume of $K$ and a translate of a homothetic copy of $K$, respectively, as functions of the translation vector. In particular, we prove that the convex hull functi…
▽ More
The aim of this note is to investigate the properties of the convex hull and the homothetic convex hull functions of a convex body $K$ in Euclidean $n$-space, defined as the volume of the union of $K$ and one of its translates, and the volume of $K$ and a translate of a homothetic copy of $K$, respectively, as functions of the translation vector. In particular, we prove that the convex hull function of the body $K$ does not determine $K$. Furthermore, we prove the equivalence of the polar projection body problem raised by Petty, and a conjecture of G.Horváth and Lángi about translative constant volume property of convex bodies. We give a short proof of some theorems of Jerónimo-Castro about the homothetic convex hull function, and prove a homothetic variant of the translative constant volume property conjecture for $3$-dimensional convex polyhedra. We also apply our results to describe the properties of the illumination bodies of convex bodies.
△ Less
Submitted 23 September, 2021; v1 submitted 16 December, 2020;
originally announced December 2020.
-
Diameter, width and thickness in the hyperbolic plane
Authors:
Ákos G. Horváth
Abstract:
This paper contains a new concept to measure the width and thickness of a convex body in the hyperbolic plane. We compare the known concepts with the new one and prove some results on bodies of constant width, constant diameter and given thickness.
This paper contains a new concept to measure the width and thickness of a convex body in the hyperbolic plane. We compare the known concepts with the new one and prove some results on bodies of constant width, constant diameter and given thickness.
△ Less
Submitted 30 November, 2020;
originally announced November 2020.
-
Note on the Equilibrium Measures of Julia sets of Exceptional Jacobi Polynomials
Authors:
Á. P. Horváth
Abstract:
We prove that similarly to the standard case, the equilibrium measure of Julia sets of exceptional Jacobi polynomials tends to the equilibrium measure of the interval of orthogonality in weak-star sense.
We prove that similarly to the standard case, the equilibrium measure of Julia sets of exceptional Jacobi polynomials tends to the equilibrium measure of the interval of orthogonality in weak-star sense.
△ Less
Submitted 16 November, 2020;
originally announced November 2020.
-
Receptive Field Size Optimization with Continuous Time Pooling
Authors:
Dóra Babicz,
Soma Kontár,
Márk Pető,
András Fülöp,
Gergely Szabó,
András Horváth
Abstract:
The pooling operation is a cornerstone element of convolutional neural networks. These elements generate receptive fields for neurons, in which local perturbations should have minimal effect on the output activations, increasing robustness and invariance of the network. In this paper we will present an altered version of the most commonly applied method, maximum pooling, where pooling in theory is…
▽ More
The pooling operation is a cornerstone element of convolutional neural networks. These elements generate receptive fields for neurons, in which local perturbations should have minimal effect on the output activations, increasing robustness and invariance of the network. In this paper we will present an altered version of the most commonly applied method, maximum pooling, where pooling in theory is substituted by a continuous time differential equation, which generates a location sensitive pooling operation, more similar to biological receptive fields. We will present how this continuous method can be approximated numerically using discrete operations which fit ideally on a GPU. In our approach the kernel size is substituted by diffusion strength which is a continuous valued parameter, this way it can be optimized by gradient descent algorithms. We will evaluate the effect of continuous pooling on accuracy and computational need using commonly applied network architectures and datasets.
△ Less
Submitted 6 November, 2020; v1 submitted 2 November, 2020;
originally announced November 2020.
-
Filtered Batch Normalization
Authors:
Andras Horvath,
Jalal Al-afandi
Abstract:
It is a common assumption that the activation of different layers in neural networks follow Gaussian distribution. This distribution can be transformed using normalization techniques, such as batch-normalization, increasing convergence speed and improving accuracy. In this paper we would like to demonstrate, that activations do not necessarily follow Gaussian distribution in all layers. Neurons in…
▽ More
It is a common assumption that the activation of different layers in neural networks follow Gaussian distribution. This distribution can be transformed using normalization techniques, such as batch-normalization, increasing convergence speed and improving accuracy. In this paper we would like to demonstrate, that activations do not necessarily follow Gaussian distribution in all layers. Neurons in deeper layers are more selective and specific which can result extremely large, out-of-distribution activations.
We will demonstrate that one can create more consistent mean and variance values for batch normalization during training by filtering out these activations which can further improve convergence speed and yield higher validation accuracy.
△ Less
Submitted 16 October, 2020;
originally announced October 2020.
-
3D Segmentation Networks for Excessive Numbers of Classes: Distinct Bone Segmentation in Upper Bodies
Authors:
Eva Schnider,
Antal Horváth,
Georg Rauter,
Azhar Zam,
Magdalena Müller-Gerbl,
Philippe C. Cattin
Abstract:
Segmentation of distinct bones plays a crucial role in diagnosis, planning, navigation, and the assessment of bone metastasis. It supplies semantic knowledge to visualisation tools for the planning of surgical interventions and the education of health professionals. Fully supervised segmentation of 3D data using Deep Learning methods has been extensively studied for many tasks but is usually restr…
▽ More
Segmentation of distinct bones plays a crucial role in diagnosis, planning, navigation, and the assessment of bone metastasis. It supplies semantic knowledge to visualisation tools for the planning of surgical interventions and the education of health professionals. Fully supervised segmentation of 3D data using Deep Learning methods has been extensively studied for many tasks but is usually restricted to distinguishing only a handful of classes. With 125 distinct bones, our case includes many more labels than typical 3D segmentation tasks. For this reason, the direct adaptation of most established methods is not possible. This paper discusses the intricacies of training a 3D segmentation network in a many-label setting and shows necessary modifications in network architecture, loss function, and data augmentation. As a result, we demonstrate the robustness of our method by automatically segmenting over one hundred distinct bones simultaneously in an end-to-end learnt fashion from a CT-scan.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
Electron Landau Dam** of Kinetic Alfvén Waves in Simulated Magnetosheath Turbulence
Authors:
Sarah A. Horvath,
Gregory G. Howes,
Andrew J. McCubbin
Abstract:
Turbulence is thought to play a role in the heating of the solar wind plasma, though many questions remain to be solved regarding the exact nature of the mechanisms driving this process in the heliosphere. In particular, the physics of the collisionless interactions between particles and turbulent electromagnetic fields in the kinetic dissipation range of the turbulent cascade remains incompletely…
▽ More
Turbulence is thought to play a role in the heating of the solar wind plasma, though many questions remain to be solved regarding the exact nature of the mechanisms driving this process in the heliosphere. In particular, the physics of the collisionless interactions between particles and turbulent electromagnetic fields in the kinetic dissipation range of the turbulent cascade remains incompletely understood. A recent analysis of an interval of Magnetosphere Multiscale (MMS) observations has used the field-particle correlation technique to demonstrate that electron Landau dam** is involved in the dissipation of turbulence in the Earth's magnetosheath. Motivated by this discovery, we perform a high-resolution gyrokinetic numerical simulation of the turbulence in the MMS interval to investigate the role of electron Landau dam** in the dissipation of turbulent energy. We employ the field-particle correlation technique on our simulation data, compare our results to the known velocity-space signatures of Landau dam** outside the dissipation range, and evaluate the net electron energization. We find qualitative agreement between the numerical and observational results for some key aspects of the energization and speculate on the nature of disagreements in light of experimental factors, such as differences in resolution, and of develo** insights into the nature of field-particle interactions in the presence of dispersive kinetic Alfvén waves.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
Sorted Pooling in Convolutional Networks for One-shot Learning
Authors:
András Horváth
Abstract:
We present generalized versions of the commonly used maximum pooling operation: $k$th maximum and sorted pooling operations which selects the $k$th largest response in each pooling region, selecting locally consistent features of the input images. This method is able to increase the generalization power of a network and can be used to decrease training time and error rate of networks and it can si…
▽ More
We present generalized versions of the commonly used maximum pooling operation: $k$th maximum and sorted pooling operations which selects the $k$th largest response in each pooling region, selecting locally consistent features of the input images. This method is able to increase the generalization power of a network and can be used to decrease training time and error rate of networks and it can significantly improve accuracy in case of training scenarios where the amount of available data is limited, like one-shot learning scenarios
△ Less
Submitted 20 July, 2020;
originally announced July 2020.
-
Discrete diffusion semigroups associated with Dunkl-Jacobi and exceptional Jacobi polynomials
Authors:
Á. P. Horváth
Abstract:
Some weighted inequalities for the maximal operator with respect to the discrete diffusion semigroups associated with exceptional Jacobi and Dunkl-Jacobi polynomials are given. This setup allows to extend the corresponding results obtained for discrete heat semigroup recently to richer class of differential-difference operators.
Some weighted inequalities for the maximal operator with respect to the discrete diffusion semigroups associated with exceptional Jacobi and Dunkl-Jacobi polynomials are given. This setup allows to extend the corresponding results obtained for discrete heat semigroup recently to richer class of differential-difference operators.
△ Less
Submitted 11 July, 2020;
originally announced July 2020.
-
Deposition distribution of the new coronavirus (SARS-CoV-2) in the human airways upon exposure to cough-generated aerosol
Authors:
Balázs G. Madas,
Péter Füri,
Árpád Farkas,
Attila Nagy,
Aladár Czitrovszky,
Imre Balásházy,
Gusztáv G. Schay,
Alpár Horváth
Abstract:
The new coronavirus disease 2019 (COVID-19) has been emerged as a rapidly spreading pandemic. The disease is thought to spread mainly from person-to-person through respiratory droplets produced when an infected person coughs, sneezes, or talks. The pathogen of COVID-19 is the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). It infects the cells binding to the angiotensin-converting en…
▽ More
The new coronavirus disease 2019 (COVID-19) has been emerged as a rapidly spreading pandemic. The disease is thought to spread mainly from person-to-person through respiratory droplets produced when an infected person coughs, sneezes, or talks. The pathogen of COVID-19 is the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). It infects the cells binding to the angiotensin-converting enzyme 2 receptor (ACE2) which is expressed by cells throughout the airways as targets for cellular entry. Although the majority of persons infected with SARS-CoV-2 experience symptoms of mild upper respiratory tract infection, in some people infections of the peripheral airways result in severe, potentially fatal pneumonia. However, the induction of COVID-19 pneumonia requires that SARS-CoV-2 reaches the peripheral airways. While huge efforts have been made to understand the spread of the disease as well as the pathogenesis following cellular entry, much less attention is paid how SARS-CoV-2 from the environment reach the receptors of the target cells. The aim of the present study is to characterize the deposition distribution of SARS-CoV-2 in the airways upon exposure to cough-generated aerosol. For this purpose, the Stochastic Lung Deposition Model has been applied. Aerosol size distribution and breathing parameters were taken from the literature supposing normal breathing through the nose. We found that the probability of direct infection of the peripheral airways due to inhalation of aerosol generated by a bystander cough is very low. As the number of pathogens deposited in the extrathoracic airways is ~10 times higher than in the peripheral airways, we concluded that in most cases COVID-19 pneumonia must be preceded by SARS-CoV-2 infection of the upper airways. Our results suggest that without the enhancement of viral load in the upper airways, COVID-19 would be much less dangerous...
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
Extending the Southern Shore of the Island of Inversion to $^{28}$F
Authors:
A. Revel,
O. Sorlin,
F. M. Marques,
Y. Kondo,
J. Kahlbow,
T. Nakamura,
N. A. Orr,
F. Nowacki,
J. A. Tostevin,
C. X. Yuan,
N. L. Achouri,
H. Al Falou,
L. Atar,
T. Aumann,
H. Baba,
K. Boretzky,
C. Caesar,
D. Calvet,
H. Chae,
N. Chiga,
A. Corsi,
H. L. Crawford,
F. Delaunay,
A. Delbart,
Q. Deshayes
, et al. (67 additional authors not shown)
Abstract:
Detailed spectroscopy of the neutron-unbound nucleus $^{28}$F has been performed for the first time following proton/neutron removal from $^{29}$Ne/$^{29}$F beams at energies around 230 MeV/nucleon. The invariant-mass spectra were reconstructed for both the $^{27}$F$^{(*)}+n$ and $^{26}$F$^{(*)}+2n$ coincidences and revealed a series of well-defined resonances. A near-threshold state was observed…
▽ More
Detailed spectroscopy of the neutron-unbound nucleus $^{28}$F has been performed for the first time following proton/neutron removal from $^{29}$Ne/$^{29}$F beams at energies around 230 MeV/nucleon. The invariant-mass spectra were reconstructed for both the $^{27}$F$^{(*)}+n$ and $^{26}$F$^{(*)}+2n$ coincidences and revealed a series of well-defined resonances. A near-threshold state was observed in both reactions and is identified as the $^{28}$F ground state, with $S_n(^{28}$F$)=-199(6)$ keV, while analysis of the $2n$ decay channel allowed a considerably improved $S_n(^{27}$F$)=1620(60)$ keV to be deduced. Comparison with shell-model predictions and eikonal-model reaction calculations have allowed spin-parity assignments to be proposed for some of the lower-lying levels of $^{28}$F. Importantly, in the case of the ground state, the reconstructed $^{27}$F$+n$ momentum distribution following neutron removal from $^{29}$F indicates that it arises mainly from the $1p_{3/2}$ neutron intruder configuration. This demonstrates that the island of inversion around $N=20$ includes $^{28}$F, and most probably $^{29}$F, and suggests that $^{28}$O is not doubly magic.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Multiplication operator and exceptional Jacobi polynomials
Authors:
Á. P. Horváth
Abstract:
Below the normalized weighted reciprocal of the Christoffel function with respect to exceptional Jacobi polynomials is investigated. It is proved that it tends to the equilibrium measure of the interval of orthogonality in weak-star sense. The main tool of this study is the multiplication operator and examination of behavior of zeros of the corresponding average characteristic polynomial. Finally,…
▽ More
Below the normalized weighted reciprocal of the Christoffel function with respect to exceptional Jacobi polynomials is investigated. It is proved that it tends to the equilibrium measure of the interval of orthogonality in weak-star sense. The main tool of this study is the multiplication operator and examination of behavior of zeros of the corresponding average characteristic polynomial. Finally, as an application of multiplication operator, location of zeros of certain self-inversive polynomials are examined.
△ Less
Submitted 16 November, 2020; v1 submitted 26 March, 2020;
originally announced March 2020.
-
Radiation detection and energy conversion in nuclear reactor environments by hybrid photovoltaic perovskites
Authors:
Gàbor Nàfràdi,
Endre Horvàth,
Màrton Kollàr,
Andràs Horvàth,
Pavao Andri čević,
Andrzej Sienkiewicz,
Làszlò Forrò,
Bàlint Nàfràdi
Abstract:
Detection and direct power conversion of high energy and high intensity ionizing radiation could be a key element in next generation nuclear reactor safety systems and space-born devices. For example, the Fukushima catastrophe in 2011 could have been largely prevented if 1\% of the reactor's remnant radiation ($γ$-rays of the nuclear fission) were directly converted within the reactor to electrici…
▽ More
Detection and direct power conversion of high energy and high intensity ionizing radiation could be a key element in next generation nuclear reactor safety systems and space-born devices. For example, the Fukushima catastrophe in 2011 could have been largely prevented if 1\% of the reactor's remnant radiation ($γ$-rays of the nuclear fission) were directly converted within the reactor to electricity to power the water cooling circuit. It is reported here that the hybrid halide perovskite methylammonium lead triiodide could perfectly play the role of a converter. Single crystals were irradiated by a typical shut-down $γ$-spectrum of a nuclear reactor with 7.61E14 Bq activity exhibit a high-efficiency of $γ$-ray to free charge carrier conversion with radiation hardening. The power density of 0.3 mW/kg of methylammonium lead triiodide at 50 Sv/h means a four times higher efficiency than that for silicon-based cells. The material was stable to the limits of the experiment without changing its performance up to 100 Sv/h dose rate and 57 Sv H*(10) ambient total $γ$-dose. Moreover, the γ-shielding performance of methylammonium lead triiodide was found to be superior to both ordinary and barite concrete.
△ Less
Submitted 28 January, 2020;
originally announced January 2020.
-
On convex bodies that are characterizable by volume function
Authors:
Ákos G. Horváth
Abstract:
The "old-new" concept of convex-hull function was investigated by several authors in the last seventy years. A recent research on it led to some other volume functions as the covariogram function, the widthness function or the so-called brightness functions, respectively. A very interesting fact that there are many long-standing open problems connected with these functions whose serious investigat…
▽ More
The "old-new" concept of convex-hull function was investigated by several authors in the last seventy years. A recent research on it led to some other volume functions as the covariogram function, the widthness function or the so-called brightness functions, respectively. A very interesting fact that there are many long-standing open problems connected with these functions whose serious investigation closed before the "age of computers". In this survey, we concentrate only on the three-dimensional case, we will mention the most important concepts, statements, and problems.
△ Less
Submitted 8 August, 2019;
originally announced August 2019.
-
MimosaNet: An Unrobust Neural Network Preventing Model Stealing
Authors:
Kálmán Szentannai,
Jalal Al-Afandi,
András Horváth
Abstract:
Deep Neural Networks are robust to minor perturbations of the learned network parameters and their minor modifications do not change the overall network response significantly. This allows space for model stealing, where a malevolent attacker can steal an already trained network, modify the weights and claim the new network his own intellectual property. In certain cases this can prevent the free…
▽ More
Deep Neural Networks are robust to minor perturbations of the learned network parameters and their minor modifications do not change the overall network response significantly. This allows space for model stealing, where a malevolent attacker can steal an already trained network, modify the weights and claim the new network his own intellectual property. In certain cases this can prevent the free distribution and application of networks in the embedded domain. In this paper, we propose a method for creating an equivalent version of an already trained fully connected deep neural network that can prevent network stealing: namely, it produces the same responses and classification accuracy, but it is extremely sensitive to weight changes.
△ Less
Submitted 2 July, 2019;
originally announced July 2019.
-
Asymptotics for Recurrence Coefficients of X1-Jacobi Polynomials and Christoffel Function
Authors:
Á. P. Horváth
Abstract:
Computing asymptotics of the recurrence coefficients of X1-Jacobi polynomials we investigate the limit of Christoffel function. We also study the relation between the normalized counting measure based on the zeros of the modified average characteristic polynomial and the Christoffel function in limit. The proofs of corresponding theorems with respect to ordinary orthogonal polynomials are based on…
▽ More
Computing asymptotics of the recurrence coefficients of X1-Jacobi polynomials we investigate the limit of Christoffel function. We also study the relation between the normalized counting measure based on the zeros of the modified average characteristic polynomial and the Christoffel function in limit. The proofs of corresponding theorems with respect to ordinary orthogonal polynomials are based on the three-term recurrence relation. The main point is that exceptional orthogonal polynomials possess at least five-term formulae and so the Christoffel-Darboux formula also fails. It seems that these difficulties can be handled in combinatorial way.
△ Less
Submitted 27 May, 2019;
originally announced May 2019.
-
Application-level Studies of Cellular Neural Network-based Hardware Accelerators
Authors:
Qiuwen Lou,
Indranil Palit,
Tang Li,
Andras Horvath,
Michael Niemier,
X. Sharon Hu
Abstract:
As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co…
▽ More
As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.
△ Less
Submitted 12 June, 2019; v1 submitted 28 February, 2019;
originally announced March 2019.
-
Domain Partitioning Network
Authors:
Botos Csaba,
Adnane Boukhayma,
Viveka Kulharia,
András Horváth,
Philip H. S. Torr
Abstract:
Standard adversarial training involves two agents, namely a generator and a discriminator, playing a mini-max game. However, even if the players converge to an equilibrium, the generator may only recover a part of the target data distribution, in a situation commonly referred to as mode collapse. In this work, we present the Domain Partitioning Network (DoPaNet), a new approach to deal with mode c…
▽ More
Standard adversarial training involves two agents, namely a generator and a discriminator, playing a mini-max game. However, even if the players converge to an equilibrium, the generator may only recover a part of the target data distribution, in a situation commonly referred to as mode collapse. In this work, we present the Domain Partitioning Network (DoPaNet), a new approach to deal with mode collapse in generative adversarial learning. We employ multiple discriminators, each encouraging the generator to cover a different part of the target distribution. To ensure these parts do not overlap and collapse into the same mode, we add a classifier as a third agent in the game. The classifier decides which discriminator the generator is trained against for each sample. Through experiments on toy examples and real images, we show the merits of DoPaNet in covering the real distribution and its superiority with respect to the competing methods. Besides, we also show that we can control the modes from which samples are generated using DoPaNet.
△ Less
Submitted 21 February, 2019;
originally announced February 2019.
-
An extremal problem of regular simplices-The higher dimensional case
Authors:
Ákos G. Horváth
Abstract:
The new result of this paper connected with the following problem: Consider a supporting hyperplane of a regular simplex and its re ected image at this hyperplane. When will be the volume of the convex hull of these two simplices maximal? We prove that in the case when the dimension is less or equal to 4, the maximal volume achieves in that case when the hyperplane goes through on a vertex and ort…
▽ More
The new result of this paper connected with the following problem: Consider a supporting hyperplane of a regular simplex and its re ected image at this hyperplane. When will be the volume of the convex hull of these two simplices maximal? We prove that in the case when the dimension is less or equal to 4, the maximal volume achieves in that case when the hyperplane goes through on a vertex and orthogonal to the height of the simplex at this vertex. More interesting that in the higher dimensional cases this position is not optimal. We also determine the optimal position of hyperplane in the 5-dimensional case. This corrects an erroneous statement in my paper [3].
△ Less
Submitted 29 November, 2018;
originally announced November 2018.
-
A mixed signal architecture for convolutional neural networks
Authors:
Qiuwen Lou,
Chenyun Pan,
John McGuiness,
Andras Horvath,
Azad Naeemi,
Michael Niemier,
X. Sharon Hu
Abstract:
Deep neural network (DNN) accelerators with improved energy and delay are desirable for meeting the requirements of hardware targeted for IoT and edge computing systems. Convolutional neural networks (CoNNs) belong to one of the most popular types of DNN architectures. This paper presents the design and evaluation of an accelerator for CoNNs. The system-level architecture is based on mixed-signal,…
▽ More
Deep neural network (DNN) accelerators with improved energy and delay are desirable for meeting the requirements of hardware targeted for IoT and edge computing systems. Convolutional neural networks (CoNNs) belong to one of the most popular types of DNN architectures. This paper presents the design and evaluation of an accelerator for CoNNs. The system-level architecture is based on mixed-signal, cellular neural networks (CeNNs). Specifically, we present (i) the implementation of different layers, including convolution, ReLU, and pooling, in a CoNN using CeNN, (ii) modified CoNN structures with CeNN-friendly layers to reduce computational overheads typically associated with a CoNN, (iii) a mixed-signal CeNN architecture that performs CoNN computations in the analog and mixed signal domain, and (iv) design space exploration that identifies what CeNN-based algorithm and architectural features fare best compared to existing algorithms and architectures when evaluated over common datasets -- MNIST and CIFAR-10. Notably, the proposed approach can lead to 8.7$\times$ improvements in energy-delay product (EDP) per digit classification for the MNIST dataset at iso-accuracy when compared with the state-of-the-art DNN engine, while our approach could offer 4.3$\times$ improvements in EDP when compared to other network implementations for the CIFAR-10 dataset.
△ Less
Submitted 2 May, 2019; v1 submitted 30 October, 2018;
originally announced November 2018.
-
Spinal Cord Gray Matter-White Matter Segmentation on Magnetic Resonance AMIRA Images with MD-GRU
Authors:
Antal Horvath,
Charidimos Tsagkas,
Simon Andermatt,
Simon Pezold,
Katrin Parmar,
Philippe Cattin
Abstract:
The small butterfly shaped structure of spinal cord (SC) gray matter (GM) is challenging to image and to delinate from its surrounding white matter (WM). Segmenting GM is up to a point a trade-off between accuracy and precision. We propose a new pipeline for GM-WM magnetic resonance (MR) image acquisition and segmentation. We report superior results as compared to the ones recently reported in the…
▽ More
The small butterfly shaped structure of spinal cord (SC) gray matter (GM) is challenging to image and to delinate from its surrounding white matter (WM). Segmenting GM is up to a point a trade-off between accuracy and precision. We propose a new pipeline for GM-WM magnetic resonance (MR) image acquisition and segmentation. We report superior results as compared to the ones recently reported in the SC GM segmentation challenge and show even better results using the averaged magnetization inversion recovery acquisitions (AMIRA) sequence. Scan-rescan experiments with the AMIRA sequence show high reproducibility in terms of Dice coefficient, Hausdorff distance and relative standard deviation. We use a recurrent neural network (RNN) with multi-dimensional gated recurrent units (MD-GRU) to train segmentation models on the AMIRA dataset of 855 slices. We added a generalized dice loss to the cross entropy loss that MD-GRU uses and were able to improve the results.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
Pathology Segmentation using Distributional Differences to Images of Healthy Origin
Authors:
Simon Andermatt,
Antal Horváth,
Simon Pezold,
Philippe Cattin
Abstract:
Fully supervised segmentation methods require a large training cohort of already segmented images, providing information at the pixel level of each image. We present a method to automatically segment and model pathologies in medical images, trained solely on data labelled on the image level as either healthy or containing a visual defect. We base our method on CycleGAN, an image-to-image translati…
▽ More
Fully supervised segmentation methods require a large training cohort of already segmented images, providing information at the pixel level of each image. We present a method to automatically segment and model pathologies in medical images, trained solely on data labelled on the image level as either healthy or containing a visual defect. We base our method on CycleGAN, an image-to-image translation technique, to translate images between the domains of healthy and pathological images. We extend the core idea with two key contributions. Implementing the generators as residual generators allows us to explicitly model the segmentation of the pathology. Realizing the translation from the healthy to the pathological domain using a variational autoencoder allows us to specify one representation of the pathology, as this transformation is otherwise not unique. Our model hence not only allows us to create pixelwise semantic segmentations, it is also able to create inpaintings for the segmentations to render the pathological image healthy. Furthermore, we can draw new unseen pathology samples from this model based on the distribution in the data. We show quantitatively, that our method is able to segment pathologies with a surprising accuracy being only slightly inferior to a state-of-the-art fully supervised method, although the latter has per-pixel rather than per-image training information. Moreover, we show qualitative results of both the segmentations and inpaintings. Our findings motivate further research into weakly-supervised segmentation using image level annotations, allowing for faster and cheaper acquisition of training data without a large sacrifice in segmentation accuracy.
△ Less
Submitted 21 August, 2019; v1 submitted 25 May, 2018;
originally announced May 2018.
-
A note on the centers of a closed chain of circles
Authors:
Ákos G. Horváth
Abstract:
In this note we prove that the centers of a closed chain of circles for which every two consecutive members meet in the points of two given circles form a tangent polygon of a conic.
In this note we prove that the centers of a closed chain of circles for which every two consecutive members meet in the points of two given circles form a tangent polygon of a conic.
△ Less
Submitted 30 November, 2018; v1 submitted 27 April, 2018;
originally announced April 2018.
-
LASR-Guided Stellar Photometric Variability Subtraction: The Linear Algorithm For Significance Reduction
Authors:
John P. Ahlers,
Jason W. Barnes,
Sarah A. Horvath,
Samuel A. Myers,
Matthew M. Hedman
Abstract:
We develop a technique for removing stellar variability in the light curves of $δ$-Scuti and similar stars. Our technique, which we name the Linear Algorithm for Significance Reduction (LASR), subtracts oscillations from a time series by minimizing their statistical significance in frequency space. We demonstrate that LASR can subtract variable signals of near-arbitrary complexity and can robustly…
▽ More
We develop a technique for removing stellar variability in the light curves of $δ$-Scuti and similar stars. Our technique, which we name the Linear Algorithm for Significance Reduction (LASR), subtracts oscillations from a time series by minimizing their statistical significance in frequency space. We demonstrate that LASR can subtract variable signals of near-arbitrary complexity and can robustly handle close frequency pairs and overtone frequencies. We demonstrate that our algorithm performs an equivalent fit as prewhitening to the straightforward variable signal of KIC 9700322. We also show that LASR provides a better fit to seismic activity than prewhitening in the case of the complex $δ$-Scuti KOI-976.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
Translation operator with exceptional Laguerre polynomials
Authors:
Á. P. Horváth
Abstract:
We extend the notion of general translation operator to exceptional Laguerre polynomials. To this we investigate the associated singular hyperbolic Cauchy problem. We derive a maximum principle with respect to this Cauchy problem and applying it we determine the norm of the translation operator. As an application we give Nikol'skii inequalities with respect to exceptional Laguerre polynomials.
We extend the notion of general translation operator to exceptional Laguerre polynomials. To this we investigate the associated singular hyperbolic Cauchy problem. We derive a maximum principle with respect to this Cauchy problem and applying it we determine the norm of the translation operator. As an application we give Nikol'skii inequalities with respect to exceptional Laguerre polynomials.
△ Less
Submitted 17 July, 2018; v1 submitted 14 March, 2018;
originally announced March 2018.
-
Gallucci's axiom revisited
Authors:
Ákos G. Horváth
Abstract:
In this paper we propose a well-justified synthetic approach of the projective space. We define the concepts of plane and space of incidence and also the Gallucci's axiom as an axiom to our classical projective space. To this purpose we prove from our space axioms, the theorems of Desargues, Pappus, the fundamental theorem of projectivities, and the fundamental theorem of central-axial collinearit…
▽ More
In this paper we propose a well-justified synthetic approach of the projective space. We define the concepts of plane and space of incidence and also the Gallucci's axiom as an axiom to our classical projective space. To this purpose we prove from our space axioms, the theorems of Desargues, Pappus, the fundamental theorem of projectivities, and the fundamental theorem of central-axial collinearities, respectively. Our building up do not use any information on analytical projective geometry, as the concept of cross-ratio and the homogeneous coordinates of points.
△ Less
Submitted 15 January, 2018; v1 submitted 12 December, 2017;
originally announced December 2017.
-
A review of the deterministic and diffusion approximations for stochastic chemical reaction networks
Authors:
Pavel Mozgunov,
Marco Beccuti,
Andras Horvath,
Thomas Jaki,
Roberta Sirovich,
Enrico Bibbona
Abstract:
This work reviews deterministic and diffusion approximations of the stochastic chemical reaction networks and explains their applications. We discuss the added value the diffusion approximation provides for systems with different phenomena, such as a deficiency and a bistability. It is advocated that the diffusion approximation can be considered as an alternative theoretical approach to study the…
▽ More
This work reviews deterministic and diffusion approximations of the stochastic chemical reaction networks and explains their applications. We discuss the added value the diffusion approximation provides for systems with different phenomena, such as a deficiency and a bistability. It is advocated that the diffusion approximation can be considered as an alternative theoretical approach to study the reaction networks rather than a simulation shortcut. We discuss two examples in which the diffusion approximation is able to catch qualitative properties of reaction networks that the deterministic model misses. We provide an explicit construction of the original process and the diffusion approximation such that the distance between their trajectories is controlled and demonstrate this construction for the examples. We also discuss the limitations and potential directions of the developments.
△ Less
Submitted 12 January, 2018; v1 submitted 7 November, 2017;
originally announced November 2017.
-
On the constructibility of the axes of an ellipsoid
Authors:
Ákos G. Horváth,
István Prok
Abstract:
In this paper we discuss Chasles's construction on ellipsoid to draw the semi-axes from a complete system of conjugate diameters. We prove that there is such situation when the construction is not planar (the needed points cannot be constructed with compasses and ruler) and give some others in which the construction is planar.In this paper we discuss Chasles's construction on ellipsoid to draw the…
▽ More
In this paper we discuss Chasles's construction on ellipsoid to draw the semi-axes from a complete system of conjugate diameters. We prove that there is such situation when the construction is not planar (the needed points cannot be constructed with compasses and ruler) and give some others in which the construction is planar.In this paper we discuss Chasles's construction on ellipsoid to draw the semi-axes from a complete system of conjugate diameters. We prove that there is such situation when the construction is not planar (the needed points cannot be constructed with compasses and ruler) and give some others in which the construction is planar.
△ Less
Submitted 19 October, 2017;
originally announced October 2017.