-
GraLMatch: Matching Groups of Entities with Graphs and Language Models
Authors:
Fernando De Meer Pardo,
Claude Lehmann,
Dennis Gehrig,
Andrea Nagy,
Stefano Nicoli,
Branka Hadji Misheva,
Martin Braschler,
Kurt Stockinger
Abstract:
In this paper, we present an end-to-end multi-source Entity Matching problem, which we call entity group matching, where the goal is to assign to the same group, records originating from multiple data sources but representing the same real-world entity. We focus on the effects of transitively matched records, i.e. the records connected by paths in the graph G = (V,E) whose nodes and edges represen…
▽ More
In this paper, we present an end-to-end multi-source Entity Matching problem, which we call entity group matching, where the goal is to assign to the same group, records originating from multiple data sources but representing the same real-world entity. We focus on the effects of transitively matched records, i.e. the records connected by paths in the graph G = (V,E) whose nodes and edges represent the records and whether they are a match or not. We present a real-world instance of this problem, where the challenge is to match records of companies and financial securities originating from different data providers. We also introduce two new multi-source benchmark datasets that present similar matching challenges as real-world records. A distinctive characteristic of these records is that they are regularly updated following real-world events, but updates are not applied uniformly across data sources. This phenomenon makes the matching of certain groups of records only possible through the use of transitive information.
In our experiments, we illustrate how considering transitively matched records is challenging since a limited amount of false positive pairwise match predictions can throw off the group assignment of large quantities of records. Thus, we propose GraLMatch, a method that can partially detect and remove false positive pairwise predictions through graph-based properties. Finally, we showcase how fine-tuning a Transformer-based model (DistilBERT) on a reduced number of labeled samples yields a better final entity group matching than training on more samples and/or incorporating fine-tuning optimizations, illustrating how precision becomes the deciding factor in the entity group matching of large volumes of records.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
The story of SN 2021aatd -- a peculiar 1987A-like supernova with an early-phase luminosity excess
Authors:
T. Szalai,
R. Könyves-Tóth,
A. P. Nagy,
D. Hiramatsu,
I. Arcavi,
A. Bostroem,
D. A. Howell,
J. Farah,
C. McCully,
M. Newsome,
E. Padilla Gonzalez,
C. Pellegrino,
G. Terreran,
E. Berger,
P. Blanchard,
S. Gomez,
P. Székely,
D. Bánhidi,
I. B. Bíró,
I. Csányi,
A. Pál,
J. Rho,
J. Vinkó
Abstract:
There is a growing number of peculiar events that cannot be assigned to any of the main supernova (SN) classes. SN 1987A and a handful of similar objects, thought to be explosive outcomes of blue supergiant stars, belong to them: while their spectra closely resemble those of H-rich (IIP) SNe, their light-curve (LC) evolution is very different. Here we present the detailed photometric and spectrosc…
▽ More
There is a growing number of peculiar events that cannot be assigned to any of the main supernova (SN) classes. SN 1987A and a handful of similar objects, thought to be explosive outcomes of blue supergiant stars, belong to them: while their spectra closely resemble those of H-rich (IIP) SNe, their light-curve (LC) evolution is very different. Here we present the detailed photometric and spectroscopic analysis of SN 2021aatd, a peculiar Type II explosion: while its early-time evolution resembles that of the slowly evolving, double-peaked SN 2020faa (however, at a lower luminosity scale), after $\sim$40 days, its LC shape becomes similar to that of SN 1987A-like explosions. Beyond comparing LCs, color curves, and spectra of SN 2021aatd to that of SNe 2020faa, 1987A, and of other objects, we compare the observed spectra with our own SYN++ models and with the outputs of published radiative transfer models. We also modeled the pseudo-bolometric LCs of SNe 2021aatd and 1987A assuming a two-component (core+shell) ejecta, and involving the rotational energy of a newborn magnetar in addition to radioactive decay. We find that both the photometric and spectroscopic evolution of SN 2021aatd can be well described with the explosion of a $\sim$15 $M_\odot$ blue supergiant star. Nevertheless, SN 2021aatd shows higher temperatures and weaker Na ID and Ba II 6142 A lines than SN 1987A, which is reminiscent of rather to IIP-like atmospheres. With the applied two-component ejecta model (counting with both decay and magnetar energy), we can successfully describe the bolometric LC of SN 2021aatd, including the first $\sim$40-day long phase showing an excess compared to 87A-like SNe but being strikingly similar to that of the long-lived SN 2020faa. Nevertheless, finding a unified model that also explains the LCs of more luminous events (like SN 2020faa) is still a matter of concern.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Novel oracle constructions for quantum random access memory
Authors:
Ákos Nagy,
Cindy Zhang
Abstract:
We present new designs for quantum random access memory. More precisely, for each function, $f : \mathbb{F}_2^n \rightarrow \mathbb{F}_2^d$, we construct oracles, $\mathcal{O}_f$, with the property \begin{equation}
\mathcal{O}_f \left| x \right\rangle_n \left| 0 \right\rangle_d = \left| x \right\rangle_n \left| f(x) \right\rangle_d. \end{equation} Our methods are based on the Walsh-Hadamard Tran…
▽ More
We present new designs for quantum random access memory. More precisely, for each function, $f : \mathbb{F}_2^n \rightarrow \mathbb{F}_2^d$, we construct oracles, $\mathcal{O}_f$, with the property \begin{equation}
\mathcal{O}_f \left| x \right\rangle_n \left| 0 \right\rangle_d = \left| x \right\rangle_n \left| f(x) \right\rangle_d. \end{equation} Our methods are based on the Walsh-Hadamard Transform of $f$, viewed as an integer valued function. In general, the complexity of our method scales with the sparsity of the Walsh-Hadamard Transform and not the sparsity of $f$, yielding more favorable constructions in cases such as binary optimization problems and function with low-degree Walsh-Hadamard Transforms. Furthermore, our design comes with a tuneable amount of ancillas that can trade depth for size. In the ancilla-free design, these oracles can be $ε$-approximated so that the Clifford + $T$ depth is $O \left( \left( n + \log_2 \left( \tfrac{d}ε \right) \right) \mathcal{W}_f \right)$, where $\mathcal{W}_f$ is the number of nonzero components in the Walsh-Hadamard Transform. The depth of the shallowest version is $O \left( n + \log_2 \left( \tfrac{d}ε \right) \right)$, using $n + d \mathcal{W}_f$ qubit. The connectivity of these circuits is also only logarithmic in $\mathcal{W}_f$. As an application, we show that for boolean functions with low approximate degrees (as in the case of read-once formulas) the complexities of the corresponding QRAM oracles scale only as $2^{\widetilde{O} \left( \sqrt{n} \log_2 \left( n \right) \right)}$.
△ Less
Submitted 13 June, 2024; v1 submitted 30 May, 2024;
originally announced May 2024.
-
From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization
Authors:
Botond Barta,
Dorina Lakatos,
Attila Nagy,
Milán Konor Nyist,
Judit Ács
Abstract:
Training summarization models requires substantial amounts of training data. However for less resourceful languages like Hungarian, openly available models and datasets are notably scarce. To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models. The dataset is assembled from segments of the Common Crawl…
▽ More
Training summarization models requires substantial amounts of training data. However for less resourceful languages like Hungarian, openly available models and datasets are notably scarce. To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models. The dataset is assembled from segments of the Common Crawl corpus undergoing thorough cleaning, preprocessing and deduplication. In addition to abstractive summarization we generate sentence-level labels for extractive summarization using sentence similarity. We train baseline models for both extractive and abstractive summarization using the collected dataset. To demonstrate the effectiveness of the trained models, we perform both quantitative and qualitative evaluation. Our dataset, models and code are publicly available, encouraging replication, further research, and real-world applications across various domains.
△ Less
Submitted 12 April, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
On locally symmetric polynomial metrics: Riemannian and Finslerian surfaces
Authors:
Csaba Vincze,
Márk Oláh,
Ábris Nagy
Abstract:
In the paper we investigate locally symmetric polynomial metrics in special cases of Riemannian and Finslerian surfaces. The Riemannian case will be presented by a collection of basic results (regularity of second root metrics) and formulas up to Gauss curvature. In case of Finslerian surfaces we formulate necessary and sufficient conditions for a locally symmetric fourth root metric in 2D to be p…
▽ More
In the paper we investigate locally symmetric polynomial metrics in special cases of Riemannian and Finslerian surfaces. The Riemannian case will be presented by a collection of basic results (regularity of second root metrics) and formulas up to Gauss curvature. In case of Finslerian surfaces we formulate necessary and sufficient conditions for a locally symmetric fourth root metric in 2D to be positive definite. They are given in terms of the coefficients of the polynomial metric to make checking the positive definiteness as simple and direct as possible. Explicit examples are also presented. The situation is more complicated in case of spaces of dimension more than two. Some necessary conditions and an explicit example are given for a positive definite locally symmetric polynomial metric in 3D. Computations are supported by the MAPLE mathematics software (LinearAlgebra).
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Bounded distributive lattices with strict implication and weak difference
Authors:
Sergio Celani,
Agustín Nagy,
William Zuluaga Botero
Abstract:
In this paper we introduce the class of weak Heyting Brouwer algebras (WHB-algebras, for short). We extend the well known duality between distributive lattices and Priestley spaces, in order to exhibit a relational Priestley-like duality for WHB-algebras. Finally, as an application of the duality, we build the tense extension of a WHB-algebra and we employ it as a tool for proving structural prope…
▽ More
In this paper we introduce the class of weak Heyting Brouwer algebras (WHB-algebras, for short). We extend the well known duality between distributive lattices and Priestley spaces, in order to exhibit a relational Priestley-like duality for WHB-algebras. Finally, as an application of the duality, we build the tense extension of a WHB-algebra and we employ it as a tool for proving structural properties of the variety such as the finite model property, the amalgamation property, the congruence extension property and the Maehara interpolation property.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Fixed-point Grover Adaptive Search for Binary Optimization Problems
Authors:
Ákos Nagy,
Jaime Park,
Cindy Zhang,
Atithi Acharya,
Alex Khan
Abstract:
We study a Grover-type method for Quadratic Binary Optimization problems. In the unconstrained (QUBO) case, for an $n$-dimensional problem with $m$ nonzero terms, we construct a marker oracle for such problems with a tuneable parameter, $Λ\in \left[ 1, m \right] \cap \mathbb{Z}$. At $d \in \mathbb{Z}_+$ precision, the oracle uses $O \left( n + Λd \right)$ qubits, has total depth…
▽ More
We study a Grover-type method for Quadratic Binary Optimization problems. In the unconstrained (QUBO) case, for an $n$-dimensional problem with $m$ nonzero terms, we construct a marker oracle for such problems with a tuneable parameter, $Λ\in \left[ 1, m \right] \cap \mathbb{Z}$. At $d \in \mathbb{Z}_+$ precision, the oracle uses $O \left( n + Λd \right)$ qubits, has total depth $O \left( \tfrac{m}Λ \log_2 \left( n \right) + \log_2 \left( d \right) \right)$, and non-Clifford depth of $O \left( \tfrac{m}Λ \right)$. Moreover, each qubit required to be connected to at most $O \left( \log_2 \left( Λ+ d \right) \right)$ other qubits. In the case of a maximal graph cuts, as $d = 2 \log_2 \left( n \right)$ always suffices, the depth of the marker oracle can be made as shallow as $O \left( \log_2 \left( n \right) \right)$. For all values of $Λ$, the non-Clifford gate count of these oracles is strictly lower (by a factor of $\sim 2$) than previous constructions.
We then introduce a novel \emph{Fixed-point Grover Adaptive Search for QUBO Problems}, using our oracle design and a hybrid Fixed-point Grover Search of Li et al. This method has better performance guarantees than previous Grover Adaptive Search methods. Finally, we give a heuristic argument that, with high probability and in $O \left( \tfrac{\log_2 \left( n \right)}{\sqrtε} \right)$ time, this adaptive method finds a configuration that is among the best $ε2^n$ ones.
△ Less
Submitted 16 May, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
TreeSwap: Data Augmentation for Machine Translation via Dependency Subtree Swap**
Authors:
Attila Nagy,
Dorina Lakatos,
Botond Barta,
Judit Ács
Abstract:
Data augmentation methods for neural machine translation are particularly useful when limited amount of training data is available, which is often the case when dealing with low-resource languages. We introduce a novel augmentation method, which generates new sentences by swap** objects and subjects across bisentences. This is performed simultaneously based on the dependency parse trees of the s…
▽ More
Data augmentation methods for neural machine translation are particularly useful when limited amount of training data is available, which is often the case when dealing with low-resource languages. We introduce a novel augmentation method, which generates new sentences by swap** objects and subjects across bisentences. This is performed simultaneously based on the dependency parse trees of the source and target sentences. We name this method TreeSwap. Our results show that TreeSwap achieves consistent improvements over baseline models in 4 language pairs in both directions on resource-constrained datasets. We also explore domain-specific corpora, but find that our method does not make significant improvements on law, medical and IT data. We report the scores of similar augmentation methods and find that TreeSwap performs comparably. We also analyze the generated sentences qualitatively and find that the augmentation produces a correct translation in most cases. Our code is available on Github.
△ Less
Submitted 4 November, 2023;
originally announced November 2023.
-
On a probabilistic problem on finite semigroups
Authors:
Attila Nagy,
Csaba Tóth
Abstract:
In this paper we deal with the following problem: how does the structure of a finite semigroup $S$ depend on the probability that two elements selected at random from $S$, with replacement, define the same inner right translation of $S$. We solve a subcase of this problem. As the main result of the paper, we show how to construct not necessarily finite medial semigroups in which the index of the k…
▽ More
In this paper we deal with the following problem: how does the structure of a finite semigroup $S$ depend on the probability that two elements selected at random from $S$, with replacement, define the same inner right translation of $S$. We solve a subcase of this problem. As the main result of the paper, we show how to construct not necessarily finite medial semigroups in which the index of the kernel of the right regular representation equals two.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
TensorBank: Tensor Lakehouse for Foundation Model Training
Authors:
Romeo Kienzler,
Leonardo Pondian Tizzei,
Benedikt Blumenstiel,
Zoltan Arnold Nagy,
S. Karthik Mukkavilli,
Johannes Schmude,
Marcus Freitag,
Michael Behrendt,
Daniel Salles Civitarese,
Naomi Simumba,
Daiki Kimura,
Hendrik Hamann
Abstract:
Storing and streaming high dimensional data for foundation model training became a critical requirement with the rise of foundation models beyond natural language. In this paper we introduce TensorBank, a petabyte scale tensor lakehouse capable of streaming tensors from Cloud Object Store (COS) to GPU memory at wire speed based on complex relational queries. We use Hierarchical Statistical Indices…
▽ More
Storing and streaming high dimensional data for foundation model training became a critical requirement with the rise of foundation models beyond natural language. In this paper we introduce TensorBank, a petabyte scale tensor lakehouse capable of streaming tensors from Cloud Object Store (COS) to GPU memory at wire speed based on complex relational queries. We use Hierarchical Statistical Indices (HSI) for query acceleration. Our architecture allows to directly address tensors on block level using HTTP range reads. Once in GPU memory, data can be transformed using PyTorch transforms. We provide a generic PyTorch dataset type with a corresponding dataset factory translating relational queries and requested transformations as an instance. By making use of the HSI, irrelevant blocks can be skipped without reading them as those indices contain statistics on their content at different hierarchical resolution levels. This is an opinionated architecture powered by open standards and making heavy use of open-source technology. Although, hardened for production use using geospatial-temporal data, this architecture generalizes to other use case like computer vision, computational neuroscience, biological sequence analysis and more.
△ Less
Submitted 21 March, 2024; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Data Augmentation for Machine Translation via Dependency Subtree Swap**
Authors:
Attila Nagy,
Dorina Petra Lakatos,
Botond Barta,
Patrick Nanys,
Judit Ács
Abstract:
We present a generic framework for data augmentation via dependency subtree swap** that is applicable to machine translation. We extract corresponding subtrees from the dependency parse trees of the source and target sentences and swap these across bisentences to create augmented samples. We perform thorough filtering based on graphbased similarities of the dependency trees and additional heuris…
▽ More
We present a generic framework for data augmentation via dependency subtree swap** that is applicable to machine translation. We extract corresponding subtrees from the dependency parse trees of the source and target sentences and swap these across bisentences to create augmented samples. We perform thorough filtering based on graphbased similarities of the dependency trees and additional heuristics to ensure that extracted subtrees correspond to the same meaning. We conduct resource-constrained experiments on 4 language pairs in both directions using the IWSLT text translation datasets and the Hunglish2 corpus. The results demonstrate consistent improvements in BLEU score over our baseline models in 3 out of 4 language pairs. Our code is available on GitHub.
△ Less
Submitted 13 July, 2023;
originally announced July 2023.
-
Three is the magic number -- distance measurement of NGC 3147 using SN 2021hpr and its siblings
Authors:
Barnabas Barna,
Andrea P. Nagy,
Zsofia Bora,
Donat R. Czavalinga,
Reka Konyves-Toth,
Tamas Szalai,
Peter Szekely,
Szanna Zsiros,
Dominik Banhidi,
Barna I. Biro,
Istvan Csanyi,
Levente Kriskovics,
Andras Pal,
Zsofia M. Szabo,
Robert Szakats,
Krisztian Vida,
Zsofia Bodola,
Jozsef Vinko
Abstract:
The nearby spiral galaxy NGC 3147 hosted three Type Ia supernovae (SNe Ia) in the past decades, which have been subjects of intense follow-up observations. Simultaneous analysis of their data provides a unique opportunity for testing the different light curve fitting methods and distance estimations. The detailed optical follow-up of SN 2021hpr allows us to revise the previous distance estimations…
▽ More
The nearby spiral galaxy NGC 3147 hosted three Type Ia supernovae (SNe Ia) in the past decades, which have been subjects of intense follow-up observations. Simultaneous analysis of their data provides a unique opportunity for testing the different light curve fitting methods and distance estimations. The detailed optical follow-up of SN 2021hpr allows us to revise the previous distance estimations to NGC 3147, and compare the widely used light curve fitting algorithms to each other. After the combination of the available and newly published data of SN 2021hpr, its physical properties can be also estimated with higher accuracy. We present and analyse new BVgriz and Swift photometry of SN 2021hpr to constrain its general physical properties. Together with its siblings, SNe 1997bq and 2008fv, we cross-compare the individual distance estimates of these three SNe given by the SALT code, and also check their consistency with the results from the MLCS2k2 method. The early spectral series of SN 2021hpr are also fit with the radiative spectral code TARDIS in order to verify the explosion properties and constrain the chemical distribution of the outer ejecta. After combining the distance estimates for the three SNe, the mean distance to their host galaxy, NGC 3127, is 42.5 $\pm$ 1.0 Mpc, which matches with the distance inferred by the most up-to-date LC fitters, SALT3 and BayeSN. We confirm that SN~2021hpr is a Branch-normal Type Ia SN that ejected $\sim 1.12 \pm 0.28$ M$_\odot$ from its progenitor white dwarf, and synthesized $\sim 0.44 \pm 0.14$ M$_\odot$ of radioactive $^{56}$Ni.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Evaluation of AI-Supported Input Methods in Augmented Reality Environment
Authors:
Akos Nagy,
Thomas Lagkas,
Panagiotis Sarigiannidis,
Vasileios Argyriou
Abstract:
Augmented Reality (AR) solutions are providing tools that could improve applications in the medical and industrial fields. Augmentation can provide additional information in training, visualization, and work scenarios, to increase efficiency, reliability, and safety, while improving communication with other devices and systems on the network. Unfortunately, tasks in these fields often require both…
▽ More
Augmented Reality (AR) solutions are providing tools that could improve applications in the medical and industrial fields. Augmentation can provide additional information in training, visualization, and work scenarios, to increase efficiency, reliability, and safety, while improving communication with other devices and systems on the network. Unfortunately, tasks in these fields often require both hands to execute, reducing the variety of input methods suitable to control AR applications. People with certain physical disabilities, where they are not able to use their hands, are also negatively impacted when using these devices. The goal of this work is to provide novel hand-free interfacing methods, using AR technology, in association with AI support approaches to produce an improved Human-Computer interaction solution.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
AI-Powered Interfaces for Extended Reality to support Remote Maintenance
Authors:
Akos Nagy,
George Amponis,
Konstantinos Kyranou,
Thomas Lagkas,
Alexandros Apostolos Boulogeorgos,
Panagiotis Sarigiannidis,
Vasileios Argyriou
Abstract:
High-end components that conduct complicated tasks automatically are a part of modern industrial systems. However, in order for these parts to function at the desired level, they need to be maintained by qualified experts. Solutions based on Augmented Reality (AR) have been established with the goal of raising production rates and quality while lowering maintenance costs. With the introduction of…
▽ More
High-end components that conduct complicated tasks automatically are a part of modern industrial systems. However, in order for these parts to function at the desired level, they need to be maintained by qualified experts. Solutions based on Augmented Reality (AR) have been established with the goal of raising production rates and quality while lowering maintenance costs. With the introduction of two unique interaction interfaces based on wearable targets and human face orientation, we are proposing hands-free advanced interactive solutions in this study with the goal of reducing the bias towards certain users. Using traditional devices in real time, a comparison investigation using alternative interaction interfaces is conducted. The suggested solutions are supported by various AI powered methods such as novel gravity-map based motion adjustment that is made possible by predictive deep models that reduce the bias of traditional hand- or finger-based interaction interfaces
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
MAC, a novel stochastic optimization method
Authors:
Attila László Nagy,
Goitom Simret Kidane,
Tamás Turányi,
János Tóth
Abstract:
A novel stochastic optimization method called MAC was suggested. The method is based on the calculation of the objective function at several random points and then an empirical expected value and an empirical covariance matrix are calculated. The empirical expected value is proven to converge to the optimum value of the problem. The MAC algorithm was encoded in Matlab and the code was tested on 20…
▽ More
A novel stochastic optimization method called MAC was suggested. The method is based on the calculation of the objective function at several random points and then an empirical expected value and an empirical covariance matrix are calculated. The empirical expected value is proven to converge to the optimum value of the problem. The MAC algorithm was encoded in Matlab and the code was tested on 20 test problems. Its performance was compared with those of the interior point method (Matlab name: fmincon), simplex, pattern search (PS), simulated annealing (SA), particle swarm optimization (PSO), and genetic algorithm (GA) methods. The MAC method failed two test functions and provided inaccurate results on four other test functions. However, it provided accurate results and required much less CPU time than the widely used optimization methods on the other 14 test functions.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
On taxicab distance mean functions and their geometric applications: methods, implementations and examples
Authors:
Csaba Vincze,
Ábris Nagy
Abstract:
A distance mean function measures the average distance of points from the elements of a given set of points (focal set) in the space. The level sets of a distance mean function are called generalized conics. In case of infinite focal points the average distance is typically given by integration over the focal set. The paper contains a survey on the applications of taxicab distance mean functions a…
▽ More
A distance mean function measures the average distance of points from the elements of a given set of points (focal set) in the space. The level sets of a distance mean function are called generalized conics. In case of infinite focal points the average distance is typically given by integration over the focal set. The paper contains a survey on the applications of taxicab distance mean functions and generalized conics' theory in geometric tomography: bisection of the focal set and reconstruction problems by coordinate X-rays. The theoretical results are illustrated by implementations in Maple, methods and examples as well.
△ Less
Submitted 24 July, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
In-situ coating of silicon-rich films on tokamak plasma-facing components with real-time Si material injection
Authors:
Florian Effenberg,
Shota Abe,
Gregory Sinclair,
Tyler Abrams,
Alessandro Bortolon,
William R. Wampler,
Florian M. Laggner,
Dmitry L. Rudakov,
Igor Bykov,
Charles J. Lasnier,
David Mauzey,
Alexander Nagy,
Raffi Nazikian,
Filippo Scotti,
Huiqian Wang,
Robert S. Wilcox,
the DIII-D Team
Abstract:
Experiments have been conducted in the DIII-D tokamak to explore the in-situ growth of silicon-rich layers as a potential technique for real-time replenishment of surface coatings on plasma-facing components (PFCs) during steady-state long-pulse reactor operation. Silicon (Si) pellets of 1 mm diameter were injected into low- and high-confinement (L-mode and H-mode) plasma discharges with densities…
▽ More
Experiments have been conducted in the DIII-D tokamak to explore the in-situ growth of silicon-rich layers as a potential technique for real-time replenishment of surface coatings on plasma-facing components (PFCs) during steady-state long-pulse reactor operation. Silicon (Si) pellets of 1 mm diameter were injected into low- and high-confinement (L-mode and H-mode) plasma discharges with densities ranging from $3.9-7.5\times10^{19}$ m$^{-3}$ and input powers ranging from $5.5-9$ MW. The small Si pellets were delivered with the impurity granule injector (IGI) at frequencies ranging from 4-16 Hz corresponding to mass flow rates of $5-19$ mg/s ($1-4.2\times10^{20}$ Si/s) at cumulative amounts of up to 34 mg of Si per five-second discharge. Graphite samples were exposed to the scrape-off layer and private flux region plasmas through the divertor material evaluation system (DiMES) to evaluate the Si deposition on the divertor targets. The Si II emission at the sample correlates with silicon injection and suggests net surface Si-deposition in measurable amounts. Post-mortem analysis showed Si-rich coatings containing silicon oxides, of which SiO$_2$ is the dominant component. No evidence of SiC was found, which is attributed to low divertor surface temperatures. The in-situ and ex-situ analysis found that Si-rich coatings of at least $0.4-1.2$ nm thickness have been deposited at $0.4-0.7$ nm/s. The technique is estimated to coat a surface area of at least 0.94 m$^2$ on the outer divertor. These results demonstrate the potential of using real-time material injection to form Si-enriched layers on divertor PFCs during reactor operation.
△ Less
Submitted 9 August, 2023; v1 submitted 8 April, 2023;
originally announced April 2023.
-
HunSum-1: an Abstractive Summarization Dataset for Hungarian
Authors:
Botond Barta,
Dorina Lakatos,
Attila Nagy,
Milán Konor Nyist,
Judit Ács
Abstract:
We introduce HunSum-1: a dataset for Hungarian abstractive summarization, consisting of 1.14M news articles. The dataset is built by collecting, cleaning and deduplicating data from 9 major Hungarian news sites through CommonCrawl. Using this dataset, we build abstractive summarizer models based on huBERT and mT5. We demonstrate the value of the created dataset by performing a quantitative and qua…
▽ More
We introduce HunSum-1: a dataset for Hungarian abstractive summarization, consisting of 1.14M news articles. The dataset is built by collecting, cleaning and deduplicating data from 9 major Hungarian news sites through CommonCrawl. Using this dataset, we build abstractive summarizer models based on huBERT and mT5. We demonstrate the value of the created dataset by performing a quantitative and qualitative analysis on the models' results. The HunSum-1 dataset, all models used in our experiments and our code are available open source.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
On left legal semigroups
Authors:
Attila Nagy
Abstract:
In this paper we study semigroups satisfying the identity $aba=ab$.
In this paper we study semigroups satisfying the identity $aba=ab$.
△ Less
Submitted 15 July, 2023; v1 submitted 20 January, 2023;
originally announced January 2023.
-
Leukemia detection based on microscopic blood smear images using deep learning
Authors:
Abdelmageed Ahmed,
Alaa Nagy,
Ahmed Kamal,
Daila Farghl
Abstract:
In this paper we discuss a new method for detecting leukemia in microscopic blood smear images using deep neural networks to diagnose leukemia early in blood. leukemia is considered one of the most dangerous mortality causes for a human being, the traditional process of diagnosis of leukemia in blood is complex, costly, and time-consuming, so patients could not receive medical treatment on time; C…
▽ More
In this paper we discuss a new method for detecting leukemia in microscopic blood smear images using deep neural networks to diagnose leukemia early in blood. leukemia is considered one of the most dangerous mortality causes for a human being, the traditional process of diagnosis of leukemia in blood is complex, costly, and time-consuming, so patients could not receive medical treatment on time; Computer vision classification technique using deep learning can overcome the problems of traditional analysis of blood smears, our system for leukemia detection provides 97.3 % accuracy in classifying samples as cancerous or normal samples by taking a shot of blood smear and passing it as an input to the system that will check whether it contains cancer or not. In case of containing cancer cells, then the hematological expert passes the sample to a more complex device such as flow cytometry to generate complete information about the progress of cancer in the blood.
△ Less
Submitted 19 December, 2022;
originally announced January 2023.
-
Ising Model Partition Function Computation as a Weighted Counting Problem
Authors:
Shaan A. Nagy,
Roger Paredes,
Jeffrey M. Dudek,
Leonardo Dueñas-Osorio,
Moshe Y. Vardi
Abstract:
While the Ising model remains essential to understand physical phenomena, its natural connection to combinatorial reasoning makes it also one of the best models to probe complex systems in science and engineering. We bring a computational lens to the study of Ising models, where our computer-science perspective is two-fold: On the one hand, we consider the computational complexity of the Ising par…
▽ More
While the Ising model remains essential to understand physical phenomena, its natural connection to combinatorial reasoning makes it also one of the best models to probe complex systems in science and engineering. We bring a computational lens to the study of Ising models, where our computer-science perspective is two-fold: On the one hand, we consider the computational complexity of the Ising partition-function problem, or #Ising, and relate it to the logic-based counting of constraint-satisfaction problems, or #CSP. We show that known dichotomy results for #CSP give an easy proof of the hardness of #Ising and provide new intuition on where the difficulty of #Ising comes from. On the other hand, we also show that #Ising can be reduced to Weighted Model Counting (WMC). This enables us to take off-the-shelf model counters and apply them to #Ising. We show that this WMC approach outperforms state-of-the-art specialized tools for #Ising, thereby expanding the range of solvable problems in computational physics.
△ Less
Submitted 24 December, 2022;
originally announced December 2022.
-
How to solve the mass-discrepancy problem of SESNe -- I. Testing model approximations
Authors:
Andrea P. Nagy
Abstract:
Here, we present a systematic study of 59 stripped-envelope supernovae (SESNe) (including Type IIb, Ib, Ic, and transitional events) to map a possible reason for the so-called mass-discrepancy problem. In this scenario, we assume the tension between the estimated ejected masses from early- and late-time light curves (LC) is due to approximations generally used in analytical models. First, we exami…
▽ More
Here, we present a systematic study of 59 stripped-envelope supernovae (SESNe) (including Type IIb, Ib, Ic, and transitional events) to map a possible reason for the so-called mass-discrepancy problem. In this scenario, we assume the tension between the estimated ejected masses from early- and late-time light curves (LC) is due to approximations generally used in analytical models. First, we examine the assumption that the R-band light curve is indeed a good approximation of the bolometric light curve. Next, we test the generally used assumption that rise-time to maximum brightness is equal to the effective diffusion time-scale that can be used to derive the ejecta mass from the early LC. In addition, we analyze the effect of gamma-ray and positron-leakage, which play an important role in forming the shape of the tails of SESNe, and also can be crucial to gaining the ejecta masses from the late-time LC data. Finally, we consider the effect of the different definitions of velocity that are needed for the ejecta mass calculations.
△ Less
Submitted 7 November, 2022; v1 submitted 19 October, 2022;
originally announced October 2022.
-
On the bifurcation theory of the Ginzburg-Landau equations
Authors:
Ákos Nagy,
Gonçalo Oliveira
Abstract:
We construct nonminimal and irreducible solutions to the Ginzburg-Landau equations on closed manifolds of arbitrary dimension with trivial first real cohomology. Our method uses bifurcation theory where the "bifurcation points" are characterized by the eigenvalues of a Laplace-type operator. To our knowledge these are the first such examples on nontrivial line bundles.
We construct nonminimal and irreducible solutions to the Ginzburg-Landau equations on closed manifolds of arbitrary dimension with trivial first real cohomology. Our method uses bifurcation theory where the "bifurcation points" are characterized by the eigenvalues of a Laplace-type operator. To our knowledge these are the first such examples on nontrivial line bundles.
△ Less
Submitted 7 April, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Modal expansions of ririgs
Authors:
Agustín L. Nagy,
William J. Zuluaga Botero
Abstract:
In this paper we introduce the variety of I-modal ririgs. We characterize the congruence lattice of its members by means of I-filters and we provide a description on I-filter generation. We also provide an axiomatic presentation for the variety generated by chains of the subvariety of contractive I-modal ririgs. Finally, we introduce a Hilbert-style calculus of a logic with I-modal ririgs as an eq…
▽ More
In this paper we introduce the variety of I-modal ririgs. We characterize the congruence lattice of its members by means of I-filters and we provide a description on I-filter generation. We also provide an axiomatic presentation for the variety generated by chains of the subvariety of contractive I-modal ririgs. Finally, we introduce a Hilbert-style calculus of a logic with I-modal ririgs as an equivalent algebraic semantics and we prove that such a logic has the parametrized local deduction-detachment theorem.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
On the hyperbolic Bloch transform
Authors:
Ákos Nagy,
Steven Rayan
Abstract:
Motivated by recent theoretical and experimental developments in the physics of hyperbolic crystals, we study the noncommutative Bloch transform of Fuchsian groups that we call the hyperbolic Bloch transform. First, we prove that the hyperbolic Bloch transform is injective and "asymptotically unitary" already in the simplest case, that is when the Hilbert space is the regular representation of the…
▽ More
Motivated by recent theoretical and experimental developments in the physics of hyperbolic crystals, we study the noncommutative Bloch transform of Fuchsian groups that we call the hyperbolic Bloch transform. First, we prove that the hyperbolic Bloch transform is injective and "asymptotically unitary" already in the simplest case, that is when the Hilbert space is the regular representation of the Fuchsian group, $Γ$. Second, when $Γ\subset \mathrm{PSU} (1, 1)$ acts isometrically on the hyperbolic plane, $\mathbb{H}$, and the Hilbert space is $L^2 \left( \mathbb{H} \right)$, then we define a modified, geometric Bloch transform, that sends wave functions to sections of stable, flat bundles over $Σ= \mathbb{H} / Γ$ and transforms the hyperbolic Laplacian into the covariant Laplacian.
△ Less
Submitted 9 August, 2022; v1 submitted 4 August, 2022;
originally announced August 2022.
-
On the construction of monopoles with arbitrary symmetry breaking
Authors:
Benoit Charbonneau,
Ákos Nagy
Abstract:
We produce finite energy BPS monopoles with prescribed arbitrary symmetry breaking from a new class of solutions to Nahm's equation.
We produce finite energy BPS monopoles with prescribed arbitrary symmetry breaking from a new class of solutions to Nahm's equation.
△ Less
Submitted 7 June, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
SN 2019va: A Type IIP Supernova with Large Influence of Nickel-56 Decay on the Plateau-phase Light Curve
Authors:
Xinghan Zhang,
Xiaofeng Wang,
Hanna Sai,
Jun Mo,
A. P. Nagy,
Jicheng Zhang,
Yongzhi Cai,
Han Lin,
Jujia Zhang,
E. Baron,
J. M. DerKacy,
T. -M. Zhang,
Zhitong Li,
Melissa Graham,
F. Huang
Abstract:
We present multi-band photometric and spectroscopic observations of the type II supernova, (SN) 2019va, which shows an unusually flat plateau-phase evolution in its V-band light curve. Its pseudo-bolometric light curve even shows a weak brightening towards the end of the plateau phase. These uncommon features are related to the influence of 56Ni decay on the light curve during the plateau phase, w…
▽ More
We present multi-band photometric and spectroscopic observations of the type II supernova, (SN) 2019va, which shows an unusually flat plateau-phase evolution in its V-band light curve. Its pseudo-bolometric light curve even shows a weak brightening towards the end of the plateau phase. These uncommon features are related to the influence of 56Ni decay on the light curve during the plateau phase, when the SN emission is usually dominated by cooling of the envelope. The inferred 56Ni mass of SN 2019va is 0.088+/-0.018 solar mass, which is significantly larger than most SNe II. To estimate the influence of 56Ni decay on the plateau-phase light curve, we calculate the ratio (dubbed as eta_Ni) between the integrated time-weighted energy from 56Ni decay and that from envelope cooling within the plateau phase, obtaining a value of 0.8 for SN 2019va, which is the second largest value among SNe II that have been measured. After removing the influence of 56Ni decay on the plateau-phase light curve, we found that the progenitor/explosion parameters derived for SN 2019va are more reasonable. In addition, SN 2019va is found to have weaker metal lines in its spectra compared to other SNe IIP at similar epochs, implying a low-metallicity progenitor, which is consistent with the metal-poor environment inferred from the host-galaxy spectrum. We further discuss the possible reasons that might lead to SN 2019va-like events.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
Mitigation of plasma-wall interactions with low-Z powders in DIII-D high confinement plasmas
Authors:
Florian Effenberg,
Alessandro Bortolon,
Livia Casali,
Raffi Nazikian,
Igor Bykov,
Filippo Scotti,
Huiqian Q. Wang,
Max E. Fenstermacher,
Robert Lunsford,
Alexander Nagy,
Brian A. Grierson,
Florian M. Laggner,
Rajesh Maingi,
the DIII-D Team
Abstract:
Experiments with low-Z powder injection in DIII-D high confinement discharges demonstrated increased divertor dissipation and detachment while maintaining good core energy confinement. Lithium (Li), boron (B), and boron nitride (BN) powders were injected in high-confinement mode plasmas ($I_p=$1 MA, $B_t=$2 T, $P_{NB}=$6 MW, $\langle n_e\rangle=3.6-5.0\cdot10^{19}$ m$^{-3}$) into the upper small-a…
▽ More
Experiments with low-Z powder injection in DIII-D high confinement discharges demonstrated increased divertor dissipation and detachment while maintaining good core energy confinement. Lithium (Li), boron (B), and boron nitride (BN) powders were injected in high-confinement mode plasmas ($I_p=$1 MA, $B_t=$2 T, $P_{NB}=$6 MW, $\langle n_e\rangle=3.6-5.0\cdot10^{19}$ m$^{-3}$) into the upper small-angle slot (SAS) divertor for 2-s intervals at constant rates of 3-204 mg/s. The multi-species BN powders at a rate of 54 mg/s showed the most substantial increase in divertor neutral compression by more than an order of magnitude and lasting detachment with minor degradation of the stored magnetic energy $W_{mhd}$ by 5%. Rates of 204 mg/s of boron nitride powder further reduce ELM-fluxes on the divertor but also cause a drop in confinement performance by 24% due to the onset of an $n=2$ tearing mode. The application of powders also showed a substantial improvement of wall conditions manifesting in reduced wall fueling source and intrinsic carbon and oxygen content in response to the cumulative injection of non-recycling materials. The results suggest that low-Z powder injection, including mixed element compounds, is a promising new core-edge compatible technique that simultaneously enables divertor detachment and improves wall conditions during high confinement operation.
△ Less
Submitted 16 August, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Syntax-based data augmentation for Hungarian-English machine translation
Authors:
Attila Nagy,
Patrick Nanys,
Balázs Frey Konrád,
Bence Bial,
Judit Ács
Abstract:
We train Transformer-based neural machine translation models for Hungarian-English and English-Hungarian using the Hunglish2 corpus. Our best models achieve a BLEU score of 40.0 on HungarianEnglish and 33.4 on English-Hungarian. Furthermore, we present results on an ongoing work about syntax-based augmentation for neural machine translation. Both our code and models are publicly available.
We train Transformer-based neural machine translation models for Hungarian-English and English-Hungarian using the Hunglish2 corpus. Our best models achieve a BLEU score of 40.0 on HungarianEnglish and 33.4 on English-Hungarian. Furthermore, we present results on an ongoing work about syntax-based augmentation for neural machine translation. Both our code and models are publicly available.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
Develo** neural machine translation models for Hungarian-English
Authors:
Attila Nagy
Abstract:
I train models for the task of neural machine translation for English-Hungarian and Hungarian-English, using the Hunglish2 corpus. The main contribution of this work is evaluating different data augmentation methods during the training of NMT models. I propose 5 different augmentation methods that are structure-aware, meaning that instead of randomly selecting words for blanking or replacement, th…
▽ More
I train models for the task of neural machine translation for English-Hungarian and Hungarian-English, using the Hunglish2 corpus. The main contribution of this work is evaluating different data augmentation methods during the training of NMT models. I propose 5 different augmentation methods that are structure-aware, meaning that instead of randomly selecting words for blanking or replacement, the dependency tree of sentences is used as a basis for augmentation. I start my thesis with a detailed literature review on neural networks, sequential modeling, neural machine translation, dependency parsing and data augmentation. After a detailed exploratory data analysis and preprocessing of the Hunglish2 corpus, I perform experiments with the proposed data augmentation techniques. The best model for Hungarian-English achieves a BLEU score of 33.9, while the best model for English-Hungarian achieves a BLEU score of 28.6.
△ Less
Submitted 7 November, 2021;
originally announced November 2021.
-
Rescued from oblivion: detailed analysis of archival {\it Spitzer} data of SN~1993J
Authors:
Sz. Zsíros,
A. P. Nagy,
T. Szalai
Abstract:
We present an extensive analysis of the late-time mid-infrared (mid-IR) evolution of Type IIb SN 1993J from 10 up to 26 years post-explosion based on archival $-$ mostly previously unpublished $-$ photometric data of Spitzer Space Telescope in conjunction with an archival IRS spectrum. SN 1993J is one of the best-studied supernovae (SNe) with an extensive, decade-long multi-wavelength dataset publ…
▽ More
We present an extensive analysis of the late-time mid-infrared (mid-IR) evolution of Type IIb SN 1993J from 10 up to 26 years post-explosion based on archival $-$ mostly previously unpublished $-$ photometric data of Spitzer Space Telescope in conjunction with an archival IRS spectrum. SN 1993J is one of the best-studied supernovae (SNe) with an extensive, decade-long multi-wavelength dataset published in various papers; however, its detailed late-time mid-IR analysis is still missing from the literature. Mid-IR data follows not just the continuously cooling SN ejecta but also late-time dust formation and circumstellar interaction processes. We provide evidence that the observed late-time mid-IR excess of SN 1993J can be described by the presence of two-component local dust with a dust mass of $\sim(3.5-6.0)\times 10^{-3} M_{\odot}$ in case of a partly silicate-based dust composition. Source of these components can be either newly-formed dust grains, or heating of pre-existing dust via ongoing CSM interaction detected also at other wavelengths. If it is newly-formed, dust is assumed to be located both in the unshocked inner ejecta and in the outer cold dense shell, just as found in the Cassiopeia A remnant and also assumed in other dust-forming SNe in a few years after explosion.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
Improving the sample-efficiency of neural architecture search with reinforcement learning
Authors:
Attila Nagy,
Ábel Boros
Abstract:
Designing complex architectures has been an essential cogwheel in the revolution deep learning has brought about in the past decade. When solving difficult problems in a datadriven manner, a well-tried approach is to take an architecture discovered by renowned deep learning scientists as a basis (e.g. Inception) and try to apply it to a specific problem. This might be sufficient, but as of now, ac…
▽ More
Designing complex architectures has been an essential cogwheel in the revolution deep learning has brought about in the past decade. When solving difficult problems in a datadriven manner, a well-tried approach is to take an architecture discovered by renowned deep learning scientists as a basis (e.g. Inception) and try to apply it to a specific problem. This might be sufficient, but as of now, achieving very high accuracy on a complex or yet unsolved task requires the knowledge of highly-trained deep learning experts. In this work, we would like to contribute to the area of Automated Machine Learning (AutoML), specifically Neural Architecture Search (NAS), which intends to make deep learning methods available for a wider range of society by designing neural topologies automatically. Although several different approaches exist (e.g. gradient-based or evolutionary algorithms), our focus is on one of the most promising research directions, reinforcement learning. In this scenario, a recurrent neural network (controller) is trained to create problem-specific neural network architectures (child). The validation accuracies of the child networks serve as a reward signal for training the controller with reinforcement learning. The basis of our proposed work is Efficient Neural Architecture Search (ENAS), where parameter sharing is applied among the child networks. ENAS, like many other RL-based algorithms, emphasize the learning of child networks as increasing their convergence result in a denser reward signal for the controller, therefore significantly reducing training times. The controller was originally trained with REINFORCE. In our research, we propose to modify this to a more modern and complex algorithm, PPO, which has demonstrated to be faster and more stable in other environments. Then, we briefly discuss and evaluate our results.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
Conjugate linear perturbations of Dirac operators and Majorana fermions
Authors:
Ákos Nagy
Abstract:
We study a canonical class of perturbations of Dirac operators that are defined in any dimension and on any Hermitian Clifford module bundle. These operators generalize the 2-dimensional Jackiw-Rossi operator, which describes electronic excitations on topological superconductors. We also describe the low energy spectrum of these operators on complete surfaces, under mild hypotheses.
We study a canonical class of perturbations of Dirac operators that are defined in any dimension and on any Hermitian Clifford module bundle. These operators generalize the 2-dimensional Jackiw-Rossi operator, which describes electronic excitations on topological superconductors. We also describe the low energy spectrum of these operators on complete surfaces, under mild hypotheses.
△ Less
Submitted 18 October, 2021; v1 submitted 11 October, 2021;
originally announced October 2021.
-
Stationary solutions to the Keller-Segel equation on curved planes
Authors:
Ákos Nagy
Abstract:
We study stationary solutions to the Keller--Segel equation on curved planes.
We prove the necessity of the mass being $8 π$ and a sharp decay bound. Notably, our results do not require the solutions to have a finite second moment, and thus are novel already in the flat case.
Furthermore, we provide a correspondence between stationary solutions to the static Keller--Segel equation on curved pl…
▽ More
We study stationary solutions to the Keller--Segel equation on curved planes.
We prove the necessity of the mass being $8 π$ and a sharp decay bound. Notably, our results do not require the solutions to have a finite second moment, and thus are novel already in the flat case.
Furthermore, we provide a correspondence between stationary solutions to the static Keller--Segel equation on curved planes and positively curved Riemannian metrics on the sphere. We use this duality to show the nonexistence of solutions in certain situations. In particular, we show the existence of metrics, arbitrarily close to the flat one on the plane, that do not support stationary solutions to the static Keller--Segel equation (with any mass).
Finally, as a complementary result, we prove a curved version of the logarithmic Hardy--Littlewood--Sobolev inequality and use it to show that the Keller--Segel free energy is bounded from below exactly when the mass is $8 π$, even in the curved case.
△ Less
Submitted 3 January, 2022; v1 submitted 26 July, 2021;
originally announced July 2021.
-
Nonminimal solutions to the Ginzburg-Landau equations on surfaces
Authors:
Ákos Nagy,
Gonçalo Oliveira
Abstract:
We prove the existence of novel, nonminimal and irreducible solutions to the (self-dual) Ginzburg-Landau equations on closed surfaces. To our knowledge these are the first such examples on nontrivial line bundles, that is, with nonzero total magnetic flux. Our method works with the 2-dimensional, critically coupled Ginzburg-Landau theory and uses the topology of the moduli space. The method is non…
▽ More
We prove the existence of novel, nonminimal and irreducible solutions to the (self-dual) Ginzburg-Landau equations on closed surfaces. To our knowledge these are the first such examples on nontrivial line bundles, that is, with nonzero total magnetic flux. Our method works with the 2-dimensional, critically coupled Ginzburg-Landau theory and uses the topology of the moduli space. The method is nonconstructive, but works for all values of the remaining coupling constant. We also prove the instability of these solutions.
△ Less
Submitted 5 October, 2022; v1 submitted 9 March, 2021;
originally announced March 2021.
-
Construction of Nahm data and BPS monopoles with continuous symmetries
Authors:
Benoit Charbonneau,
Anuk Dayaprema,
C. J. Lang,
Ákos Nagy,
Haoyang Yu
Abstract:
We study solutions to Nahm's equations with continuous symmetries and, under certain (mild) hypotheses, we classify the corresponding Ansätze. Using our classification, we construct novel Nahm data, and prescribe methods for generating further solutions. Finally, we use these results to construct new BPS monopoles with spherical symmetry.
We study solutions to Nahm's equations with continuous symmetries and, under certain (mild) hypotheses, we classify the corresponding Ansätze. Using our classification, we construct novel Nahm data, and prescribe methods for generating further solutions. Finally, we use these results to construct new BPS monopoles with spherical symmetry.
△ Less
Submitted 14 November, 2022; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Automatic punctuation restoration with BERT models
Authors:
Attila Nagy,
Bence Bial,
Judit Ács
Abstract:
We present an approach for automatic punctuation restoration with BERT models for English and Hungarian. For English, we conduct our experiments on Ted Talks, a commonly used benchmark for punctuation restoration, while for Hungarian we evaluate our models on the Szeged Treebank dataset. Our best models achieve a macro-averaged $F_1$-score of 79.8 in English and 82.2 in Hungarian. Our code is publ…
▽ More
We present an approach for automatic punctuation restoration with BERT models for English and Hungarian. For English, we conduct our experiments on Ted Talks, a commonly used benchmark for punctuation restoration, while for Hungarian we evaluate our models on the Szeged Treebank dataset. Our best models achieve a macro-averaged $F_1$-score of 79.8 in English and 82.2 in Hungarian. Our code is publicly available.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
Spherical Density Functional Theory
Authors:
Ágnes Nagy,
Kalevi Kokko,
Jesse Huhtala,
Torbjörn Björkman,
Levente Vitos
Abstract:
Recently, Theophilou (J. Chem.Phys {\bf 149} 074104 (2018)) showed that a set of spherically symmetric densities determines uniquely the external potential in molecules and solids. Here, spherically symmetric Kohn-Sham-like equations are derived. The spherical densities can be expressed with radial wave functions. Expression for the total energy is also presented.
Recently, Theophilou (J. Chem.Phys {\bf 149} 074104 (2018)) showed that a set of spherically symmetric densities determines uniquely the external potential in molecules and solids. Here, spherically symmetric Kohn-Sham-like equations are derived. The spherical densities can be expressed with radial wave functions. Expression for the total energy is also presented.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
Spherically symmetric density and potential of a hydrogen molecule
Authors:
K. Kokko,
Á. Nagy,
J. Huhtala,
T. Björkman,
L. Vitos
Abstract:
Using a hydrogen molecule as a test system we demonstrate how to compute the effective potential according to the formalism of the new density functional theory (DFT), in which the basic variable is the set of spherically averaged densities instead of the total density, used in the traditional DFT. The effective potential together the external potential, nuclear Coulomb potential, can be substitut…
▽ More
Using a hydrogen molecule as a test system we demonstrate how to compute the effective potential according to the formalism of the new density functional theory (DFT), in which the basic variable is the set of spherically averaged densities instead of the total density, used in the traditional DFT. The effective potential together the external potential, nuclear Coulomb potential, can be substituted in the Schrödinger like differential equation to obtain the spherically averaged electron density of the system. In the new method instead of one three-dimensional low symmetry equation one has to solve as many spherically symmetric equations as there are atoms in the system.
△ Less
Submitted 19 September, 2020;
originally announced September 2020.
-
The asymptotic geometry of $\rm{G}_2$-monopoles
Authors:
Daniel Fadel,
Ákos Nagy,
Gonçalo Oliveira
Abstract:
This article investigates the asymptotics of $\rm{G}_2$-monopoles. First, we prove that when the underlying $\rm{G}_2$-manifold is nonparabolic (i.e. admits a positive Green's function), finite intermediate energy monopoles with bounded curvature have finite mass. The second main result restricts to the case when the underlying $\rm{G}_2$-manifold is asymptotically conical. In this situation, we d…
▽ More
This article investigates the asymptotics of $\rm{G}_2$-monopoles. First, we prove that when the underlying $\rm{G}_2$-manifold is nonparabolic (i.e. admits a positive Green's function), finite intermediate energy monopoles with bounded curvature have finite mass. The second main result restricts to the case when the underlying $\rm{G}_2$-manifold is asymptotically conical. In this situation, we deduce sharp decay estimates and that the connection converges, along the end, to a pseudo-Hermitian--Yang--Mills connection over the asymptotic cone. Finally, our last result exhibits a Fredholm setup describing the moduli space of finite intermediate energy monopoles on an asymptotically conical $\rm{G}_2$-manifold.
△ Less
Submitted 12 September, 2022; v1 submitted 14 September, 2020;
originally announced September 2020.
-
A low-luminosity core-collapse supernova very similar to SN 2005cs
Authors:
Zoltán Jäger Jr.,
József Vinkó,
Barna I. Bíró,
Tibor Hegedüs,
Tamás Borkovits,
Zoltán Jäger Sr.,
Andrea P. Nagy,
László Molnár,
Levente Kriskovics
Abstract:
We present observations and analysis of PSN J17292918+7542390, a low-luminosity Type II-P supernova (LL SN IIP). The observed sample of such events is still low, and their nature is still under debate. Such supernovae are similar to SN 2005cs, a well-observed low-luminosity Type II-P event, having low expansion velocities, and small ejected $^{56}$Ni mass. We have developed a robust and relatively…
▽ More
We present observations and analysis of PSN J17292918+7542390, a low-luminosity Type II-P supernova (LL SN IIP). The observed sample of such events is still low, and their nature is still under debate. Such supernovae are similar to SN 2005cs, a well-observed low-luminosity Type II-P event, having low expansion velocities, and small ejected $^{56}$Ni mass. We have developed a robust and relatively fast Monte-Carlo code that fits semi-analytic models to light curves of core collapse supernovae. This allows the estimation of the most important physical parameters, like the radius of the progenitor star, the mass of the ejected envelope, the mass of the radioactive nickel synthesized during the explosion, among others. PSN J17292918+7542390 has $R_0 = 91_{-70}^{+119} \cdot 10^{11} \;\text{cm}$, $M_\text{ej} = 9.89_{-1.00}^{+2.10} \; M_{\odot}$, $E_{\mathrm{kin}} = 0.65_{-0.18}^{+0.19} \;\text{foe}$, $v_{\mathrm{exp}} = 3332_{-347}^{+216}$ km s$^{-1}$, for its progenitor radius, ejecta mass, kinetic energy and expansion velocity, respectively. The initial nickel mass of the PSN J17292918+7542390 turned out to be $1.55_{-0.70}^{+0.75} \cdot 10^{-3} M_{\odot}$. The measured photospheric velocity at the earliest observed phase is 7000 km s$^{-1}$. As far as we can tell based on the small population of observed low-luminosity Type II-P supernovae, the determined values are typical for these events.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
Deposition distribution of the new coronavirus (SARS-CoV-2) in the human airways upon exposure to cough-generated aerosol
Authors:
Balázs G. Madas,
Péter Füri,
Árpád Farkas,
Attila Nagy,
Aladár Czitrovszky,
Imre Balásházy,
Gusztáv G. Schay,
Alpár Horváth
Abstract:
The new coronavirus disease 2019 (COVID-19) has been emerged as a rapidly spreading pandemic. The disease is thought to spread mainly from person-to-person through respiratory droplets produced when an infected person coughs, sneezes, or talks. The pathogen of COVID-19 is the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). It infects the cells binding to the angiotensin-converting en…
▽ More
The new coronavirus disease 2019 (COVID-19) has been emerged as a rapidly spreading pandemic. The disease is thought to spread mainly from person-to-person through respiratory droplets produced when an infected person coughs, sneezes, or talks. The pathogen of COVID-19 is the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). It infects the cells binding to the angiotensin-converting enzyme 2 receptor (ACE2) which is expressed by cells throughout the airways as targets for cellular entry. Although the majority of persons infected with SARS-CoV-2 experience symptoms of mild upper respiratory tract infection, in some people infections of the peripheral airways result in severe, potentially fatal pneumonia. However, the induction of COVID-19 pneumonia requires that SARS-CoV-2 reaches the peripheral airways. While huge efforts have been made to understand the spread of the disease as well as the pathogenesis following cellular entry, much less attention is paid how SARS-CoV-2 from the environment reach the receptors of the target cells. The aim of the present study is to characterize the deposition distribution of SARS-CoV-2 in the airways upon exposure to cough-generated aerosol. For this purpose, the Stochastic Lung Deposition Model has been applied. Aerosol size distribution and breathing parameters were taken from the literature supposing normal breathing through the nose. We found that the probability of direct infection of the peripheral airways due to inhalation of aerosol generated by a bystander cough is very low. As the number of pathogens deposited in the extrathoracic airways is ~10 times higher than in the peripheral airways, we concluded that in most cases COVID-19 pneumonia must be preceded by SARS-CoV-2 infection of the upper airways. Our results suggest that without the enhancement of viral load in the upper airways, COVID-19 would be much less dangerous...
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
New evidence supporting the existence of the hypothetic X17 particle
Authors:
A. J. Krasznahorkay,
M. Csatlos,
L. Csige,
J. Gulyas,
M. Koszta,
B. Szihalmi,
J. Timar,
D. S. Firak,
A. Nagy,
N. J. Sas,
A. Krasznahorkay
Abstract:
We observed electron-positron pairs from the electro-magnetically forbidden M0 transition depopulating the 21.01 MeV 0$^-$ state in $^4$He. A peak was observed in their $e^+e^-$ angular correlations at 115$^\circ$ with 7.2$σ$ significance, and could be described by assuming the creation and subsequent decay of a light particle with mass of $m_\mathrm{X}c^2$=16.84$\pm0.16 (stat) \pm 0.20 (syst)$ Me…
▽ More
We observed electron-positron pairs from the electro-magnetically forbidden M0 transition depopulating the 21.01 MeV 0$^-$ state in $^4$He. A peak was observed in their $e^+e^-$ angular correlations at 115$^\circ$ with 7.2$σ$ significance, and could be described by assuming the creation and subsequent decay of a light particle with mass of $m_\mathrm{X}c^2$=16.84$\pm0.16 (stat) \pm 0.20 (syst)$ MeV and $Γ_\mathrm{X}$= $3.9\times 10^{-5}$ eV. According to the mass, it is likely the same X17 particle, which we recently suggested [Phys. Rev. Lett. 116, 052501 (2016)] for describing the anomaly observed in $^8$Be.
△ Less
Submitted 23 October, 2019;
originally announced October 2019.
-
The Kapustin--Witten equations on ALE and ALF gravitational instantons
Authors:
Ákos Nagy,
Gonçalo Oliveira
Abstract:
We study solutions to the Kapustin--Witten equations on ALE and ALF gravitational instantons. On any such space and for any compact structure group, we prove asymptotic estimates for the Higgs field. We then use it to prove a vanishing theorem in the case when the underlying manifold is $\mathrm{R}^4$ or $\mathrm{R}^3 \times \mathbb{S}^1$ and the structure group is $\mathrm{SU} (2)$.
We study solutions to the Kapustin--Witten equations on ALE and ALF gravitational instantons. On any such space and for any compact structure group, we prove asymptotic estimates for the Higgs field. We then use it to prove a vanishing theorem in the case when the underlying manifold is $\mathrm{R}^4$ or $\mathrm{R}^3 \times \mathbb{S}^1$ and the structure group is $\mathrm{SU} (2)$.
△ Less
Submitted 1 August, 2022; v1 submitted 12 June, 2019;
originally announced June 2019.
-
The Haydys monopole equation
Authors:
Ákos Nagy,
Gonçalo Oliveira
Abstract:
We study complexified Bogomolny monopoles using the complex linear extension of the Hodge star operator; these monopoles can be interpreted as solutions to the Bogomolny equation with a complex gauge group. Alternatively, these equations can be obtained from dimensional reduction of the Haydys instanton equations to three dimensions, thus we call them Haydys monopoles.
We find that (under mild h…
▽ More
We study complexified Bogomolny monopoles using the complex linear extension of the Hodge star operator; these monopoles can be interpreted as solutions to the Bogomolny equation with a complex gauge group. Alternatively, these equations can be obtained from dimensional reduction of the Haydys instanton equations to three dimensions, thus we call them Haydys monopoles.
We find that (under mild hypotheses) the smooth locus of the moduli space of finite energy Haydys monopoles on $\mathbb{R}^3$ is a Kähler manifold containing the ordinary Bogomolny moduli space as a minimal Lagrangian submanifold -- an $A$-brane. Moreover, using a gluing construction we construct an open neighborhood of this submanifold modeled on a neighborhood of the zero section in the tangent bundle to the Bogomolny moduli space. This is analogous to the case of Higgs bundles over a Riemann surface, where the (co)tangent bundle of holomorphic bundles canonically embeds into the Hitchin moduli space.
These results contrast immensely with the case of finite energy Kapustin--Witten monopoles for which we have shown a vanishing theorem in [12].
△ Less
Submitted 20 July, 2022; v1 submitted 12 June, 2019;
originally announced June 2019.
-
The Type II-P Supernova 2017eaw: from explosion to the nebular phase
Authors:
Tamás Szalai,
József Vinkó,
Réka Könyves-Tóth,
Andrea P. Nagy,
K. Azalee Bostroem,
Krisztián Sárneczky,
Peter J. Brown,
Ondrej Pejcha,
Attila Bódi,
Borbála Cseh,
Géza Csörnyei,
Zoltán Dencs,
Ottó Hanyecz,
Bernadett Ignácz,
Csilla Kalup,
Levente Kriskovics,
András Ordasi,
András Pál,
Bálint Seli,
Ádám Sódor,
Róbert Szakáts,
Krisztián Vida,
Gabriella Zsidi,
Iair Arcavi,
Chris Ashall
, et al. (14 additional authors not shown)
Abstract:
The nearby SN 2017eaw is a Type II-P (``plateau') supernova showing early-time, moderate CSM interaction. We present a comprehensive study of this SN including the analysis of high-quality optical photometry and spectroscopy covering the very early epochs up to the nebular phase, as well as near-UV and near-infrared spectra, and early-time X-ray and radio data. The combined data of SNe 2017eaw and…
▽ More
The nearby SN 2017eaw is a Type II-P (``plateau') supernova showing early-time, moderate CSM interaction. We present a comprehensive study of this SN including the analysis of high-quality optical photometry and spectroscopy covering the very early epochs up to the nebular phase, as well as near-UV and near-infrared spectra, and early-time X-ray and radio data. The combined data of SNe 2017eaw and 2004et allow us to get an improved distance to the host galaxy, NGC 6946, as $D \sim 6.85$ $\pm 0.63$ Mpc; this fits in recent independent results on the distance of the host and disfavors the previously derived (30% shorter) distances based on SN 2004et. From modeling the nebular spectra and the quasi-bolometric light curve, we estimate the progenitor mass and some basic physical parameters for the explosion and the ejecta. Our results agree well with previous reports on a RSG progenitor star with a mass of $\sim15-16$ M$_\odot$. Our estimation on the pre-explosion mass-loss rate ($\dot{M} \sim3 \times 10^{-7} -$ $1\times 10^{-6} M_{\odot}$ yr$^{-1}$) agrees well with previous results based on the opacity of the dust shell enshrouding the progenitor, but it is orders of magnitude lower than previous estimates based on general light-curve modeling of Type II-P SNe. Combining late-time optical and mid-infrared data, a clear excess at 4.5 $μ$m can be seen, supporting the previous statements on the (moderate) dust formation in the vicinity of SN 2017eaw.
△ Less
Submitted 21 March, 2019;
originally announced March 2019.
-
Variational Quantum Monte Carlo Method with a Neural-Network Ansatz for Open Quantum Systems
Authors:
Alexandra Nagy,
Vincenzo Savona
Abstract:
The possibility to simulate the properties of many-body open quantum systems with a large number of degrees of freedom is the premise to the solution of several outstanding problems in quantum science and quantum information. The challenge posed by this task lies in the complexity of the density matrix increasing exponentially with the system size. Here, we develop a variational method to efficien…
▽ More
The possibility to simulate the properties of many-body open quantum systems with a large number of degrees of freedom is the premise to the solution of several outstanding problems in quantum science and quantum information. The challenge posed by this task lies in the complexity of the density matrix increasing exponentially with the system size. Here, we develop a variational method to efficiently simulate the non-equilibrium steady state of Markovian open quantum systems based on variational Monte Carlo and on a neural network representation of the density matrix. Thanks to the stochastic reconfiguration scheme, the application of the variational principle is translated into the actual integration of the quantum master equation. We test the effectiveness of the method by modeling the two-dimensional dissipative XYZ spin model on a lattice.
△ Less
Submitted 29 June, 2019; v1 submitted 25 February, 2019;
originally announced February 2019.
-
Topological Analysis of Bitcoin's Lightning Network
Authors:
István András Seres,
László Gulyás,
Dániel A. Nagy,
Péter Burcsi
Abstract:
Bitcoin's Lightning Network (LN) is a scalability solution for Bitcoin allowing transactions to be issued with negligible fees and settled instantly at scale. In order to use LN, funds need to be locked in payment channels on the Bitcoin blockchain (Layer-1) for subsequent use in LN (Layer-2). LN is comprised of many payment channels forming a payment channel network. LN's promise is that relative…
▽ More
Bitcoin's Lightning Network (LN) is a scalability solution for Bitcoin allowing transactions to be issued with negligible fees and settled instantly at scale. In order to use LN, funds need to be locked in payment channels on the Bitcoin blockchain (Layer-1) for subsequent use in LN (Layer-2). LN is comprised of many payment channels forming a payment channel network. LN's promise is that relatively few payment channels already enable anyone to efficiently, securely and privately route payments across the whole network. In this paper, we quantify the structural properties of LN and argue that LN's current topological properties can be ameliorated in order to improve the security of LN, enabling it to reach its true potential.
△ Less
Submitted 14 April, 2019; v1 submitted 15 January, 2019;
originally announced January 2019.
-
Average opacity calculation for core-collapse supernovae
Authors:
Andrea P. Nagy
Abstract:
Supernovae (SNe) are among the most intensely studied objects of modern astrophysics, but due to their complex physical nature, theoretical models are essential to understand better these exploding stars, as well as the properties of the variation of the emitted radiation. One possibility for modeling SNe light curves is the construction of a simplified semi-analytic model, which can be used for g…
▽ More
Supernovae (SNe) are among the most intensely studied objects of modern astrophysics, but due to their complex physical nature, theoretical models are essential to understand better these exploding stars, as well as the properties of the variation of the emitted radiation. One possibility for modeling SNe light curves is the construction of a simplified semi-analytic model, which can be used for getting order-of magnitude estimates of the SN properties. One of the strongest simplification in most of these light curve models is the assumption of the constant Thomson-scattering opacity that can be determined as the average opacity of the ejecta. Here we present a systematic analysis for estimating the average opacity in different types of core-collapse supernovae (CCSNe) that can be used as the constant opacity of the ejecta in simplified semi-analytic models. To use these average opacities self-consistently during light curve (LC) fit we estimate their values from hydrodynamic simulations. In this analysis we first generate MESA (Paxton et al. 2011, 2013, 2015, 2018) stellar models with different physical parameters (initial mass, metallicity, rotation), which determine the mass-loss history of the model star. Then we synthesize SN LCs from these models with the SNEC hydrodynamic code (Morozova et al. 2015) and calculate the Rosseland mean opacity in every mass element. Finally, we compute the average opacities by integrating these Rosseland mean opacities. As a result we find that the average opacities from our calculations show adequate agreement with the opacities generally used in previous studies.
△ Less
Submitted 19 June, 2018;
originally announced June 2018.
-
Modeling Martian Atmospheric Losses over Time: Implications for Exoplanetary Climate Evolution and Habitability
Authors:
Chuanfei Dong,
Yuni Lee,
Yingjuan Ma,
Manasvi Lingam,
Stephen Bougher,
Janet Luhmann,
Shannon Curry,
Gabor Toth,
Andrew Nagy,
Valeriy Tenishev,
Xiaohua Fang,
David Mitchell,
David Brain,
Bruce Jakosky
Abstract:
In this Letter, we make use of sophisticated 3D numerical simulations to assess the extent of atmospheric ion and photochemical losses from Mars over time. We demonstrate that the atmospheric ion escape rates were significantly higher (by more than two orders of magnitude) in the past at $\sim 4$ Ga compared to the present-day value owing to the stronger solar wind and higher ultraviolet fluxes fr…
▽ More
In this Letter, we make use of sophisticated 3D numerical simulations to assess the extent of atmospheric ion and photochemical losses from Mars over time. We demonstrate that the atmospheric ion escape rates were significantly higher (by more than two orders of magnitude) in the past at $\sim 4$ Ga compared to the present-day value owing to the stronger solar wind and higher ultraviolet fluxes from the young Sun. We found that the photochemical loss of atomic hot oxygen dominates over the total ion loss at the current epoch whilst the atmospheric ion loss is likely much more important at ancient times. We briefly discuss the ensuing implications of high atmospheric ion escape rates in the context of ancient Mars, and exoplanets with similar atmospheric compositions around young solar-type stars and M-dwarfs.
△ Less
Submitted 14 May, 2018;
originally announced May 2018.