-
NoPropaganda at SemEval-2020 Task 11: A Borrowed Approach to Sequence Tagging and Text Classification
Authors:
Ilya Dimov,
Vladislav Korzun,
Ivan Smurov
Abstract:
This paper describes our contribution to SemEval-2020 Task 11: Detection Of Propaganda Techniques In News Articles. We start with simple LSTM baselines and move to an autoregressive transformer decoder to predict long continuous propaganda spans for the first subtask. We also adopt an approach from relation extraction by envelo** spans mentioned above with special tokens for the second subtask o…
▽ More
This paper describes our contribution to SemEval-2020 Task 11: Detection Of Propaganda Techniques In News Articles. We start with simple LSTM baselines and move to an autoregressive transformer decoder to predict long continuous propaganda spans for the first subtask. We also adopt an approach from relation extraction by envelo** spans mentioned above with special tokens for the second subtask of propaganda technique classification. Our models report an F-score of 44.6% and a micro-averaged F-score of 58.2% for those tasks accordingly.
△ Less
Submitted 25 July, 2020;
originally announced July 2020.
-
arXiv:1910.13325
[pdf]
physics.data-an
cond-mat.mtrl-sci
cs.LG
physics.app-ph
physics.comp-ph
stat.ML
Fragment Graphical Variational AutoEncoding for Screening Molecules with Small Data
Authors:
John Armitage,
Leszek J. Spalek,
Malgorzata Nguyen,
Mark Nikolka,
Ian E. Jacobs,
Lorena Marañón,
Iyad Nasrallah,
Guillaume Schweicher,
Ivan Dimov,
Dimitrios Simatos,
Iain McCulloch,
Christian B. Nielsen,
Gareth Conduit,
Henning Sirringhaus
Abstract:
In the majority of molecular optimization tasks, predictive machine learning (ML) models are limited due to the unavailability and cost of generating big experimental datasets on the specific task. To circumvent this limitation, ML models are trained on big theoretical datasets or experimental indicators of molecular suitability that are either publicly available or inexpensive to acquire. These a…
▽ More
In the majority of molecular optimization tasks, predictive machine learning (ML) models are limited due to the unavailability and cost of generating big experimental datasets on the specific task. To circumvent this limitation, ML models are trained on big theoretical datasets or experimental indicators of molecular suitability that are either publicly available or inexpensive to acquire. These approaches produce a set of candidate molecules which have to be ranked using limited experimental data or expert knowledge. Under the assumption that structure is related to functionality, here we use a molecular fragment-based graphical autoencoder to generate unique structural fingerprints to efficiently search through the candidate set. We demonstrate that fragment-based graphical autoencoding reduces the error in predicting physical characteristics such as the solubility and partition coefficient in the small data regime compared to other extended circular fingerprints and string based approaches. We further demonstrate that this approach is capable of providing insight into real world molecular optimization problems, such as searching for stabilization additives in organic semiconductors by accurately predicting 92% of test molecules given 69 training examples. This task is a model example of black box molecular optimization as there is minimal theoretical and experimental knowledge to accurately predict the suitability of the additives.
△ Less
Submitted 30 October, 2019; v1 submitted 21 October, 2019;
originally announced October 2019.
-
Numerical solutions of ordinary fractional differential equations with singularities
Authors:
Yuri Dimitrov,
Ivan Dimov,
Venelin Todorov
Abstract:
The solutions of fractional differential equations (FDEs) have a natural singularity at the initial point. The accuracy of their numerical solutions is lower than the accuracy of the numerical solutions of FDEs whose solutions are differentiable functions. In the present paper we propose a method for improving the accuracy of the numerical solutions of ordinary linear FDEs with constant coefficien…
▽ More
The solutions of fractional differential equations (FDEs) have a natural singularity at the initial point. The accuracy of their numerical solutions is lower than the accuracy of the numerical solutions of FDEs whose solutions are differentiable functions. In the present paper we propose a method for improving the accuracy of the numerical solutions of ordinary linear FDEs with constant coefficients which uses the fractional Taylor polynomials of the solutions. The numerical solutions of the two-term and three-term FDEs are studied in the paper.
△ Less
Submitted 8 June, 2018;
originally announced June 2018.
-
Multidimensional Sensitivity Analysis of Large-scale Mathematical Models
Authors:
Ivan Dimov,
Rayna Georgieva
Abstract:
Sensitivity analysis (SA) is a procedure for studying how sensitive are the output results of large-scale mathematical models to some uncertainties of the input data. The models are described as a system of partial differential equations. Often such systems contain a large number of input parameters. Obviously, it is important to know how sensitive is the solution to some uncontrolled variations o…
▽ More
Sensitivity analysis (SA) is a procedure for studying how sensitive are the output results of large-scale mathematical models to some uncertainties of the input data. The models are described as a system of partial differential equations. Often such systems contain a large number of input parameters. Obviously, it is important to know how sensitive is the solution to some uncontrolled variations or uncertainties in the input parameters of the model. Algorithms based on analysis of variances technique (ANOVA) for calculating numerical indicators of sensitivity and computationally efficient Monte Carlo integration techniques have recently been developed by the authors. They have been successfully applied to sensitivity studies of air pollution levels calculated by the Unified Danish Eulerian Model (UNI-DEM) with respect to several important input parameters. In this paper a comprehensive theoretical and experimental study of the Monte Carlo algorithm based on \textit{symmetrised shaking} of Sobol sequences has been done. It has been proven that this algorithm has an optimal rate of convergence for functions with continuous and bounded second derivatives in terms of probability and mean square error. Extensive numerical experiments with Monte Carlo, quasi-Monte Carlo (QMC) and scrambled quasi-Monte Carlo algorithms based on Sobol sequences are performed to support the theoretical studies and to analyze applicability of the algorithms to various classes of problems. The numerical tests show that the Monte Carlo algorithm based on \textit{symmetrised shaking} of Sobol sequences gives reliable results for multidimensional integration problems under consideration.
△ Less
Submitted 19 January, 2017;
originally announced January 2017.
-
A Comparison Study of Two High Accuracy Numerical Methods for a Parabolic System in Air Pollution Modelling
Authors:
Ivan Dimov,
Juri Kandilarov,
Venelin Todorov,
Lubin Vulkov
Abstract:
We present two approaches for enhancing the accuracy of second order finite difference approximations of two-dimensional semilinear parabolic systems. These are the fourth order compact difference scheme and the fourth order scheme based on Richardson extrapolation. Our interest is concentrated on a system of ten parabolic partial differential equations in air pollution modeling. We analyze numeri…
▽ More
We present two approaches for enhancing the accuracy of second order finite difference approximations of two-dimensional semilinear parabolic systems. These are the fourth order compact difference scheme and the fourth order scheme based on Richardson extrapolation. Our interest is concentrated on a system of ten parabolic partial differential equations in air pollution modeling. We analyze numerical experiments to compare the two approaches with respect to accuracy, computational complexity, non-negativity preserving and etc. Sixth-order approximation based on the fourth-order compact difference scheme combined with Richardson extrapolation is also discussed numerically.
△ Less
Submitted 11 January, 2017;
originally announced January 2017.
-
On randomization of neural networks as a form of post-learning strategy
Authors:
K. G. Kapanova,
I. Dimov,
J. M. Sellier
Abstract:
Today artificial neural networks are applied in various fields - engineering, data analysis, robotics. While they represent a successful tool for a variety of relevant applications, mathematically speaking they are still far from being conclusive. In particular, they suffer from being unable to find the best configuration possible during the training process (local minimum problem). In this paper,…
▽ More
Today artificial neural networks are applied in various fields - engineering, data analysis, robotics. While they represent a successful tool for a variety of relevant applications, mathematically speaking they are still far from being conclusive. In particular, they suffer from being unable to find the best configuration possible during the training process (local minimum problem). In this paper, we focus on this issue and suggest a simple, but effective, post-learning strategy to allow the search for improved set of weights at a relatively small extra computational cost. Therefore, we introduce a novel technique based on analogy with quantum effects occurring in nature as a way to improve (and sometimes overcome) this problem. Several numerical experiments are presented to validate the approach.
△ Less
Submitted 26 November, 2015;
originally announced November 2015.
-
Hidden Noise Structure and Random Matrix Models of Stock Correlations
Authors:
Ivailo I. Dimov,
Petter N. Kolm,
Lee Maclin,
Dan Y. C. Shiber
Abstract:
We find a novel correlation structure in the residual noise of stock market returns that is remarkably linked to the composition and stability of the top few significant factors driving the returns, and moreover indicates that the noise band is composed of multiple subbands that do not fully mix. Our findings allow us to construct effective generalized random matrix theory market models that are…
▽ More
We find a novel correlation structure in the residual noise of stock market returns that is remarkably linked to the composition and stability of the top few significant factors driving the returns, and moreover indicates that the noise band is composed of multiple subbands that do not fully mix. Our findings allow us to construct effective generalized random matrix theory market models that are closely related to correlation and eigenvector clustering. We show how to use these models in a simulation that incorporates heavy tails. Finally, we demonstrate how a subtle purely stationary risk estimation bias can arise in the conventional cleaning prescription.
△ Less
Submitted 14 December, 2009; v1 submitted 8 September, 2009;
originally announced September 2009.
-
Competing order, Fermi surface reconstruction, and quantum oscillations in underdoped high temperature superconductors
Authors:
Ivailo Dimov,
Pallab Goswami,
Xun Jia,
Sudip Chakravarty
Abstract:
We consider incommensurate $d$-density wave order in underdoped high temperature superconductors. We find that Fermi surface reconstruction can correctly capture the phenomenology of the recent quantum oscillation experiments that suggest incommensurate order. The predicted frequencies are a frequency around 530 T arising from the electron pocket, a hole frequency at around 1650 T, and a new low…
▽ More
We consider incommensurate $d$-density wave order in underdoped high temperature superconductors. We find that Fermi surface reconstruction can correctly capture the phenomenology of the recent quantum oscillation experiments that suggest incommensurate order. The predicted frequencies are a frequency around 530 T arising from the electron pocket, a hole frequency at around 1650 T, and a new low frequency from a smaller hole pocket at 250 T for which there are some indications that require further investigation. The oscillation corresponding to the electron pocket will be further split due to bilayer coupling but the splitting is sufficiently small to require more refined measurements. The truly incommensurate $d$-density wave breaks both time reversal and inversion but the product of these two symmetry operations is preserved. There is some similarity of our results with the spiral spin density wave order, which, as pointed out by Overhauser, also breaks time reversal and inversion. Calculations corresponding to higher order commensuration produces results similar to anti-phase spin stripes, but appear to us to be an unlikely explanation of the experiments. The analysis of the Gorkov equation in the mixed state shows that the oscillation frequencies are unshifted from the putative normal state and the additional Dingle factor arising from the presence of the mixed state can provide a subtle distinction between the spiral spin density wave and the $d$-density wave.
△ Less
Submitted 27 July, 2008;
originally announced July 2008.
-
Hidden order revealed in quantum oscillations in cuprate superconductors
Authors:
Xun Jia,
Ivailo Dimov,
Pallab Goswami,
Sudip Chakravarty
Abstract:
We follow the line of reasoning that hidden broken symmetries are the root of quantum oscillations observed in underdoped superconductors and examine the role of bilayer splitting and incommensuration. This is a view that eschews the notion of a featureless Mott liquid as the source of complexity. Instead, our view is grounded in a conventional Fermi surface and quasiparticles. We show that bila…
▽ More
We follow the line of reasoning that hidden broken symmetries are the root of quantum oscillations observed in underdoped superconductors and examine the role of bilayer splitting and incommensuration. This is a view that eschews the notion of a featureless Mott liquid as the source of complexity. Instead, our view is grounded in a conventional Fermi surface and quasiparticles. We show that bilayer splitting and/or incommensurate $d$-density wave order can lead to many interesting results, in particular a splitting of the main frequency of the quantum oscillations.
△ Less
Submitted 25 June, 2008; v1 submitted 23 June, 2008;
originally announced June 2008.
-
Spin Order in Paired Quantum Hall States
Authors:
Ivailo Dimov,
Bertrand I. Halperin,
Chetan Nayak
Abstract:
We consider quantum Hall states at even-denominator filling fractions, especially $ν=5/2$, in the limit of small Zeeman energy. Assuming that a paired quantum Hall state forms, we study spin ordering and its interplay with pairing. We give numerical evidence that at $ν= 5/2$ an incompressible ground state will exhibit spontaneous ferromagnetism. The Ginzburg-Landau theory for the spin degrees of…
▽ More
We consider quantum Hall states at even-denominator filling fractions, especially $ν=5/2$, in the limit of small Zeeman energy. Assuming that a paired quantum Hall state forms, we study spin ordering and its interplay with pairing. We give numerical evidence that at $ν= 5/2$ an incompressible ground state will exhibit spontaneous ferromagnetism. The Ginzburg-Landau theory for the spin degrees of freedom of paired Hall states is a perturbed CP$^2$ model. We compute the coefficients in the Ginzburg-Landau theory by a BCS-Stoner mean field theory for coexisting order parameters, and show that even if repulsion is smaller than that required for a Stoner instability, ferromagnetic fluctuations can induce a partially or fully polarized superconducting state.
△ Less
Submitted 10 October, 2007;
originally announced October 2007.
-
Incommensurate DDW order
Authors:
Ivailo Dimov,
Chetan Nayak
Abstract:
We consider various incommensurate (IC) order parameters for electrons on a square lattice which reduce to $d_{x^2-y^2}$-density wave (DDW) order when the ordering wavevector ${\bf Q}\to (π,π)$. We describe the associated charge and current distributions and their experimental signatures. Such orders can arise at the mean-field level in extended Hubbard models. We compare the phase diagrams of t…
▽ More
We consider various incommensurate (IC) order parameters for electrons on a square lattice which reduce to $d_{x^2-y^2}$-density wave (DDW) order when the ordering wavevector ${\bf Q}\to (π,π)$. We describe the associated charge and current distributions and their experimental signatures. Such orders can arise at the mean-field level in extended Hubbard models. We compare the phase diagrams of these models with experiments in the underdoped cuprates, where (1) DDW order is a possible explanation of the pseudogap, and (2) there are experimental indications of incommensurability. We find various types of IC DDW and discuss their possible relevance to the physics of the cuprates. Our main finding is that IC DDW order is generally accompanied by superconducting order, but the magnitude of the IC wavevector can be small. A comparison with the analogous AF-ICSDW transition is given.
△ Less
Submitted 16 March, 2006; v1 submitted 23 December, 2005;
originally announced December 2005.