-
Document Layout Annotation: Database and Benchmark in the Domain of Public Affairs
Authors:
Alejandro Peña,
Aythami Morales,
Julian Fierrez,
Javier Ortega-Garcia,
Marcos Grande,
Iñigo Puente,
Jorge Cordova,
Gonzalo Cordova
Abstract:
Every day, thousands of digital documents are generated with useful information for companies, public organizations, and citizens. Given the impossibility of processing them manually, the automatic processing of these documents is becoming increasingly necessary in certain sectors. However, this task remains challenging, since in most cases a text-only based parsing is not enough to fully understa…
▽ More
Every day, thousands of digital documents are generated with useful information for companies, public organizations, and citizens. Given the impossibility of processing them manually, the automatic processing of these documents is becoming increasingly necessary in certain sectors. However, this task remains challenging, since in most cases a text-only based parsing is not enough to fully understand the information presented through different components of varying significance. In this regard, Document Layout Analysis (DLA) has been an interesting research field for many years, which aims to detect and classify the basic components of a document. In this work, we used a procedure to semi-automatically annotate digital documents with different layout labels, including 4 basic layout blocks and 4 text categories. We apply this procedure to collect a novel database for DLA in the public affairs domain, using a set of 24 data sources from the Spanish Administration. The database comprises 37.9K documents with more than 441K document pages, and more than 8M labels associated to 8 layout block units. The results of our experiments validate the proposed text labeling procedure with accuracy up to 99%.
△ Less
Submitted 8 August, 2023; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs
Authors:
Alejandro Peña,
Aythami Morales,
Julian Fierrez,
Ignacio Serna,
Javier Ortega-Garcia,
Iñigo Puente,
Jorge Cordova,
Gonzalo Cordova
Abstract:
The analysis of public affairs documents is crucial for citizens as it promotes transparency, accountability, and informed decision-making. It allows citizens to understand government policies, participate in public discourse, and hold representatives accountable. This is crucial, and sometimes a matter of life or death, for companies whose operation depend on certain regulations. Large Language M…
▽ More
The analysis of public affairs documents is crucial for citizens as it promotes transparency, accountability, and informed decision-making. It allows citizens to understand government policies, participate in public discourse, and hold representatives accountable. This is crucial, and sometimes a matter of life or death, for companies whose operation depend on certain regulations. Large Language Models (LLMs) have the potential to greatly enhance the analysis of public affairs documents by effectively processing and understanding the complex language used in such documents. In this work, we analyze the performance of LLMs in classifying public affairs documents. As a natural multi-label task, the classification of these documents presents important challenges. In this work, we use a regex-powered tool to collect a database of public affairs documents with more than 33K samples and 22.5M tokens. Our experiments assess the performance of 4 different Spanish LLMs to classify up to 30 different topics in the data in different configurations. The results shows that LLMs can be of great use to process domain-specific documents, such as those in the domain of public affairs.
△ Less
Submitted 8 August, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Levinson theorem for discrete Schrödinger operators on the line with matrix potentials having a first moment
Authors:
Miguel Ballesteros,
Gerardo Franco Córdova,
Ivan Naumkin,
Hermann Schulz-Baldes
Abstract:
This paper proves new results on spectral and scattering theory for matrix-valued Schrödinger operators on the discrete line with non-compactly supported perturbations whose first moments are assumed to exist. In particular, a Levinson theorem is proved, in which a relation between scattering data and spectral properties (bound and half bound states) of the corresponding Hamiltonians is derived. T…
▽ More
This paper proves new results on spectral and scattering theory for matrix-valued Schrödinger operators on the discrete line with non-compactly supported perturbations whose first moments are assumed to exist. In particular, a Levinson theorem is proved, in which a relation between scattering data and spectral properties (bound and half bound states) of the corresponding Hamiltonians is derived. The proof is based on stationary scattering theory with prominent use of Jost solutions at complex energies that are controlled by Volterra-type integral equations.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Band edge limit of the scattering matrix for quasi-one-dimensional discrete Schrödinger operators
Authors:
Miguel Ballesteros,
Gerardo Franco Córdova,
Guillermo Garro,
Hermann Schulz-Baldes
Abstract:
This paper is about the scattering theory for one-dimensional matrix Schrödinger operators with a matrix potential having a finite first moment. The transmission coefficients are analytically continued and extended to the band edges. An explicit expression is given for these extensions. The limits of the reflection coefficients at the band edges is also calculated.
This paper is about the scattering theory for one-dimensional matrix Schrödinger operators with a matrix potential having a finite first moment. The transmission coefficients are analytically continued and extended to the band edges. An explicit expression is given for these extensions. The limits of the reflection coefficients at the band edges is also calculated.
△ Less
Submitted 28 March, 2022; v1 submitted 5 August, 2020;
originally announced August 2020.
-
Analyticity properties of the scattering matrix for matrix Schrödinger operators on the discrete line
Authors:
Miguel Ballesteros,
Gerardo Franco Córdova,
Hermann Schulz-Baldes
Abstract:
Explicit formulas for the analytic extensions of the scattering matrix and the time delay of a quasi-one-dimensional discrete Schrödinger operator with a potential of finite support are derived. This includes a careful analysis of the band edge singularities and allows to prove a Levinson-type theorem. The main algebraic tool are the plane wave transfer matrices.
Explicit formulas for the analytic extensions of the scattering matrix and the time delay of a quasi-one-dimensional discrete Schrödinger operator with a potential of finite support are derived. This includes a careful analysis of the band edge singularities and allows to prove a Levinson-type theorem. The main algebraic tool are the plane wave transfer matrices.
△ Less
Submitted 22 January, 2021; v1 submitted 27 April, 2020;
originally announced April 2020.
-
Magnetic Force Microscopy Characterization of Superparamagnetic Iron Oxide Nanoparticles (SPIONs)
Authors:
Gustavo Cordova,
Simon Attwood,
Ravi Gaikwad,
Frank Gu,
Zoya Leonenko
Abstract:
Superparamagnetic iron oxide nanoparticles (SPIONs), due to their controllable sizes, relatively long in vivo half-life and limited agglomeration, are ideal for biomedical applications such as magnetic labeling, hyperthermia cancer treatment, targeted drug delivery and for magnetic resonance imaging (MRI) as contrast enhancement agents. In order to understand how SPIONs interact with cells and cel…
▽ More
Superparamagnetic iron oxide nanoparticles (SPIONs), due to their controllable sizes, relatively long in vivo half-life and limited agglomeration, are ideal for biomedical applications such as magnetic labeling, hyperthermia cancer treatment, targeted drug delivery and for magnetic resonance imaging (MRI) as contrast enhancement agents. In order to understand how SPIONs interact with cells and cellular membranes it would be of interest to characterize individual SPIONs at the nanoscale in physiologically relevant conditions without labeling them. We demonstrate that Magnetic Force Microscopy (MFM) can be used to image SPIONs in air as well as in liquid. The magnetic properties of bare and SiO2 coated SPIONs are compared using MFM. We report that surface modification using (3-mercaptopropyl)-trimethoxysilane significantly improves adsorption and distribution of SPIONs on gold surfaces. To obtain proof of principle that SPIONS can be imaged with MFM inside the cell we imaged SPIONs buried in thin polymer films (polystyrene (PS) and poly methyl-methacrylate (PMMA)). This opens the possibility of visualizing SPIONs inside the cell without any labeling or modifications and present MFM as a potential magnetic analogue for fluorescence microscopy. The results of these studies may have a valuable impact for characterization and further development of biomedical applications of SPIONs and other magnetic nanoparticles.
△ Less
Submitted 26 April, 2017;
originally announced April 2017.
-
Magnetic Force Microscopy for Nanoparticle Characterization
Authors:
Gustavo Cordova,
Brenda Yasie Lee,
Zoya Leonenko
Abstract:
Since the invention of the atomic force microscope (AFM) in 1986, there has been a drive to apply this scanning probe technique or a form of this technique to various disciplines in nanoscale science. Magnetic force microscopy (MFM) is a member of a growing family of scanning probe methods and has been widely used for the study of magnetic materials. In MFM a magnetic probe is used to raster-scan…
▽ More
Since the invention of the atomic force microscope (AFM) in 1986, there has been a drive to apply this scanning probe technique or a form of this technique to various disciplines in nanoscale science. Magnetic force microscopy (MFM) is a member of a growing family of scanning probe methods and has been widely used for the study of magnetic materials. In MFM a magnetic probe is used to raster-scan the surface of the sample, of which its magnetic field interacts with the magnetic tip to offer insight into its magnetic properties. This review will focus on the use of MFM in relation to nanoparticle characterization, including superparamagnetic iron oxide nanoparticles, covering MFM imaging in air and in liquid environments.
△ Less
Submitted 26 April, 2017;
originally announced April 2017.