Search | arXiv e-print repository

doi 10.1016/j.eswa.2023.122035

Disambiguation of Company names via Deep Recurrent Networks

Authors: Alessandro Basile, Riccardo Crupi, Michele Grasso, Alessandro Mercanti, Daniele Regoli, Simone Scarsi, Shuyi Yang, Andrea Cosentini

Abstract: Name Entity Disambiguation is the Natural Language Processing task of identifying textual records corresponding to the same Named Entity, i.e. real-world entities represented as a list of attributes (names, places, organisations, etc.). In this work, we face the task of disambiguating companies on the basis of their written names. We propose a Siamese LSTM Network approach to extract -- via superv… ▽ More Name Entity Disambiguation is the Natural Language Processing task of identifying textual records corresponding to the same Named Entity, i.e. real-world entities represented as a list of attributes (names, places, organisations, etc.). In this work, we face the task of disambiguating companies on the basis of their written names. We propose a Siamese LSTM Network approach to extract -- via supervised learning -- an embedding of company name strings in a (relatively) low dimensional vector space and use this representation to identify pairs of company names that actually represent the same company (i.e. the same Entity). Given that the manual labelling of string pairs is a rather onerous task, we analyse how an Active Learning approach to prioritise the samples to be labelled leads to a more efficient overall learning pipeline. With empirical investigations, we show that our proposed Siamese Network outperforms several benchmark approaches based on standard string matching algorithms when enough labelled data are available. Moreover, we show that Active Learning prioritisation is indeed helpful when labelling resources are limited, and let the learning models reach the out-of-sample performance saturation with less labelled data with respect to standard (random) data labelling approaches. △ Less

Submitted 15 April, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: submitted to Elsevier. 26 pages, 6 figures, 4 tables

Journal ref: updated version is published by Expert Systems with Applications, Volume 238, Part C, 2024, 122035, ISSN 0957-4174

arXiv:2208.01594 [pdf, other]

The character of non-manipulable collective choices between two alternatives

Authors: Achille Basile, K. P. S. Bhaskara Rao, Surekha Rao

Abstract: We consider classes of non-manipulable social choice functions with range of cardinality at most two within a set of at least two alternatives. We provide the functional form for each of the classes we consider. This functional form is a characterization that explicitly describes how a social choice function of that particular class selects the collective choice corresponding to a profile. We… ▽ More We consider classes of non-manipulable social choice functions with range of cardinality at most two within a set of at least two alternatives. We provide the functional form for each of the classes we consider. This functional form is a characterization that explicitly describes how a social choice function of that particular class selects the collective choice corresponding to a profile. We provide a unified formulation of these characterizations using the new concept of "character". The choice of the character, depending on the class of social choice functions, gives the functional form of all social choice functions of the class. △ Less

Submitted 27 February, 2024; v1 submitted 2 August, 2022; originally announced August 2022.

Comments: JEL Code: D71

MSC Class: 91B14

arXiv:2204.09481 [pdf, other]

Unsupervised Ranking and Aggregation of Label Descriptions for Zero-Shot Classifiers

Authors: Angelo Basile, Marc Franco-Salvador, Paolo Rosso

Abstract: Zero-shot text classifiers based on label descriptions embed an input text and a set of labels into the same space: measures such as cosine similarity can then be used to select the most similar label description to the input text as the predicted label. In a true zero-shot setup, designing good label descriptions is challenging because no development set is available. Inspired by the literature o… ▽ More Zero-shot text classifiers based on label descriptions embed an input text and a set of labels into the same space: measures such as cosine similarity can then be used to select the most similar label description to the input text as the predicted label. In a true zero-shot setup, designing good label descriptions is challenging because no development set is available. Inspired by the literature on Learning with Disagreements, we look at how probabilistic models of repeated rating analysis can be used for selecting the best label descriptions in an unsupervised fashion. We evaluate our method on a set of diverse datasets and tasks (sentiment, topic and stance). Furthermore, we show that multiple, noisy label descriptions can be aggregated to boost the performance. △ Less

Submitted 24 May, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

Comments: 6 pages, 2 figures

MSC Class: I.2.7

arXiv:2204.09347 [pdf, other]

Active Few-Shot Learning with FASL

Authors: Thomas Müller, Guillermo Pérez-Torró, Angelo Basile, Marc Franco-Salvador

Abstract: Recent advances in natural language processing (NLP) have led to strong text classification models for many tasks. However, still often thousands of examples are needed to train models with good quality. This makes it challenging to quickly develop and deploy new models for real world problems and business needs. Few-shot learning and active learning are two lines of research, aimed at tackling th… ▽ More Recent advances in natural language processing (NLP) have led to strong text classification models for many tasks. However, still often thousands of examples are needed to train models with good quality. This makes it challenging to quickly develop and deploy new models for real world problems and business needs. Few-shot learning and active learning are two lines of research, aimed at tackling this problem. In this work, we combine both lines into FASL, a platform that allows training text classification models using an iterative and fast process. We investigate which active learning methods work best in our few-shot setup. Additionally, we develop a model to predict when to stop annotating. This is relevant as in a few-shot setup we do not have access to a large validation set. △ Less

Submitted 17 May, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

arXiv:2104.10205 [pdf, ps, other]

On the relation between Preference Reversal and Strategy-Proofness

Authors: K. P. S. Bhaskara Rao, Achille Basile, Surekha Rao

Abstract: We analyze the relation between strategy-proofness and preference reversal in the case that agents may declare indifference. Interestingly, Berga and Moreno (2020), have recently derived preference reversal from group strategy-proofness of social choice functions on strict preferences domains if the range has no more than three elements. We extend this result and at the same time simplify it. Our… ▽ More We analyze the relation between strategy-proofness and preference reversal in the case that agents may declare indifference. Interestingly, Berga and Moreno (2020), have recently derived preference reversal from group strategy-proofness of social choice functions on strict preferences domains if the range has no more than three elements. We extend this result and at the same time simplify it. Our analysis points out the role of individual strategy-proofness in deriving the preference reversal property, giving back to the latter its original individual nature (cfr. Eliaz, 2004). Moreover, we show that the difficulties Berga and Moreno highlighted relaxing the assumption on the cardinality of the range, disappear under a proper assumption on the domain. We introduce the concept of complete sets of preferences and show that individual strategy-proofness is sufficient to obtain the preference reversal property when the agents' feasible set of orderings is complete. This covers interesting cases like single peaked preferences, rich domains admitting regular social choice functions, and universal domains. The fact that we use individual rather than group strategy-proofness, allows to get immediately some of the known, and some new, equivalences between individual and group strategy-proofness. Finally, we show that group strategy-proofness is only really needed to obtain preference reversal if there are infinitely many voters. △ Less

Submitted 22 April, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

Comments: JEL Code: D71

MSC Class: 91B14

arXiv:2008.02041 [pdf, ps, other]

Geometry of anonymous binary social choices that are strategy-proof

Authors: Achille Basile, Surekha Rao, K. P. S. Bhaskara Rao

Abstract: Let $V$ be society whose members express preferences about two alternatives, indifference included. Identifying anonymous binary social choice functions with binary functions $f=f(k,m)$ defined over the integer triangular grid $G=\{(k,m)\in \mathbb{N}_0\times\mathbb{N}_0 : k+m\le |V|\} $, we show that every strategy-proof, anonymous social choice function can be described geometrically by listing,… ▽ More Let $V$ be society whose members express preferences about two alternatives, indifference included. Identifying anonymous binary social choice functions with binary functions $f=f(k,m)$ defined over the integer triangular grid $G=\{(k,m)\in \mathbb{N}_0\times\mathbb{N}_0 : k+m\le |V|\} $, we show that every strategy-proof, anonymous social choice function can be described geometrically by listing, in a sequential manner, groups of segments of G, of equal (maximum possible) length, alternately horizontal and vertical, representative of preference profiles that determine the collective choice of one of the two alternatives. Indeed, we show that every function which is anonymous and strategy-proof can be described in terms of a sequence of nonnegative integers $(q_1, q_2, \cdots, q_s)$ corresponding to the cardinalities of the mentioned groups of segments. We also analyze the connections between our present representation with another of our earlier representations involving sequences of majority quotas. A Python code is available with the authors for the implementation of any such social choice function. △ Less

Submitted 5 August, 2020; originally announced August 2020.

Comments: JEL Code: D71

MSC Class: 91B14

arXiv:2007.01552 [pdf, ps, other]

Anonymous, non-manipulable, binary social choice

Authors: Achille Basile, Surekha Rao, K. P. S. Bhaskara Rao

Abstract: Let V be a finite society whose members express weak orderings (hence also indifference, possibly) about two alternatives. We show a simple representation formula that is valid for all, and only, anonymous, non-manipulable, binary social choice functions on V . The number of such functions is $2^{n+1}$ if V contains $n$ agents. Let V be a finite society whose members express weak orderings (hence also indifference, possibly) about two alternatives. We show a simple representation formula that is valid for all, and only, anonymous, non-manipulable, binary social choice functions on V . The number of such functions is $2^{n+1}$ if V contains $n$ agents. △ Less

Submitted 3 July, 2020; originally announced July 2020.

Comments: JEL Code: D71

MSC Class: 91B14

arXiv:2002.06341 [pdf, ps, other]

The structure of two-valued strategy-proof social choice functions with indifference

Authors: Achille Basile, Surekha Rao, K. P. S. Bhaskara Rao

Abstract: We give a structure theorem for all coalitionally strategy-proof social choice functions whose range is a subset of cardinality two of a given larger set of alternatives. We provide this in the case where the voters/agents are allowed to express indifference and the domain consists of profiles of preferences over a society of arbitrary cardinality. The theorem, that takes the form of a represent… ▽ More We give a structure theorem for all coalitionally strategy-proof social choice functions whose range is a subset of cardinality two of a given larger set of alternatives. We provide this in the case where the voters/agents are allowed to express indifference and the domain consists of profiles of preferences over a society of arbitrary cardinality. The theorem, that takes the form of a representation formula, can be used to construct all functions under consideration. △ Less

Submitted 3 July, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

Comments: JEL Code: D71

MSC Class: 91B14

arXiv:1907.07265 [pdf, other]

You Write Like You Eat: Stylistic variation as a predictor of social stratification

Authors: Angelo Basile, Albert Gatt, Malvina Nissim

Abstract: Inspired by Labov's seminal work on stylistic variation as a function of social stratification, we develop and compare neural models that predict a person's presumed socio-economic status, obtained through distant supervision,from their writing style on social media. The focus of our work is on identifying the most important stylistic parameters to predict socio-economic group. In particular, we s… ▽ More Inspired by Labov's seminal work on stylistic variation as a function of social stratification, we develop and compare neural models that predict a person's presumed socio-economic status, obtained through distant supervision,from their writing style on social media. The focus of our work is on identifying the most important stylistic parameters to predict socio-economic group. In particular, we show the effectiveness of morpho-syntactic features as stylistic predictors of socio-economic group,in contrast to lexical features, which are good predictors of topic. △ Less

Submitted 16 July, 2019; originally announced July 2019.

Comments: 11 pages, 5 figures, ACL Conference 2019

ACM Class: I.2.7

arXiv:1808.07570 [pdf, ps, other]

$n$-H-closed spaces

Authors: Fortunata Aurora Basile, Maddalena Bonanzinga, Nathan Carlson, Jack Porter

Abstract: In this paper we extend the theory of H-closed extensions of Hausdorff spaces to a class of non-Hausdorff spaces, defined in \cite{B}, called $n$-Hausdorff spaces. The notion of H-closed is generalized to an $n$-H-closed space. Known construction for Hausdorff spaces $X$, such as the Katětov H-closed extension $κX$, are generalized to a maximal $n$-H-closed extension denoted by $n$-$κX$. In this paper we extend the theory of H-closed extensions of Hausdorff spaces to a class of non-Hausdorff spaces, defined in \cite{B}, called $n$-Hausdorff spaces. The notion of H-closed is generalized to an $n$-H-closed space. Known construction for Hausdorff spaces $X$, such as the Katětov H-closed extension $κX$, are generalized to a maximal $n$-H-closed extension denoted by $n$-$κX$. △ Less

Submitted 22 August, 2018; originally announced August 2018.

Comments: 15 pages

MSC Class: 54D10

arXiv:1808.06712 [pdf, ps, other]

On cardinality bounds for $θ^n$-Urysohn spaces

Authors: Fortunata Aurora Basile, Nathan Carlson, Jack Porter

Abstract: We introduce the class of $θ^{n}$-Urysohn spaces and the $n$-$θ$-closure operator. $θ^n$-Urysohn spaces generalize the notion of a Urysohn space. We estabilish bounds on the cardinality of these spaces and cardinality bounds if the space is additionally homogeneous. We introduce the class of $θ^{n}$-Urysohn spaces and the $n$-$θ$-closure operator. $θ^n$-Urysohn spaces generalize the notion of a Urysohn space. We estabilish bounds on the cardinality of these spaces and cardinality bounds if the space is additionally homogeneous. △ Less

Submitted 20 August, 2018; originally announced August 2018.

Comments: 14 pages

MSC Class: 54A25

arXiv:1801.00065 [pdf, other]

An Unsupervised Homogenization Pipeline for Clustering Similar Patients using Electronic Health Record Data

Authors: Alvaro Ulloa, Anna Basile, Gregory J. Wehner, Linyuan **g, Marylyn D. Ritchie, Brett Beaulieu-Jones, Christopher M. Haggerty, Brandon K. Fornwalt

Abstract: Electronic health records (EHR) contain a large variety of information on the clinical history of patients such as vital signs, demographics, diagnostic codes and imaging data. The enormous potential for discovery in this rich dataset is hampered by its complexity and heterogeneity. We present the first study to assess unsupervised homogenization pipelines designed for EHR clustering. To identif… ▽ More Electronic health records (EHR) contain a large variety of information on the clinical history of patients such as vital signs, demographics, diagnostic codes and imaging data. The enormous potential for discovery in this rich dataset is hampered by its complexity and heterogeneity. We present the first study to assess unsupervised homogenization pipelines designed for EHR clustering. To identify the optimal pipeline, we tested accuracy on simulated data with varying amounts of redundancy, heterogeneity, and missingness. We identified two optimal pipelines: 1) Multiple Imputation by Chained Equations (MICE) combined with Local Linear Embedding; and 2) MICE, Z-scoring, and Deep Autoencoders. △ Less

Submitted 21 March, 2018; v1 submitted 29 December, 2017; originally announced January 2018.

Comments: conference

arXiv:1709.10497 [pdf, ps, other]

Variations on known and recent cardinality bounds

Authors: Fortunata Aurora Basile, Maddalena Bonanzinga, Nathan Carlson

Abstract: Sapirovskii [18] proved that $|X|\leqπχ(X)^{c(X)ψ(X)}$, for a regular space $X$. We introduce the $θ$-pseudocharacter of a Urysohn space $X$, denoted by $ψ_θ(X)$, and prove that the previous inequality holds for Urysohn spaces replacing the bounds on celluarity $c(X)\leqκ$ and on pseudocharacter $ψ(X)\leqκ$ with a bound on Urysohn cellularity $Uc(X)\leqκ$ (which is a weaker conditon because… ▽ More Sapirovskii [18] proved that $|X|\leqπχ(X)^{c(X)ψ(X)}$, for a regular space $X$. We introduce the $θ$-pseudocharacter of a Urysohn space $X$, denoted by $ψ_θ(X)$, and prove that the previous inequality holds for Urysohn spaces replacing the bounds on celluarity $c(X)\leqκ$ and on pseudocharacter $ψ(X)\leqκ$ with a bound on Urysohn cellularity $Uc(X)\leqκ$ (which is a weaker conditon because $Uc(X)\leq c(X)$) and on $θ$-pseudocharacter $ψ_θ(X)\leqκ$ respectivly (note that in general $ψ(\cdot)\leqψ_θ(\cdot)$ and in the class of regular spaces $ψ(\cdot)=ψ_θ(\cdot)$). Further, in [6] the authors generalized the Dissanayake and Willard's inequality: $|X|\leq 2^{aL_{c}(X)χ(X)}$, for Hausdorff spaces $X$ [25], in the class of $n$-Hausdorff spaces and de Groot's result: $|X|\leq 2^{hL(X)}$, for Hausdorff spaces [11], in the class of $T_1$ spaces (see Theorems 2.22 and 2.23 in [6]). In this paper we restate Theorem 2.22 in [6] in the class of $n$-Urysohn spaces and give a variation of Theorem 2.23 in [6] using new cardinal functions, denoted by $UW(X)$, $ψw_θ(X)$, $θ\hbox{-}aL(X)$, $hθ\hbox{-}aL(X)$, $θ\hbox{-}aL_c(X)$ and $θ\hbox{-}aL_θ(X)$. In [5] the authors introduced the Hausdorff point separating weight of a space $X$ denoted by $Hpsw(X)$ and proved a Hausdorff version of Charlesworth's inequality $|X|\leq psw(X)^{L(X)ψ(X)}$ [7]. In this paper, we introduce the Urysohn point separating weight of a space $X$, denoted by $Upsw(X)$, and prove that $|X|\leq Upsw(X)^{θ\hbox{-}aL_{c}(X)ψ(X)}$, for a Urysohn space $X$. △ Less

Submitted 29 September, 2017; originally announced September 2017.

Comments: 14 pages

MSC Class: 54A25

arXiv:1707.03764 [pdf, other]

N-GrAM: New Groningen Author-profiling Model

Authors: Angelo Basile, Gareth Dwyer, Maria Medvedeva, Josine Rawee, Hessel Haagsma, Malvina Nissim

Abstract: We describe our participation in the PAN 2017 shared task on Author Profiling, identifying authors' gender and language variety for English, Spanish, Arabic and Portuguese. We describe both the final, submitted system, and a series of negative results. Our aim was to create a single model for both gender and language, and for all language varieties. Our best-performing system (on cross-validated r… ▽ More We describe our participation in the PAN 2017 shared task on Author Profiling, identifying authors' gender and language variety for English, Spanish, Arabic and Portuguese. We describe both the final, submitted system, and a series of negative results. Our aim was to create a single model for both gender and language, and for all language varieties. Our best-performing system (on cross-validated results) is a linear support vector machine (SVM) with word unigrams and character 3- to 5-grams as features. A set of additional features, including POS tags, additional datasets, geographic entities, and Twitter handles, hurt, rather than improve, performance. Results from cross-validation indicated high performance overall and results on the test set confirmed them, at 0.86 averaged accuracy, with performance on sub-tasks ranging from 0.68 to 0.98. △ Less

Submitted 12 July, 2017; originally announced July 2017.

arXiv:1103.1157 [pdf, ps, other]

GRASP and path-relinking for Coalition Structure Generation

Authors: Nicola Di Mauro, Teresa M. A. Basile, Stefano Ferilli, Floriana Esposito

Abstract: In Artificial Intelligence with Coalition Structure Generation (CSG) one refers to those cooperative complex problems that require to find an optimal partition, maximising a social welfare, of a set of entities involved in a system into exhaustive and disjoint coalitions. The solution of the CSG problem finds applications in many fields such as Machine Learning (covering machines, clustering), Dat… ▽ More In Artificial Intelligence with Coalition Structure Generation (CSG) one refers to those cooperative complex problems that require to find an optimal partition, maximising a social welfare, of a set of entities involved in a system into exhaustive and disjoint coalitions. The solution of the CSG problem finds applications in many fields such as Machine Learning (covering machines, clustering), Data Mining (decision tree, discretization), Graph Theory, Natural Language Processing (aggregation), Semantic Web (service composition), and Bioinformatics. The problem of finding the optimal coalition structure is NP-complete. In this paper we present a greedy adaptive search procedure (GRASP) with path-relinking to efficiently search the space of coalition structures. Experiments and comparisons to other algorithms prove the validity of the proposed method in solving this hard combinatorial problem. △ Less

Submitted 9 March, 2011; v1 submitted 6 March, 2011; originally announced March 2011.

arXiv:1006.5188 [pdf, ps, other]

Feature Construction for Relational Sequence Learning

Authors: Nicola Di Mauro, Teresa M. A. Basile, Stefano Ferilli, Floriana Esposito

Abstract: We tackle the problem of multi-class relational sequence learning using relevant patterns discovered from a set of labelled sequences. To deal with this problem, firstly each relational sequence is mapped into a feature vector using the result of a feature construction method. Since, the efficacy of sequence learning algorithms strongly depends on the features used to represent the sequences, the… ▽ More We tackle the problem of multi-class relational sequence learning using relevant patterns discovered from a set of labelled sequences. To deal with this problem, firstly each relational sequence is mapped into a feature vector using the result of a feature construction method. Since, the efficacy of sequence learning algorithms strongly depends on the features used to represent the sequences, the second step is to find an optimal subset of the constructed features leading to high classification accuracy. This feature selection task has been solved adopting a wrapper approach that uses a stochastic local search algorithm embedding a naive Bayes classifier. The performance of the proposed method applied to a real-world dataset shows an improvement when compared to other established methods, such as hidden Markov models, Fisher kernels and conditional random fields for relational sequences. △ Less

Submitted 27 June, 2010; originally announced June 2010.

Comments: 15 pages

arXiv:1004.2880 [pdf, ps, other]

GRASP for the Coalition Structure Formation Problem

Authors: Nicola Di Mauro, Teresa M. A. Basile, Stefano Ferilli, Floriana Esposito

Abstract: The coalition structure formation problem represents an active research area in multi-agent systems. A coalition structure is defined as a partition of the agents involved in a system into disjoint coalitions. The problem of finding the optimal coalition structure is NP-complete. In order to find the optimal solution in a combinatorial optimization problem it is theoretically possib… ▽ More The coalition structure formation problem represents an active research area in multi-agent systems. A coalition structure is defined as a partition of the agents involved in a system into disjoint coalitions. The problem of finding the optimal coalition structure is NP-complete. In order to find the optimal solution in a combinatorial optimization problem it is theoretically possible to enumerate the solutions and evaluate each. But this approach is infeasible since the number of solutions often grows exponentially with the size of the problem. In this paper we present a greedy adaptive search procedure (GRASP) to efficiently search the space of coalition structures in order to find an optimal one. Experiments and comparisons to other algorithms prove the validity of the proposed method in solving this hard combinatorial problem. △ Less

Submitted 16 April, 2010; originally announced April 2010.

Comments: 12 pages, Submitted to an International Conference

arXiv:0810.3721 [pdf]

Second maximal subgroups of the finite alternating and symmetric groups

Authors: Alberto Basile

Abstract: A subgroup of a finite group G is said to be second maximal if it is maximal in every maximal subgroup of G that contains it. A question which has received considerable attention asks: can every positive integer occur as the number of the maximal subgroups that contain a given second maximal subgroup in some finite group G? Various reduction arguments are available except when G is almost simple… ▽ More A subgroup of a finite group G is said to be second maximal if it is maximal in every maximal subgroup of G that contains it. A question which has received considerable attention asks: can every positive integer occur as the number of the maximal subgroups that contain a given second maximal subgroup in some finite group G? Various reduction arguments are available except when G is almost simple. Following the classification of the finite simple groups, finite almost simple groups fall into three categories: alternating and symmetric groups, almost simple groups of Lie type, sporadic groups and automorphism groups of sporadic groups. This thesis investigates the finite alternating and symmetric groups, and finds that in such groups, except three well known examples, no second maximal subgroup can be contained in more than 3 maximal subgroups. △ Less

Submitted 20 October, 2008; originally announced October 2008.

Comments: The PhD thesis of the author (The Australian National University, Canberra)

MSC Class: 20B05; 20B15; 20B30; 20B35; 20D06; 20D30

Showing 1–18 of 18 results for author: Basile, A