Skip to main content

Showing 1–7 of 7 results for author: Firmani, D

.
  1. arXiv:2203.12978  [pdf, other

    cs.DB cs.LG

    Effective Explanations for Entity Resolution Models

    Authors: Tommaso Teofili, Donatella Firmani, Nick Koudas, Vincenzo Martello, Paolo Merialdo, Divesh Srivastava

    Abstract: Entity resolution (ER) aims at matching records that refer to the same real-world entity. Although widely studied for the last 50 years, ER still represents a challenging data management problem, and several recent works have started to investigate the opportunity of applying deep learning (DL) techniques to solve this problem. In this paper, we study the fundamental problem of explainability of t… ▽ More

    Submitted 1 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  2. arXiv:2101.11259  [pdf, other

    cs.DB

    Alaska: A Flexible Benchmark for Data Integration Tasks

    Authors: Valter Crescenzi, Andrea De Angelis, Donatella Firmani, Maurizio Mazzei, Paolo Merialdo, Federico Piai, Divesh Srivastava

    Abstract: Data integration is a long-standing interest of the data management community and has many disparate applications, including business, science and government. We have recently witnessed impressive results in specific data integration tasks, such as Entity Resolution, thanks to the increasing availability of benchmarks. A limitation of such benchmarks is that they typically come with their own task… ▽ More

    Submitted 3 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  3. Efficient and Effective ER with Progressive Blocking

    Authors: Sainyam Galhotra, Donatella Firmani, Barna Saha, Divesh Srivastava

    Abstract: Blocking is a mechanism to improve the efficiency of Entity Resolution (ER) which aims to quickly prune out all non-matching record pairs. However, depending on the distributions of entity cluster sizes, existing techniques can be either (a) too aggressive, such that they help scale but can adversely affect the ER effectiveness, or (b) too permissive, potentially harming ER efficiency. In this pap… ▽ More

    Submitted 16 March, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: Galhotra, S., Firmani, D., Saha, B. et al. Efficient and effective ER with progressive blocking. The VLDB Journal (2021)

  4. arXiv:2002.00819  [pdf, other

    cs.LG cs.DB stat.ML

    Knowledge Graph Embedding for Link Prediction: A Comparative Analysis

    Authors: Andrea Rossi, Donatella Firmani, Antonio Matinata, Paolo Merialdo, Denilson Barbosa

    Abstract: Knowledge Graphs (KGs) have found many applications in industry and academic settings, which in turn, have motivated considerable research efforts towards large-scale information extraction from a variety of sources. Despite such efforts, it is well known that even state-of-the-art KGs suffer from incompleteness. Link Prediction (LP), the task of predicting missing facts among entities already a K… ▽ More

    Submitted 21 January, 2021; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: Andrea Rossi, Donatella Firmani, Antonio Matinata, Paolo Merialdo, Denilson Barbosa. 2020. Knowledge Graph Embedding for Link Prediction: A Comparative Analysis. In ACM Transactions on Knowledge Discovery from Data. January 2021. (TKDD 2021). ACM, New York, NY, USA

  5. arXiv:1901.10232  [pdf, other

    cs.LG stat.ML

    Multikernel activation functions: formulation and a case study

    Authors: Simone Scardapane, Elena Nieddu, Donatella Firmani, Paolo Merialdo

    Abstract: The design of activation functions is a growing research area in the field of neural networks. In particular, instead of using fixed point-wise functions (e.g., the rectified linear unit), several authors have proposed ways of learning these functions directly from the data in a non-parametric fashion. In this paper we focus on the kernel activation function (KAF), a recently proposed framework wh… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: Accepted for presentation at INNS BDDL 2019 (https://innsbddl2019.org)

  6. Towards Knowledge Discovery from the Vatican Secret Archives. In Codice Ratio -- Episode 1: Machine Transcription of the Manuscripts

    Authors: Donatella Firmani, Marco Maiorino, Paolo Merialdo, Elena Nieddu

    Abstract: In Codice Ratio is a research project to study tools and techniques for analyzing the contents of historical documents conserved in the Vatican Secret Archives (VSA). In this paper, we present our efforts to develop a system to support the transcription of medieval manuscripts. The goal is to provide paleographers with a tool to reduce their efforts in transcribing large volumes, as those stored i… ▽ More

    Submitted 12 September, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

    Comments: Donatella Firmani, Marco Maiorino, Paolo Merialdo, and Elena Nieddu. 2018. Towards Knowledge Discovery from the Vatican Secret Archives. In Codice Ratio - Episode 1: Machine Transcription of the Manuscripts. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '18). ACM, New York, NY, USA, 263-272

  7. Real-Time Monitoring of Undirected Networks: Articulation Points, Bridges, and Connected and Biconnected Components

    Authors: Giorgio Ausiello, Donatella Firmani, Luigi Laura

    Abstract: In this paper we present the first algorithm in the streaming model to characterize completely the biconnectivity properties of undirected networks: articulation points, bridges, and connected and biconnected components. The motivation of our work was the development of a real-time algorithm to monitor the connectivity of the Autonomous Systems (AS) Network, but the solution provided is general en… ▽ More

    Submitted 1 February, 2012; originally announced February 2012.