Skip to main content

Showing 1–4 of 4 results for author: Vinciguerra, G

.
  1. arXiv:2311.15380  [pdf, other

    cs.DS cs.DB

    Grafite: Taming Adversarial Queries with Optimal Range Filters

    Authors: Marco Costa, Paolo Ferragina, Giorgio Vinciguerra

    Abstract: Range filters allow checking whether a query range intersects a given set of keys with a chance of returning a false positive answer, thus generalising the functionality of Bloom filters from point to range queries. Existing practical range filters have addressed this problem heuristically, resulting in high false positive rates and query times when dealing with adversarial inputs, such as in the… ▽ More

    Submitted 19 March, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in Proceedings of the ACM on Management of Data (SIGMOD 2024)

    Journal ref: Proceedings of the ACM on Management of Data, Volume 2, Issue 1 (2024), Article No. 3, pp 1-23

  2. arXiv:2304.11012  [pdf, other

    cs.DS

    Learned Monotone Minimal Perfect Hashing

    Authors: Paolo Ferragina, Hans-Peter Lehmann, Peter Sanders, Giorgio Vinciguerra

    Abstract: A Monotone Minimal Perfect Hash Function (MMPHF) constructed on a set S of keys is a function that maps each key in S to its rank. On keys not in S, the function returns an arbitrary value. Applications range from databases, search engines, data encryption, to pattern-matching algorithms. In this paper, we describe LeMonHash, a new technique for constructing MMPHFs for integers. The core idea of… ▽ More

    Submitted 30 August, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  3. arXiv:1910.06169  [pdf, ps, other

    cs.DS cs.DB cs.IR cs.LG

    The PGM-index: a multicriteria, compressed and learned approach to data indexing

    Authors: Paolo Ferragina, Giorgio Vinciguerra

    Abstract: The recent introduction of learned indexes has shaken the foundations of the decades-old field of indexing data structures. Combining, or even replacing, classic design elements such as B-tree nodes with machine learning models has proven to give outstanding improvements in the space footprint and time efficiency of data systems. However, these novel approaches are based on heuristics, thus they l… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: We remark to the reader that this paper is an extended and improved version of our previous paper titled "Superseding traditional indexes by orchestrating learning and geometry" (arXiv:1903.00507)

    ACM Class: E.1; E.4; I.2.6

    Journal ref: PVLDB, 13(8): 1162-1175, 2020

  4. arXiv:1903.00507  [pdf

    cs.DS

    Superseding traditional indexes by orchestrating learning and geometry

    Authors: Giorgio Vinciguerra, Paolo Ferragina, Michele Miccinesi

    Abstract: We design the first learned index that solves the dictionary problem with time and space complexity provably better than classic data structures for hierarchical memories, such as B-trees, and modern learned indexes. We call our solution the Piecewise Geometric Model index (PGM-index) because it turns the indexing of a sequence of keys into the coverage of a sequence of 2D-points via linear models… ▽ More

    Submitted 9 March, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

    ACM Class: E.1; E.4; I.2.6