Skip to main content

Showing 1–7 of 7 results for author: Ottaviano, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.10628  [pdf, other

    cs.CL cs.IR

    Autoregressive Search Engines: Generating Substrings as Document Identifiers

    Authors: Michele Bevilacqua, Giuseppe Ottaviano, Patrick Lewis, Wen-tau Yih, Sebastian Riedel, Fabio Petroni

    Abstract: Knowledge-intensive language tasks require NLP systems to both provide the correct answer and retrieve supporting evidence for it in a given corpus. Autoregressive language models are emerging as the de-facto standard for generating answers, with newer and more powerful systems emerging at an astonishing pace. In this paper we argue that all this (and future) progress can be directly applied to th… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: 9 pages

  2. Embedding-based Retrieval in Facebook Search

    Authors: Jui-Ting Huang, Ashish Sharma, Shuying Sun, Li Xia, David Zhang, Philip Pronin, Janani Padmanabhan, Giuseppe Ottaviano, Linjun Yang

    Abstract: Search in social networks such as Facebook poses different challenges than in classical web search: besides the query text, it is important to take into account the searcher's context to provide relevant results. Their social graph is an integral part of this context and is a unique aspect of Facebook search. While embedding-based retrieval (EBR) has been applied in eb search engines for years, Fa… ▽ More

    Submitted 29 July, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: 9 pages, 3 figures, 3 tables, to be published in KDD '20

  3. arXiv:1603.04330  [pdf, other

    cs.DS

    Fast Scalable Construction of (Minimal Perfect Hash) Functions

    Authors: Marco Genuzio, Giuseppe Ottaviano, Sebastiano Vigna

    Abstract: Recent advances in random linear systems on finite fields have paved the way for the construction of constant-time data structures representing static functions and minimal perfect hash functions using less space with respect to existing techniques. The main obstruction for any practical application of these results is the cubic-time Gaussian elimination required to solve these linear systems: des… ▽ More

    Submitted 22 March, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

  4. Compressing Graphs and Indexes with Recursive Graph Bisection

    Authors: Laxman Dhulipala, Igor Kabiljo, Brian Karrer, Giuseppe Ottaviano, Sergey Pupyrev, Alon Shalita

    Abstract: Graph reordering is a powerful technique to increase the locality of the representations of graphs, which can be helpful in several applications. We study how the technique can be used to improve compression of graphs and inverted indexes. We extend the recent theoretical model of Chierichetti et al. (KDD 2009) for graph compression, and show how it can be employed for compression-friendly reord… ▽ More

    Submitted 28 February, 2016; originally announced February 2016.

  5. arXiv:1312.0526  [pdf, other

    cs.DS

    Cache-Oblivious Peeling of Random Hypergraphs

    Authors: Djamal Belazzougui, Paolo Boldi, Giuseppe Ottaviano, Rossano Venturini, Sebastiano Vigna

    Abstract: The computation of a peeling order in a randomly generated hypergraph is the most time-consuming step in a number of constructions, such as perfect hashing schemes, random $r$-SAT solvers, error-correcting codes, and approximate set encodings. While there exists a straightforward linear time algorithm, its poor I/O performance makes it impractical for hypergraphs whose size exceeds the available i… ▽ More

    Submitted 2 December, 2013; originally announced December 2013.

  6. arXiv:1204.3581  [pdf, other

    cs.DS cs.DB

    The Wavelet Trie: Maintaining an Indexed Sequence of Strings in Compressed Space

    Authors: Roberto Grossi, Giuseppe Ottaviano

    Abstract: An indexed sequence of strings is a data structure for storing a string sequence that supports random access, searching, range counting and analytics operations, both for exact matches and prefix search. String sequences lie at the core of column-oriented databases, log processing, and other storage and query tasks. In these applications each string can appear several times and the order of the st… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

  7. arXiv:1111.5220  [pdf, ps, other

    cs.DS

    Fast Compressed Tries through Path Decompositions

    Authors: Roberto Grossi, Giuseppe Ottaviano

    Abstract: Tries are popular data structures for storing a set of strings, where common prefixes are represented by common root-to-node paths. Over fifty years of usage have produced many variants and implementations to overcome some of their limitations. We explore new succinct representations of path-decomposed tries and experimentally evaluate the corresponding reduction in space usage and memory latency,… ▽ More

    Submitted 3 December, 2011; v1 submitted 22 November, 2011; originally announced November 2011.