Skip to main content

Showing 1–14 of 14 results for author: de Bernardo, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2009.10045  [pdf, other

    cs.DS

    Space/time-efficient RDF stores based on circular suffix sorting

    Authors: Nieves R. Brisaboa, Ana Cerdeira-Pena, Guillermo de Bernardo, Antonio Fariña, Gonzalo Navarro

    Abstract: In recent years, RDF has gained popularity as a format for the standardized publication and exchange of information in the Web of Data. In this paper we introduce RDFCSA, a data structure that is able to self-index an RDF dataset in small space and supports efficient querying. RDFCSA regards the triples of the RDF store as short circular strings and applies suffix sorting on those strings, so that… ▽ More

    Submitted 13 April, 2022; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: This work has been submitted to a Journal for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2002.11622  [pdf, ps, other

    cs.DB cs.DS

    Revisiting compact RDF stores based on k2-trees

    Authors: Nieves R. Brisaboa, Ana Cerdeira-Pena, Guillermo de Bernardo, Antonio Fariña

    Abstract: We present a new compact representation to efficiently store and query large RDF datasets in main memory. Our proposal, called BMatrix, is based on the k2-tree, a data structure devised to represent binary matrices in a compressed way, and aims at improving the results of previous state-of-the-art alternatives, especially in datasets with a relatively large number of predicates. We introduce our t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

  3. Faster Dynamic Compressed d-ary Relations

    Authors: Diego Arroyuelo, Guillermo de Bernardo, Travis Gagie, Gonzalo Navarro

    Abstract: The $k^2$-tree is a successful compact representation of binary relations that exhibit sparseness and/or clustering properties. It can be extended to $d$ dimensions, where it is called a $k^d$-tree. The representation boils down to a long bitvector. We show that interpreting the $k^d$-tree as a dynamic trie on the Morton codes of the points, instead of as a dynamic representation of the bitvector… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

    Journal ref: Proc. SPIRE 2019

  4. Extending General Compact Querieable Representations to GIS Applications

    Authors: Nieves R. Brisaboa, Ana Cerdeira-Pena, Guillermo de Bernardo, Gonzalo Navarro, Oscar Pedreira

    Abstract: The raster model is commonly used for the representation of images in many domains, and is especially useful in Geographic Information Systems (GIS) to store information about continuous variables of the space (elevation, temperature, etc.). Current representations of raster data are usually designed for external memory or, when stored in main memory, lack efficient query capabilities. In this pap… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941,

    Journal ref: Information Sciences 2020

  5. Improved Compressed String Dictionaries

    Authors: Nieves R. Brisaboa, Ana Cerdeira-Pena, Guillermo de Bernardo, Gonzalo Navarro

    Abstract: We introduce a new family of compressed data structures to efficiently store and query large string dictionaries in main memory. Our main technique is a combination of hierarchical Front-coding with ideas from longest-common-prefix computation in suffix arrays. Our data structures yield relevant space-time tradeoffs in real-world dictionaries. We focus on two domains where string dictionaries are… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

    Journal ref: Proc. 28th ACM International Conference on Information and Knowledge Management (CIKM 2019)

  6. arXiv:1911.03195  [pdf, other

    cs.DS

    On dynamic succinct graph representations

    Authors: Miguel E. Coimbra, Alexandre P. Francisco, Luís M. S. Russo, Guillermo de Bernardo, Susana Ladra, Gonzalo Navarro

    Abstract: We address the problem of representing dynamic graphs using $k^2$-trees. The $k^2$-tree data structure is one of the succinct data structures proposed for representing static graphs, and binary relations in general. It relies on compact representations of bit vectors. Hence, by relying on compact representations of dynamic bit vectors, we can also represent dynamic graphs. In this paper we follow… ▽ More

    Submitted 6 December, 2019; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

  7. arXiv:1810.10965  [pdf, other

    cs.DS

    Towards a compact representation of temporal rasters

    Authors: Ana Cerdeira-Pena, Guillermo de Bernardo, Antonio Fariña, Jose R. Parama, Fernando Silva-Coira

    Abstract: Big research efforts have been devoted to efficiently manage spatio-temporal data. However, most works focused on vectorial data, and much less, on raster data. This work presents a new representation for raster data that evolve along time named Temporal k^2 raster. It faces the two main issues that arise when dealing with spatio-temporal data: the space consumption and the query response times. I… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941. Published in SPIRE 2018

    Journal ref: SPIRE 2018

  8. About BIRDS project (Bioinformatics and Information Retrieval Data Structures Analysis and Design)

    Authors: Guillermo de Bernardo, Susana Ladra

    Abstract: BIRDS stands for "Bioinformatics and Information Retrieval Data Structures analysis and design" and is a 4-year project (2016--2019) that has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 690941. The overall goal of BIRDS is to establish a long term international network involving leading researchers… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941. CERI 2018

    MSC Class: 68

    Journal ref: Proceedings of the 5th Spanish Conference on Information Retrieval (CERI '18), 2018

  9. arXiv:1803.02576  [pdf, ps, other

    cs.DS

    Compact Representations of Event Sequences

    Authors: Nieves R. Brisaboa, Guillermo de Bernardo, Gonzalo Navarro, Tirso V. Rodeiro, Diego Seco

    Abstract: We introduce a new technique for the efficient management of large sequences of multidimensional data, which takes advantage of regularities that arise in real-world datasets and supports different types of aggregation queries. More importantly, our representation is flexible in the sense that the relevant dimensions and queries may be used to guide the construction process, easily providing a spa… ▽ More

    Submitted 7 March, 2018; originally announced March 2018.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

  10. arXiv:1707.02769  [pdf, ps, other

    cs.DS

    Compressed Representation of Dynamic Binary Relations with Applications

    Authors: Nieves R. Brisaboa, Ana Cerdeira-Pena, Guillermo de Bernardo, Gonzalo Navarro

    Abstract: We introduce a dynamic data structure for the compact representation of binary relations $\mathcal{R} \subseteq A \times B$. The data structure is a dynamic variant of the k$^2$-tree, a static compact representation that takes advantage of clustering in the binary relation to achieve compression. Our structure can efficiently check whether two objects $(a,b) \in A \times B$ are related, and list t… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941, Information Systems (2017)

  11. A succinct data structure for self-indexing ternary relations

    Authors: Sandra Alvarez-Garcia, Guillermo de Bernardo, Nieves R. Brisaboa, Gonzalo Navarro

    Abstract: The representation of binary relations has been intensively studied and many different theoretical and practical representations have been proposed to answer the usual queries in multiple domains. However, ternary relations have not received as much attention, even though many real-world applications require the processing of ternary relations. In this paper we present a new compressed and self-in… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941, Journal of Discrete Algorithms (2017)

  12. arXiv:1611.05247  [pdf

    cs.DS

    A new method to index and store spatio-temporal data

    Authors: Guillermo de Bernardo, Ramón Casares, Adrián Gómez-Brandón, José R. Paramá

    Abstract: We propose a data structure that stores, in a compressed way, object trajectories, which at the same time, allow to efficiently response queries without the need to decompress the data. We use a data structure, called $k^{2}$-tree, to store the full position of all objects at regular time intervals. For storing the positions of objects between two time instants represented with $k^{2}$-trees, we o… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

    Journal ref: Proceeding of the 20th Pacific Asia Conference on Information Systems (PACIS 2016). Association for Information Systems. AIS Electronic Library (AISeL). Paper 93. ISBN: 9789860491029

  13. Aggregated 2D Range Queries on Clustered Points

    Authors: Nieves R. Brisaboa, Guillermo De Bernardo, Roberto Konow, Gonzalo Navarro, Diego Seco

    Abstract: Efficient processing of aggregated range queries on two-dimensional grids is a common requirement in information retrieval and data mining systems, for example in Geographic Information Systems and OLAP cubes. We introduce a technique to represent grids supporting aggregated range queries that requires little space when the data points in the grid are clustered, which is common in practice. We sho… ▽ More

    Submitted 30 March, 2016; v1 submitted 7 March, 2016; originally announced March 2016.

    Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941

    Journal ref: Information Systems, Volume 60, Pages 34-49, 2016

  14. arXiv:1411.2785  [pdf, other

    cs.DS

    Faster Compressed Quadtrees

    Authors: Guillermo de Bernardo, Travis Gagie, Susana Ladra, Gonzalo Navarro, Diego Seco

    Abstract: Real-world point sets tend to be clustered, so using a machine word for each point is wasteful. In this paper we first show how a compact representation of quadtrees using $\Oh{1}$ bits per node can break this bound on clustered point sets, while offering efficient range searches. We then describe a new compact quadtree representation based on heavy path decompositions, which supports queries fast… ▽ More

    Submitted 8 December, 2021; v1 submitted 11 November, 2014; originally announced November 2014.

    Comments: Journal version of DCC '15 paper