Skip to main content

Showing 1–12 of 12 results for author: Papastefanatos, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.12333  [pdf, other

    cs.LG cs.CY cs.DB

    Auditing for Spatial Fairness

    Authors: Dimitris Sacharidis, Giorgos Giannopoulos, George Papastefanatos, Kostas Stefanidis

    Abstract: This paper studies algorithmic fairness when the protected attribute is location. To handle protected attributes that are continuous, such as age or income, the standard approach is to discretize the domain into predefined groups, and compare algorithmic outcomes across groups. However, applying this idea to location raises concerns of gerrymandering and may introduce statistical bias. Prior work… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  2. arXiv:2301.12939  [pdf, other

    eess.SP cs.CV cs.LG

    Data-driven soiling detection in PV modules

    Authors: Alexandros Kalimeris, Ioannis Psarros, Giorgos Giannopoulos, Manolis Terrovitis, George Papastefanatos, Gregory Kotsis

    Abstract: Soiling is the accumulation of dirt in solar panels which leads to a decreasing trend in solar energy yield and may be the cause of vast revenue losses. The effect of soiling can be reduced by washing the panels, which is, however, a procedure of non-negligible cost. Moreover, soiling monitoring systems are often unreliable or very costly. We study the problem of estimating the soiling ratio in ph… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: 12 pages, 4 figures

  3. arXiv:2202.01546  [pdf, other

    cs.DB

    QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data

    Authors: Giorgos Alexiou, George Papastefanatos, Vassilis Stamatopoulos, Georgia Koutrika, Nectarios Koziris

    Abstract: In this work, we explore the problem of correctly and efficiently answering complex SPJ queries issued directly on top of dirty data. We introduce QueryER, a framework that seamlessly integrates Entity Resolution into Query Processing. QueryER executes analysis-aware deduplication by weaving ER operators into the query plan. The experimental evaluation of our approach exhibits that it adapts to th… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  4. arXiv:1809.02345  [pdf, other

    cs.DB

    Hierarchical Characteristic Set Merging for Optimizing SPARQL Queries in Heterogeneous RDF

    Authors: Marios Meimaris, George Papastefanatos

    Abstract: Characteristic sets (CS) organize RDF triples based on the set of properties characterizing their subject nodes. This concept is recently used in indexing techniques, as it can capture the implicit schema of RDF data. While most CS-based approaches yield significant improvements in space and query performance, they fail to perform well in the presence of schema heterogeneity, i.e., when the number… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

  5. graphVizdb: A Scalable Platform for Interactive Large Graph Visualization

    Authors: Nikos Bikakis, John Liagouris, Maria Krommyda, George Papastefanatos, Timos Sellis

    Abstract: We present a novel platform for the interactive visualization of very large graphs. The platform enables the user to interact with the visualized graph in a way that is very similar to the exploration of maps at multiple levels. Our approach involves an offline preprocessing phase that builds the layout of the graph by assigning coordinates to its nodes with respect to a Euclidean plane. The respe… ▽ More

    Submitted 20 February, 2016; originally announced February 2016.

    Comments: 32nd IEEE International Conference on Data Engineering (ICDE '16)

    MSC Class: 97R50; 68P05; 68P15 ACM Class: E.1; H.2.8; H.5.2; H.4

  6. arXiv:1511.04750  [pdf, other

    cs.HC cs.DB cs.DS

    A Hierarchical Aggregation Framework for Efficient Multilevel Visual Exploration and Analysis

    Authors: Nikos Bikakis, George Papastefanatos, Melina Skourla, Timos Sellis

    Abstract: Data exploration and visualization systems are of great importance in the Big Data era, in which the volume and heterogeneity of available information make it difficult for humans to manually explore and analyse data. Most traditional systems operate in an offline way, limited to accessing preprocessed (static) sets of data. They also restrict themselves to dealing with small dataset sizes, which… ▽ More

    Submitted 19 February, 2016; v1 submitted 15 November, 2015; originally announced November 2015.

    Comments: Semantic Web Journal 2016 (to appear)

    MSC Class: 97R50; 68P05; 68P15 ACM Class: E.1; H.2.8; H.5.2; H.4

  7. arXiv:1506.04333  [pdf, other

    cs.HC cs.DB

    Towards Scalable Visual Exploration of Very Large RDF Graphs

    Authors: Nikos Bikakis, John Liagouris, Maria Krommyda, George Papastefanatos, Timos Sellis

    Abstract: In this paper, we outline our work on develo** a disk-based infrastructure for efficient visualization and graph exploration operations over very large graphs. The proposed platform, called graphVizdb, is based on a novel technique for indexing and storing the graph. Particularly, the graph layout is indexed with a spatial data structure, i.e., an R-tree, and stored in a database. In runtime, us… ▽ More

    Submitted 16 June, 2015; v1 submitted 13 June, 2015; originally announced June 2015.

    Comments: 12th Extended Semantic Web Conference (ESWC 2015)

  8. arXiv:1504.06451  [pdf

    cs.DB

    A Framework for Managing Evolving Information Resources on the Data Web

    Authors: Marios Meimaris, George Papastefanatos, Christos Pateritsas, Theodora Galani, Yannis Stavrakas

    Abstract: The web of data has brought forth the need to preserve and sustain evolving information within linked datasets; however, a basic requirement of data preservation is the maintenance of the datasets' structural characteristics as well. As open data are often found using different and/or heterogeneous data models and schemata from one source to another, there is a need to reconcile these mismatches a… ▽ More

    Submitted 5 May, 2015; v1 submitted 24 April, 2015; originally announced April 2015.

    Comments: arXiv admin note: text overlap with arXiv:1504.01891

  9. arXiv:1504.01891  [pdf

    cs.DB

    A Query Language for Multi-version Data Web Archives

    Authors: Marios Meimaris, George Papastefanatos, Stratis Viglas, Yannis Stavrakas, Christos Pateritsas, Ioannis Anagnostopoulos

    Abstract: The Data Web refers to the vast and rapidly increasing quantity of scientific, corporate, government and crowd-sourced data published in the form of Linked Open Data, which encourages the uniform representation of heterogeneous data items on the web and the creation of links between them. The growing availability of open linked datasets has brought forth significant new challenges regarding their… ▽ More

    Submitted 12 May, 2016; v1 submitted 8 April, 2015; originally announced April 2015.

  10. arXiv:1408.3148  [pdf, other

    cs.DB

    rdf:SynopsViz - A Framework for Hierarchical Linked Data Visual Exploration and Analysis

    Authors: Nikos Bikakis, Melina Skourla, George Papastefanatos

    Abstract: The purpose of data visualization is to offer intuitive ways for information perception and manipulation, especially for non-expert users. The Web of Data has realized the availability of a huge amount of datasets. However, the volume and heterogeneity of available information make it difficult for humans to manually explore and analyse large datasets. In this paper, we present rdf:SynopsViz, a to… ▽ More

    Submitted 27 June, 2017; v1 submitted 13 August, 2014; originally announced August 2014.

    Comments: 11th Extended Semantic Web Conference (ESWC '14)

  11. arXiv:1205.2320  [pdf, other

    cs.DB

    Publishing Life Science Data as Linked Open Data: the Case Study of miRBase

    Authors: Theodore Dalamagas, Nikos Bikakis, George Papastefanatos, Yannis Stavrakas, Artemis G. Hatzigeorgiou

    Abstract: This paper presents our Linked Open Data (LOD) infrastructures for genomic and experimental data related to microRNA biomolecules. Legacy data from two well-known microRNA databases with experimental data and observations, as well as change and version information about microRNA entities, are fused and exported as LOD. Our LOD server assists biologists to explore biological entities and their evol… ▽ More

    Submitted 10 May, 2012; originally announced May 2012.

    Comments: Presented at the First International Workshop On Open Data, WOD-2012 (arXiv:1204.3726)

    Report number: WOD/2012/NANTES/7

  12. arXiv:1205.2292  [pdf

    cs.DB cs.DL

    Diachronic Linked Data: Towards Long-Term Preservation of Structured Interrelated Information

    Authors: Yannis Stavrakas, George Papastefanatos, Theodore Dalamagas, Vassilis Christophides

    Abstract: The Linked Data Paradigm is one of the most promising technologies for publishing, sharing, and connecting data on the Web, and offers a new way for data integration and interoperability. However, the proliferation of distributed, inter-connected sources of information and services on the Web poses significant new challenges for managing consistently a huge number of large datasets and their inter… ▽ More

    Submitted 10 May, 2012; originally announced May 2012.

    Comments: Presented at the First International Workshop On Open Data, WOD-2012 (http://arxiv.longhoe.net/abs/1204.3726)

    Report number: WOD/2012/NANTES/10