Skip to main content

Showing 1–6 of 6 results for author: Koutrika, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.16547  [pdf, other

    cs.CR cs.DB

    FreqyWM: Frequency Watermarking for the New Data Economy

    Authors: Devriş İşler, Elisa Cabana, Alvaro Garcia-Recuero, Georgia Koutrika, Nikolaos Laoutaris

    Abstract: We present a novel technique for modulating the appearance frequency of a few tokens within a dataset for encoding an invisible watermark that can be used to protect ownership rights upon data. We develop optimal as well as fast heuristic algorithms for creating and verifying such watermarks. We also demonstrate the robustness of our technique against various attacks and derive analytical bounds f… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: Accepted at ICDE 2024

  2. arXiv:2306.04743  [pdf, other

    cs.DB cs.AI cs.CL

    ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems

    Authors: Yi Zhang, Jan Deriu, George Katsogiannis-Meimarakis, Catherine Kosten, Georgia Koutrika, Kurt Stockinger

    Abstract: Natural Language to SQL systems (NL-to-SQL) have recently shown a significant increase in accuracy for natural language to SQL query translation. This improvement is due to the emergence of transformer-based language models, and the popularity of the Spider benchmark - the de-facto standard for evaluating NL-to-SQL systems. The top NL-to-SQL systems reach accuracies of up to 85\%. However, Spider… ▽ More

    Submitted 5 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 12 pages, 2 figures, 5 tables

    ACM Class: H.2.4; I.2.7

    Journal ref: PVLDB Volume 17, 2023-2024

  3. arXiv:2202.01546  [pdf, other

    cs.DB

    QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data

    Authors: Giorgos Alexiou, George Papastefanatos, Vassilis Stamatopoulos, Georgia Koutrika, Nectarios Koziris

    Abstract: In this work, we explore the problem of correctly and efficiently answering complex SPJ queries issued directly on top of dirty data. We introduce QueryER, a framework that seamlessly integrates Entity Resolution into Query Processing. QueryER executes analysis-aware deduplication by weaving ER operators into the query plan. The experimental evaluation of our approach exhibits that it adapts to th… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  4. arXiv:2104.05994  [pdf, other

    cs.DB

    Fairness in Rankings and Recommendations: An Overview

    Authors: Evaggelia Pitoura, Kostas Stefanidis, Georgia Koutrika

    Abstract: We increasingly depend on a variety of data-driven algorithmic systems to assist us in many aspects of life. Search engines and recommender systems amongst others are used as sources of information and to help us in making all sort of decisions from selecting restaurants and books, to choosing friends and careers. This has given rise to important concerns regarding the fairness of such systems. In… ▽ More

    Submitted 31 August, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

  5. arXiv:2104.04194  [pdf, other

    cs.LG cs.AI cs.DB

    INODE: Building an End-to-End Data Exploration System in Practice [Extended Vision]

    Authors: Sihem Amer-Yahia, Georgia Koutrika, Frederic Bastian, Theofilos Belmpas, Martin Braschler, Ursin Brunner, Diego Calvanese, Maximilian Fabricius, Orest Gkini, Catherine Kosten, Davide Lanti, Antonis Litke, Hendrik Lücke-Tieke, Francesco Alessandro Massucci, Tarcisio Mendes de Farias, Alessandro Mosca, Francesco Multari, Nikolaos Papadakis, Dimitris Papadopoulos, Yogendra Patil, Aurélien Personnaz, Guillem Rull, Ana Sima, Ellery Smith, Dimitrios Skoutas , et al. (3 additional authors not shown)

    Abstract: A full-fledged data exploration system must combine different access modalities with a powerful concept of guiding the user in the exploration process, by being reactive and anticipative both for data discovery and for data linking. Such systems are a real opportunity for our community to cater to users with different domain and data science expertise. We introduce INODE -- an end-to-end data expl… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 8 pages, 5 figures

    ACM Class: I.2; H.2

  6. arXiv:0909.1774  [pdf

    cs.DB cs.CY

    Social Systems: Can we Do More Than Just Poke Friends?

    Authors: Georgia Koutrika, Benjamin Bercovitz, Robert Ikeda, Filip Kaliszan, Henry Liou, Zahra Mohammadi Zadeh, Hector Garcia-Molina

    Abstract: Social sites have become extremely popular among users but have they attracted equal attention from the research community? Are they good only for simple tasks, such as tagging and poking friends? Do they present any new or interesting research challenges? In this paper, we describe the insights we have obtained implementing CourseRank, a course evaluation and planning social system. We argue th… ▽ More

    Submitted 9 September, 2009; originally announced September 2009.

    Comments: CIDR 2009