Skip to main content

Showing 1–4 of 4 results for author: Kosten, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.08349  [pdf, other

    cs.DB cs.AI cs.CL

    Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries

    Authors: Jonathan Fürst, Catherine Kosten, Farhard Nooralahzadeh, Yi Zhang, Kurt Stockinger

    Abstract: Text-to-SQL systems (also known as NL-to-SQL systems) have become an increasingly popular solution for bridging the gap between user capabilities and SQL-based data access. These systems translate user requests in natural language to valid SQL statements for a specific database. Recent Text-to-SQL systems have benefited from the rapid improvement of transformer-based language models. However, whil… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  2. Spider4SPARQL: A Complex Benchmark for Evaluating Knowledge Graph Question Answering Systems

    Authors: Catherine Kosten, Philippe Cudré-Mauroux, Kurt Stockinger

    Abstract: With the recent spike in the number and availability of Large Language Models (LLMs), it has become increasingly important to provide large and realistic benchmarks for evaluating Knowledge Graph Question Answering (KGQA) systems. So far the majority of benchmarks rely on pattern-based SPARQL query generation approaches. The subsequent natural language (NL) question generation is conducted through… ▽ More

    Submitted 8 December, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 10 pages, 5 figures, accepted at IEEE BigData Conference 2023, 8th IEEE Special Session on Machine Learning on Big Data (MLBD 2023)

    Journal ref: IEEE International Conference on Big Data 2023

  3. arXiv:2306.04743  [pdf, other

    cs.DB cs.AI cs.CL

    ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems

    Authors: Yi Zhang, Jan Deriu, George Katsogiannis-Meimarakis, Catherine Kosten, Georgia Koutrika, Kurt Stockinger

    Abstract: Natural Language to SQL systems (NL-to-SQL) have recently shown a significant increase in accuracy for natural language to SQL query translation. This improvement is due to the emergence of transformer-based language models, and the popularity of the Spider benchmark - the de-facto standard for evaluating NL-to-SQL systems. The top NL-to-SQL systems reach accuracies of up to 85\%. However, Spider… ▽ More

    Submitted 5 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 12 pages, 2 figures, 5 tables

    ACM Class: H.2.4; I.2.7

    Journal ref: PVLDB Volume 17, 2023-2024

  4. arXiv:2104.04194  [pdf, other

    cs.LG cs.AI cs.DB

    INODE: Building an End-to-End Data Exploration System in Practice [Extended Vision]

    Authors: Sihem Amer-Yahia, Georgia Koutrika, Frederic Bastian, Theofilos Belmpas, Martin Braschler, Ursin Brunner, Diego Calvanese, Maximilian Fabricius, Orest Gkini, Catherine Kosten, Davide Lanti, Antonis Litke, Hendrik Lücke-Tieke, Francesco Alessandro Massucci, Tarcisio Mendes de Farias, Alessandro Mosca, Francesco Multari, Nikolaos Papadakis, Dimitris Papadopoulos, Yogendra Patil, Aurélien Personnaz, Guillem Rull, Ana Sima, Ellery Smith, Dimitrios Skoutas , et al. (3 additional authors not shown)

    Abstract: A full-fledged data exploration system must combine different access modalities with a powerful concept of guiding the user in the exploration process, by being reactive and anticipative both for data discovery and for data linking. Such systems are a real opportunity for our community to cater to users with different domain and data science expertise. We introduce INODE -- an end-to-end data expl… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 8 pages, 5 figures

    ACM Class: I.2; H.2