Skip to main content

Showing 1–2 of 2 results for author: Esmailoghli, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.02656  [pdf, other

    cs.DB

    Blend: A Unified Data Discovery System

    Authors: Mahdi Esmailoghli, Christoph Schnell, Renée J. Miller, Ziawasch Abedjan

    Abstract: Data discovery is an iterative and incremental process that necessitates the execution of multiple data discovery queries to identify the desired tables from large and diverse data lakes. Current methodologies concentrate on single discovery tasks such as join, correlation, or union discovery. However, in practice, a series of these approaches and their corresponding index structures are necessary… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  2. arXiv:2110.00318  [pdf, other

    cs.DB

    MATE: Multi-Attribute Table Extraction

    Authors: Mahdi Esmailoghli, Jorge-Arnulfo Quiané-Ruiz, Ziawasch Abedjan

    Abstract: A core operation in data discovery is to find joinable tables for a given table. Real-world tables include both unary and n-ary join keys. However, existing table discovery systems are optimized for unary joins and are ineffective and slow in the existence of n-ary keys. In this paper, we introduce MATE, a table discovery system that leverages a novel hash-based index that enables n-ary join disco… ▽ More

    Submitted 25 April, 2022; v1 submitted 1 October, 2021; originally announced October 2021.