Skip to main content

Showing 1–3 of 3 results for author: Bobrov, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.14935  [pdf, ps, other

    cs.DB cs.AI cs.CE cs.LG

    Solving Data Quality Problems with Desbordante: a Demo

    Authors: George Chernishev, Michael Polyntsov, Anton Chizhov, Kirill Stupakov, Ilya Shchuckin, Alexander Smirnov, Maxim Strutovsky, Alexey Shlyonskikh, Mikhail Firsov, Stepan Manannikov, Nikita Bobrov, Daniil Goncharov, Ilia Barutkin, Vladislav Shalnev, Kirill Muraviev, Anna Rakhmukova, Dmitriy Shcheka, Anton Chernikov, Mikhail Vyrodov, Yaroslav Kurbatov, Maxim Fofanov, Sergei Belokonnyi, Pavel Anosov, Arthur Saliou, Eduard Gaisin , et al. (1 additional authors not shown)

    Abstract: Data profiling is an essential process in modern data-driven industries. One of its critical components is the discovery and validation of complex statistics, including functional dependencies, data constraints, association rules, and others. However, most existing data profiling systems that focus on complex statistics do not provide proper integration with the tools used by contemporary data s… ▽ More

    Submitted 28 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    ACM Class: H.3; I.5; J.0

  2. arXiv:2301.05965  [pdf, other

    cs.DB cs.AI cs.LG

    Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)

    Authors: George Chernishev, Michael Polyntsov, Anton Chizhov, Kirill Stupakov, Ilya Shchuckin, Alexander Smirnov, Maxim Strutovsky, Alexey Shlyonskikh, Mikhail Firsov, Stepan Manannikov, Nikita Bobrov, Daniil Goncharov, Ilia Barutkin, Vladislav Shalnev, Kirill Muraviev, Anna Rakhmukova, Dmitriy Shcheka, Anton Chernikov, Dmitrii Mandelshtam, Mikhail Vyrodov, Arthur Saliou, Eduard Gaisin, Kirill Smirnov

    Abstract: Pioneering data profiling systems such as Metanome and OpenClean brought public attention to science-intensive data profiling. This type of profiling aims to extract complex patterns (primitives) such as functional dependencies, data constraints, association rules, and others. However, these tools are research prototypes rather than production-ready systems. The following work presents Desbordan… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    ACM Class: H.3; I.5; J.0

  3. arXiv:2005.07992  [pdf, other

    cs.DB

    Extending Databases to Support Data Manipulation with Functional Dependencies: a Vision Paper

    Authors: Nikita Bobrov, Kirill Smirnov, George Chernishev

    Abstract: In the current paper, we propose to fuse together stored data (tables) and their functional dependencies (FDs) inside a DBMS. We aim to make FDs first-class citizens: objects which can be queried and used to query data. Our idea is to allow analysts to explore both data and functional dependencies using the database interface. For example, an analyst may be interested in such tasks as: "find all r… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    ACM Class: H.2.0