Skip to main content

Showing 1–7 of 7 results for author: Smirnov, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.08702  [pdf, other

    cs.DB cs.PF

    Finding a Second Wind: Speeding Up Graph Traversal Queries in RDBMSs Using Column-Oriented Processing

    Authors: Mikhail Firsov, Michael Polyntsov, Kirill Smirnov, George Chernishev

    Abstract: Recursive queries and recursive derived tables constitute an important part of the SQL standard. Their efficient processing is important for many real-life applications that rely on graph or hierarchy traversal. Position-enabled column-stores offer a novel opportunity to improve run times for this type of queries. Such systems allow the engine to explicitly use data positions (row ids) inside its… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    ACM Class: H.2.4; E.2

  2. arXiv:2307.14935  [pdf, ps, other

    cs.DB cs.AI cs.CE cs.LG

    Solving Data Quality Problems with Desbordante: a Demo

    Authors: George Chernishev, Michael Polyntsov, Anton Chizhov, Kirill Stupakov, Ilya Shchuckin, Alexander Smirnov, Maxim Strutovsky, Alexey Shlyonskikh, Mikhail Firsov, Stepan Manannikov, Nikita Bobrov, Daniil Goncharov, Ilia Barutkin, Vladislav Shalnev, Kirill Muraviev, Anna Rakhmukova, Dmitriy Shcheka, Anton Chernikov, Mikhail Vyrodov, Yaroslav Kurbatov, Maxim Fofanov, Sergei Belokonnyi, Pavel Anosov, Arthur Saliou, Eduard Gaisin , et al. (1 additional authors not shown)

    Abstract: Data profiling is an essential process in modern data-driven industries. One of its critical components is the discovery and validation of complex statistics, including functional dependencies, data constraints, association rules, and others. However, most existing data profiling systems that focus on complex statistics do not provide proper integration with the tools used by contemporary data s… ▽ More

    Submitted 28 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    ACM Class: H.3; I.5; J.0

  3. Simple method for detecting sleep episodes in rats ECoG using machine learning

    Authors: Konstantin Sergeev, Anastasiya Runnova, Maxim Zhuravlev, Evgenia Sitnikova, Elizaveta Rutskova, Kirill Smirnov, Andrei Slepnev, Nadezhda Semenova

    Abstract: In this paper we propose a new method for the automatic recognition of the state of behavioral sleep (BS) and waking state (WS) in freely moving rats using their electrocorticographic (ECoG) data. Three-channels ECoG signals were recorded from frontal left, frontal right and occipital right cortical areas. We employed a simple artificial neural network (ANN), in which the mean values and standard… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: Supplementary materials are given in pp.8--9. 4 figures in the main text and 1 figure in supplementary materials. The manuscript is under consideration at Pattern Recognition Letters

    Report number: Chaos, Solitons & Fractals, 2023, Vol. 173, P. 113608 MSC Class: 62M45; 92B20

  4. arXiv:2301.05965  [pdf, other

    cs.DB cs.AI cs.LG

    Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)

    Authors: George Chernishev, Michael Polyntsov, Anton Chizhov, Kirill Stupakov, Ilya Shchuckin, Alexander Smirnov, Maxim Strutovsky, Alexey Shlyonskikh, Mikhail Firsov, Stepan Manannikov, Nikita Bobrov, Daniil Goncharov, Ilia Barutkin, Vladislav Shalnev, Kirill Muraviev, Anna Rakhmukova, Dmitriy Shcheka, Anton Chernikov, Dmitrii Mandelshtam, Mikhail Vyrodov, Arthur Saliou, Eduard Gaisin, Kirill Smirnov

    Abstract: Pioneering data profiling systems such as Metanome and OpenClean brought public attention to science-intensive data profiling. This type of profiling aims to extract complex patterns (primitives) such as functional dependencies, data constraints, association rules, and others. However, these tools are research prototypes rather than production-ready systems. The following work presents Desbordan… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    ACM Class: H.3; I.5; J.0

  5. arXiv:2207.12713  [pdf, other

    cs.DB cs.DS cs.PF

    Implementing the Comparison-Based External Sort

    Authors: Michael Polyntsov, Valentin Grigorev, Kirill Smirnov, George Chernishev

    Abstract: In the age of big data, sorting is an indispensable operation for DBMSes and similar systems. Having data sorted can help produce query plans with significantly lower run times. It also can provide other benefits like having non-blocking operators which will produce data steadily (without bursts), or operators with reduced memory footprint. Sorting may be required on any step of query processing… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    ACM Class: H.2; E.5

  6. arXiv:2105.09058  [pdf, other

    cs.DB cs.DC cs.PF

    Revisiting Data Compression in Column-Stores

    Authors: Alexander Slesarev, Evgeniy Klyuchikov, Kirill Smirnov, George Chernishev

    Abstract: Data compression is widely used in contemporary column-oriented DBMSes to lower space usage and to speed up query processing. Pioneering systems have introduced compression to tackle the disk bandwidth bottleneck by trading CPU processing power for it. The main issue of this is a trade-off between the compression ratio and the decompression CPU cost. Existing results state that light-weight compre… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    ACM Class: H.2; E.2

  7. arXiv:2005.07992  [pdf, other

    cs.DB

    Extending Databases to Support Data Manipulation with Functional Dependencies: a Vision Paper

    Authors: Nikita Bobrov, Kirill Smirnov, George Chernishev

    Abstract: In the current paper, we propose to fuse together stored data (tables) and their functional dependencies (FDs) inside a DBMS. We aim to make FDs first-class citizens: objects which can be queried and used to query data. Our idea is to allow analysts to explore both data and functional dependencies using the database interface. For example, an analyst may be interested in such tasks as: "find all r… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    ACM Class: H.2.0