Skip to main content

Showing 1–9 of 9 results for author: Chertok, A

.
  1. arXiv:2309.15552  [pdf, other

    cs.LG cs.CE q-fin.CP

    Startup success prediction and VC portfolio simulation using CrunchBase data

    Authors: Mark Potanin, Andrey Chertok, Konstantin Zorin, Cyril Shtabtsovsky

    Abstract: Predicting startup success presents a formidable challenge due to the inherently volatile landscape of the entrepreneurial ecosystem. The advent of extensive databases like Crunchbase jointly with available open data enables the application of machine learning and artificial intelligence for more accurate predictive analytics. This paper focuses on startups at their Series B and Series C investmen… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 13 pages, preprint

    ACM Class: I.2.1; J.4

  2. arXiv:2206.12514  [pdf, other

    cs.CL

    DetIE: Multilingual Open Information Extraction Inspired by Object Detection

    Authors: Michael Vasilkovsky, Anton Alekseev, Valentin Malykh, Ilya Shenbin, Elena Tutubalina, Dmitriy Salikhov, Mikhail Stepnov, Andrey Chertok, Sergey Nikolenko

    Abstract: State of the art neural methods for open information extraction (OpenIE) usually extract triplets (or tuples) iteratively in an autoregressive or predicate-based manner in order not to produce duplicates. In this work, we propose a different approach to the problem that can be equally or more successful. Namely, we present a novel single-pass method for OpenIE inspired by object detection algorith… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  3. arXiv:2202.10784  [pdf, other

    cs.CV cs.AI

    RuCLIP -- new models and experiments: a technical report

    Authors: Alex Shonenkov, Andrey Kuznetsov, Denis Dimitrov, Tatyana Shavrina, Daniil Chesakov, Anastasia Maltseva, Alena Fenogenova, Igor Pavlov, Anton Emelyanov, Sergey Markov, Daria Bakshandaeva, Vera Shybaeva, Andrey Chertok

    Abstract: In the report we propose six new implementations of ruCLIP model trained on our 240M pairs. The accuracy results are compared with original CLIP model with Ru-En translation (OPUS-MT) on 16 datasets from different domains. Our best implementations outperform CLIP + OPUS-MT solution on most of the datasets in few-show and zero-shot tasks. In the report we briefly describe the implementations and co… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  4. arXiv:2112.07395  [pdf, other

    cs.CV

    Handwritten text generation and strikethrough characters augmentation

    Authors: Alex Shonenkov, Denis Karachev, Max Novopoltsev, Mark Potanin, Denis Dimitrov, Andrey Chertok

    Abstract: We introduce two data augmentation techniques, which, used with a Resnet-BiLSTM-CTC network, significantly reduce Word Error Rate (WER) and Character Error Rate (CER) beyond best-reported results on handwriting text recognition (HTR) tasks. We apply a novel augmentation that simulates strikethrough text (HandWritten Blots) and a handwritten text generation method based on printed text (StackMix),… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 16 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2108.11667

    MSC Class: 68-04 ACM Class: I.7.5; I.4.6

  5. arXiv:2110.04228  [pdf, ps, other

    cs.LG

    Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

    Authors: Vadim Porvatov, Natalia Semenova, Andrey Chertok

    Abstract: Recently, deep learning has achieved promising results in the calculation of Estimated Time of Arrival (ETA), which is considered as predicting the travel time from the start point to a certain place along a given path. ETA plays an essential role in intelligent taxi services or automotive navigation systems. A common practice is to use embedding vectors to represent the elements of a road network… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: Accepted in ICCNA 2021

  6. RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark

    Authors: Tatiana Shavrina, Alena Fenogenova, Anton Emelyanov, Denis Shevelev, Ekaterina Artemova, Valentin Malykh, Vladislav Mikhailov, Maria Tikhonova, Andrey Chertok, Andrey Evlampiev

    Abstract: In this paper, we introduce an advanced Russian general language understanding evaluation benchmark -- RussianGLUE. Recent advances in the field of universal language models and transformers require the development of a methodology for their broad diagnostics and testing for general intellectual skills - detection of natural language inference, commonsense reasoning, ability to perform simple logi… ▽ More

    Submitted 2 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: to appear in EMNLP 2020

  7. SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis

    Authors: Pavel Efimov, Andrey Chertok, Leonid Boytsov, Pavel Braslavski

    Abstract: SberQuAD -- a large scale analog of Stanford SQuAD in the Russian language - is a valuable resource that has not been properly presented to the scientific community. We fill this gap by providing a description, a thorough analysis, and baseline experimental results.

    Submitted 2 May, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

  8. A note on functional limit theorems for compound Cox processes

    Authors: V. Yu. Korolev, A. V. Chertok, A. Yu. Korchagin, E. V. Kossova, A. I. Zeifman

    Abstract: An improved version of the functional limit theorem is proved establishing weak convergence of random walks generated by compound doubly stochastic Poisson processes (compound Cox processes) to L{é}vy processes in the Skorokhod space under more realistic moment conditions. As corollaries, theorems are proved on convergence of random walks with jumps having finite variances to L{é}vy processes with… ▽ More

    Submitted 9 July, 2015; originally announced July 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1410.1900

  9. arXiv:1410.1900  [pdf, ps, other

    math.PR

    Modeling high-frequency order flow imbalance by functional limit theorems for two-sided risk processes

    Authors: V. Yu. Korolev, A. V. Chertok, A. Yu. Korchagin, A. I. Zeifman

    Abstract: A micro-scale model is proposed for the evolution of the limit order book. Within this model, the flows of orders (claims) are described by doubly stochastic Poisson processes taking account of the stochastic character of intensities of bid and ask orders that determine the price discovery mechanism in financial markets. The process of {\it order flow imbalance} (OFI) is studied. This process is a… ▽ More

    Submitted 8 December, 2014; v1 submitted 6 October, 2014; originally announced October 2014.