Skip to main content

Showing 1–6 of 6 results for author: Pavlichenko, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.16511  [pdf, other

    cs.CV cs.AI cs.CL cs.HC

    Toloka Visual Question Answering Benchmark

    Authors: Dmitry Ustalov, Nikita Pavlichenko, Sergey Koshelev, Daniil Likhobaba, Alisa Smirnova

    Abstract: In this paper, we present Toloka Visual Question Answering, a new crowdsourced dataset allowing comparing performance of machine learning systems against human level of expertise in the grounding visual question answering task. In this task, given an image and a textual question, one has to draw the bounding box around the object correctly responding to that question. Every image-question pair con… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 16 pages; see https://toloka.ai/challenges/wsdm2023/ for more details

    MSC Class: 68-11 ACM Class: C.4

  2. arXiv:2209.11711  [pdf, other

    cs.HC cs.CL cs.CV

    Best Prompts for Text-to-Image Models and How to Find Them

    Authors: Nikita Pavlichenko, Dmitry Ustalov

    Abstract: Recent progress in generative models, especially in text-guided diffusion models, has enabled the production of aesthetically-pleasing imagery resembling the works of professional human artists. However, one has to carefully compose the textual description, called the prompt, and augment it with a set of clarifying keywords. Since aesthetics are challenging to evaluate computationally, human feedb… ▽ More

    Submitted 1 June, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 13 pages (6 main pages), 7 figures, 4 tables, accepted at SIGIR '23 Short Paper Track

    ACM Class: H.5.2; H.3.3

  3. arXiv:2110.14990  [pdf, other

    cs.HC

    IMDB-WIKI-SbS: An Evaluation Dataset for Crowdsourced Pairwise Comparisons

    Authors: Nikita Pavlichenko, Dmitry Ustalov

    Abstract: Today, comprehensive evaluation of large-scale machine learning models is possible thanks to the open datasets produced using crowdsourcing, such as SQuAD, MS COCO, ImageNet, SuperGLUE, etc. These datasets capture objective responses, assuming the single correct answer, which does not allow to capture the subjective human perception. In turn, pairwise comparison tasks, in which one has to choose b… ▽ More

    Submitted 26 November, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS Data-Centric AI Workshop

  4. Learning from Crowds with Crowd-Kit

    Authors: Dmitry Ustalov, Nikita Pavlichenko, Boris Tseitlin

    Abstract: This paper presents Crowd-Kit, a general-purpose computational quality control toolkit for crowdsourcing. Crowd-Kit provides efficient and convenient implementations of popular quality control algorithms in Python, including methods for truth inference, deep learning from crowds, and data quality estimation. Our toolkit supports multiple modalities of answers and provides dataset loaders and examp… ▽ More

    Submitted 6 April, 2024; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: published at JOSS

    ACM Class: G.4

    Journal ref: Journal of Open Source Software (2024), 9(96), 6227

  5. arXiv:2107.01091  [pdf, other

    cs.SD cs.HC cs.LG eess.AS

    CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription

    Authors: Nikita Pavlichenko, Ivan Stelmakh, Dmitry Ustalov

    Abstract: Domain-specific data is the crux of the successful transfer of machine learning systems from benchmarks to real life. In simple problems such as image classification, crowdsourcing has become one of the standard tools for cheap and time-efficient data collection: thanks in large part to advances in research on aggregation methods. However, the applicability of crowdsourcing to more complex tasks (… ▽ More

    Submitted 20 October, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

  6. arXiv:2011.07980  [pdf, other

    q-bio.QM cs.LG

    Spherical convolutions on molecular graphs for protein model quality assessment

    Authors: Ilia Igashov, Nikita Pavlichenko, Sergei Grudinin

    Abstract: Processing information on 3D objects requires methods stable to rigid-body transformations, in particular rotations, of the input data. In image processing tasks, convolutional neural networks achieve this property using rotation-equivariant operations. However, contrary to images, graphs generally have irregular topology. This makes it challenging to define a rotation-equivariant convolution oper… ▽ More

    Submitted 6 January, 2021; v1 submitted 16 November, 2020; originally announced November 2020.