Skip to main content

Showing 1–4 of 4 results for author: Burnyshev, P

.
  1. arXiv:2309.06527  [pdf, other

    cs.CL cs.CR cs.LG

    Machine Translation Models Stand Strong in the Face of Adversarial Attacks

    Authors: Pavel Burnyshev, Elizaveta Kostenok, Alexey Zaytsev

    Abstract: Adversarial attacks expose vulnerabilities of deep learning models by introducing minor perturbations to the input, which lead to substantial alterations in the output. Our research focuses on the impact of such adversarial attacks on sequence-to-sequence (seq2seq) models, specifically machine translation models. We introduce algorithms that incorporate basic text perturbation heuristics and more… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Journal ref: AIST-2023

  2. arXiv:2206.10914  [pdf, other

    cs.CL

    Template-based Approach to Zero-shot Intent Recognition

    Authors: Dmitry Lamanov, Pavel Burnyshev, Ekaterina Artemova, Valentin Malykh, Andrey Bout, Irina Piontkovskaya

    Abstract: The recent advances in transfer learning techniques and pre-training of large contextualized encoders foster innovation in real-life applications, including dialog assistants. Practical needs of intent recognition require effective data usage and the ability to constantly update supported intents, adopting new ones, and abandoning outdated ones. In particular, the generalized zero-shot paradigm, i… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: accepted to INLG 2022

  3. arXiv:2108.06991  [pdf, other

    cs.CL

    A Single Example Can Improve Zero-Shot Data Generation

    Authors: Pavel Burnyshev, Valentin Malykh, Andrey Bout, Ekaterina Artemova, Irina Piontkovskaya

    Abstract: Sub-tasks of intent classification, such as robustness to distribution shift, adaptation to specific user groups and personalization, out-of-domain detection, require extensive and flexible datasets for experiments and evaluation. As collecting such datasets is time- and labor-consuming, we propose to use text generation methods to gather datasets. The generator should be trained to generate utter… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: To appear in INLG2021 proceedings

  4. arXiv:2107.11275  [pdf, other

    cs.CL cs.LG

    A Differentiable Language Model Adversarial Attack on Text Classifiers

    Authors: Ivan Fursov, Alexey Zaytsev, Pavel Burnyshev, Ekaterina Dmitrieva, Nikita Klyuchnikov, Andrey Kravchenko, Ekaterina Artemova, Evgeny Burnaev

    Abstract: Robustness of huge Transformer-based models for natural language processing is an important issue due to their capabilities and wide adoption. One way to understand and improve robustness of these models is an exploration of an adversarial attack scenario: check if a small perturbation of an input can fool a model. Due to the discrete nature of textual data, gradient-based adversarial methods, w… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2006.11078