Skip to main content

Showing 1–16 of 16 results for author: Piontkovskaya, I

.
  1. arXiv:2406.15035  [pdf, other

    cs.CV

    Improving Interpretability and Robustness for the Detection of AI-Generated Images

    Authors: Tatiana Gaintseva, Laida Kushnareva, German Magai, Irina Piontkovskaya, Sergey Nikolenko, Martin Benning, Serguei Barannikov, Gregory Slabaugh

    Abstract: With growing abilities of generative models, artificial content detection becomes an increasingly important and difficult task. However, all popular approaches to this problem suffer from poor generalization across domains and generative models. In this work, we focus on the robustness of AI-generated image (AIGI) detectors. We analyze existing state-of-the-art AIGI detection methods based on froz… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2311.11813  [pdf, other

    cs.CL

    Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule

    Authors: Andrey Bout, Alexander Podolskiy, Sergey Nikolenko, Irina Piontkovskaya

    Abstract: Progress in neural grammatical error correction (GEC) is hindered by the lack of annotated training data. Sufficient amounts of high-quality manually annotated data are not available, so recent research has relied on generating synthetic data, pretraining on it, and then fine-tuning on real datasets; performance gains have been achieved either by ensembling or by using huge pretrained models such… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  3. arXiv:2311.08349  [pdf, other

    cs.CL

    AI-generated text boundary detection with RoFT

    Authors: Laida Kushnareva, Tatiana Gaintseva, German Magai, Serguei Barannikov, Dmitry Abulkhanov, Kristian Kuznetsov, Eduard Tulchinskii, Irina Piontkovskaya, Sergey Nikolenko

    Abstract: Due to the rapid development of large language models, people increasingly often encounter texts that may start as written by a human but continue as machine-generated. Detecting the boundary between human-written and machine-generated parts of such texts is a challenging problem that has not received much attention in literature. We attempt to bridge this gap and examine several ways to adapt sta… ▽ More

    Submitted 2 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  4. arXiv:2311.08191  [pdf, other

    cs.CL

    GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding

    Authors: Konstantin Yakovlev, Alexander Podolskiy, Andrey Bout, Sergey Nikolenko, Irina Piontkovskaya

    Abstract: Grammatical error correction (GEC) is an important NLP task that is currently usually solved with autoregressive sequence-to-sequence models. However, approaches of this class are inherently slow due to one-by-one token generation, so non-autoregressive alternatives are needed. In this work, we propose a novel non-autoregressive approach to GEC that decouples the architecture into a permutation ne… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: ACL 2023

  5. arXiv:2311.08143  [pdf, other

    cs.CL

    Sinkhorn Transformations for Single-Query Postprocessing in Text-Video Retrieval

    Authors: Konstantin Yakovlev, Gregory Polyakov, Ilseyar Alimova, Alexander Podolskiy, Andrey Bout, Sergey Nikolenko, Irina Piontkovskaya

    Abstract: A recent trend in multimodal retrieval is related to postprocessing test set results via the dual-softmax loss (DSL). While this approach can bring significant improvements, it usually presumes that an entire matrix of test samples is available as DSL input. This work introduces a new postprocessing approach based on Sinkhorn transformations that outperforms DSL. Further, we propose a new postproc… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: SIGIR 2023

  6. arXiv:2306.04723  [pdf, other

    cs.CL cs.AI cs.LG math.AT

    Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts

    Authors: Eduard Tulchinskii, Kristian Kuznetsov, Laida Kushnareva, Daniil Cherniavskii, Serguei Barannikov, Irina Piontkovskaya, Sergey Nikolenko, Evgeny Burnaev

    Abstract: Rapidly increasing quality of AI-generated content makes it difficult to distinguish between human and AI-generated texts, which may lead to undesirable consequences for society. Therefore, it becomes increasingly important to study the properties of human texts that are invariant over different text domains and varying proficiency of human writers, can be easily calculated for any language, and c… ▽ More

    Submitted 31 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    MSC Class: 68T50

  7. Can BERT eat RuCoLA? Topological Data Analysis to Explain

    Authors: Irina Proskurina, Irina Piontkovskaya, Ekaterina Artemova

    Abstract: This paper investigates how Transformer language models (LMs) fine-tuned for acceptability classification capture linguistic features. Our approach uses the best practices of topological data analysis (TDA) in NLP: we construct directed attention graphs from attention matrices, derive topological features from them, and feed them to linear classifiers. We introduce two novel features, chordality,… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted to the Workshop on Slavic NLP @ EACL 2023

  8. arXiv:2303.10845  [pdf, other

    cs.CL

    PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

    Authors: Xiaozhe Ren, **yi Zhou, Xinfan Meng, Xin**g Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao

    Abstract: The scaling of large language models has greatly improved natural language understanding, generation, and reasoning. In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1.085T parameters named PanGu-Σ. With parameter inherent from PanGu-α, we extend the dense Transfo… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  9. arXiv:2211.17223  [pdf, other

    cs.SD cs.CL cs.LG eess.AS math.AT

    Topological Data Analysis for Speech Processing

    Authors: Eduard Tulchinskii, Kristian Kuznetsov, Laida Kushnareva, Daniil Cherniavskii, Serguei Barannikov, Irina Piontkovskaya, Sergey Nikolenko, Evgeny Burnaev

    Abstract: We apply topological data analysis (TDA) to speech classification problems and to the introspection of a pretrained speech model, HuBERT. To this end, we introduce a number of topological and algebraic features derived from Transformer attention maps and embeddings. We show that a simple linear classifier built on top of such features outperforms a fine-tuned classification head. In particular, we… ▽ More

    Submitted 6 June, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted to INTERSPEECH 2023 conference

    Journal ref: Proc. INTERSPEECH 2023, pages 311--315

  10. arXiv:2207.01903  [pdf, other

    cs.CL

    Betti numbers of attention graphs is all you really need

    Authors: Laida Kushnareva, Dmitri Piontkovski, Irina Piontkovskaya

    Abstract: We apply methods of topological analysis to the attention graphs, calculated on the attention heads of the BERT model ( arXiv:1810.04805v2 ). Our research shows that the classifier built upon basic persistent topological features (namely, Betti numbers) of the trained neural network can achieve classification results on par with the conventional classification method. We show the relevance of such… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: This short paper was submitted to "Topological Data Analysis and Beyond" Workshop at NeurIPS 2020 at July 2020, but wasn't accepted. Later the ideas from this short paper found a rich development in arXiv:2109.04825 and arXiv:2205.09630

  11. arXiv:2206.10914  [pdf, other

    cs.CL

    Template-based Approach to Zero-shot Intent Recognition

    Authors: Dmitry Lamanov, Pavel Burnyshev, Ekaterina Artemova, Valentin Malykh, Andrey Bout, Irina Piontkovskaya

    Abstract: The recent advances in transfer learning techniques and pre-training of large contextualized encoders foster innovation in real-life applications, including dialog assistants. Practical needs of intent recognition require effective data usage and the ability to constantly update supported intents, adopting new ones, and abandoning outdated ones. In particular, the generalized zero-shot paradigm, i… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: accepted to INLG 2022

  12. arXiv:2205.09630  [pdf, other

    cs.CL cs.AI cs.LG math.AT

    Acceptability Judgements via Examining the Topology of Attention Maps

    Authors: Daniil Cherniavskii, Eduard Tulchinskii, Vladislav Mikhailov, Irina Proskurina, Laida Kushnareva, Ekaterina Artemova, Serguei Barannikov, Irina Piontkovskaya, Dmitri Piontkovski, Evgeny Burnaev

    Abstract: The role of the attention mechanism in encoding linguistic knowledge has received special interest in NLP. However, the ability of the attention heads to judge the grammatical acceptability of a sentence has been underexplored. This paper approaches the paradigm of acceptability judgments with topological data analysis (TDA), showing that the geometric properties of the attention graph can be effi… ▽ More

    Submitted 23 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted to EMNLP 2022 Findings

    Journal ref: Findings of the Association for Computational Linguistics: EMNLP 2022, 88-107

  13. Artificial Text Detection via Examining the Topology of Attention Maps

    Authors: Laida Kushnareva, Daniil Cherniavskii, Vladislav Mikhailov, Ekaterina Artemova, Serguei Barannikov, Alexander Bernstein, Irina Piontkovskaya, Dmitri Piontkovski, Evgeny Burnaev

    Abstract: The impressive capabilities of recent generative models to create texts that are challenging to distinguish from the human-written ones can be misused for generating fake news, product reviews, and even abusive content. Despite the prominent performance of existing methods for artificial text detection, they still lack interpretability and robustness towards unseen models. To this end, we propose… ▽ More

    Submitted 28 April, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021

    Journal ref: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 635-649

  14. arXiv:2108.06991  [pdf, other

    cs.CL

    A Single Example Can Improve Zero-Shot Data Generation

    Authors: Pavel Burnyshev, Valentin Malykh, Andrey Bout, Ekaterina Artemova, Irina Piontkovskaya

    Abstract: Sub-tasks of intent classification, such as robustness to distribution shift, adaptation to specific user groups and personalization, out-of-domain detection, require extensive and flexible datasets for experiments and evaluation. As collecting such datasets is time- and labor-consuming, we propose to use text generation methods to gather datasets. The generator should be trained to generate utter… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: To appear in INLG2021 proceedings

  15. arXiv:2101.03778  [pdf, other

    cs.CL cs.LG

    Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

    Authors: Alexander Podolskiy, Dmitry Lipin, Andrey Bout, Ekaterina Artemova, Irina Piontkovskaya

    Abstract: Real-life applications, heavily relying on machine learning, such as dialog systems, demand out-of-domain detection methods. Intent classification models should be equipped with a mechanism to distinguish seen intents from unseen ones so that the dialog agent is capable of rejecting the latter and avoiding undesired behavior. However, despite increasing attention paid to the task, the best practic… ▽ More

    Submitted 23 May, 2022; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: AAAI 2021

  16. arXiv:1712.07473  [pdf, ps, other

    cs.CL cs.CR cs.LG

    Differentially Private Distributed Learning for Language Modeling Tasks

    Authors: Vadim Popov, Mikhail Kudinov, Irina Piontkovskaya, Petr Vytovtov, Alex Nevidomsky

    Abstract: One of the big challenges in machine learning applications is that training data can be different from the real-world data faced by the algorithm. In language modeling, users' language (e.g. in private messaging) could change in a year and be completely different from what we observe in publicly available data. At the same time, public data can be used for obtaining general knowledge (i.e. general… ▽ More

    Submitted 6 March, 2018; v1 submitted 20 December, 2017; originally announced December 2017.