Skip to main content

Showing 1–8 of 8 results for author: Kushnareva, L

.
  1. arXiv:2406.15035  [pdf, other

    cs.CV

    Improving Interpretability and Robustness for the Detection of AI-Generated Images

    Authors: Tatiana Gaintseva, Laida Kushnareva, German Magai, Irina Piontkovskaya, Sergey Nikolenko, Martin Benning, Serguei Barannikov, Gregory Slabaugh

    Abstract: With growing abilities of generative models, artificial content detection becomes an increasingly important and difficult task. However, all popular approaches to this problem suffer from poor generalization across domains and generative models. In this work, we focus on the robustness of AI-generated image (AIGI) detectors. We analyze existing state-of-the-art AIGI detection methods based on froz… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2311.08349  [pdf, other

    cs.CL

    AI-generated text boundary detection with RoFT

    Authors: Laida Kushnareva, Tatiana Gaintseva, German Magai, Serguei Barannikov, Dmitry Abulkhanov, Kristian Kuznetsov, Eduard Tulchinskii, Irina Piontkovskaya, Sergey Nikolenko

    Abstract: Due to the rapid development of large language models, people increasingly often encounter texts that may start as written by a human but continue as machine-generated. Detecting the boundary between human-written and machine-generated parts of such texts is a challenging problem that has not received much attention in literature. We attempt to bridge this gap and examine several ways to adapt sta… ▽ More

    Submitted 2 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  3. arXiv:2306.04723  [pdf, other

    cs.CL cs.AI cs.LG math.AT

    Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts

    Authors: Eduard Tulchinskii, Kristian Kuznetsov, Laida Kushnareva, Daniil Cherniavskii, Serguei Barannikov, Irina Piontkovskaya, Sergey Nikolenko, Evgeny Burnaev

    Abstract: Rapidly increasing quality of AI-generated content makes it difficult to distinguish between human and AI-generated texts, which may lead to undesirable consequences for society. Therefore, it becomes increasingly important to study the properties of human texts that are invariant over different text domains and varying proficiency of human writers, can be easily calculated for any language, and c… ▽ More

    Submitted 31 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    MSC Class: 68T50

  4. arXiv:2211.17223  [pdf, other

    cs.SD cs.CL cs.LG eess.AS math.AT

    Topological Data Analysis for Speech Processing

    Authors: Eduard Tulchinskii, Kristian Kuznetsov, Laida Kushnareva, Daniil Cherniavskii, Serguei Barannikov, Irina Piontkovskaya, Sergey Nikolenko, Evgeny Burnaev

    Abstract: We apply topological data analysis (TDA) to speech classification problems and to the introspection of a pretrained speech model, HuBERT. To this end, we introduce a number of topological and algebraic features derived from Transformer attention maps and embeddings. We show that a simple linear classifier built on top of such features outperforms a fine-tuned classification head. In particular, we… ▽ More

    Submitted 6 June, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted to INTERSPEECH 2023 conference

    Journal ref: Proc. INTERSPEECH 2023, pages 311--315

  5. arXiv:2207.01903  [pdf, other

    cs.CL

    Betti numbers of attention graphs is all you really need

    Authors: Laida Kushnareva, Dmitri Piontkovski, Irina Piontkovskaya

    Abstract: We apply methods of topological analysis to the attention graphs, calculated on the attention heads of the BERT model ( arXiv:1810.04805v2 ). Our research shows that the classifier built upon basic persistent topological features (namely, Betti numbers) of the trained neural network can achieve classification results on par with the conventional classification method. We show the relevance of such… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: This short paper was submitted to "Topological Data Analysis and Beyond" Workshop at NeurIPS 2020 at July 2020, but wasn't accepted. Later the ideas from this short paper found a rich development in arXiv:2109.04825 and arXiv:2205.09630

  6. arXiv:2205.09630  [pdf, other

    cs.CL cs.AI cs.LG math.AT

    Acceptability Judgements via Examining the Topology of Attention Maps

    Authors: Daniil Cherniavskii, Eduard Tulchinskii, Vladislav Mikhailov, Irina Proskurina, Laida Kushnareva, Ekaterina Artemova, Serguei Barannikov, Irina Piontkovskaya, Dmitri Piontkovski, Evgeny Burnaev

    Abstract: The role of the attention mechanism in encoding linguistic knowledge has received special interest in NLP. However, the ability of the attention heads to judge the grammatical acceptability of a sentence has been underexplored. This paper approaches the paradigm of acceptability judgments with topological data analysis (TDA), showing that the geometric properties of the attention graph can be effi… ▽ More

    Submitted 23 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted to EMNLP 2022 Findings

    Journal ref: Findings of the Association for Computational Linguistics: EMNLP 2022, 88-107

  7. Artificial Text Detection via Examining the Topology of Attention Maps

    Authors: Laida Kushnareva, Daniil Cherniavskii, Vladislav Mikhailov, Ekaterina Artemova, Serguei Barannikov, Alexander Bernstein, Irina Piontkovskaya, Dmitri Piontkovski, Evgeny Burnaev

    Abstract: The impressive capabilities of recent generative models to create texts that are challenging to distinguish from the human-written ones can be misused for generating fake news, product reviews, and even abusive content. Despite the prominent performance of existing methods for artificial text detection, they still lack interpretability and robustness towards unseen models. To this end, we propose… ▽ More

    Submitted 28 April, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021

    Journal ref: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 635-649

  8. arXiv:2010.05007  [pdf, other

    cs.LG cs.CV

    Category-Learning with Context-Augmented Autoencoder

    Authors: Denis Kuzminykh, Laida Kushnareva, Timofey Grigoryev, Alexander Zatolokin

    Abstract: Finding an interpretable non-redundant representation of real-world data is one of the key problems in Machine Learning. Biological neural networks are known to solve this problem quite well in unsupervised manner, yet unsupervised artificial neural networks either struggle to do it or require fine tuning for each task individually. We associate this with the fact that a biological brain learns in… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: 11 pages, 12 figures

    Journal ref: Information Technologies and Computing Systems 3/2020, pp. 30-39