Skip to main content

Showing 1–5 of 5 results for author: Petrakov, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15627  [pdf, other

    cs.CL cs.LG

    Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph

    Authors: Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev, Akim Tsvigun, Daniil Vasilev, Rui Xing, Abdelrahman Boda Sadallah, Lyudmila Rvanova, Sergey Petrakov, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov, Artem Shelmanov

    Abstract: Uncertainty quantification (UQ) is becoming increasingly recognized as a critical component of applications that rely on machine learning (ML). The rapid proliferation of large language models (LLMs) has stimulated researchers to seek efficient and effective approaches to UQ in text generation tasks, as in addition to their emerging capabilities, these models have introduced new challenges for bui… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev contributed equally

  2. arXiv:2404.06137  [pdf, other

    cs.CL cs.AI

    SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection

    Authors: Elisei Rykov, Yana Shishkina, Kseniia Petrushina, Kseniia Titova, Sergey Petrakov, Alexander Panchenko

    Abstract: In this paper, we present our novel systems developed for the SemEval-2024 hallucination detection task. Our investigation spans a range of strategies to compare model predictions with reference standards, encompassing diverse baselines, the refinement of pre-trained encoders through supervised learning, and an ensemble approaches utilizing several high-performing models. Through these exploration… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 12 pages, 10 tables, 3 figures

  3. arXiv:2403.04696  [pdf, other

    cs.CL cs.AI cs.LG

    Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

    Authors: Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov

    Abstract: Large language models (LLMs) are notorious for hallucinating, i.e., producing erroneous claims in their output. Such hallucinations can be dangerous, as occasional factual inaccuracies in the generated text might be obscured by the rest of the output being generally factually correct, making it extremely hard for the users to spot them. Current services that leverage LLMs usually do not provide an… ▽ More

    Submitted 6 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted to ACL-2024 (Findings). Ekaterina Fadeeva, Aleksandr Rubashevskii, and Artem Shelmanov contributed equally

  4. arXiv:2311.07383  [pdf, other

    cs.CL cs.LG

    LM-Polygraph: Uncertainty Estimation for Language Models

    Authors: Ekaterina Fadeeva, Roman Vashurin, Akim Tsvigun, Artem Vazhentsev, Sergey Petrakov, Kirill Fedyanin, Daniil Vasilev, Elizaveta Goncharova, Alexander Panchenko, Maxim Panov, Timothy Baldwin, Artem Shelmanov

    Abstract: Recent advancements in the capabilities of large language models (LLMs) have paved the way for a myriad of groundbreaking applications in various fields. However, a significant challenge arises as these models often "hallucinate", i.e., fabricate facts without providing users an apparent means to discern the veracity of their statements. Uncertainty estimation (UE) methods are one path to safer, m… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP-2023

  5. arXiv:2212.14246  [pdf, other

    cs.LG

    Robust representations of oil wells' intervals via sparse attention mechanism

    Authors: Alina Ermilova, Nikita Baramiia, Valerii Kornilov, Sergey Petrakov, Alexey Zaytsev

    Abstract: Transformer-based neural network architectures achieve state-of-the-art results in different domains, from natural language processing (NLP) to computer vision (CV). The key idea of Transformers, the attention mechanism, has already led to significant breakthroughs in many areas. The attention has found their implementation for time series data as well. However, due to the quadratic complexity of… ▽ More

    Submitted 6 November, 2023; v1 submitted 29 December, 2022; originally announced December 2022.