Skip to main content

Showing 1–32 of 32 results for author: Malykh, V

.
  1. arXiv:2406.08215  [pdf, other

    cs.CL

    SumHiS: Extractive Summarization Exploiting Hidden Structure

    Authors: Tikhonov Pavel, Anastasiya Ianina, Valentin Malykh

    Abstract: Extractive summarization is a task of highlighting the most important parts of the text. We introduce a new approach to extractive summarization task using hidden clustering structure of the text. Experimental results on CNN/DailyMail demonstrate that our approach generates more accurate summaries than both extractive and abstractive methods, achieving state-of-the-art results in terms of ROUGE-2… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2310.07008  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Answer Candidate Type Selection: Text-to-Text Language Model for Closed Book Question Answering Meets Knowledge Graphs

    Authors: Mikhail Salnikov, Maria Lysyuk, Pavel Braslavski, Anton Razzhigaev, Valentin Malykh, Alexander Panchenko

    Abstract: Pre-trained Text-to-Text Language Models (LMs), such as T5 or BART yield promising results in the Knowledge Graph Question Answering (KGQA) task. However, the capacity of the models is limited and the quality decreases for questions with less popular entities. In this paper, we present a novel approach which works on top of the pre-trained Text-to-Text QA system to address this issue. Our simple y… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  3. arXiv:2310.02166  [pdf, other

    cs.CL

    Large Language Models Meet Knowledge Graphs to Answer Factoid Questions

    Authors: Mikhail Salnikov, Hai Le, Prateek Rajput, Irina Nikishina, Pavel Braslavski, Valentin Malykh, Alexander Panchenko

    Abstract: Recently, it has been shown that the incorporation of structured knowledge into Large Language Models significantly improves the results for a variety of NLP tasks. In this paper, we propose a method for exploring pre-trained Text-to-Text Language Models enriched with additional information from Knowledge Graphs for answering factoid questions. More specifically, we propose an algorithm for subgra… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  4. arXiv:2305.11626  [pdf, other

    cs.CL cs.SE

    CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search

    Authors: Nikita Sorokin, Dmitry Abulkhanov, Sergey Nikolenko, Valentin Malykh

    Abstract: We consider the clone detection and information retrieval problems for source code, well-known tasks important for any programming language. Although it is also an important and interesting problem to find code snippets that operate identically but are written in different programming languages, to the best of our knowledge multilingual clone detection has not been studied in literature. In this w… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  5. arXiv:2305.11625  [pdf, other

    cs.CL cs.SE

    Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets

    Authors: Ivan Sedykh, Dmitry Abulkhanov, Nikita Sorokin, Sergey Nikolenko, Valentin Malykh

    Abstract: Code search is an important and well-studied task, but it usually means searching for code by a text query. We argue that using a code snippet (and possibly an error traceback) as a query while looking for bugfixing instructions and code samples is a natural use case not covered by prior art. Moreover, existing datasets use code comments rather than full-text descriptions as text, making them unsu… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: COLING 2024

  6. arXiv:2206.12514  [pdf, other

    cs.CL

    DetIE: Multilingual Open Information Extraction Inspired by Object Detection

    Authors: Michael Vasilkovsky, Anton Alekseev, Valentin Malykh, Ilya Shenbin, Elena Tutubalina, Dmitriy Salikhov, Mikhail Stepnov, Andrey Chertok, Sergey Nikolenko

    Abstract: State of the art neural methods for open information extraction (OpenIE) usually extract triplets (or tuples) iteratively in an autoregressive or predicate-based manner in order not to produce duplicates. In this work, we propose a different approach to the problem that can be equally or more successful. Namely, we present a novel single-pass method for OpenIE inspired by object detection algorith… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  7. arXiv:2206.10914  [pdf, other

    cs.CL

    Template-based Approach to Zero-shot Intent Recognition

    Authors: Dmitry Lamanov, Pavel Burnyshev, Ekaterina Artemova, Valentin Malykh, Andrey Bout, Irina Piontkovskaya

    Abstract: The recent advances in transfer learning techniques and pre-training of large contextualized encoders foster innovation in real-life applications, including dialog assistants. Practical needs of intent recognition require effective data usage and the ability to constantly update supported intents, adopting new ones, and abandoning outdated ones. In particular, the generalized zero-shot paradigm, i… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: accepted to INLG 2022

  8. arXiv:2205.01245  [pdf, ps, other

    cond-mat.quant-gas physics.atom-ph

    Mass-ratio condition for non-binding of three two-component particles with contact interactions

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: Binding of two heavy fermions interacting with a light particle via the contact interaction is possible only for sufficiently large heavy-light mass ratio. In this work, the two-variable inequality is derived to determine a specific value $ μ^* $ providing that there are no three-body bound states for the mass ratio smaller than $ μ^* $. The value $ μ^* = 5.26 $ is obtained by analyzing this inequ… ▽ More

    Submitted 23 June, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: 11 pages, 1 figure

    MSC Class: 70F07; 34L15

    Journal ref: Eur. Phys. J. Plus 138, 147 (2023)

  9. arXiv:2204.11997  [pdf, ps, other

    physics.atom-ph cond-mat.quant-gas

    Minlos-Faddeev regularization of zero-range interactions in the three-body problem

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: To regularize the three-body problem, Minlos and Faddeev suggested a modification of zero-range model, which diminishes interaction at the triple-collision point. The analysis reveals that this regularization results in four alternatives depending on the regularization parameter $ σ$. Explicitly, Efimov or Thomas effects remain for $ σ< σ_c $, the additional boundary conditions of two types should… ▽ More

    Submitted 16 June, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: 12 pages, 3 figures. Accepted in JETP Letters

    MSC Class: 70F07; 81Vxx

    Journal ref: JOURNAL OF EXPERIMENTAL AND THEORETICAL PHYSICS LETTERS 116 (2022) 179-184

  10. arXiv:2204.11104  [pdf, other

    cs.CL

    WikiMulti: a Corpus for Cross-Lingual Summarization

    Authors: Pavel Tikhonov, Valentin Malykh

    Abstract: Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our datas… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

  11. arXiv:2202.07791  [pdf, other

    cs.CL cs.AI

    Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

    Authors: Alena Fenogenova, Maria Tikhonova, Vladislav Mikhailov, Tatiana Shavrina, Anton Emelyanov, Denis Shevelev, Alexandr Kukushkin, Valentin Malykh, Ekaterina Artemova

    Abstract: In the last year, new neural architectures and multilingual pre-trained models have been released for Russian, which led to performance evaluation problems across a range of language understanding tasks. This paper presents Russian SuperGLUE 1.1, an updated benchmark styled after GLUE for Russian NLP models. The new version includes a number of technical, user experience and methodological impro… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: Computational Linguistics and Intellectual Technologies Papers from the Annual International Conference "Dialogue" (2021) Issue 20

    MSC Class: 68-06; 68T50; 68T01 ACM Class: G.3; I.2.7

  12. arXiv:2108.06991  [pdf, other

    cs.CL

    A Single Example Can Improve Zero-Shot Data Generation

    Authors: Pavel Burnyshev, Valentin Malykh, Andrey Bout, Ekaterina Artemova, Irina Piontkovskaya

    Abstract: Sub-tasks of intent classification, such as robustness to distribution shift, adaptation to specific user groups and personalization, out-of-domain detection, require extensive and flexible datasets for experiments and evaluation. As collecting such datasets is time- and labor-consuming, we propose to use text generation methods to gather datasets. The generator should be trained to generate utter… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: To appear in INLG2021 proceedings

  13. arXiv:2104.14314  [pdf, other

    cs.CL

    MOROCCO: Model Resource Comparison Framework

    Authors: Valentin Malykh, Alexander Kukushkin, Ekaterina Artemova, Vladislav Mikhailov, Maria Tikhonova, Tatiana Shavrina

    Abstract: The new generation of pre-trained NLP models push the SOTA to the new limits, but at the cost of computational resources, to the point that their use in real production environments is often prohibitively expensive. We tackle this problem by evaluating not only the standard quality metrics on downstream tasks but also the memory footprint and inference time. We present MOROCCO, a framework to comp… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  14. RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark

    Authors: Tatiana Shavrina, Alena Fenogenova, Anton Emelyanov, Denis Shevelev, Ekaterina Artemova, Valentin Malykh, Vladislav Mikhailov, Maria Tikhonova, Andrey Chertok, Andrey Evlampiev

    Abstract: In this paper, we introduce an advanced Russian general language understanding evaluation benchmark -- RussianGLUE. Recent advances in the field of universal language models and transformers require the development of a methodology for their broad diagnostics and testing for general intellectual skills - detection of natural language inference, commonsense reasoning, ability to perform simple logi… ▽ More

    Submitted 2 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: to appear in EMNLP 2020

  15. Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

    Authors: Anton Alekseev, Elena Tutubalina, Valentin Malykh, Sergey Nikolenko

    Abstract: Deep learning architectures based on self-attention have recently achieved and surpassed state of the art results in the task of unsupervised aspect extraction and topic modeling. While models such as neural attention-based aspect extraction (ABAE) have been successfully applied to user-generated texts, they are less coherent when applied to traditional data sources such as news articles and newsg… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Journal of Intelligent & Fuzzy Systems, pre-press, https://content.iospress.com/articles/journal-of-intelligent-and-fuzzy-systems/ifs179908

  16. The Russian Drug Reaction Corpus and Neural Models for Drug Reactions and Effectiveness Detection in User Reviews

    Authors: Elena Tutubalina, Ilseyar Alimova, Zulfat Miftahutdinov, Andrey Sakhovskiy, Valentin Malykh, Sergey Nikolenko

    Abstract: The Russian Drug Reaction Corpus (RuDReC) is a new partially annotated corpus of consumer reviews in Russian about pharmaceutical products for the detection of health-related named entities and the effectiveness of pharmaceutical products. The corpus itself consists of two parts, the raw one and the labelled one. The raw part includes 1.4 million health-related user-generated texts collected from… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 9 pages, 9 tables, 4 figures

    Journal ref: Bioinformatics, 2020

  17. RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback

    Authors: Ilya Shenbin, Anton Alekseev, Elena Tutubalina, Valentin Malykh, Sergey I. Nikolenko

    Abstract: Recent research has shown the advantages of using autoencoders based on deep neural networks for collaborative filtering. In particular, the recently proposed Mult-VAE model, which used the multinomial likelihood variational autoencoders, has shown excellent results for top-N recommendations. In this work, we propose the Recommender VAE (RecVAE) model that originates from our research on regulariz… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: In The Thirteenth ACM International Conference on Web Search and Data Mining (WSDM '20), February 3-7, 2020, Houston, TX, USA. ACM, New York, NY, USA, 9 pages

  18. arXiv:1904.04943  [pdf, ps, other

    cond-mat.quant-gas physics.atom-ph

    Three two-component fermions with contact interactions: correct formulation and energy spectrum

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: Properties of two identical particles of mass $m$ and a distinct particle of mass $m_1$ in the universal low-energy limit of zero-range two-body interaction are studied in different sectors of total angular momentum $L$ and parity $P$. For the unambiguous formulation of the problem in the interval $μ_r(L^P) < m/m_1 \le μ_c(L^P)$ ($μ_r(1^-) \approx 8.619$ and $μ_c(1^-) \approx 13.607$,… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 32 pages, 14 figures

  19. arXiv:1902.00098  [pdf, other

    cs.AI cs.CL cs.HC

    The Second Conversational Intelligence Challenge (ConvAI2)

    Authors: Emily Dinan, Varvara Logacheva, Valentin Malykh, Alexander Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, Alan W Black, Alexander Rudnicky, Jason Williams, Joelle Pineau, Mikhail Burtsev, Jason Weston

    Abstract: We describe the setting and results of the ConvAI2 NeurIPS competition that aims to further the state-of-the-art in open-domain chatbots. Some key takeaways from the competition are: (i) pretrained Transformer variants are currently the best performing models on this task, (ii) but to improve performance on multi-turn conversations with humans, future systems must go beyond single word metrics lik… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

  20. arXiv:1901.07829  [pdf, other

    cs.CL cs.AI

    AspeRa: Aspect-based Rating Prediction Model

    Authors: Sergey I. Nikolenko, Elena Tutubalina, Valentin Malykh, Ilya Shenbin, Anton Alekseev

    Abstract: We propose a novel end-to-end Aspect-based Rating Prediction model (AspeRa) that estimates user rating based on review texts for the items and at the same time discovers coherent aspects of reviews that can be used to explain predictions or profile users. The AspeRa model uses max-margin losses for joint item and user embedding learning and a dual-headed architecture; it significantly outperforms… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: accepted to ECIR 2019

  21. arXiv:1901.07786  [pdf, ps, other

    cs.CL cs.AI

    Self-Attentive Model for Headline Generation

    Authors: Daniil Gavrilov, Pavel Kalaidin, Valentin Malykh

    Abstract: Headline generation is a special type of text summarization task. While the amount of available training data for this task is almost unlimited, it still remains challenging, as learning to generate headlines for news articles implies that the model has strong reasoning about natural language. To overcome this issue, we applied recent Universal Transformer architecture paired with byte-pair encodi… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: accepted for ECIR 2019

  22. Sequence Learning with RNNs for Medical Concept Normalization in User-Generated Texts

    Authors: Elena Tutubalina, Zulfat Miftahutdinov, Sergey Nikolenko, Valentin Malykh

    Abstract: In this work, we consider the medical concept normalization problem, i.e., the problem of map** a disease mention in free-form text to a concept in a controlled vocabulary, usually to the standard thesaurus in the Unified Medical Language System (UMLS). This task is challenging since medical terminology is very different when coming from health care professionals or from the general public in th… ▽ More

    Submitted 29 November, 2018; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/117

    Journal ref: Journal of Biomedical Informatics. - 2018. - Vol.84, Is.. - P.93-102

  23. arXiv:1512.06786  [pdf, ps, other

    cond-mat.quant-gas

    Universal description of three two-component fermions

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: A quantum mechanical three-body problem for two identical fermions of mass $m$ and a distinct particle of mass $m_1$ in the universal limit of zero-range two-body interaction is studied. For the unambiguous formulation of the problem in the interval $μ_r < m/m_1 \le μ_c$ ($μ_r \approx 8.619$ and $μ_c \approx 13.607$) an additional parameter $b$ determining the wave function near the triple-collisi… ▽ More

    Submitted 25 January, 2016; v1 submitted 18 December, 2015; originally announced December 2015.

    Report number: INT-PUB-15-078

    Journal ref: EPL 115, 36005 (2016)

  24. arXiv:1211.5557  [pdf, ps, other

    physics.atom-ph cond-mat.quant-gas

    Recent advances in description of few two-component fermions

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: Overview of the recent advances in description of the few two-component fermions is presented. The model of zero-range interaction is generally considered to discuss the principal aspects of the few-body dynamics. Particular attention is paid to detailed description of two identical fermions of mass $m$ and a distinct particle of mass $m_1$: it turns out that two $L^P = 1^-$ three-body bound state… ▽ More

    Submitted 25 January, 2013; v1 submitted 23 November, 2012; originally announced November 2012.

    Comments: 16 pages, 1 figure

    Journal ref: Yad. Fiz. 77, N4, 458-465 (2014)

  25. arXiv:1009.1726  [pdf, ps, other

    nucl-th astro-ph.SR

    Consistent alpha-cluster description of the 12C (0^+_2) resonance

    Authors: S. I. Fedotov, O. I. Kartavtsev, A. V. Malykh

    Abstract: The near-threshold 12C (0^+_2) resonance provides unique possibility for fast helium burning in stars, as predicted by Hoyle to explain the observed abundance of elements in the Universe. Properties of this resonance are calculated within the framework of the alpha-cluster model whose two-body and three-body effective potentials are tuned to describe the alpha - alpha scattering data, the energies… ▽ More

    Submitted 9 September, 2010; originally announced September 2010.

    Journal ref: Pisma Zh.Eksp.Teor.Fiz.92:715-719,2010

  26. Bound states and scattering lengths of three two-component particles with zero-range interactions under one-dimensional confinement

    Authors: O. I. Kartavtsev, A. V. Malykh, S. A. Sofianos

    Abstract: The universal three-body dynamics in ultra-cold binary gases confined to one-dimensional motion are studied. The three-body binding energies and the (2 + 1)-scattering lengths are calculated for two identical particles of mass $m$ and a different one of mass $m_1$, which interactions is described in the low-energy limit by zero-range potentials. The critical values of the mass ratio $m/m_1$, at… ▽ More

    Submitted 24 October, 2008; v1 submitted 20 August, 2008; originally announced August 2008.

    Journal ref: ZhETF 135, 419 (2009)

  27. Universal description of the rotational-vibrational spectrum of three particles with zero-range interactions

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: A comprehensive universal description of the rotational-vibrational spectrum for two identical particles of mass $m$ and the third particle of the mass $m_1$ in the zero-range limit of the interaction between different particles is given for arbitrary values of the mass ratio $m/m_1$ and the total angular momentum $L$. If the two-body scattering length is positive, a number of vibrational states… ▽ More

    Submitted 19 October, 2007; v1 submitted 26 September, 2007; originally announced September 2007.

    Journal ref: Pis'ma ZhETF 86, 713 (2007)

  28. Low-energy three-body dynamics in binary quantum gases

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: The universal three-body dynamics in ultra-cold binary Fermi and Fermi-Bose mixtures is studied. Two identical fermions of the mass $m$ and a particle of the mass $m_1$ with the zero-range two-body interaction in the states of the total angular momentum L=1 are considered. Using the boundary condition model for the s-wave interaction of different particles, both eigenvalue and scattering problem… ▽ More

    Submitted 28 October, 2006; originally announced October 2006.

    Comments: 16 pages

    Journal ref: J. Phys. B: At. Mol. Opt. Phys. 40 (2007) 1429-1441

  29. Universal low-energy properties of three two-dimensional particles

    Authors: O. I. Kartavtsev, A. V. Malykh

    Abstract: Universal low-energy properties are studied for three identical bosons confined in two dimensions. The short-range pair-wise interaction in the low-energy limit is described by means of the boundary condition model. The wave function is expanded in a set of eigenfunctions on the hypersphere and the system of hyper-radial equations is used to obtain analytical and numerical results. Within the fr… ▽ More

    Submitted 1 June, 2006; originally announced June 2006.

    Comments: 30 pages with 13 figures

    Journal ref: Phys. Rev. A74 (2006) 042506

  30. Effective three-body interactions in the alpha-cluster model for the ^{12}C nucleus

    Authors: S. I. Fedotov, O. I. Kartavtsev, A. V. Malykh

    Abstract: Properties of the lowest $0^{+}$ states of $^{12}\mathrm{C}$ are calculated to study the role of three-body interactions in the $α$-cluster model. An additional short-range part of the local three-body potential is introduced to incorporate the effects beyond the $α$-cluster model. There is enough freedom in this potential to reproduce the experimental values of the ground-state and excited-stat… ▽ More

    Submitted 9 September, 2005; originally announced September 2005.

    Journal ref: Eur.Phys.J. A26 (2005) 201-207

  31. Three-alpha-cluster structure of the 0^+ states in ^{12}C and the effective alpha-alpha interactions

    Authors: S. I. Fedotov, O. I. Kartavtsev, V. I. Kochkin, A. V. Malykh

    Abstract: The $0^{+}$ states of $^{12}\mathrm{C}$ are considered within the framework of the microscopic three-$α$-cluster model. The main attention is paid to accurate calculation of the width of the extremely narrow near-threshold $0^+_2$ state which plays a key role in stellar nucleosynthesis. It is shown that the $0^{+}_2$-state decays by means of the sequential mechanism… ▽ More

    Submitted 2 June, 2004; v1 submitted 7 April, 2004; originally announced April 2004.

    Journal ref: Phys.Rev. C70 (2004) 014006

  32. Effect of dtμquasi-nucleus structure on energy levels of the (dtμ)Xee exotic molecule

    Authors: O. I. Kartavtsev, A. V. Malykh, V. P. Permyakov

    Abstract: Precise energies of rovibrational states of the exotic hydrogen-like molecule $(dtμ)Xee$ are of importance for $dtμ$ resonant formation, which is a key process in the muon-catalyzed fusion cycle. The effect of the internal structure and motion of the $dtμ$ quasi-nucleus on energy levels is studied using the three-body description of the $(dtμ)Xee$ molecule based on the hierarchy of scales and co… ▽ More

    Submitted 31 March, 2004; v1 submitted 24 March, 2004; originally announced March 2004.

    Journal ref: Phys.Rev. A70 (2004) 022504