Skip to main content

Showing 1–2 of 2 results for author: Vorontsov, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10402  [pdf, other

    cs.CL cs.IR math.PR

    Determination of the Number of Topics Intrinsically: Is It Possible?

    Authors: Victor Bulatov, Vasiliy Alekseev, Konstantin Vorontsov

    Abstract: The number of topics might be the most important parameter of a topic model. The topic modelling community has developed a set of various procedures to estimate the number of topics in a dataset, but there has not yet been a sufficiently complete comparison of existing practices. This study attempts to partially fill this gap by investigating the performance of various methods applied to several t… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: This is the first full draft version of the article. The camera-ready version was accepted at the 11th International Conference on Analysis of Images, Social Networks and Texts (AIST 2023). Presented on September 30, 2023. Expected to be published in the conference proceedings, as part of the Communications in Computer and Information Science series (CCIS, Vol. 1905)

  2. arXiv:1711.04154  [pdf, other

    cs.CL

    Interpretable probabilistic embeddings: bridging the gap between topic models and neural networks

    Authors: Anna Potapenko, Artem Popov, Konstantin Vorontsov

    Abstract: We consider probabilistic topic models and more recent word embedding techniques from a perspective of learning hidden semantic representations. Inspired by a striking similarity of the two approaches, we merge them and learn probabilistic embeddings with online EM-algorithm on word co-occurrence data. The resulting embeddings perform on par with Skip-Gram Negative Sampling (SGNS) on word similari… ▽ More

    Submitted 11 November, 2017; originally announced November 2017.

    Comments: Appeared in AINL-2017