Skip to main content

Showing 1–24 of 24 results for author: Bielikova, M

.
  1. arXiv:2406.12471  [pdf, other

    cs.CL

    Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

    Authors: Branislav Pecher, Jan Cegin, Robert Belanec, Jakub Simko, Ivan Srba, Maria Bielikova

    Abstract: While fine-tuning of pre-trained language models generally helps to overcome the lack of labelled training samples, it also displays model performance instability. This instability mainly originates from randomness in initialisation or data shuffling. To address this, researchers either modify the training process or augment the available samples, which typically results in increased computational… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2402.12819  [pdf, other

    cs.CL cs.AI cs.LG

    Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance

    Authors: Branislav Pecher, Ivan Srba, Maria Bielikova

    Abstract: When solving NLP tasks with limited labelled data, researchers can either use a general large language model without further update, or use a small number of labelled examples to tune a specialised smaller model. In this work, we address the research gap of how many labelled samples are required for the specialised small models to outperform general large models, while taking the performance varia… ▽ More

    Submitted 26 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  3. arXiv:2402.12817  [pdf, other

    cs.CL cs.AI cs.LG

    On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices

    Authors: Branislav Pecher, Ivan Srba, Maria Bielikova

    Abstract: While learning with limited labelled data can improve performance when the labels are lacking, it is also sensitive to the effects of uncontrolled randomness introduced by so-called randomness factors (e.g., varying order of data). We propose a method to systematically investigate the effects of randomness factors while taking the interactions between them into consideration. To measure the true e… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2402.03038  [pdf, other

    cs.LG cs.AI cs.CL

    Automatic Combination of Sample Selection Strategies for Few-Shot Learning

    Authors: Branislav Pecher, Ivan Srba, Maria Bielikova, Joaquin Vanschoren

    Abstract: In few-shot learning, such as meta-learning, few-shot fine-tuning or in-context learning, the limited number of samples used to train a model have a significant impact on the overall success. Although a large number of sample selection strategies exist, their impact on the performance of few-shot learning is not extensively known, as most of them have been so far evaluated in typical supervised se… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  5. arXiv:2401.07867  [pdf, other

    cs.CL

    Authorship Obfuscation in Multilingual Machine-Generated Text Detection

    Authors: Dominik Macko, Robert Moro, Adaku Uchendu, Ivan Srba, Jason Samuel Lucas, Michiharu Yamashita, Nafis Irtiza Tripto, Dongwon Lee, Jakub Simko, Maria Bielikova

    Abstract: High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection. So far, this was eval… ▽ More

    Submitted 18 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  6. arXiv:2401.06643  [pdf, other

    cs.CL

    Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation

    Authors: Jan Cegin, Branislav Pecher, Jakub Simko, Ivan Srba, Maria Bielikova, Peter Brusilovsky

    Abstract: The latest generative large language models (LLMs) have found their application in data augmentation tasks, where small numbers of text samples are LLM-paraphrased and then used to fine-tune downstream models. However, more research is needed to assess how different prompts, seed data selection strategies, filtering methods, or model settings affect the quality of paraphrased data (and downstream… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: 24 pages, updated with new experimets - Mistral as downstream task classifier and new method combination (of taboo and hints methods)

  7. arXiv:2312.01082  [pdf, other

    cs.LG cs.AI cs.CL

    On the Effects of Randomness on Stability of Learning with Limited Labelled Data: A Systematic Literature Review

    Authors: Branislav Pecher, Ivan Srba, Maria Bielikova

    Abstract: Learning with limited labelled data, such as few-shot learning, meta-learning or transfer learning, aims to effectively train a model using only small amount of labelled samples. However, these approaches were observed to be excessively sensitive to the effects of uncontrolled randomness caused by non-determinism in the training process. The randomness negatively affects the stability of the model… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  8. arXiv:2311.08838  [pdf, other

    cs.CL

    Disinformation Capabilities of Large Language Models

    Authors: Ivan Vykopal, Matúš Pikuliak, Ivan Srba, Robert Moro, Dominik Macko, Maria Bielikova

    Abstract: Automated disinformation generation is often listed as an important risk associated with large language models (LLMs). The theoretical ability to flood the information space with disinformation content might have dramatic consequences for societies around the world. This paper presents a comprehensive study of the disinformation capabilities of the current generation of LLMs to generate false news… ▽ More

    Submitted 23 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  9. MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

    Authors: Dominik Macko, Robert Moro, Adaku Uchendu, Jason Samuel Lucas, Michiharu Yamashita, Matúš Pikuliak, Ivan Srba, Thai Le, Dongwon Lee, Jakub Simko, Maria Bielikova

    Abstract: There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available benchmarks which lack authentic texts in languages other than English and predominantly cover older generators. To fill this gap, we introduce MULTITuDE,… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

  10. arXiv:2309.12325  [pdf

    cs.CY cs.AI cs.CV cs.LG

    FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare

    Authors: Karim Lekadir, Aasa Feragen, Abdul Joseph Fofanah, Alejandro F Frangi, Alena Buyx, Anais Emelie, Andrea Lara, Antonio R Porras, An-Wen Chan, Arcadi Navarro, Ben Glocker, Benard O Botwe, Bishesh Khanal, Brigit Beger, Carol C Wu, Celia Cintas, Curtis P Langlotz, Daniel Rueckert, Deogratias Mzurikwao, Dimitrios I Fotiadis, Doszhan Zhussupov, Enzo Ferrante, Erik Meijering, Eva Weicken, Fabio A González , et al. (93 additional authors not shown)

    Abstract: Despite major advances in artificial intelligence (AI) for medicine and healthcare, the deployment and adoption of AI technologies remain limited in real-world clinical practice. In recent years, concerns have been raised about the technical, clinical, ethical and legal risks associated with medical AI. To increase real world adoption, it is essential that medical AI tools are trusted and accepted… ▽ More

    Submitted 5 July, 2024; v1 submitted 11 August, 2023; originally announced September 2023.

    ACM Class: I.2.0; I.4.0; I.5.0

  11. Multilingual Previously Fact-Checked Claim Retrieval

    Authors: Matúš Pikuliak, Ivan Srba, Robert Moro, Timo Hromadka, Timotej Smolen, Martin Melisek, Ivan Vykopal, Jakub Simko, Juraj Podrouzek, Maria Bielikova

    Abstract: Fact-checkers are often hampered by the sheer amount of online content that needs to be fact-checked. NLP can help them by retrieving already existing fact-checks relevant to the content being investigated. This paper introduces a new multilingual dataset -- MultiClaim -- for previously fact-checked claim retrieval. We collected 28k posts in 27 languages from social media, 206k fact-checks in 39 l… ▽ More

    Submitted 13 October, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

    Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

  12. Eye Tracking as a Source of Implicit Feedback in Recommender Systems: A Preliminary Analysis

    Authors: Santiago de Leon-Martinez, Robert Moro, Maria Bielikova

    Abstract: Eye tracking in recommender systems can provide an additional source of implicit feedback, while hel** to evaluate other sources of feedback. In this study, we use eye tracking data to inform a collaborative filtering model for movie recommendation providing an improvement over the click-based implementations and additionally analyze the area of interest (AOI) duration as related to the known in… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: Paper accepted to Eyes4ICU workshop at ETRA 2023

  13. arXiv:2211.14631  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    Searching for Discriminative Words in Multidimensional Continuous Feature Space

    Authors: Marius Sajgalik, Michal Barla, Maria Bielikova

    Abstract: Word feature vectors have been proven to improve many NLP tasks. With recent advances in unsupervised learning of these feature vectors, it became possible to train it with much more data, which also resulted in better quality of learned features. Since it learns joint probability of latent features of words, it has the advantage that we can train it without any prior knowledge about the goal task… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    ACM Class: I.2.7

    Journal ref: Computer Speech & Language, Volume 53, 2019, Pages 276-301

  14. arXiv:2211.12143  [pdf

    cs.CY cs.AI cs.HC

    Automated, not Automatic: Needs and Practices in European Fact-checking Organizations as a basis for Designing Human-centered AI Systems

    Authors: Andrea Hrckova, Robert Moro, Ivan Srba, Jakub Simko, Maria Bielikova

    Abstract: To mitigate the negative effects of false information more effectively, the development of automated AI (artificial intelligence) tools assisting fact-checkers is needed. Despite the existing research, there is still a gap between the fact-checking practitioners' needs and pains and the current AI research. We aspire to bridge this gap by employing methods of information behavior research to ident… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 41 pages, 13 figures, 1 table, 2 annexes

  15. arXiv:2210.10085  [pdf, other

    cs.IR cs.LG cs.SI

    Auditing YouTube's Recommendation Algorithm for Misinformation Filter Bubbles

    Authors: Ivan Srba, Robert Moro, Matus Tomlein, Branislav Pecher, Jakub Simko, Elena Stefancova, Michal Kompan, Andrea Hrckova, Juraj Podrouzek, Adrian Gavornik, Maria Bielikova

    Abstract: In this paper, we present results of an auditing study performed over YouTube aimed at investigating how fast a user can get into a misinformation filter bubble, but also what it takes to "burst the bubble", i.e., revert the bubble enclosure. We employ a sock puppet audit methodology, in which pre-programmed agents (acting as YouTube users) delve into misinformation filter bubbles by watching misi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: Just accepted to ACM Transactions on Recommender Systems (ACM TORS). arXiv admin note: substantial text overlap with arXiv:2203.13769

    Journal ref: ACM Transactions on Recommender Systems. 1, 1, Article 6 (March 2023), 33 pages

  16. arXiv:2204.12294  [pdf, other

    cs.CL cs.CY cs.IR cs.LG

    Monant Medical Misinformation Dataset: Map** Articles to Fact-Checked Claims

    Authors: Ivan Srba, Branislav Pecher, Matus Tomlein, Robert Moro, Elena Stefancova, Jakub Simko, Maria Bielikova

    Abstract: False information has a significant negative influence on individuals as well as on the whole society. Especially in the current COVID-19 era, we witness an unprecedented growth of medical misinformation. To help tackle this problem with machine learning approaches, we are publishing a feature-rich dataset of approx. 317k medical news articles/blogs and 3.5k fact-checked claims. It also contains 5… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 11 pages, 4 figures, SIGIR 2022 Resource paper track

    Journal ref: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2022)

  17. An Audit of Misinformation Filter Bubbles on YouTube: Bubble Bursting and Recent Behavior Changes

    Authors: Matus Tomlein, Branislav Pecher, Jakub Simko, Ivan Srba, Robert Moro, Elena Stefancova, Michal Kompan, Andrea Hrckova, Juraj Podrouzek, Maria Bielikova

    Abstract: The negative effects of misinformation filter bubbles in adaptive systems have been known to researchers for some time. Several studies investigated, most prominently on YouTube, how fast a user can get into a misinformation filter bubble simply by selecting wrong choices from the items offered. Yet, no studies so far have investigated what it takes to burst the bubble, i.e., revert the bubble enc… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: RecSys '21: Fifteenth ACM Conference on Recommender System

    Journal ref: RecSys '21: Fifteenth ACM Conference on Recommender Systems, 2021

  18. arXiv:2203.06641  [pdf, other

    cs.IR cs.AI cs.LG

    Exploring Customer Price Preference and Product Profit Role in Recommender Systems

    Authors: Michal Kompan, Peter Gaspar, Jakub Macina, Matus Cimerman, Maria Bielikova

    Abstract: Most of the research in the recommender systems domain is focused on the optimization of the metrics based on historical data such as Mean Average Precision (MAP) or Recall. However, there is a gap between the research and industry since the leading Key Performance Indicators (KPIs) for businesses are revenue and profit. In this paper, we explore the impact of manipulating the profit awareness of… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: in IEEE Intelligent Systems

    Journal ref: IEEE Intelligent Systems, 2021

  19. arXiv:2109.12523  [pdf, other

    cs.HC cs.CY cs.LG cs.SI

    A Study of Fake News Reading and Annotating in Social Media Context

    Authors: Jakub Simko, Patrik Racsko, Matus Tomlein, Martin Hanakova, Robert Moro, Maria Bielikova

    Abstract: The online spreading of fake news is a major issue threatening entire societies. Much of this spreading is enabled by new media formats, namely social networks and online media sites. Researchers and practitioners have been trying to answer this by characterizing the fake news and devising automated methods for detecting them. The detection methods had so far only limited success, mostly due to th… ▽ More

    Submitted 26 April, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

    ACM Class: H.5.2; H.5.4; K.4.2; H.3.1

    Journal ref: New Review of Hypermedia and Multimedia. pages 1-31 (2021)

  20. arXiv:2106.00102  [pdf, other

    cs.IR

    The Cold-start Problem: Minimal Users' Activity Estimation

    Authors: Juraj Visnovsky, Ondrej Kassak, Michal Kompan, Maria Bielikova

    Abstract: Cold-start problem, which arises upon the new users arrival, is one of the fundamental problems in today's recommender approaches. Moreover, in some domains as TV or multime-dia-items take long time to experience by users, thus users usually do not provide rich preference information. In this paper we analyze the minimal amount of ratings needs to be done by a user over a set of items, in order to… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: 1st Workshop on Recommender Systems for Television and online Video (RecSysTV) in conjunction with 8th ACM Conference on Recommender Systems, 2014

    Journal ref: 1st Workshop on Recommender Systems for Television and online Video (RecSysTV) in conjunction with 8th ACM Conference on Recommender Systems, 2014

  21. arXiv:2012.08793  [pdf, other

    cs.IR

    Session-based k-NNs with Semantic Suggestions for Next-item Prediction

    Authors: Miroslav Rac, Michal Kompan, Maria Bielikova

    Abstract: One of the most critical problems in e-commerce domain is the information overload problem. Usually, an enormous number of products is offered to a user. The characteristics of this domain force researchers to opt for session-based recommendation methods, from which nearest-neighbors-based (SkNN) approaches have been shown to be competitive with and even outperform neural network-based models. Exi… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 11 pages, 3 figures, 3 tables, submitted to and presented at RecSys20 CARS workshop

  22. arXiv:1904.02981  [pdf, other

    cs.CL

    NL-FIIT at SemEval-2019 Task 9: Neural Model Ensemble for Suggestion Mining

    Authors: Samuel Pecar, Marian Simko, Maria Bielikova

    Abstract: In this paper, we present neural model architecture submitted to the SemEval-2019 Task 9 competition: "Suggestion Mining from Online Reviews and Forums". We participated in both subtasks for domain specific and also cross-domain suggestion mining. We proposed a recurrent neural network architecture that employs Bi-LSTM layers and also self-attention mechanism. Our architecture tries to encode word… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted at the SemEval-2019 International Workshop on Semantic Evaluation

  23. arXiv:1809.06906  [pdf, other

    cs.CL

    Improving Moderation of Online Discussions via Interpretable Neural Models

    Authors: Andrej Švec, Matúš Pikuliak, Marián Šimko, Mária Bieliková

    Abstract: Growing amount of comments make online discussions difficult to moderate by human moderators only. Antisocial behavior is a common occurrence that often discourages other users from participating in discussion. We propose a neural network based method that partially automates the moderation process. It consists of two steps. First, we detect inappropriate comments for moderators to see. Second, we… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: ALW2

  24. Towards quantum-based privacy and voting

    Authors: Mark Hillery, Mario Ziman, Vladimir Buzek, Martina Bielikova

    Abstract: The privacy of communicating participants is often of paramount importance, but in some situations it is an essential condition. A typical example is a fair (secret) voting. We analyze in detail communication privacy based on quantum resources, and we propose new quantum protocols. Possible generalizations that would lead to voting schemes are discussed.

    Submitted 25 August, 2005; v1 submitted 6 May, 2005; originally announced May 2005.

    Comments: 5 pages, improved description of the protocol