Skip to main content

Showing 1–13 of 13 results for author: Vladika, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00637  [pdf, other

    cs.CL

    DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

    Authors: Stephen Meisenbacher, Maulik Chevli, Juraj Vladika, Florian Matthes

    Abstract: The task of text privatization using Differential Privacy has recently taken the form of $\textit{text rewriting}$, in which an input text is obfuscated via the use of generative (large) language models. While these methods have shown promising results in the ability to preserve privacy, these methods rely on autoregressive models which lack a mechanism to contextualize the private rewriting proce… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures, 8 tables. Accepted to ACL 2024 (Findings)

  2. arXiv:2406.05845  [pdf, other

    cs.CL

    MedREQAL: Examining Medical Knowledge Recall of Large Language Models via Question Answering

    Authors: Juraj Vladika, Phillip Schneider, Florian Matthes

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated an impressive ability to encode knowledge during pre-training on large text corpora. They can leverage this knowledge for downstream tasks like question answering (QA), even in complex areas involving health topics. Considering their high potential for facilitating clinical work in the future, understanding the quality of encoded medi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 (Findings)

  3. arXiv:2404.18759  [pdf

    cs.CL cs.CY

    Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective

    Authors: Juraj Vladika, Stephen Meisenbacher, Martina Preis, Alexandra Klymenko, Florian Matthes

    Abstract: In recent years, the field of Legal Tech has risen in prevalence, as the Natural Language Processing (NLP) and legal disciplines have combined forces to digitalize legal processes. Amidst the steady flow of research solutions stemming from the NLP domain, the study of use cases has fallen behind, leading to a number of innovative technical methods without a place in practice. In this work, we aim… ▽ More

    Submitted 2 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 tables, 30th Americas Conference on Information Systems (AMCIS 2024)

  4. arXiv:2404.08359  [pdf, other

    cs.CL cs.AI cs.IR

    Improving Health Question Answering with Reliable and Time-Aware Evidence Retrieval

    Authors: Juraj Vladika, Florian Matthes

    Abstract: In today's digital world, seeking answers to health questions on the Internet is a common practice. However, existing question answering (QA) systems often rely on using pre-selected and annotated evidence documents, thus making them inadequate for addressing novel questions. Our study focuses on the open-domain QA setting, where the key challenge is to first uncover relevant evidence in large kno… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024 (Findings)

  5. arXiv:2404.01443  [pdf

    cs.CL

    Enterprise Use Cases Combining Knowledge Graphs and Natural Language Processing

    Authors: Phillip Schneider, Tim Schopf, Juraj Vladika, Florian Matthes

    Abstract: Knowledge management is a critical challenge for enterprises in today's digital world, as the volume and complexity of data being generated and collected continue to grow incessantly. Knowledge graphs (KG) emerged as a promising solution to this problem by providing a flexible, scalable, and semantically rich way to organize and make sense of data. This paper builds upon a recent survey of the res… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 16 pages

  6. arXiv:2402.02844  [pdf, other

    cs.CL cs.AI cs.IR

    Comparing Knowledge Sources for Open-Domain Scientific Claim Verification

    Authors: Juraj Vladika, Florian Matthes

    Abstract: The increasing rate at which scientific knowledge is discovered and health claims shared online has highlighted the importance of develo** efficient fact-checking systems for scientific claims. The usual setting for this task in the literature assumes that the documents containing the evidence for claims are already provided and annotated or contained in a limited corpus. This renders the system… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024

  7. arXiv:2312.13881  [pdf, other

    cs.CL

    Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs

    Authors: Juraj Vladika, Alexander Fichtl, Florian Matthes

    Abstract: Recent advances in natural language processing (NLP) owe their success to pre-training language models on large amounts of unstructured data. Still, there is an increasing effort to combine the unstructured nature of LMs with structured knowledge and reasoning. Particularly in the rapidly evolving field of biomedical NLP, knowledge-enhanced language models (KELMs) have emerged as promising tools t… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted as Full Paper to ICAART 2024

  8. arXiv:2309.08503  [pdf, other

    cs.CL cs.AI

    HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking

    Authors: Juraj Vladika, Phillip Schneider, Florian Matthes

    Abstract: In the digital age, seeking health advice on the Internet has become a common practice. At the same time, determining the trustworthiness of online medical content is increasingly challenging. Fact-checking has emerged as an approach to assess the veracity of factual claims using evidence from credible knowledge sources. To help advance automated Natural Language Processing (NLP) solutions for thi… ▽ More

    Submitted 25 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to LREC-COLING 2024

  9. Challenges in Domain-Specific Abstractive Summarization and How to Overcome them

    Authors: Anum Afzal, Juraj Vladika, Daniel Braun, Florian Matthes

    Abstract: Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive text summarization: 1) Quadratic complexity of transformer-based models w… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  10. arXiv:2305.16859  [pdf, other

    cs.CL

    Scientific Fact-Checking: A Survey of Resources and Approaches

    Authors: Juraj Vladika, Florian Matthes

    Abstract: The task of fact-checking deals with assessing the veracity of factual claims based on credible evidence and background knowledge. In particular, scientific fact-checking is the variation of the task concerned with verifying claims rooted in scientific knowledge. This task has received significant attention due to the growing importance of scientific and health discussions on online platforms. Aut… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 9 pages, ACL 2023 (Findings)

  11. arXiv:2304.13180  [pdf, other

    cs.CL

    Sebis at SemEval-2023 Task 7: A Joint System for Natural Language Inference and Evidence Retrieval from Clinical Trial Reports

    Authors: Juraj Vladika, Florian Matthes

    Abstract: With the increasing number of clinical trial reports generated every day, it is becoming hard to keep up with novel discoveries that inform evidence-based healthcare recommendations. To help automate this process and assist medical experts, NLP solutions are being developed. This motivated the SemEval-2023 Task 7, where the goal was to develop an NLP system for two tasks: evidence retrieval and na… ▽ More

    Submitted 2 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 6 pages, SemEval 2023

  12. arXiv:2301.04098  [pdf, other

    cs.CL

    Investigating Conversational Search Behavior For Domain Exploration

    Authors: Phillip Schneider, Anum Afzal, Juraj Vladika, Daniel Braun, Florian Matthes

    Abstract: Conversational search has evolved as a new information retrieval paradigm, marking a shift from traditional search systems towards interactive dialogues with intelligent search agents. This change especially affects exploratory information-seeking contexts, where conversational search systems can guide the discovery of unfamiliar domains. In these scenarios, users find it often difficult to expres… ▽ More

    Submitted 27 February, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Accepted to ECIR 2023

  13. arXiv:2210.00105  [pdf, other

    cs.CL cs.AI

    A Decade of Knowledge Graphs in Natural Language Processing: A Survey

    Authors: Phillip Schneider, Tim Schopf, Juraj Vladika, Mikhail Galkin, Elena Simperl, Florian Matthes

    Abstract: In pace with developments in the research field of artificial intelligence, knowledge graphs (KGs) have attracted a surge of interest from both academia and industry. As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing (NLP), experiencing a rapid spread and wide adoption within recent years. Given the increasing am… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted to AACL-IJCNLP 2022