Skip to main content

Showing 1–14 of 14 results for author: Roller, R

.
  1. arXiv:2403.18336  [pdf, other

    cs.CL cs.LG

    A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages

    Authors: Lisa Raithel, Hui-Syuan Yeh, Shuntaro Yada, Cyril Grouin, Thomas Lavergne, Aurélie Névéol, Patrick Paroubek, Philippe Thomas, Tomohiro Nishiyama, Sebastian Möller, Eiji Aramaki, Yuji Matsumoto, Roland Roller, Pierre Zweigenbaum

    Abstract: User-generated data sources have gained significance in uncovering Adverse Drug Reactions (ADRs), with an increasing number of discussions occurring in the digital world. However, the existing clinical corpora predominantly revolve around scientific articles in English. This work presents a multilingual corpus of texts concerning ADRs gathered from diverse sources, including patient fora, social m… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  2. arXiv:2310.11275  [pdf, other

    cs.CL

    xMEN: A Modular Toolkit for Cross-Lingual Medical Entity Normalization

    Authors: Florian Borchert, Ignacio Llorca, Roland Roller, Bert Arnrich, Matthieu-P. Schapranow

    Abstract: Objective: To improve performance of medical entity normalization across many languages, especially when fewer language resources are available compared to English. Materials and Methods: We introduce xMEN, a modular system for cross-lingual medical entity normalization, which performs well in both low- and high-resource scenarios. When synonyms in the target language are scarce for a given term… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 16 pages, 3 figures

  3. arXiv:2308.08827  [pdf, other

    cs.CL

    Factuality Detection using Machine Translation -- a Use Case for German Clinical Text

    Authors: Mohammed Bin Sumait, Aleksandra Gabryszak, Leonhard Hennig, Roland Roller

    Abstract: Factuality can play an important role when automatically processing clinical text, as it makes a difference if particular symptoms are explicitly not present, possibly present, not mentioned, or affirmed. In most cases, a sufficient number of examples is necessary to handle such phenomena in a supervised machine learning setting. However, as clinical text might contain sensitive information, data… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted at KONVENS 2023

  4. arXiv:2301.00183  [pdf, other

    cs.SI cs.MA nlin.AO physics.soc-ph

    Modeling social resilience: Questions, answers, open problems

    Authors: Frank Schweitzer, Georges Andres, Giona Casiraghi, Christoph Gote, Ramona Roller, Ingo Scholtes, Giacomo Vaccario, Christian Zingg

    Abstract: Resilience denotes the capacity of a system to withstand shocks and its ability to recover from them. We develop a framework to quantify the resilience of highly volatile, non-equilibrium social organizations, such as collectives or collaborating teams. It consists of four steps: (i) \emph{delimitation}, i.e., narrowing down the target systems, (ii) \emph{conceptualization}, .e., identifying how t… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

  5. arXiv:2209.00262  [pdf, other

    cs.CL

    Which anonymization technique is best for which NLP task? -- It depends. A Systematic Study on Clinical Text Processing

    Authors: Iyadh Ben Cheikh Larbi, Aljoscha Burchardt, Roland Roller

    Abstract: Clinical text processing has gained more and more attention in recent years. The access to sensitive patient data, on the other hand, is still a big challenge, as text cannot be shared without legal hurdles and without removing personal information. There are many techniques to modify or remove patient related information, each with different strengths. This paper investigates the influence of dif… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  6. arXiv:2208.02031  [pdf, other

    cs.CL cs.LG

    Cross-lingual Approaches for the Detection of Adverse Drug Reactions in German from a Patient's Perspective

    Authors: Lisa Raithel, Philippe Thomas, Roland Roller, Oliver Sapina, Sebastian Möller, Pierre Zweigenbaum

    Abstract: In this work, we present the first corpus for German Adverse Drug Reaction (ADR) detection in patient-generated content. The data consists of 4,169 binary annotated documents from a German patient forum, where users talk about health issues and get advice from medical doctors. As is common in social media data in this domain, the class labels of the corpus are very imbalanced. This and a high topi… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at LREC 2022

  7. arXiv:2207.03885  [pdf, other

    cs.CL

    A Medical Information Extraction Workbench to Process German Clinical Text

    Authors: Roland Roller, Laura Seiffe, Ammer Ayach, Sebastian Möller, Oliver Marten, Michael Mikhailov, Christoph Alt, Danilo Schmidt, Fabian Halleck, Marcel Naik, Wiebke Duettmann, Klemens Budde

    Abstract: Background: In the information extraction and natural language processing domain, accessible datasets are crucial to reproduce and compare results. Publicly available implementations and tools can serve as benchmark and facilitate the development of more complex applications. However, in the context of clinical text processing the number of accessible datasets is scarce -- and so is the number of… ▽ More

    Submitted 15 August, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: Paper under review since 2021

  8. arXiv:2204.12810  [pdf, other

    cs.LG cs.CY

    When Performance is not Enough -- A Multidisciplinary View on Clinical Decision Support

    Authors: Roland Roller, Klemens Budde, Aljoscha Burchardt, Peter Dabrock, Sebastian Möller, Bilgin Osmanodja, Simon Ronicke, David Samhammer, Sven Schmeier

    Abstract: Scientific publications about machine learning in healthcare are often about implementing novel methods and boosting the performance - at least from a computer science perspective. However, beyond such often short-lived improvements, much more needs to be taken into consideration if we want to arrive at a sustainable progress in healthcare. What does it take to actually implement such a system, ma… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Paper currently under review

  9. arXiv:2005.11494  [pdf, ps, other

    cs.CL

    From Witch's Shot to Music Making Bones -- Resources for Medical Laymen to Technical Language and Vice Versa

    Authors: Laura Seiffe, Oliver Marten, Michael Mikhailov, Sven Schmeier, Sebastian Möller, Roland Roller

    Abstract: Many people share information in social media or forums, like food they eat, sports activities they do or events which have been visited. This also applies to information about a person's health status. Information we share online unveils directly or indirectly information about our lifestyle and health situation and thus provides a valuable data resource. If we can make advantage of that data, ap… ▽ More

    Submitted 23 May, 2020; originally announced May 2020.

    Comments: In Proceedings of LREC 2020

  10. SIA: A Scalable Interoperable Annotation Server for Biomedical Named Entities

    Authors: Johannes Kirschnick, Philippe Thomas, Roland Roller, Leonhard Hennig

    Abstract: Recent years showed a strong increase in biomedical sciences and an inherent increase in publication volume. Extraction of specific information from these sources requires highly sophisticated text mining and information extraction tools. However, the integration of freely available tools into customized workflows is often cumbersome and difficult. We describe SIA (Scalable Interoperable Annotatio… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: 11 pages, 2 figures, published in Journal of Cheminformatics

    Journal ref: J Cheminform 10, 63 (2018)

  11. arXiv:1811.03809  [pdf, other

    cs.SI

    Football and Beer - a Social Media Analysis on Twitter in Context of the FIFA Football World Cup 2018

    Authors: Roland Roller, Philippe Thomas, Sven Schmeier

    Abstract: In many societies alcohol is a legal and common recreational substance and socially accepted. Alcohol consumption often comes along with social events as it helps people to increase their sociability and to overcome their inhibitions. On the other hand we know that increased alcohol consumption can lead to serious health issues, such as cancer, cardiovascular diseases and diseases of the digestive… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Journal ref: In proceedings of Social Media Mining for Health Applications (SMM4H) @ EMNLP 2018

  12. arXiv:1805.01646  [pdf, ps, other

    cs.CL

    Cross-lingual Candidate Search for Biomedical Concept Normalization

    Authors: Roland Roller, Madeleine Kittner, Dirk Weissenborn, Ulf Leser

    Abstract: Biomedical concept normalization links concept mentions in texts to a semantically equivalent concept in a biomedical knowledge base. This task is challenging as concepts can have different expressions in natural languages, e.g. paraphrases, which are not necessarily all present in the knowledge base. Concept normalization of non-English biomedical text is even more challenging as non-English reso… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  13. arXiv:1710.11154  [pdf, other

    cs.CL

    Creation of an Annotated Corpus of Spanish Radiology Reports

    Authors: Viviana Cotik, Darío Filippo, Roland Roller, Hans Uszkoreit, Feiyu Xu

    Abstract: This paper presents a new annotated corpus of 513 anonymized radiology reports written in Spanish. Reports were manually annotated with entities, negation and uncertainty terms and relations. The corpus was conceived as an evaluation resource for named entity recognition and relation extraction algorithms, and as input for the use of supervised methods. Biomedical annotated resources are scarce du… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

    Comments: WiNLP Workshop ACL

  14. arXiv:1509.03739  [pdf, other

    cs.CL

    Improving distant supervision using inference learning

    Authors: Roland Roller, Eneko Agirre, Aitor Soroa, Mark Stevenson

    Abstract: Distant supervision is a widely applied approach to automatic training of relation extraction systems and has the advantage that it can generate large amounts of labelled data with minimal effort. However, this data may contain errors and consequently systems trained using distant supervision tend not to perform as well as those based on manually labelled data. This work proposes a novel method fo… ▽ More

    Submitted 12 September, 2015; originally announced September 2015.

    Comments: In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)