Skip to main content

Showing 1–6 of 6 results for author: Kraft, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10675  [pdf, other

    cs.CL

    German Text Simplification: Finetuning Large Language Models with Semi-Synthetic Data

    Authors: Lars Klöser, Mika Beele, Jan-Niklas Schagen, Bodo Kraft

    Abstract: This study pioneers the use of synthetically generated data for training generative models in document-level text simplification of German texts. We demonstrate the effectiveness of our approach with real-world online texts. Addressing the challenge of data scarcity in language simplification, we crawled professionally simplified German texts and synthesized a corpus using GPT-4. We finetune Large… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at Fourth Workshop on Language Technology for Equality, Diversity, Inclusion - EACL 2024

    ACM Class: I.2.7

  2. ALE: A Simulation-Based Active Learning Evaluation Framework for the Parameter-Driven Comparison of Query Strategies for NLP

    Authors: Philipp Kohl, Nils Freyer, Yoka Krämer, Henri Werth, Steffen Wolf, Bodo Kraft, Matthias Meinecke, Albert Zündorf

    Abstract: Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data points to annotators they annotate next instead of a subsequent or random sample. This method is supposed to save annotation effort while maintaining model perf… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: The Version of Record of this contribution is published in Deep Learning Theory and Applications 4th International Conference, DeLTA 2023 Proceedings, and is available online at https://doi.org/10.1007/978-3-031-39059-3_16

    Journal ref: Conte, D., Fred, A., Gusikhin, O., Sansone, C. (eds) Deep Learning Theory and Applications. DeLTA 2023. Communications in Computer and Information Science, vol 1875. Springer, Cham

  3. Explaining Relation Classification Models with Semantic Extents

    Authors: Lars Klöser, Andre Büsgen, Philipp Kohl, Bodo Kraft, Albert Zündorf

    Abstract: In recent years, the development of large pretrained language models, such as BERT and GPT, significantly improved information extraction systems on various tasks, including relation classification. State-of-the-art systems are highly accurate on scientific benchmarks. A lack of explainability is currently a complicating factor in many real-world applications. Comprehensible systems are necessary… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Accepted at DeLTA 2023: Deep Learning Theory and Applications conference

    ACM Class: I.2.7

  4. arXiv:2111.09035  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Multi-Attribute Relation Extraction (MARE) -- Simplifying the Application of Relation Extraction

    Authors: Lars Klöser, Philipp Kohl, Bodo Kraft, Albert Zündorf

    Abstract: Natural language understanding's relation extraction makes innovative and encouraging novel business concepts possible and facilitates new digitilized decision-making processes. Current approaches allow the extraction of relations with a fixed number of entities as attributes. Extracting relations with an arbitrary amount of attributes requires complex systems and costly relation-trigger annotatio… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Preprint of short paper for the 2nd International Conference on Deep Learning Theory and Applications (2021)

    Journal ref: Proceedings of the 2nd International Conference on Deep Learning Theory and Applications, Vol. 1, (2021), P. 148 - 156

  5. STAMP 4 NLP -- An Agile Framework for Rapid Quality-Driven NLP Applications Development

    Authors: Philipp Kohl, Oliver Schmidts, Lars Klöser, Henri Werth, Bodo Kraft, Albert Zündorf

    Abstract: The progress in natural language processing (NLP) research over the last years, offers novel business opportunities for companies, as automated user interaction or improved data analysis. Building sophisticated NLP applications requires dealing with modern machine learning (ML) technologies, which impedes enterprises from establishing successful NLP projects. Our experience in applied NLP research… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Preprint of short paper for QUATIC 2021 conference

    Journal ref: Quality of Information and Communications Technology, 2021, p. 156-166

  6. arXiv:1909.10296  [pdf, other

    cs.CV cs.LG

    Predicting Landscapes from Environmental Conditions Using Generative Networks

    Authors: Christian Requena-Mesa, Markus Reichstein, Miguel Mahecha, Basil Kraft, Joachim Denzler

    Abstract: Landscapes are meaningful ecological units that strongly depend on the environmental conditions. Such dependencies between landscapes and the environment have been noted since the beginning of Earth sciences and cast into conceptual models describing the interdependencies of climate, geology, vegetation and geomorphology. Here, we ask whether landscapes, as seen from space, can be statistically pr… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

    Comments: Accepted conference paper at GCPR2019