Skip to main content

Showing 1–21 of 21 results for author: Leippold, M

.
  1. arXiv:2406.14162  [pdf, other

    cs.IR cs.AI cs.CL

    DIRAS: Efficient LLM-Assisted Annotation of Document Relevance in Retrieval Augmented Generation

    Authors: **gwei Ni, Tobias Schimanski, Meihong Lin, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: Retrieval Augmented Generation (RAG) is widely employed to ground responses to queries on domain-specific documents. But do RAG implementations leave out important information or excessively include irrelevant information? To allay these concerns, it is necessary to annotate domain-specific benchmarks to evaluate information retrieval (IR) performance, as relevance definitions vary across queries… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.09818  [pdf, other

    cs.IR

    ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures

    Authors: Tobias Schimanski, **gwei Ni, Roberto Spacey, Nicola Ranger, Markus Leippold

    Abstract: To handle the vast amounts of qualitative data produced in corporate climate communication, stakeholders increasingly rely on Retrieval Augmented Generation (RAG) systems. However, a significant gap remains in evaluating domain-specific information retrieval - the basis for answer generation. To address this challenge, this work simulates the typical tasks of a sustainability analyst by examining… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2402.11073  [pdf, other

    cs.CL cs.AI

    AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

    Authors: **gwei Ni, Min**g Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: With the rise of generative AI, automated fact-checking methods to combat misinformation are becoming more and more important. However, factual claim detection, the first step in a fact-checking pipeline, suffers from two key issues that limit its scalability and generalizability: (1) inconsistency in definitions of the task and what a claim is, and (2) the high cost of manual annotation. To addre… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL2024 Main Conference

  4. arXiv:2402.08277  [pdf, other

    cs.CL cs.LG

    Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

    Authors: Tobias Schimanski, **gwei Ni, Mathias Kraus, Elliott Ash, Markus Leippold

    Abstract: Advances towards more faithful and traceable answers of Large Language Models (LLMs) are crucial for various research and practical endeavors. One avenue in reaching this goal is basing the answers on reliable sources. However, this Evidence-Based QA has proven to work insufficiently with LLMs in terms of citing the correct sources (source quality) and truthfully representing the information withi… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  5. arXiv:2401.12566  [pdf, other

    cs.CL

    Automated Fact-Checking of Climate Change Claims with Large Language Models

    Authors: Markus Leippold, Saeid Ashraf Vaghefi, Dominik Stammbach, Veruska Muccione, Julia Bingler, **gwei Ni, Chiara Colesanti-Senni, Tobias Wekhof, Tobias Schimanski, Glen Gostlow, Tingyu Yu, Juerg Luterbacher, Christian Huggel

    Abstract: This paper presents Climinator, a novel AI-based tool designed to automate the fact-checking of climate change claims. Utilizing an array of Large Language Models (LLMs) informed by authoritative sources like the IPCC reports and peer-reviewed scientific literature, Climinator employs an innovative Mediator-Advocate framework. This design allows Climinator to effectively synthesize varying scienti… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  6. arXiv:2312.17337  [pdf, other

    cs.CL econ.GN

    Exploring Nature: Datasets and Models for Analyzing Nature-Related Disclosures

    Authors: Tobias Schimanski, Chiara Colesanti Senni, Glen Gostlow, **gwei Ni, Tingyu Yu, Markus Leippold

    Abstract: Nature is an amorphous concept. Yet, it is essential for the planet's well-being to understand how the economy interacts with it. To address the growing demand for information on corporate nature disclosure, we provide datasets and classifiers to detect nature communication by companies. We ground our approach in the guidelines of the Taskforce on Nature-related Financial Disclosures (TNFD). Parti… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  7. arXiv:2310.08096  [pdf, other

    cs.LG

    ClimateBERT-NetZero: Detecting and Assessing Net Zero and Reduction Targets

    Authors: Tobias Schimanski, Julia Bingler, Camilla Hyslop, Mathias Kraus, Markus Leippold

    Abstract: Public and private actors struggle to assess the vast amounts of information about sustainability commitments made by various institutions. To address this problem, we create a novel tool for automatically detecting corporate, national, and regional net zero and reduction targets in three steps. First, we introduce an expert-annotated data set with 3.5K text samples. Second, we train and release C… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  8. arXiv:2310.02932  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Assessing Large Language Models on Climate Information

    Authors: Jannis Bulian, Mike S. Schäfer, Afra Amini, Heidi Lam, Massimiliano Ciaramita, Ben Gaiarin, Michelle Chen Hübscher, Christian Buck, Niels G. Mede, Markus Leippold, Nadine Strauß

    Abstract: As Large Language Models (LLMs) rise in popularity, it is necessary to assess their capability in critically relevant domains. We present a comprehensive evaluation framework, grounded in science communication research, to assess LLM responses to questions about climate change. Our framework emphasizes both presentational and epistemological adequacy, offering a fine-grained analysis of LLM genera… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  9. arXiv:2307.15770  [pdf, other

    cs.CL cs.AI

    CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools

    Authors: **gwei Ni, Julia Bingler, Chiara Colesanti-Senni, Mathias Kraus, Glen Gostlow, Tobias Schimanski, Dominik Stammbach, Saeid Ashraf Vaghefi, Qian Wang, Nicolas Webersinke, Tobias Wekhof, Tingyu Yu, Markus Leippold

    Abstract: In the face of climate change, are companies really taking substantial steps toward more sustainable operations? A comprehensive answer lies in the dense, information-rich landscape of corporate sustainability reports. However, the sheer volume and complexity of these reports make human analysis very costly. Therefore, only a few entities worldwide have the resources to analyze these reports at sc… ▽ More

    Submitted 11 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 6 pages. arXiv admin note: text overlap with arXiv:2306.15518

  10. arXiv:2306.15518   

    cs.CL

    Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool

    Authors: **gwei Ni, Julia Bingler, Chiara Colesanti-Senni, Mathias Kraus, Glen Gostlow, Tobias Schimanski, Dominik Stammbach, Saeid Ashraf Vaghefi, Qian Wang, Nicolas Webersinke, Tobias Wekhof, Tingyu Yu, Markus Leippold

    Abstract: This paper introduces a novel approach to enhance Large Language Models (LLMs) with expert knowledge to automate the analysis of corporate sustainability reports by benchmarking them against the Task Force for Climate-Related Financial Disclosures (TCFD) recommendations. Corporate sustainability reports are crucial in assessing organizations' environmental and social risks and impacts. However, an… ▽ More

    Submitted 16 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: A new version of the ChatReport paper: arXiv:2307.15770

  11. arXiv:2305.14007  [pdf, other

    cs.CL

    When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP

    Authors: **gwei Ni, Zhi**g **, Qian Wang, Mrinmaya Sachan, Markus Leippold

    Abstract: Multi-task learning (MTL) aims at achieving a better model by leveraging data and knowledge from multiple tasks. However, MTL does not always work -- sometimes negative transfer occurs between tasks, especially when aggregating loosely related skills, leaving it an open question when MTL works. Previous studies show that MTL performance can be improved by algorithmic tricks. However, what tasks an… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  12. arXiv:2304.05510  [pdf, other

    cs.CL

    chatClimate: Grounding Conversational AI in Climate Science

    Authors: Saeid Ashraf Vaghefi, Qian Wang, Veruska Muccione, **gwei Ni, Mathias Kraus, Julia Bingler, Tobias Schimanski, Chiara Colesanti-Senni, Nicolas Webersinke, Christrian Huggel, Markus Leippold

    Abstract: Large Language Models (LLMs) have made significant progress in recent years, achieving remarkable results in question-answering tasks (QA). However, they still face two major challenges: hallucination and outdated information after the training phase. These challenges take center stage in critical domains like climate change, where obtaining accurate and up-to-date information from reliable source… ▽ More

    Submitted 28 April, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

  13. arXiv:2304.00116  [pdf, other

    cs.CL cs.IR

    Enhancing Large Language Models with Climate Resources

    Authors: Mathias Kraus, Julia Anna Bingler, Markus Leippold, Tobias Schimanski, Chiara Colesanti Senni, Dominik Stammbach, Saeid Ashraf Vaghefi, Nicolas Webersinke

    Abstract: Large language models (LLMs) have significantly transformed the landscape of artificial intelligence by demonstrating their ability in generating human-like text across diverse topics. However, despite their impressive capabilities, LLMs lack recent information and often employ imprecise language, which can be detrimental in domains where accuracy is crucial, such as climate change. In this study,… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  14. arXiv:2209.00507  [pdf, other

    cs.CL

    Environmental Claim Detection

    Authors: Dominik Stammbach, Nicolas Webersinke, Julia Anna Bingler, Mathias Kraus, Markus Leippold

    Abstract: To transition to a green economy, environmental claims made by companies must be reliable, comparable, and verifiable. To analyze such claims at scale, automated methods are needed to detect them in the first place. However, there exist no datasets or models for this. Thus, this paper introduces the task of environmental claim detection. To accompany the task, we release an expert-annotated datase… ▽ More

    Submitted 26 May, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

  15. arXiv:2205.05071  [pdf, other

    cs.CL cs.CY

    Towards Climate Awareness in NLP Research

    Authors: Daniel Hershcovich, Nicolas Webersinke, Mathias Kraus, Julia Anna Bingler, Markus Leippold

    Abstract: The climate impact of AI, and NLP research in particular, has become a serious issue given the enormous amount of energy that is increasingly being used for training and running computational models. Consequently, increasing focus is placed on efficient NLP. However, this important initiative lacks simple guidelines that would allow for systematic climate reporting of NLP research. We argue that t… ▽ More

    Submitted 18 October, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted to EMNLP 2022

  16. arXiv:2110.12010  [pdf, other

    cs.CL

    ClimateBert: A Pretrained Language Model for Climate-Related Text

    Authors: Nicolas Webersinke, Mathias Kraus, Julia Anna Bingler, Markus Leippold

    Abstract: Over the recent years, large pretrained language models (LM) have revolutionized the field of natural language processing (NLP). However, while pretraining on general language has been shown to work very well for common language, it has been observed that niche language poses problems. In particular, climate-related texts include specific language that common LMs can not represent accurately. We a… ▽ More

    Submitted 17 December, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

  17. arXiv:2012.00614  [pdf, other

    cs.CL cs.AI

    CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims

    Authors: Thomas Diggelmann, Jordan Boyd-Graber, Jannis Bulian, Massimiliano Ciaramita, Markus Leippold

    Abstract: We introduce CLIMATE-FEVER, a new publicly available dataset for verification of climate change-related claims. By providing a dataset for the research community, we aim to facilitate and encourage work on improving algorithms for retrieving evidential support for climate-specific claims, addressing the underlying language understanding challenges, and ultimately help alleviate the impact of misin… ▽ More

    Submitted 2 January, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: Accepted for the Tackling Climate Change with Machine Learning Workshop at NeurIPS 2020

  18. arXiv:2012.00483  [pdf, other

    cs.CL cs.AI

    ClimaText: A Dataset for Climate Change Topic Detection

    Authors: Francesco S. Varini, Jordan Boyd-Graber, Massimiliano Ciaramita, Markus Leippold

    Abstract: Climate change communication in the mass media and other textual sources may affect and shape public perception. Extracting climate change information from these sources is an important task, e.g., for filtering content and e-discovery, sentiment analysis, automatic summarization, question-answering, and fact-checking. However, automating this process is a challenge, as climate change is a complex… ▽ More

    Submitted 2 January, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: Accepted for the Tackling Climate Change with Machine Learning Workshop at NeurIPS 2020

  19. arXiv:2010.08570  [pdf, other

    cs.CL

    Generating Fact Checking Summaries for Web Claims

    Authors: Rahul Mishra, Dhruv Gupta, Markus Leippold

    Abstract: We present SUMO, a neural attention-based approach that learns to establish the correctness of textual claims based on evidence in the form of text documents (e.g., news articles or Web documents). SUMO further generates an extractive summary by presenting a diversified set of sentences from the documents that explain its decision on the correctness of the textual claim. Prior approaches to addres… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: Accepted paper; The 2020 Conference on Empirical Methods in Natural Language Processing EMNLP - WNUT

    MSC Class: 68T50 ACM Class: H.1.1; H.3.1; H.3.3

  20. arXiv:2010.03617  [pdf, other

    cs.CL

    MuSeM: Detecting Incongruent News Headlines using Mutual Attentive Semantic Matching

    Authors: Rahul Mishra, Piyush Yadav, Remi Calizzano, Markus Leippold

    Abstract: Measuring the congruence between two texts has several useful applications, such as detecting the prevalent deceptive and misleading news headlines on the web. Many works have proposed machine learning based solutions such as text similarity between the headline and body text to detect the incongruence. Text similarity based methods fail to perform well due to different inherent challenges such as… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted paper; IEEE 2020 International Conference on Machine Learning and Applications (ICMLA)

    MSC Class: 68T50 ACM Class: H.1.1; H.3.1; H.3.3

  21. Quantile estimation with adaptive importance sampling

    Authors: Daniel Egloff, Markus Leippold

    Abstract: We introduce new quantile estimators with adaptive importance sampling. The adaptive estimators are based on weighted samples that are neither independent nor identically distributed. Using a new law of iterated logarithm for martingales, we prove the convergence of the adaptive quantile estimators for general distributions with nonunique quantiles thereby extending the work of Feldman and Tucke… ▽ More

    Submitted 26 February, 2010; originally announced February 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOS745 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS745 MSC Class: 62L20; 65C05 (Primary) 65C60 (Secondary)

    Journal ref: Annals of Statistics 2010, Vol. 38, No. 2, 1244-1278