Skip to main content

Showing 1–7 of 7 results for author: Nielsen, D S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13469  [pdf, other

    cs.CL cs.AI cs.LG

    Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks

    Authors: Dan Saattrup Nielsen, Kenneth Enevoldsen, Peter Schneider-Kamp

    Abstract: This paper explores the performance of encoder and decoder language models on multilingual Natural Language Understanding (NLU) tasks, with a broad focus on Germanic languages. Building upon the ScandEval benchmark, which initially was restricted to evaluating encoder models, we extend the evaluation framework to include decoder models. We introduce a method for evaluating decoder models on NLU ta… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 14 pages, 2 figures

    ACM Class: I.2.7

  2. arXiv:2311.09145  [pdf, other

    cs.LG stat.ML

    Model Agnostic Explainable Selective Regression via Uncertainty Estimation

    Authors: Andrea Pugnana, Carlos Mougan, Dan Saattrup Nielsen

    Abstract: With the wide adoption of machine learning techniques, requirements have evolved beyond sheer high performance, often requiring models to be trustworthy. A common approach to increase the trustworthiness of such systems is to allow them to refrain from predicting. Such a framework is known as selective prediction. While selective prediction for classification tasks has been widely analyzed, the pr… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2311.07264  [pdf, other

    cs.CL

    Danish Foundation Models

    Authors: Kenneth Enevoldsen, Lasse Hansen, Dan S. Nielsen, Rasmus A. F. Egebæk, Søren V. Holm, Martin C. Nielsen, Martin Bernstorff, Rasmus Larsen, Peter B. Jørgensen, Malte Højmark-Bertelsen, Peter B. Vahlstrup, Per Møldrup-Dalum, Kristoffer Nielbo

    Abstract: Large language models, sometimes referred to as foundation models, have transformed multiple fields of research. However, smaller languages risk falling behind due to high training costs and small incentives for large companies to train these models. To combat this, the Danish Foundation Models project seeks to provide and maintain open, well-documented, and high-quality foundation models for the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 4 pages, 2 tables

  4. arXiv:2304.00906  [pdf, other

    cs.CL cs.LG

    ScandEval: A Benchmark for Scandinavian Natural Language Processing

    Authors: Dan Saattrup Nielsen

    Abstract: This paper introduces a Scandinavian benchmarking platform, ScandEval, which can benchmark any pretrained model on four different tasks in the Scandinavian languages. The datasets used in two of the tasks, linguistic acceptability and question answering, are new. We develop and release a Python package and command-line interface, scandeval, which can benchmark any model that has been uploaded to t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 17 pages, 11 figures, camera-ready NoDaLiDa 2023 submission

  5. arXiv:2210.09014  [pdf

    cs.CY cs.AI cs.LG cs.SI

    Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda

    Authors: Andrés Domínguez Hernández, Richard Owen, Dan Saattrup Nielsen, Ryan McConville

    Abstract: Machine learning (ML) enabled classification models are becoming increasingly popular for tackling the sheer volume and speed of online misinformation and other content that could be identified as harmful. In building these models, data scientists need to take a stance on the legitimacy, authoritativeness and objectivity of the sources of ``truth" used for model training and testing. This has poli… ▽ More

    Submitted 13 April, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Andrés Domínguez Hernández, Richard Owen, Dan Saattrup Nielsen and Ryan McConville. 2023. Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda. Accepted in 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23), June 12-15, 2023, Chicago, United States of America. ACM, New York, NY, USA, 16 pages

  6. arXiv:2202.11684  [pdf, other

    cs.LG cs.CL cs.CY cs.IR cs.SI

    MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset

    Authors: Dan Saattrup Nielsen, Ryan McConville

    Abstract: Misinformation is becoming increasingly prevalent on social media and in news articles. It has become so widespread that we require algorithmic assistance utilising machine learning to detect such content. Training these machine learning models require datasets of sufficient scale, diversity and quality. However, datasets in the field of automatic misinformation detection are predominantly monolin… ▽ More

    Submitted 8 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: 9+3 pages

  7. arXiv:2201.11676  [pdf, other

    cs.LG stat.ML

    Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap

    Authors: Carlos Mougan, Dan Saattrup Nielsen

    Abstract: Monitoring machine learning models once they are deployed is challenging. It is even more challenging to decide when to retrain models in real-case scenarios when labeled data is beyond reach, and monitoring performance metrics becomes unfeasible. In this work, we use non-parametric bootstrapped uncertainty estimates and SHAP values to provide explainable uncertainty estimation as a technique that… ▽ More

    Submitted 22 November, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 7+6 pages. Accepted at AAAI'23 Safe and Robust AI track