Skip to main content

Showing 1–10 of 10 results for author: Nielsen, D S

.
  1. arXiv:2406.13469  [pdf, other

    cs.CL cs.AI cs.LG

    Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks

    Authors: Dan Saattrup Nielsen, Kenneth Enevoldsen, Peter Schneider-Kamp

    Abstract: This paper explores the performance of encoder and decoder language models on multilingual Natural Language Understanding (NLU) tasks, with a broad focus on Germanic languages. Building upon the ScandEval benchmark, which initially was restricted to evaluating encoder models, we extend the evaluation framework to include decoder models. We introduce a method for evaluating decoder models on NLU ta… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 14 pages, 2 figures

    ACM Class: I.2.7

  2. arXiv:2311.09145  [pdf, other

    cs.LG stat.ML

    Model Agnostic Explainable Selective Regression via Uncertainty Estimation

    Authors: Andrea Pugnana, Carlos Mougan, Dan Saattrup Nielsen

    Abstract: With the wide adoption of machine learning techniques, requirements have evolved beyond sheer high performance, often requiring models to be trustworthy. A common approach to increase the trustworthiness of such systems is to allow them to refrain from predicting. Such a framework is known as selective prediction. While selective prediction for classification tasks has been widely analyzed, the pr… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2311.07264  [pdf, other

    cs.CL

    Danish Foundation Models

    Authors: Kenneth Enevoldsen, Lasse Hansen, Dan S. Nielsen, Rasmus A. F. Egebæk, Søren V. Holm, Martin C. Nielsen, Martin Bernstorff, Rasmus Larsen, Peter B. Jørgensen, Malte Højmark-Bertelsen, Peter B. Vahlstrup, Per Møldrup-Dalum, Kristoffer Nielbo

    Abstract: Large language models, sometimes referred to as foundation models, have transformed multiple fields of research. However, smaller languages risk falling behind due to high training costs and small incentives for large companies to train these models. To combat this, the Danish Foundation Models project seeks to provide and maintain open, well-documented, and high-quality foundation models for the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 4 pages, 2 tables

  4. arXiv:2304.00906  [pdf, other

    cs.CL cs.LG

    ScandEval: A Benchmark for Scandinavian Natural Language Processing

    Authors: Dan Saattrup Nielsen

    Abstract: This paper introduces a Scandinavian benchmarking platform, ScandEval, which can benchmark any pretrained model on four different tasks in the Scandinavian languages. The datasets used in two of the tasks, linguistic acceptability and question answering, are new. We develop and release a Python package and command-line interface, scandeval, which can benchmark any model that has been uploaded to t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 17 pages, 11 figures, camera-ready NoDaLiDa 2023 submission

  5. arXiv:2210.09014  [pdf

    cs.CY cs.AI cs.LG cs.SI

    Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda

    Authors: Andrés Domínguez Hernández, Richard Owen, Dan Saattrup Nielsen, Ryan McConville

    Abstract: Machine learning (ML) enabled classification models are becoming increasingly popular for tackling the sheer volume and speed of online misinformation and other content that could be identified as harmful. In building these models, data scientists need to take a stance on the legitimacy, authoritativeness and objectivity of the sources of ``truth" used for model training and testing. This has poli… ▽ More

    Submitted 13 April, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Andrés Domínguez Hernández, Richard Owen, Dan Saattrup Nielsen and Ryan McConville. 2023. Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda. Accepted in 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23), June 12-15, 2023, Chicago, United States of America. ACM, New York, NY, USA, 16 pages

  6. arXiv:2202.11684  [pdf, other

    cs.LG cs.CL cs.CY cs.IR cs.SI

    MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset

    Authors: Dan Saattrup Nielsen, Ryan McConville

    Abstract: Misinformation is becoming increasingly prevalent on social media and in news articles. It has become so widespread that we require algorithmic assistance utilising machine learning to detect such content. Training these machine learning models require datasets of sufficient scale, diversity and quality. However, datasets in the field of automatic misinformation detection are predominantly monolin… ▽ More

    Submitted 8 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: 9+3 pages

  7. arXiv:2201.11676  [pdf, other

    cs.LG stat.ML

    Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap

    Authors: Carlos Mougan, Dan Saattrup Nielsen

    Abstract: Monitoring machine learning models once they are deployed is challenging. It is even more challenging to decide when to retrain models in real-case scenarios when labeled data is beyond reach, and monitoring performance metrics becomes unfeasible. In this work, we use non-parametric bootstrapped uncertainty estimates and SHAP values to provide explainable uncertainty estimation as a technique that… ▽ More

    Submitted 22 November, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 7+6 pages. Accepted at AAAI'23 Safe and Robust AI track

  8. arXiv:2109.06079  [pdf, other

    math.LO

    The Virtual Large Cardinal Hierarchy

    Authors: Stamatis Dimopoulos, Victoria Gitman, Dan Saattrup Nielsen

    Abstract: We continue the study of the virtual large cardinal hierarchy by analysing virtual versions of superstrong, Woodin, and Berkeley cardinals. Gitman and Schindler showed that virtualizations of strong and supercompact cardinals yield the same large cardinal notion. We provide various equivalent characterizations of virtually Woodin cardinals, including showing that On is virtually Woodin if and only… ▽ More

    Submitted 6 May, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Published version at Fundamenta Mathematicae. 29 pages, 3 figures

  9. arXiv:1804.10383  [pdf, other

    math.LO

    Games and Ramsey-like cardinals

    Authors: Dan Saattrup Nielsen, Philip Welch

    Abstract: We generalise the $α$-Ramsey cardinals introduced in Holy and Schlicht (2018) for cardinals $α$ to arbitrary ordinals $α$, and answer several questions posed in that paper. In particular, we show that $α$-Ramseys are downwards absolute to the core model $K$ for all $α$ of uncountable cofinality, that strategic $ω$-Ramsey cardinals are equiconsistent with remarkable cardinals and that strategic… ▽ More

    Submitted 30 October, 2018; v1 submitted 27 April, 2018; originally announced April 2018.

    Comments: 33 pages, 2 figures. Added Theorem 4.20 saying that strategic $(ω{+}1)$-Ramsey cardinals are equiconsistent with measurables, and fixed many typos. This version is forthcoming in the JSL

  10. Hot dense capsule implosion cores produced by z-pinch dynamic hohlraum radiation

    Authors: J. E. Bailey, G. A. Chandler, S. A. Slutz, I. Golovkin, P. W. Lake, J. J. MacFarlane, R. C. Mancini, T. J. Buris-Mog, G. Cooper, R. J. Leeper, T. A. Mehlhorn, T. C. Moore, T. J. Nash, D. S. Nielsen, C. L. Ruiz, D. G. Schroen, W. A. Varnum

    Abstract: Hot dense capsule implosions driven by z-pinch x-rays have been measured for the first time. A ~220 eV dynamic hohlraum imploded 1.7-2.1 mm diameter gas-filled CH capsules which absorbed up to ~20 kJ of x-rays. Argon tracer atom spectra were used to measure the Te~ 1keV electron temperature and the ne ~ 1-4 x10^23 cm-3 electron density. Spectra from multiple directions provide core symmetry esti… ▽ More

    Submitted 4 June, 2003; originally announced June 2003.

    Comments: submitted to Phys. Rev. Lett

    Report number: SAND2003-1722J