Skip to main content

Showing 1–10 of 10 results for author: Beyer, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14035  [pdf, other

    cs.CL cs.AI

    Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

    Authors: Sherzod Hakimov, Yerkezhan Abdullayeva, Kushal Koshti, Antonia Schmidt, Yan Weiser, Anne Beyer, David Schlangen

    Abstract: While the situation has improved for text-only models, it again seems to be the case currently that multimodal (text and image) models develop faster than ways to evaluate them. In this paper, we bring a recently developed evaluation paradigm from text models to multimodal models, namely evaluation through the goal-oriented game (self) play, complementing reference-based and preference-based evalu… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: under review

  2. arXiv:2406.05561  [pdf, other

    cs.HC

    Learning Human Detected Differences in Directed Acyclic Graphs

    Authors: Kathrin Guckes, Alena Beyer, Prof. Margit Pohl, Prof. Tatiana von Landesberger

    Abstract: Prior research has shown that human perception of similarity differs from mathematical measures in visual comparison tasks, including those involving directed acyclic graphs. This divergence can lead to missed differences and skepticism about algorithmic results. To address this, we aim to learn the structural differences humans detect in graphs visually. We want to visualize these human-detected… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  3. arXiv:2405.20859  [pdf, other

    cs.CL cs.AI

    clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents

    Authors: Anne Beyer, Kranti Chalamalasetti, Sherzod Hakimov, Brielen Madureira, Philipp Sadler, David Schlangen

    Abstract: It has been established in recent work that Large Language Models (LLMs) can be prompted to "self-play" conversational games that probe certain capabilities (general instruction following, strategic goal orientation, language understanding abilities), where the resulting interactive game play can be automatically scored. In this paper, we take one of the proposed frameworks for setting up such gam… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: under review

  4. arXiv:2308.06095  [pdf, other

    cs.CL cs.AI cs.LG

    Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes

    Authors: Fabian Galetzka, Anne Beyer, David Schlangen

    Abstract: Recent conditional language models are able to continue any kind of text source in an often seemingly fluent way. This fact encouraged research in the area of open-domain conversational systems that are based on powerful language models and aim to imitate an interlocutor by generating appropriate contributions to a written dialogue. From a linguistic perspective, however, the complexity of contrib… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Represents the state of the field in 2022; partially based on the first authors 2022 PhD thesis

  5. arXiv:2203.12111  [pdf

    cs.AI

    Muscle Vision: Real Time Keypoint Based Pose Classification of Physical Exercises

    Authors: Alex Moran, Bart Gebka, Joshua Goldshteyn, Autumn Beyer, Nathan Johnson, Alexander Neuwirth

    Abstract: Recent advances in machine learning technology have enabled highly portable and performant models for many common tasks, especially in image recognition. One emerging field, 3D human pose recognition extrapolated from video, has now advanced to the point of enabling real-time software applications with robust enough output to support downstream machine learning tasks. In this work we propose a new… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: Published in MICS 2022

  6. arXiv:2105.03495  [pdf, other

    cs.CL

    Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

    Authors: Anne Beyer, Sharid Loáiciga, David Schlangen

    Abstract: Coherent discourse is distinguished from a mere collection of utterances by the satisfaction of a diverse set of constraints, for example choice of expression, logical relation between denoted events, and implicit compatibility with world-knowledge. Do neural language models encode such constraints? We design an extendable set of test suites addressing different aspects of discourse and dialogue c… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted as long paper at NAACL 2021

  7. White Paper on Crowdsourced Network and QoE Measurements -- Definitions, Use Cases and Challenges

    Authors: Tobias Hoßfeld, Stefan Wunderer, André Beyer, Andrew Hall, Anika Schwind, Christian Gassner, Fabrice Guillemin, Florian Wamser, Krzysztof Wascinski, Matthias Hirth, Michael Seufert, Pedro Casas, Phuoc Tran-Gia, Werner Robitza, Wojciech Wascinski, Zied Ben Houidi

    Abstract: This white paper is the outcome of the Würzburg seminar on "Crowdsourced Network and QoE Measurements" which took place from 25-26 September 2019 in Würzburg, Germany. International experts were invited from industry and academia. They are well known in their communities, having different backgrounds in crowdsourcing, mobile networks, network measurements, network performance, Quality of Service (… ▽ More

    Submitted 25 May, 2020; originally announced June 2020.

  8. Learning to play the Chess Variant Crazyhouse above World Champion Level with Deep Neural Networks and Human Data

    Authors: Johannes Czech, Moritz Willig, Alena Beyer, Kristian Kersting, Johannes Fürnkranz

    Abstract: Deep neural networks have been successfully applied in learning the board games Go, chess and shogi without prior knowledge by making use of reinforcement learning. Although starting from zero knowledge has been shown to yield impressive results, it is associated with high computationally costs especially for complex games. With this paper, we present CrazyAra which is a neural network based engin… ▽ More

    Submitted 22 August, 2019; v1 submitted 19 August, 2019; originally announced August 2019.

    Comments: 35 pages, 19 figures, 14 tables

    Journal ref: Frontiers in Artificial Intelligence, Machine Learning and Artificial Intelligence, Volume 3 (2020)

  9. Ten simple rules for measuring the impact of workshops

    Authors: Shoaib Sufi, Beth Duckles, Iveta Simera, Terhi Nurmikko-Fuller, Louisa Bellis, Wadud Miah, Adriana Wilde, Aleksandra Nenadic, Raniere Silva, Jennifer A. de Beyer, Caroline Struthers, Iain Emsley, Olivier Philippe, Melissa Balzano, Sara Coelho, Heather Ford, Catherine Jones, Vanessa Higgins

    Abstract: Workshops are used to explore a specific topic, transfer knowledge, solve identified problems or create something new. In funded research projects and other research endeavours, workshops are the mechanism to gather the wider project, community or interested people together around a particular topic. However, natural questions arise: how do we measure the impact of these workshops? Do we know whet… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

  10. arXiv:1404.6583  [pdf, other

    cs.CG

    ILATO Project: Fusion of Optical Surface Models and Volumetric CT Data

    Authors: Andreas Beyer, Hubert Mara, Susanne Krömker

    Abstract: Project ILATO focuses on Improving Limited Angle computed Tomography by Optical data integration in order to enhance image quality and shorten acquisition times in X-ray based industrial quality inspection. Limited angle computed tomography is indicated whenever specimen dimensions exceed cone beam limits or the object is impenetrable from certain angles. Thus, acquiring only a subset of a full ci… ▽ More

    Submitted 25 April, 2014; originally announced April 2014.

    Comments: Part of the OAGM 2014 proceedings (arXiv:1404.3538)

    Report number: OAGM/2014/17