Skip to main content

Showing 1–5 of 5 results for author: Lindström, A D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18346  [pdf, ps, other

    cs.AI

    AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations

    Authors: Adam Dahlgren Lindström, Leila Methnani, Lea Krause, Petter Ericson, Íñigo Martínez de Rituerto de Troya, Dimitri Coelho Mollo, Roel Dobbe

    Abstract: This paper critically evaluates the attempts to align Artificial Intelligence (AI) systems, especially Large Language Models (LLMs), with human values and intentions through Reinforcement Learning from Feedback (RLxF) methods, involving either human feedback (RLHF) or AI feedback (RLAIF). Specifically, we show the shortcomings of the broadly pursued alignment goals of honesty, harmlessness, and he… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 12 pages, 1 table, to be submitted

  2. arXiv:2304.11217  [pdf, other

    cs.CY cs.AI

    ACROCPoLis: A Descriptive Framework for Making Sense of Fairness

    Authors: Andrea Aler Tubella, Dimitri Coelho Mollo, Adam Dahlgren Lindström, Hannah Devinney, Virginia Dignum, Petter Ericson, Anna Jonsson, Timotheus Kampik, Tom Lenaerts, Julian Alfredo Mendez, Juan Carlos Nieves

    Abstract: Fairness is central to the ethical and responsible development and use of AI systems, with a large number of frameworks and formal notions of algorithmic fairness being available. However, many of the fairness solutions proposed revolve around technical considerations and not the needs of and consequences for the most impacted communities. We therefore want to take the focus away from definitions… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: To appear in the proceedings of ACM FAccT 2023

  3. arXiv:2208.05358  [pdf, other

    cs.LG cs.CL cs.CV

    CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical Reasoning

    Authors: Adam Dahlgren Lindström, Savitha Sam Abraham

    Abstract: We introduce CLEVR-Math, a multi-modal math word problems dataset consisting of simple math word problems involving addition/subtraction, represented partly by a textual description and partly by an image illustrating the scenario. The text describes actions performed on the scene that is depicted in the image. Since the question posed may not be about the scene in the image, but about the state o… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: NeSy 2022, 16th International Workshop on Neural-Symbolic Learning and Reasoning, Cumberland Lodge, Windsor, UK

    ACM Class: I.2.7; I.2.10; I.2.6; I.4.8; I.1.4

  4. arXiv:2204.02813  [pdf, other

    cs.CL cs.FL cs.LO

    An Algebraic Approach to Learning and Grounding

    Authors: Johanna Björklund, Adam Dahlgren Lindström, Frank Drewes

    Abstract: We consider the problem of learning the semantics of composite algebraic expressions from examples. The outcome is a versatile framework for studying learning tasks that can be put into the following abstract form: The input is a partial algebra $\alg$ and a finite set of examples $(\varphi_1, O_1), (\varphi_2, O_2), \ldots$, each consisting of an algebraic term $\varphi_i$ and a set of objects~… ▽ More

    Submitted 4 July, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: Accepted to LearnAut 2022 at ICALP 2022

    ACM Class: I.2.4; I.2.7; I.2.10; I.1.3; I.2

  5. Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case

    Authors: Adam Dahlgren Lindström, Suna Bensch, Johanna Björklund, Frank Drewes

    Abstract: Semantic embeddings have advanced the state of the art for countless natural language processing tasks, and various extensions to multimodal domains, such as visual-semantic embeddings, have been proposed. While the power of visual-semantic embeddings comes from the distillation and enrichment of information through machine learning, their inner workings are poorly understood and there is a shorta… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Submitted July 1 2020, COLING 2020 main conference