Skip to main content

Showing 1–11 of 11 results for author: Martínez-Plumed, F

.
  1. arXiv:2310.06167  [pdf

    cs.AI

    Predictable Artificial Intelligence

    Authors: Lexin Zhou, Pablo A. Moreno-Casares, Fernando Martínez-Plumed, John Burden, Ryan Burnell, Lucy Cheke, Cèsar Ferri, Alexandru Marcoci, Behzad Mehrbakhsh, Yael Moros-Daval, Seán Ó hÉigeartaigh, Danaja Rutar, Wout Schellaert, Konstantinos Voudouris, José Hernández-Orallo

    Abstract: We introduce the fundamental ideas and challenges of Predictable AI, a nascent research area that explores the ways in which we can anticipate key indicators of present and future AI ecosystems. We argue that achieving predictability is crucial for fostering trust, liability, control, alignment and safety of AI ecosystems, and thus should be prioritised over performance. While distinctive from oth… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 11 pages excluding references, 4 figures, and 2 tables. Paper Under Review

    MSC Class: ACM-class: I.2

  2. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  3. Compute and Energy Consumption Trends in Deep Learning Inference

    Authors: Radosvet Desislavov, Fernando Martínez-Plumed, José Hernández-Orallo

    Abstract: The progress of some AI paradigms such as deep learning is said to be linked to an exponential growth in the number of parameters. There are many studies corroborating these trends, but does this translate into an exponential increase in energy consumption? In order to answer this question we focus on inference costs rather than training costs, as the former account for most of the computing effor… ▽ More

    Submitted 29 March, 2023; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: For a revised version and its published version refer to: Desislavov, Radosvet, Fernando Martínez-Plumed, and José Hernández-Orallo. Trends in AI inference energy consumption: Beyond the performance-vs-parameter laws of deep learning. Sustainable Computing: Informatics and Systems, Volume 38, April 2023. (https://doi.org/10.1016/j.suscom.2023.100857)

    Journal ref: "Trends in AI inference energy consumption: Beyond the performance-vs-parameter laws of deep learning" Sustainable Computing: Informatics and Systems (2023). Volume 38, April 2023, 100857

  4. arXiv:1905.12728  [pdf, other

    cs.LG cs.AI stat.ML

    Fairness and Missing Values

    Authors: Fernando Martínez-Plumed, Cèsar Ferri, David Nieves, José Hernández-Orallo

    Abstract: The causes underlying unfair decision making are complex, being internalised in different ways by decision makers, other actors dealing with data and models, and ultimately by the individuals being affected by these decisions. One frequent manifestation of all these latent causes arises in the form of missing values: protected groups are more reluctant to give information that could be used agains… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: Preprint submitted to Decision Support Systems Journal

  5. Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them

    Authors: Fernando Martínez-Plumed, José Hernández-Orallo

    Abstract: Item response theory (IRT) can be applied to the analysis of the evaluation of results from AI benchmarks. The two-parameter IRT model provides two indicators (difficulty and discrimination) on the side of the item (or AI problem) while only one indicator (ability) on the side of the respondent (or AI agent). In this paper we analyse how to make this set of indicators dual, by adding a fourth indi… ▽ More

    Submitted 22 March, 2019; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: This report is a preliminary version of a related paper with title "Dual Indicators to Analyse AI Benchmarks: Difficulty, Discrimination, Ability and Generality", accepted for publication at IEEE Transactions on Games. Please refer to and cite the journal paper (https://doi.org/10.1109/TG.2018.2883773)

    Journal ref: IEEE Transactions on Games, 2018

  6. arXiv:1809.10054  [pdf, other

    cs.AI cs.DB

    General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation

    Authors: Lidia Contreras-Ochando, César Ferri, José Hernández-Orallo, Fernando Martínez-Plumed, María José Ramírez-Quintana, Susumu Katayama

    Abstract: Given one or two examples, humans are good at understanding how to solve a problem independently of its domain, because they are able to detect what the problem is and to choose the appropriate background knowledge according to the context. For instance, presented with the string "8/17/2017" to be transformed to "17th of August of 2017", humans will process this in two steps: (1) they recognise th… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: 24 pages

  7. arXiv:1807.02416  [pdf, other

    cs.AI cs.CY

    A multidisciplinary task-based perspective for evaluating the impact of AI autonomy and generality on the future of work

    Authors: Enrique Fernández-Macías, Emilia Gómez, José Hernández-Orallo, Bao Sheng Loe, Bertin Martens, Fernando Martínez-Plumed, Songül Tolan

    Abstract: This paper presents a multidisciplinary task approach for assessing the impact of artificial intelligence on the future of work. We provide definitions of a task from two main perspectives: socio-economic and computational. We propose to explore ways in which we can integrate or map these perspectives, and link them with the skills or capabilities required by them, for humans and AI systems. Final… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: AEGAP2018 Workshop at ICML 2018, 7 pages, 1 table

    MSC Class: 68T99

  8. arXiv:1806.00610  [pdf, other

    cs.AI

    Between Progress and Potential Impact of AI: the Neglected Dimensions

    Authors: Fernando Martínez-Plumed, Shahar Avin, Miles Brundage, Allan Dafoe, Sean Ó hÉigeartaigh, José Hernández-Orallo

    Abstract: We reframe the analysis of progress in AI by incorporating into an overall framework both the task performance of a system, and the time and resource costs incurred in the development and deployment of the system. These costs include: data, expert knowledge, human oversight, software resources, computing cycles, hardware and network facilities, and (what kind of) time. These costs are distributed… ▽ More

    Submitted 2 July, 2022; v1 submitted 2 June, 2018; originally announced June 2018.

  9. arXiv:1709.09003  [pdf, other

    cs.DB

    CASP-DM: Context Aware Standard Process for Data Mining

    Authors: Fernando Martínez-Plumed, Lidia Contreras-Ochando, Cèsar Ferri, Peter Flach, José Hernández-Orallo, Meelis Kull, Nicolas Lachiche, María José Ramírez-Quintana

    Abstract: We propose an extension of the Cross Industry Standard Process for Data Mining (CRISPDM) which addresses specific challenges of machine learning and data mining for context and model reuse handling. This new general context-aware process model is mapped with CRISP-DM reference model proposing some new or enhanced outputs.

    Submitted 19 September, 2017; originally announced September 2017.

  10. arXiv:1502.05615  [pdf, other

    cs.AI

    Forgetting and consolidation for incremental and cumulative knowledge acquisition systems

    Authors: Fernando Martínez-Plumed, Cèsar Ferri, José Hernández-Orallo, María José Ramírez-Quintana

    Abstract: The application of cognitive mechanisms to support knowledge acquisition is, from our point of view, crucial for making the resulting models coherent, efficient, credible, easy to use and understandable. In particular, there are two characteristic features of intelligence that are essential for knowledge development: forgetting and consolidation. Both plays an important role in knowledge bases and… ▽ More

    Submitted 19 February, 2015; originally announced February 2015.

  11. arXiv:1311.4235  [pdf, other

    cs.LG

    On the definition of a general learning system with user-defined operators

    Authors: Fernando Martínez-Plumed, Cèsar Ferri, José Hernández-Orallo, María-José Ramírez-Quintana

    Abstract: In this paper, we push forward the idea of machine learning systems whose operators can be modified and fine-tuned for each problem. This allows us to propose a learning paradigm where users can write (or adapt) their operators, according to the problem, data representation and the way the information should be navigated. To achieve this goal, data instances, background knowledge, rules, programs… ▽ More

    Submitted 17 November, 2013; originally announced November 2013.