Skip to main content

Showing 1–4 of 4 results for author: Segura, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.06076  [pdf, ps, other

    cs.LG cs.AI cs.IT stat.ML

    An Information-Theoretic Perspective on Overfitting and Underfitting

    Authors: Daniel Bashir, George D. Montanez, Sonia Sehra, Pedro Sandoval Segura, Julius Lauw

    Abstract: We present an information-theoretic framework for understanding overfitting and underfitting in machine learning and prove the formal undecidability of determining whether an arbitrary classification algorithm will overfit a dataset. Measuring algorithm capacity via the information transferred from datasets to models, we consider mismatches between algorithm capacities and datasets to provide a si… ▽ More

    Submitted 6 November, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted for presentation at The 33rd Australasian Joint Conference on Artificial Intelligence (AJCAI 2020), November 29-30, 2020

  2. arXiv:2005.03320  [pdf, other

    cs.SE

    Specification and Automated Analysis of Inter-Parameter Dependencies in Web APIs

    Authors: Alberto Martin-Lopez, Sergio Segura, Carlos Müller, Antonio Ruiz-Cortés

    Abstract: Web services often impose inter-parameter dependencies that restrict the way in which two or more input parameters can be combined to form valid calls to the service. Unfortunately, current specification languages for web services like the OpenAPI Specification (OAS) provide no support for the formal description of such dependencies, which makes it hardly possible to automatically discover and int… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    ACM Class: H.0; D.2

  3. The Labeling Distribution Matrix (LDM): A Tool for Estimating Machine Learning Algorithm Capacity

    Authors: Pedro Sandoval Segura, Julius Lauw, Daniel Bashir, Kinjal Shah, Sonia Sehra, Dominique Macias, George Montanez

    Abstract: Algorithm performance in supervised learning is a combination of memorization, generalization, and luck. By estimating how much information an algorithm can memorize from a dataset, we can set a lower bound on the amount of performance due to other factors such as generalization and luck. With this goal in mind, we introduce the Labeling Distribution Matrix (LDM) as a tool for estimating the capac… ▽ More

    Submitted 6 January, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

    Comments: Accepted to 12th International Conference on Agents and Artificial Intelligence (ICAART 2020), 7 pages including references

  4. arXiv:1804.11121  [pdf, other

    cs.SE

    Towards the Automation of Metamorphic Testing in Model Transformations

    Authors: Javier Troya, Sergio Segura, Antonio Ruiz-Cortés

    Abstract: Model transformations are the cornerstone of Model-Driven Engineering, and provide the essential mechanisms for manipulating and transforming models. Checking whether the output of a model transformation is correct is a manual and error-prone task, this is referred to as the oracle problem in the software testing literature. The correctness of the model transformation program is crucial for the pr… ▽ More

    Submitted 30 April, 2018; originally announced April 2018.

    Comments: Jornadas de Ingeniería del Software y Bases de Datos (JISBD) 2016