Skip to main content

Showing 1–7 of 7 results for author: Kadra, A

.
  1. arXiv:2402.03970  [pdf, other

    cs.LG cs.AI

    Tabular Data: Is Attention All You Need?

    Authors: Guri Zabërgja, Arlind Kadra, Josif Grabocka

    Abstract: Deep Learning has revolutionized the field of AI and led to remarkable achievements in applications involving image and text data. Unfortunately, there is inconclusive evidence on the merits of neural networks for structured tabular data. In this paper, we introduce a large-scale empirical study comparing neural networks against gradient-boosted decision trees on tabular data, but also transformer… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  2. arXiv:2306.03828  [pdf, other

    cs.LG

    Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How

    Authors: Sebastian Pineda Arango, Fabio Ferreira, Arlind Kadra, Frank Hutter, Josif Grabocka

    Abstract: With the ever-increasing number of pretrained models, machine learning practitioners are continuously faced with which pretrained model to use, and how to finetune it for a new dataset. In this paper, we propose a methodology that jointly searches for the optimal pretrained model and the hyperparameters for finetuning it. Our method transfers knowledge about the performance of many pretrained mode… ▽ More

    Submitted 22 February, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

  3. arXiv:2305.13072  [pdf, other

    cs.LG

    Breaking the Paradox of Explainable Deep Learning

    Authors: Arlind Kadra, Sebastian Pineda Arango, Josif Grabocka

    Abstract: Deep Learning has achieved tremendous results by pushing the frontier of automation in diverse domains. Unfortunately, current neural network architectures are not explainable by design. In this paper, we propose a novel method that trains deep hypernetworks to generate explainable linear models. Our models retain the accuracy of black-box deep networks while offering free lunch explainability by… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  4. arXiv:2302.00441  [pdf, other

    cs.LG

    Scaling Laws for Hyperparameter Optimization

    Authors: Arlind Kadra, Maciej Janowski, Martin Wistuba, Josif Grabocka

    Abstract: Hyperparameter optimization is an important subfield of machine learning that focuses on tuning the hyperparameters of a chosen algorithm to achieve peak performance. Recently, there has been a stream of methods that tackle the issue of hyperparameter optimization, however, most of the methods do not exploit the dominant power law nature of learning curves for Bayesian optimization. In this work,… ▽ More

    Submitted 25 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted at NeurIPS 2023

  5. arXiv:2202.09774  [pdf, other

    cs.LG cs.AI

    Supervising the Multi-Fidelity Race of Hyperparameter Configurations

    Authors: Martin Wistuba, Arlind Kadra, Josif Grabocka

    Abstract: Multi-fidelity (gray-box) hyperparameter optimization techniques (HPO) have recently emerged as a promising direction for tuning Deep Learning methods. However, existing methods suffer from a sub-optimal allocation of the HPO budget to the hyperparameter configurations. In this work, we introduce DyHPO, a Bayesian Optimization method that learns to decide which hyperparameter configuration to trai… ▽ More

    Submitted 1 June, 2023; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted at NeurIPS 2022

  6. arXiv:2106.11189  [pdf, other

    cs.LG

    Well-tuned Simple Nets Excel on Tabular Datasets

    Authors: Arlind Kadra, Marius Lindauer, Frank Hutter, Josif Grabocka

    Abstract: Tabular datasets are the last "unconquered castle" for deep learning, with traditional ML methods like Gradient-Boosted Decision Trees still performing strongly even against recent specialized neural architectures. In this paper, we hypothesize that the key to boosting the performance of neural networks lies in rethinking the joint and simultaneous application of a large set of modern regularizati… ▽ More

    Submitted 5 November, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

  7. arXiv:1911.02490  [pdf, other

    cs.LG stat.ML

    OpenML-Python: an extensible Python API for OpenML

    Authors: Matthias Feurer, Jan N. van Rijn, Arlind Kadra, Pieter Gijsbers, Neeratyoy Mallik, Sahithya Ravi, Andreas Müller, Joaquin Vanschoren, Frank Hutter

    Abstract: OpenML is an online platform for open science collaboration in machine learning, used to share datasets and results of machine learning experiments. In this paper we introduce OpenML-Python, a client API for Python, opening up the OpenML platform for a wide range of Python-based tools. It provides easy access to all datasets, tasks and experiments on OpenML from within Python. It also provides fun… ▽ More

    Submitted 23 June, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Journal ref: Journal of Machine Learning Research 22(100), 2021