Skip to main content

Showing 1–50 of 56 results for author: Vanschoren, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16124  [pdf, other

    cs.LG

    Unsupervised Meta-Learning via In-Context Learning

    Authors: Anna Vettoruzzo, Lorenzo Braccaioli, Joaquin Vanschoren, Marlena Nowaczyk

    Abstract: Unsupervised meta-learning aims to learn feature representations from unsupervised datasets that can transfer to downstream tasks with limited labeled data. In this paper, we propose a novel approach to unsupervised meta-learning that leverages the generalization abilities of in-context learning observed in transformer architectures. Our method reframes meta-learning as a sequence modeling problem… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2403.19546  [pdf, other

    cs.LG cs.AI cs.DB cs.IR

    Croissant: A Metadata Format for ML-Ready Datasets

    Authors: Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, Pieter Gijsbers, Joan Giner-Miguelez, Nitisha Jain, Michael Kuchnik, Quentin Lhoest, Pierre Marcenac, Manil Maskey, Peter Mattson, Luis Oala, Pierre Ruyssen, Rajat Shinde, Elena Simperl, Goeffry Thomas, Slava Tykhonov, Joaquin Vanschoren, Jos van der Velde, Steffen Vogler, Carole-Jean Wu

    Abstract: Data is a critical resource for Machine Learning (ML), yet working with data remains a key friction point. This paper introduces Croissant, a metadata format for datasets that simplifies how data is used by ML tools and frameworks. Croissant makes datasets more discoverable, portable and interoperable, thereby addressing significant challenges in ML data management and responsible AI. Croissant is… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Published in Proceedings of ACM SIGMOD/PODS'24 Data Management for End-to-End Machine Learning (DEEM) Workshop https://dl.acm.org/doi/10.1145/3650203.3663326

  4. arXiv:2403.14684  [pdf, other

    cs.CV cs.LG

    FOCIL: Finetune-and-Freeze for Online Class Incremental Learning by Training Randomly Pruned Sparse Experts

    Authors: Murat Onur Yildirim, Elif Ceren Gok Yildirim, Decebal Constantin Mocanu, Joaquin Vanschoren

    Abstract: Class incremental learning (CIL) in an online continual learning setting strives to acquire knowledge on a series of novel classes from a data stream, using each data point only once for training. This is more realistic compared to offline modes, where it is assumed that all data from novel class(es) is readily available. Current online CIL approaches store a subset of the previous data which crea… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2402.03038  [pdf, other

    cs.LG cs.AI cs.CL

    Automatic Combination of Sample Selection Strategies for Few-Shot Learning

    Authors: Branislav Pecher, Ivan Srba, Maria Bielikova, Joaquin Vanschoren

    Abstract: In few-shot learning, such as meta-learning, few-shot fine-tuning or in-context learning, the limited number of samples used to train a model have a significant impact on the overall success. Although a large number of sample selection strategies exist, their impact on the performance of few-shot learning is not extensively known, as most of them have been so far evaluated in typical supervised se… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  6. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 17 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  7. arXiv:2311.13028  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    DMLR: Data-centric Machine Learning Research -- Past, Present and Future

    Authors: Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš , et al. (13 additional authors not shown)

    Abstract: Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods tow… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Published in the Journal of Data-centric Machine Learning Research (DMLR) at https://data.mlr.press/assets/pdf/v01-5.pdf

  8. arXiv:2311.11963  [pdf, other

    cs.LG cs.CV

    What Can AutoML Do For Continual Learning?

    Authors: Mert Kilickaya, Joaquin Vanschoren

    Abstract: This position paper outlines the potential of AutoML for incremental (continual) learning to encourage more research in this direction. Incremental learning involves incorporating new data from a stream of tasks and distributions to learn enhanced deep representations and adapt better to new tasks. However, a significant limitation of incremental learners is that most current techniques freeze the… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  9. arXiv:2309.01561  [pdf, other

    cs.CV

    Locality-Aware Hyperspectral Classification

    Authors: Fangqin Zhou, Mert Kilickaya, Joaquin Vanschoren

    Abstract: Hyperspectral image classification is gaining popularity for high-precision vision tasks in remote sensing, thanks to their ability to capture visual information available in a wide continuum of spectra. Researchers have been working on automating Hyperspectral image classification, with recent efforts leveraging Vision-Transformers. However, most research models only spectra information and lacks… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: The paper is accepted at BMVC2023

  10. arXiv:2308.14831  [pdf, other

    cs.LG cs.CV

    Continual Learning with Dynamic Sparse Training: Exploring Algorithms for Effective Model Updates

    Authors: Murat Onur Yildirim, Elif Ceren Gok Yildirim, Ghada Sokar, Decebal Constantin Mocanu, Joaquin Vanschoren

    Abstract: Continual learning (CL) refers to the ability of an intelligent system to sequentially acquire and retain knowledge from a stream of data with as little computational overhead as possible. To this end; regularization, replay, architecture, and parameter isolation approaches were introduced to the literature. Parameter isolation using a sparse network which enables to allocate distinct parts of the… ▽ More

    Submitted 4 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  11. arXiv:2307.04722  [pdf, other

    cs.LG

    Advances and Challenges in Meta-Learning: A Technical Review

    Authors: Anna Vettoruzzo, Mohamed-Rafik Bouguelia, Joaquin Vanschoren, Thorsteinn Rögnvaldsson, KC Santosh

    Abstract: Meta-learning empowers learning systems with the ability to acquire knowledge from multiple tasks, enabling faster adaptation and generalization to new tasks. This review provides a comprehensive technical overview of meta-learning, emphasizing its importance in real-world applications where data may be scarce or expensive to obtain. The paper covers the state-of-the-art meta-learning approaches a… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  12. arXiv:2307.03565  [pdf, other

    cs.LG stat.ML

    MALIBO: Meta-learning for Likelihood-free Bayesian Optimization

    Authors: Jiarong Pan, Stefan Falkner, Felix Berkenkamp, Joaquin Vanschoren

    Abstract: Bayesian optimization (BO) is a popular method to optimize costly black-box functions. While traditional BO optimizes each new target task from scratch, meta-learning has emerged as a way to leverage knowledge from related tasks to optimize new tasks faster. However, existing meta-learning BO methods rely on surrogate models that suffer from scalability issues and are sensitive to observations wit… ▽ More

    Submitted 28 June, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  13. arXiv:2304.08975  [pdf, other

    cs.CV

    Neural Architecture Search for Visual Anomaly Segmentation

    Authors: Tommie Kerssies, Joaquin Vanschoren

    Abstract: This paper presents the first application of neural architecture search to the complex task of segmenting visual anomalies. Measurement of anomaly segmentation performance is challenging due to imbalanced anomaly pixels, varying region areas, and various types of anomalies. First, the region-weighted Average Precision (rwAP) metric is proposed as an alternative to existing metrics, which does not… ▽ More

    Submitted 9 August, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Main track paper for the International Conference on Automated Machine Learning (AutoML Conference), published in Proceedings of Machine Learning Research (PMLR), 2023

  14. arXiv:2303.13113  [pdf, other

    cs.LG cs.CV

    AdaCL:Adaptive Continual Learning

    Authors: Elif Ceren Gok Yildirim, Murat Onur Yildirim, Mert Kilickaya, Joaquin Vanschoren

    Abstract: Class-Incremental Learning aims to update a deep classifier to learn new categories while maintaining or improving its accuracy on previously observed classes. Common methods to prevent forgetting previously learned classes include regularizing the neural network updates and storing exemplars in memory, which come with hyperparameters such as the learning rate, regularization strength, or the numb… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Published in 1st ContinualAI Unconference

  15. Can Fairness be Automated? Guidelines and Opportunities for Fairness-aware AutoML

    Authors: Hilde Weerts, Florian Pfisterer, Matthias Feurer, Katharina Eggensperger, Edward Bergman, Noor Awad, Joaquin Vanschoren, Mykola Pechenizkiy, Bernd Bischl, Frank Hutter

    Abstract: The field of automated machine learning (AutoML) introduces techniques that automate parts of the development of machine learning (ML) systems, accelerating the process and reducing barriers for novices. However, decisions derived from ML models can reproduce, amplify, or even introduce unfairness in our societies, causing harm to (groups of) individuals. In response, researchers have started to p… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Journal ref: Journal of Artificial Intelligence Research 79 (2024) 639-677

  16. arXiv:2302.08909  [pdf, other

    cs.CV

    Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification

    Authors: Ihsan Ullah, Dustin Carrión-Ojeda, Sergio Escalera, Isabelle Guyon, Mike Huisman, Felix Mohr, Jan N van Rijn, Haozhe Sun, Joaquin Vanschoren, Phan Anh Vu

    Abstract: We introduce Meta-Album, an image classification meta-dataset designed to facilitate few-shot learning, transfer learning, meta-learning, among other tasks. It includes 40 open datasets, each having at least 20 classes with 40 examples per class, with verified licences. They stem from diverse domains, such as ecology (fauna and flora), manufacturing (textures, vehicles), human actions, and optical… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks., NeurIPS, Nov 2022, New Orleans, United States

  17. arXiv:2301.11417  [pdf, other

    cs.CV

    Are Labels Needed for Incremental Instance Learning?

    Authors: Mert Kilickaya, Joaquin Vanschoren

    Abstract: In this paper, we learn to classify visual object instances, incrementally and via self-supervision (self-incremental). Our learner observes a single instance at a time, which is then discarded from the dataset. Incremental instance learning is challenging, since longer learning sessions exacerbate forgetfulness, and labeling instances is cumbersome. We overcome these challenges via three contribu… ▽ More

    Submitted 6 April, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Accepted at CVPRW on CLVISION (Oral)

  18. arXiv:2211.00376  [pdf, other

    cs.LG cs.AI

    Automated Imbalanced Learning

    Authors: Prabhant Singh, Joaquin Vanschoren

    Abstract: Automated Machine Learning has grown very successful in automating the time-consuming, iterative tasks of machine learning model development. However, current methods struggle when the data is imbalanced. Since many real-world datasets are naturally imbalanced, and improper handling of this issue can lead to quite useless models, this issue should be handled carefully. This paper first introduces… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  19. arXiv:2211.00372  [pdf, other

    cs.LG cs.AI

    Meta-Learning for Unsupervised Outlier Detection with Optimal Transport

    Authors: Prabhant Singh, Joaquin Vanschoren

    Abstract: Automated machine learning has been widely researched and adopted in the field of supervised classification and regression, but progress in unsupervised settings has been limited. We propose a novel approach to automate outlier detection based on meta-learning from previous datasets with outliers. Our premise is that the selection of the optimal outlier detection technique depends on the inherent… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  20. arXiv:2208.08767  [pdf, other

    cs.CV cs.LG

    Evaluating Continual Test-Time Adaptation for Contextual and Semantic Domain Shifts

    Authors: Tommie Kerssies, Mert Kılıçkaya, Joaquin Vanschoren

    Abstract: In this paper, our goal is to adapt a pre-trained convolutional neural network to domain shifts at test time. We do so continually with the incoming stream of test batches, without labels. The existing literature mostly operates on artificial shifts obtained via adversarial perturbations of a test image. Motivated by this, we evaluate the state of the art on two realistic and challenging sources o… ▽ More

    Submitted 12 March, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

  21. arXiv:2207.12560  [pdf, other

    cs.LG stat.ML

    AMLB: an AutoML Benchmark

    Authors: Pieter Gijsbers, Marcos L. P. Bueno, Stefan Coors, Erin LeDell, Sébastien Poirier, Janek Thomas, Bernd Bischl, Joaquin Vanschoren

    Abstract: Comparing different AutoML frameworks is notoriously challenging and often done incorrectly. We introduce an open and extensible benchmark that follows best practices and avoids common mistakes when comparing AutoML frameworks. We conduct a thorough comparison of 9 well-known AutoML frameworks across 71 classification and 33 regression tasks. The differences between the AutoML frameworks are explo… ▽ More

    Submitted 16 November, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: UNDER REVIEW: Revised submission to JMLR, with updated results from June 2023

  22. arXiv:2207.10062  [pdf, other

    cs.LG

    DataPerf: Benchmarks for Data-Centric AI Development

    Authors: Mark Mazumder, Colby Banbury, Xiaozhe Yao, Bojan Karlaš, William Gaviria Rojas, Sudnya Diamos, Greg Diamos, Lynn He, Alicia Parrish, Hannah Rose Kirk, Jessica Quaye, Charvi Rastogi, Douwe Kiela, David Jurado, David Kanter, Rafael Mosquera, Juan Ciro, Lora Aroyo, Bilge Acun, Lingjiao Chen, Mehul Smriti Raje, Max Bartolo, Sabri Eyuboglu, Amirata Ghorbani, Emmett Goodman , et al. (20 additional authors not shown)

    Abstract: Machine learning research has long focused on models rather than datasets, and prominent datasets are used for common ML tasks without regard to the breadth, difficulty, and faithfulness of the underlying problems. Neglecting the fundamental importance of data has given rise to inaccuracy, bias, and fragility in real-world applications, and research is hindered by saturation across existing datase… ▽ More

    Submitted 13 October, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  23. arXiv:2206.06796  [pdf, other

    cs.RO cs.AI

    Open-Ended Learning Strategies for Learning Complex Locomotion Skills

    Authors: Fangqin Zhou, Joaquin Vanschoren

    Abstract: Teaching robots to learn diverse locomotion skills under complex three-dimensional environmental settings via Reinforcement Learning (RL) is still challenging. It has been shown that training agents in simple settings before moving them on to complex settings improves the training process, but so far only in the context of relatively simple locomotion skills. In this work, we adapt the Enhanced Pa… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  24. arXiv:2206.05972  [pdf, other

    cs.LG

    EmProx: Neural Network Performance Estimation For Neural Architecture Search

    Authors: G. G. H. Franken, P. Singh, J. Vanschoren

    Abstract: Common Neural Architecture Search methods generate large amounts of candidate architectures that need training in order to assess their performance and find an optimal architecture. To minimize the search time we use different performance estimation strategies. The effectiveness of such strategies varies in terms of accuracy and fit and query time. This study proposes a new method, EmProx Score (E… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  25. arXiv:2205.06355  [pdf, other

    cs.LG

    Warm-starting DARTS using meta-learning

    Authors: Matej Grobelnik, Joaquin Vanschoren

    Abstract: Neural architecture search (NAS) has shown great promise in the field of automated machine learning (AutoML). NAS has outperformed hand-designed networks and made a significant step forward in the field of automating the design of deep neural networks, thus further reducing the need for human expertise. However, most research is done targeting a single specific task, leaving research of NAS method… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  26. arXiv:2202.01890  [pdf, other

    cs.CV

    Advances in MetaDL: AAAI 2021 challenge and workshop

    Authors: Adrian El Baz, Isabelle Guyon, Zhengying Liu, Jan van Rijn, Sebastien Treguer, Joaquin Vanschoren

    Abstract: To stimulate advances in metalearning using deep learning techniques (MetaDL), we organized in 2021 a challenge and an associated workshop. This paper presents the design of the challenge and its results, and summarizes presentations made at the workshop. The challenge focused on few-shot learning classification tasks of small images. Participants' code submissions were run in a uniform manner, un… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Proceedings of Machine Learning Research, PMLR, 2021

  27. Online AutoML: An adaptive AutoML framework for online learning

    Authors: Bilge Celik, Prabhant Singh, Joaquin Vanschoren

    Abstract: Automated Machine Learning (AutoML) has been used successfully in settings where the learning task is assumed to be static. In many real-world scenarios, however, the data distribution will evolve over time, and it is yet to be shown whether AutoML techniques can effectively design online pipelines in dynamic environments. This study aims to automate pipeline design for online learning while conti… ▽ More

    Submitted 7 December, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: 25 pages, 8 figures. Machine Learning S.I.: Automating Data Science (2022)

  28. arXiv:2201.05000  [pdf, other

    cs.LG cs.AI

    Automated Reinforcement Learning: An Overview

    Authors: Reza Refaei Afshar, Yingqian Zhang, Joaquin Vanschoren, Uzay Kaymak

    Abstract: Reinforcement Learning and recently Deep Reinforcement Learning are popular methods for solving sequential decision making problems modeled as Markov Decision Processes. RL modeling of a problem and selecting algorithms and hyper-parameters require careful considerations as different configurations may entail completely different performances. These considerations are mainly the task of RL experts… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  29. arXiv:2111.03731  [pdf, other

    cs.LG eess.SP

    Frugal Machine Learning

    Authors: Mikhail Evchenko, Joaquin Vanschoren, Holger H. Hoos, Marc Schoenauer, Michèle Sebag

    Abstract: Machine learning, already at the core of increasingly many systems and applications, is set to become even more ubiquitous with the rapid rise of wearable devices and the Internet of Things. In most machine learning applications, the main focus is on the quality of the results achieved (e.g., prediction accuracy), and hence vast amounts of data are being collected, requiring significant computatio… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  30. arXiv:2111.01868  [pdf, other

    cs.LG cs.AI

    From Strings to Data Science: a Practical Framework for Automated String Handling

    Authors: John W. van Lith, Joaquin Vanschoren

    Abstract: Many machine learning libraries require that string features be converted to a numerical representation for the models to work as intended. Categorical string features can represent a wide variety of data (e.g., zip codes, names, marital status), and are notoriously difficult to preprocess automatically. In this paper, we propose a framework to do so based on best practices, domain knowledge, and… ▽ More

    Submitted 4 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

  31. arXiv:2107.05940  [pdf, other

    cs.CV

    Cats, not CAT scans: a study of dataset similarity in transfer learning for 2D medical image classification

    Authors: Irma van den Brandt, Floris Fok, Bas Mulders, Joaquin Vanschoren, Veronika Cheplygina

    Abstract: Transfer learning is a commonly used strategy for medical image classification, especially via pretraining on source data and fine-tuning on target data. There is currently no consensus on how to choose appropriate source data, and in the literature we can find both evidence of favoring large natural image datasets such as ImageNet, and evidence of favoring more specialized medical datasets. In th… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  32. Meta-Learning for Symbolic Hyperparameter Defaults

    Authors: Pieter Gijsbers, Florian Pfisterer, Jan N. van Rijn, Bernd Bischl, Joaquin Vanschoren

    Abstract: Hyperparameter optimization in machine learning (ML) deals with the problem of empirically learning an optimal algorithm configuration from data, usually formulated as a black-box optimization problem. In this work, we propose a zero-shot method to meta-learn symbolic default hyperparameter configurations that are expressed in terms of the properties of the dataset. This enables a much faster, but… ▽ More

    Submitted 11 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Pieter Gijsbers and Florian Pfisterer contributed equally to the paper. V1: Two page GECCO poster paper accepted at GECCO 2021. V2: The original full length paper (8 pages) with appendix

  33. arXiv:2102.02147  [pdf, other

    cs.CV cs.LG

    Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms

    Authors: Rishabh Goyal, Joaquin Vanschoren, Victor van Acht, Stephan Nijssen

    Abstract: Convolutional Neural Networks (CNNs) have proven to be a powerful state-of-the-art method for image classification tasks. One drawback however is the high computational complexity and high memory consumption of CNNs which makes them unfeasible for execution on embedded platforms which are constrained on physical resources needed to support CNNs. Quantization has often been used to efficiently opti… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 39 Pages, 40 Figures, Appendix with Supplementary Figures

  34. arXiv:2101.02289  [pdf, other

    cs.LG stat.ML

    Hyperboost: Hyperparameter Optimization by Gradient Boosting surrogate models

    Authors: Jeroen van Hoof, Joaquin Vanschoren

    Abstract: Bayesian Optimization is a popular tool for tuning algorithms in automatic machine learning (AutoML) systems. Current state-of-the-art methods leverage Random Forests or Gaussian processes to build a surrogate model that predicts algorithm performance given a certain set of hyperparameter settings. In this paper, we propose a new surrogate model based on gradient boosting, where we use quantile re… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: ECMLPKDD 2019 Workshop on Automating Data Science

  35. arXiv:2101.01637  [pdf, other

    cs.AI cs.HC cs.LG

    Theory-based Habit Modeling for Enhancing Behavior Prediction

    Authors: Chao Zhang, Joaquin Vanschoren, Arlette van Wissen, Daniel Lakens, Boris de Ruyter, Wijnand A. IJsselsteijn

    Abstract: Psychological theories of habit posit that when a strong habit is formed through behavioral repetition, it can trigger behavior automatically in the same environment. Given the reciprocal relationship between habit and behavior, changing lifestyle behaviors (e.g., toothbrushing) is largely a task of breaking old habits and creating new and healthy ones. Thus, representing users' habit strengths ca… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

  36. arXiv:2012.02024  [pdf, other

    cs.CV

    Aerial Imagery Pixel-level Segmentation

    Authors: Michael R. Heffels, Joaquin Vanschoren

    Abstract: Aerial imagery can be used for important work on a global scale. Nevertheless, the analysis of this data using neural network architectures lags behind the current state-of-the-art on popular datasets such as PASCAL VOC, CityScapes and Camvid. In this paper we bridge the performance-gap between these popular datasets and aerial imagery data. Little work is done on aerial imagery with state-of-the-… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 30 pages, 15 figures, 4 tables. Code available through GitHub repo at https://github.com/mrheffels/aerial-imagery-segmentation

    ACM Class: I.4.6

  37. arXiv:2007.07588  [pdf, other

    cs.LG stat.ML

    Importance of Tuning Hyperparameters of Machine Learning Algorithms

    Authors: Hilde J. P. Weerts, Andreas C. Mueller, Joaquin Vanschoren

    Abstract: The performance of many machine learning algorithms depends on their hyperparameter settings. The goal of this study is to determine whether it is important to tune a hyperparameter or whether it can be safely set to a default value. We present a methodology to determine the importance of tuning a hyperparameter based on a non-inferiority test and tuning risk: the performance loss that is incurred… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

  38. GAMA: a General Automated Machine learning Assistant

    Authors: Pieter Gijsbers, Joaquin Vanschoren

    Abstract: The General Automated Machine learning Assistant (GAMA) is a modular AutoML system developed to empower users to track and control how AutoML algorithms search for optimal machine learning pipelines, and facilitate AutoML research itself. In contrast to current, often black-box systems, GAMA allows users to plug in different AutoML and post-processing techniques, logs and visualizes the search pro… ▽ More

    Submitted 7 October, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted at ECML-PKDD 2020 Demo Track

    Journal ref: Lecture Notes in Computer Science, vol 12461 (2021). p560-564

  39. Adaptation Strategies for Automated Machine Learning on Evolving Data

    Authors: Bilge Celik, Joaquin Vanschoren

    Abstract: Automated Machine Learning (AutoML) systems have been shown to efficiently build good models for new datasets. However, it is often not clear how well they can adapt when the data evolves over time. The main goal of this study is to understand the effect of data stream challenges such as concept drift on the performance of AutoML methods, and which adaptation strategies can be employed to make the… ▽ More

    Submitted 10 May, 2022; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: 12 pages, 7 figures (14 counting subfigures), submitted to TPAMI - AutoML Special Issue

  40. arXiv:1911.03769  [pdf, other

    cs.NE cs.LG

    Learning to reinforcement learn for Neural Architecture Search

    Authors: J. Gomez Robles, J. Vanschoren

    Abstract: Reinforcement learning (RL) is a goal-oriented learning solution that has proven to be successful for Neural Architecture Search (NAS) on the CIFAR and ImageNet datasets. However, a limitation of this approach is its high computational cost, making it unfeasible to replay it on other datasets. Through meta-learning, we could bring this cost down by adapting previously learned policies instead of l… ▽ More

    Submitted 2 December, 2019; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: 32 pages, 21 figures, 9 tables

  41. arXiv:1911.02490  [pdf, other

    cs.LG stat.ML

    OpenML-Python: an extensible Python API for OpenML

    Authors: Matthias Feurer, Jan N. van Rijn, Arlind Kadra, Pieter Gijsbers, Neeratyoy Mallik, Sahithya Ravi, Andreas Müller, Joaquin Vanschoren, Frank Hutter

    Abstract: OpenML is an online platform for open science collaboration in machine learning, used to share datasets and results of machine learning experiments. In this paper we introduce OpenML-Python, a client API for Python, opening up the OpenML platform for a wide range of Python-based tools. It provides easy access to all datasets, tasks and experiments on OpenML from within Python. It also provides fun… ▽ More

    Submitted 23 June, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Journal ref: Journal of Machine Learning Research 22(100), 2021

  42. arXiv:1907.00909  [pdf, other

    cs.LG stat.ML

    An Open Source AutoML Benchmark

    Authors: Pieter Gijsbers, Erin LeDell, Janek Thomas, Sébastien Poirier, Bernd Bischl, Joaquin Vanschoren

    Abstract: In recent years, an active field of research has developed around automated machine learning (AutoML). Unfortunately, comparing different AutoML systems is hard and often done incorrectly. We introduce an open, ongoing, and extensible benchmark framework which follows best practices and avoids common mistakes. The framework is open-source, uses public datasets and has a website with up-to-date res… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: Accepted paper at the AutoML Workshop at ICML 2019. Code: https://github.com/openml/automlbenchmark/ Accompanying website: https://openml.github.io/automlbenchmark/

  43. A meta-learning recommender system for hyperparameter tuning: predicting when tuning improves SVM classifiers

    Authors: Rafael Gomes Mantovani, André Luis Debiaso Rossi, Edesio Alcobaça, Joaquin Vanschoren, André Carlos Ponce de Leon Ferreira de Carvalho

    Abstract: For many machine learning algorithms, predictive performance is critically affected by the hyperparameter values used to train them. However, tuning these hyperparameters can come at a high computational cost, especially on larger datasets, while the tuned settings do not always significantly outperform the default values. This paper proposes a recommender system based on meta-learning to identify… ▽ More

    Submitted 11 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: 49 pages, 11 figures

    Journal ref: Information Sciences, Volume 501, 2019. Pages 193-221, ISSN 0020-0255

  44. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  45. Better Trees: An empirical study on hyperparameter tuning of classification decision tree induction algorithms

    Authors: Rafael Gomes Mantovani, Tomáš Horváth, André L. D. Rossi, Ricardo Cerri, Sylvio Barbon Junior, Joaquin Vanschoren, André Carlos Ponce de Leon Ferreira de Carvalho

    Abstract: Machine learning algorithms often contain many hyperparameters (HPs) whose values affect the predictive performance of the induced models in intricate ways. Due to the high number of possibilities for these HP configurations and their complex interactions, it is common to use optimization techniques to find settings that lead to high predictive performance. However, insights into efficiently explo… ▽ More

    Submitted 21 December, 2023; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: 60 pages, 16 figures

  46. arXiv:1811.03392  [pdf, other

    cs.LG stat.ML

    Transformative Machine Learning

    Authors: Ivan Olier, Oghenejokpeme I. Orhobor, Joaquin Vanschoren, Ross D. King

    Abstract: The key to success in machine learning (ML) is the use of effective data representations. Traditionally, data representations were hand-crafted. Recently it has been demonstrated that, given sufficient data, deep neural networks can learn effective implicit representations from simple input representations. However, for most scientific problems, the use of deep learning is not appropriate as the a… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  47. arXiv:1810.03548  [pdf, ps, other

    cs.LG stat.ML

    Meta-Learning: A Survey

    Authors: Joaquin Vanschoren

    Abstract: Meta-learning, or learning to learn, is the science of systematically observing how different machine learning approaches perform on a wide range of learning tasks, and then learning from this experience, or meta-data, to learn new tasks much faster than otherwise possible. Not only does this dramatically speed up and improve the design of machine learning pipelines or neural architectures, it als… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

  48. arXiv:1808.10406  [pdf, other

    cs.LG stat.ML

    Characterizing classification datasets: a study of meta-features for meta-learning

    Authors: Adriano Rivolli, Luís P. F. Garcia, Carlos Soares, Joaquin Vanschoren, André C. P. L. F. de Carvalho

    Abstract: Meta-learning is increasingly used to support the recommendation of machine learning algorithms and their configurations. Such recommendations are made based on meta-data, consisting of performance evaluations of algorithms on prior datasets, as well as characterizations of these datasets. These characterizations, also called meta-features, describe properties of the data which are predictive for… ▽ More

    Submitted 26 August, 2019; v1 submitted 30 August, 2018; originally announced August 2018.

  49. arXiv:1807.05351  [pdf, other

    cs.LG cs.DB cs.IR stat.ML

    ML-Schema: Exposing the Semantics of Machine Learning with Schemas and Ontologies

    Authors: Gustavo Correa Publio, Diego Esteves, Agnieszka Ławrynowicz, Panče Panov, Larisa Soldatova, Tommaso Soru, Joaquin Vanschoren, Hamid Zafar

    Abstract: The ML-Schema, proposed by the W3C Machine Learning Schema Community Group, is a top-level ontology that provides a set of classes, properties, and restrictions for representing and interchanging information on machine learning algorithms, datasets, and experiments. It can be easily extended and specialized and it is also mapped to other more domain-specific ontologies developed in the area of mac… ▽ More

    Submitted 14 July, 2018; originally announced July 2018.

    Comments: Poster, selected for the 2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden

  50. arXiv:1801.06007  [pdf, ps, other

    cs.NE cs.AI

    Layered TPOT: Speeding up Tree-based Pipeline Optimization

    Authors: Pieter Gijsbers, Joaquin Vanschoren, Randal S. Olson

    Abstract: With the demand for machine learning increasing, so does the demand for tools which make it easier to use. Automated machine learning (AutoML) tools have been developed to address this need, such as the Tree-Based Pipeline Optimization Tool (TPOT) which uses genetic programming to build optimal pipelines. We introduce Layered TPOT, a modification to TPOT which aims to create pipelines equally good… ▽ More

    Submitted 12 March, 2018; v1 submitted 18 January, 2018; originally announced January 2018.

    Comments: Update to include a reference to Zutty et al. after it was brought to our attention