Skip to main content

Showing 1–20 of 20 results for author: Salinas, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14657  [pdf, other

    cs.LG stat.ML

    Deep Non-Parametric Time Series Forecaster

    Authors: Syama Sundar Rangapuram, Jan Gasthaus, Lorenzo Stella, Valentin Flunkert, David Salinas, Yuyang Wang, Tim Januschowski

    Abstract: This paper presents non-parametric baseline models for time series forecasting. Unlike classical forecasting models, the proposed approach does not assume any parametric form for the predictive distribution and instead generates predictions by sampling from the empirical distribution according to a tunable strategy. By virtue of this, the model is always able to produce reasonable forecasts (i.e.,… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  2. arXiv:2311.02971  [pdf, other

    cs.LG cs.AI

    TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML Applications

    Authors: David Salinas, Nick Erickson

    Abstract: We introduce TabRepo, a new dataset of tabular model evaluations and predictions. TabRepo contains the predictions and metrics of 1310 models evaluated on 200 classification and regression datasets. We illustrate the benefit of our dataset in multiple ways. First, we show that it allows to perform analysis such as comparing Hyperparameter Optimization against current AutoML systems while also cons… ▽ More

    Submitted 19 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  3. arXiv:2306.16916  [pdf, other

    cs.LG

    Obeying the Order: Introducing Ordered Transfer Hyperparameter Optimisation

    Authors: Sigrid Passano Hellan, Huibin Shen, François-Xavier Aubet, David Salinas, Aaron Klein

    Abstract: We introduce ordered transfer hyperparameter optimisation (OTHPO), a version of transfer learning for hyperparameter optimisation (HPO) where the tasks follow a sequential order. Unlike for state-of-the-art transfer HPO, the assumption is that each task is most correlated to those immediately before it. This matches many deployed settings, where hyperparameters are retuned as more data is collecte… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: To be presented at the AutoML 2023 Workshop Track

  4. arXiv:2305.03623  [pdf, other

    cs.LG stat.ML

    Optimizing Hyperparameters with Conformal Quantile Regression

    Authors: David Salinas, Jacek Golebiowski, Aaron Klein, Matthias Seeger, Cedric Archambeau

    Abstract: Many state-of-the-art hyperparameter optimization (HPO) algorithms rely on model-based optimizers that learn surrogate models of the target function to guide the search. Gaussian processes are the de facto surrogate model due to their ability to capture uncertainty but they make strong assumptions about the observation noise, which might not be warranted in practice. In this work, we propose to le… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  5. arXiv:2212.03523  [pdf, other

    stat.ML cs.LG

    Criteria for Classifying Forecasting Methods

    Authors: Tim Januschowski, Jan Gasthaus, Yuyang Wang, David Salinas, Valentin Flunkert, Michael Bohlke-Schneider, Laurent Callot

    Abstract: Classifying forecasting methods as being either of a "machine learning" or "statistical" nature has become commonplace in parts of the forecasting literature and community, as exemplified by the M4 competition and the conclusion drawn by the organizers. We argue that this distinction does not stem from fundamental differences in the methods assigned to either class. Instead, this distinction is pr… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  6. arXiv:2202.08485  [pdf, other

    cs.LG

    Multi-Objective Model Selection for Time Series Forecasting

    Authors: Oliver Borchert, David Salinas, Valentin Flunkert, Tim Januschowski, Stephan Günnemann

    Abstract: Research on time series forecasting has predominantly focused on develo** methods that improve accuracy. However, other criteria such as training time or latency are critical in many real-world applications. We therefore address the question of how to choose an appropriate forecasting model for a given dataset among the plethora of available forecasting methods when accuracy is only one of many… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  7. arXiv:2111.03418  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Forecasting by combining Global Deep Representations with Local Adaptation

    Authors: Riccardo Grazzi, Valentin Flunkert, David Salinas, Tim Januschowski, Matthias Seeger, Cedric Archambeau

    Abstract: While classical time series forecasting considers individual time series in isolation, recent advances based on deep learning showed that jointly learning from a large pool of related time series can boost the forecasting accuracy. However, the accuracy of these methods suffers greatly when modeling out-of-sample time series, significantly limiting their applicability compared to classical forecas… ▽ More

    Submitted 12 November, 2021; v1 submitted 5 November, 2021; originally announced November 2021.

  8. arXiv:2106.12639  [pdf, other

    stat.ML cs.LG

    Multi-objective Asynchronous Successive Halving

    Authors: Robin Schmucker, Michele Donini, Muhammad Bilal Zafar, David Salinas, Cédric Archambeau

    Abstract: Hyperparameter optimization (HPO) is increasingly used to automatically tune the predictive performance (e.g., accuracy) of machine learning models. However, in a plethora of real-world applications, accuracy is only one of the multiple -- often conflicting -- performance criteria, necessitating the adoption of a multi-objective (MO) perspective. While the literature on MO optimization is rich, fe… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  9. arXiv:2106.05680  [pdf, other

    cs.LG

    A multi-objective perspective on jointly tuning hardware and hyperparameters

    Authors: David Salinas, Valerio Perrone, Olivier Cruchant, Cedric Archambeau

    Abstract: In addition to the best model architecture and hyperparameters, a full AutoML solution requires selecting appropriate hardware automatically. This can be framed as a multi-objective optimization problem: there is not a single best hardware configuration but a set of optimal ones achieving different trade-offs between cost and runtime. In practice, some choices may be overly costly or take days to… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  10. arXiv:2103.16111  [pdf, other

    cs.LG cs.AI

    A resource-efficient method for repeated HPO and NAS problems

    Authors: Giovanni Zappella, David Salinas, Cédric Archambeau

    Abstract: In this work we consider the problem of repeated hyperparameter and neural architecture search (HNAS). We propose an extension of Successive Halving that is able to leverage information gained in previous HNAS problems with the goal of saving computational resources. We empirically demonstrate that our solution is able to drastically decrease costs while maintaining accuracy and being robust to ne… ▽ More

    Submitted 13 July, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted at AutoML workshop @ ICML 2021

  11. arXiv:2011.05138  [pdf, other

    cs.LG cs.AI

    Relation-weighted Link Prediction for Disease Gene Identification

    Authors: Srivamshi Pittala, William Koehler, Jonathan Deans, Daniel Salinas, Martin Bringmann, Katharina Sophia Volz, Berk Kapicioglu

    Abstract: Identification of disease genes, which are a set of genes associated with a disease, plays an important role in understanding and curing diseases. In this paper, we present a biomedical knowledge graph designed specifically for this problem, propose a novel machine learning method that identifies disease genes on such graphs by leveraging recent advances in network biology and graph representation… ▽ More

    Submitted 13 November, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 4th Knowledge Representation and Reasoning Meets Machine Learning Workshop (KR2ML), NeurIPS 2020

  12. arXiv:2005.10111  [pdf, other

    cs.LG stat.ML

    The Effectiveness of Discretization in Forecasting: An Empirical Study on Neural Time Series Models

    Authors: Stephan Rabanser, Tim Januschowski, Valentin Flunkert, David Salinas, Jan Gasthaus

    Abstract: Time series modeling techniques based on deep learning have seen many advancements in recent years, especially in data-abundant settings and with the central aim of learning global models that can extract patterns across multiple time series. While the crucial importance of appropriate data pre-processing and scaling has often been noted in prior work, most studies focus on improving model archite… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  13. arXiv:2004.10240  [pdf, other

    cs.LG stat.ML

    Deep Learning for Time Series Forecasting: Tutorial and Literature Survey

    Authors: Konstantinos Benidis, Syama Sundar Rangapuram, Valentin Flunkert, Yuyang Wang, Danielle Maddix, Caner Turkmen, Jan Gasthaus, Michael Bohlke-Schneider, David Salinas, Lorenzo Stella, Francois-Xavier Aubet, Laurent Callot, Tim Januschowski

    Abstract: Deep learning based forecasting methods have become the methods of choice in many applications of time series prediction or forecasting often outperforming other approaches. Consequently, over the last years, these methods are now ubiquitous in large-scale industrial forecasting applications and have consistently ranked among the best entries in forecasting competitions (e.g., M4 and M5). This pra… ▽ More

    Submitted 15 June, 2022; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: 33 pages, 6 figures

    ACM Class: A.1

    Journal ref: ACM Computing Surveys (2022)

  14. arXiv:1912.08913  [pdf, other

    cs.CG

    Reconstructing Embedded Graphs from Persistence Diagrams

    Authors: Robin Lynne Belton, Brittany Terese Fasy, Rostik Mertz, Samuel Micka, David L. Millman, Daniel Salinas, Anna Schenfisch, Jordan Schupbach, Lucia Williams

    Abstract: The persistence diagram (PD) is an increasingly popular topological descriptor. By encoding the size and prominence of topological features at varying scales, the PD provides important geometric and topological information about a space. Recent work has shown that well-chosen (finite) sets of PDs can differentiate between geometric simplicial complexes, providing a method for representing complex… ▽ More

    Submitted 18 June, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: 32 pages, 10 figures. This paper is an extended version of "Learning Simplicial Complexes from Persistence Diagrams" that appeared in the conference proceedings for the Canadian Conference on Computational Geometry (CCCG) 2018. This extended paper will appear in a special issue of the journal, Computational Geometry Theory and Applications (CGTA)

  15. arXiv:1910.03002  [pdf, other

    cs.LG stat.ML

    High-Dimensional Multivariate Forecasting with Low-Rank Gaussian Copula Processes

    Authors: David Salinas, Michael Bohlke-Schneider, Laurent Callot, Roberto Medico, Jan Gasthaus

    Abstract: Predicting the dependencies between observations from multiple time series is critical for applications such as anomaly detection, financial risk management, causal analysis, or demand forecasting. However, the computational and numerical difficulties of estimating time-varying and high-dimensional covariance matrices often limits existing methods to handling at most a few hundred dimensions or re… ▽ More

    Submitted 24 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

  16. arXiv:1909.13595  [pdf, other

    stat.ML cs.LG

    A Quantile-based Approach for Hyperparameter Transfer Learning

    Authors: David Salinas, Huibin Shen, Valerio Perrone

    Abstract: Bayesian optimization (BO) is a popular methodology to tune the hyperparameters of expensive black-box functions. Traditionally, BO focuses on a single task at a time and is not designed to leverage information from related functions, such as tuning performance objectives of the same algorithm across multiple datasets. In this work, we introduce a novel approach to achieve transfer learning across… ▽ More

    Submitted 19 April, 2021; v1 submitted 30 September, 2019; originally announced September 2019.

  17. arXiv:1906.05264  [pdf, other

    cs.LG stat.ML

    GluonTS: Probabilistic Time Series Models in Python

    Authors: Alexander Alexandrov, Konstantinos Benidis, Michael Bohlke-Schneider, Valentin Flunkert, Jan Gasthaus, Tim Januschowski, Danielle C. Maddix, Syama Rangapuram, David Salinas, Jasper Schulz, Lorenzo Stella, Ali Caner Türkmen, Yuyang Wang

    Abstract: We introduce Gluon Time Series (GluonTS, available at https://gluon-ts.mxnet.io), a library for deep-learning-based time series modeling. GluonTS simplifies the development of and experimentation with time series models for common tasks such as forecasting or anomaly detection. It provides all necessary components and tools that scientists need for quickly building new models, for efficiently runn… ▽ More

    Submitted 14 June, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: ICML Time Series Workshop 2019

  18. arXiv:1805.10716  [pdf, other

    cs.CG math.AT

    Learning Simplicial Complexes from Persistence Diagrams

    Authors: Robin Lynne Belton, Brittany Terese Fasy, Rostik Mertz, Samuel Micka, David L. Millman, Daniel Salinas, Anna Schenfisch, Jordan Schupbach, Lucia Williams

    Abstract: Topological Data Analysis (TDA) studies the shape of data. A common topological descriptor is the persistence diagram, which encodes topological features in a topological space at different scales. Turner, Mukeherjee, and Boyer showed that one can reconstruct a simplicial complex embedded in R^3 using persistence diagrams generated from all possible height filtrations (an uncountably infinite numb… ▽ More

    Submitted 31 July, 2018; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Updated our document for clarity in response to comments by reviewers at CCCG. This paper will appear at CCCG 2018

  19. arXiv:1709.07638  [pdf, other

    stat.ML cs.LG

    Approximate Bayesian Inference in Linear State Space Models for Intermittent Demand Forecasting at Scale

    Authors: Matthias Seeger, Syama Rangapuram, Yuyang Wang, David Salinas, Jan Gasthaus, Tim Januschowski, Valentin Flunkert

    Abstract: We present a scalable and robust Bayesian inference method for linear state space models. The method is applied to demand forecasting in the context of a large e-commerce platform, paying special attention to intermittent and bursty target statistics. Inference is approximated by the Newton-Raphson algorithm, reduced to linear-time Kalman smoothing, which allows us to operate on several orders of… ▽ More

    Submitted 22 September, 2017; originally announced September 2017.

  20. arXiv:1704.04110  [pdf, other

    cs.AI cs.LG stat.ML

    DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks

    Authors: David Salinas, Valentin Flunkert, Jan Gasthaus

    Abstract: Probabilistic forecasting, i.e. estimating the probability distribution of a time series' future given its past, is a key enabler for optimizing business processes. In retail businesses, for example, forecasting demand is crucial for having the right inventory available at the right time at the right place. In this paper we propose DeepAR, a methodology for producing accurate probabilistic forecas… ▽ More

    Submitted 22 February, 2019; v1 submitted 13 April, 2017; originally announced April 2017.