Showing 1–2 of 2 results for author: Borbély, G

Search v0.5.6 released 2020-02-24

arXiv:1905.09139 [pdf, other]

cs.CL

Sentence Length

Authors: Gábor Borbély, András Kornai

Abstract: The distribution of sentence length in ordinary language is not well captured by the existing models. Here we survey previous models of sentence length and present our random walk model that offers both a better fit with the data and a better understanding of the distribution. We develop a generalization of KL divergence, discuss measuring the noise inherent in a corpus, and present a hyperparamet… ▽ More The distribution of sentence length in ordinary language is not well captured by the existing models. Here we survey previous models of sentence length and present our random walk model that offers both a better fit with the data and a better understanding of the distribution. We develop a generalization of KL divergence, discuss measuring the noise inherent in a corpus, and present a hyperparameter-free Bayesian model comparison method that has strong conceptual ties to Minimal Description Length modeling. The models we obtain require only a few dozen bits, orders of magnitude less than the naive nonparametric MDL models would. △ Less

Submitted 22 May, 2019; originally announced May 2019.
arXiv:1506.06972 [pdf, other]

stat.ML cs.CE cs.LG stat.AP

doi 10.1007/978-3-319-19857-6_7

GEFCOM 2014 - Probabilistic Electricity Price Forecasting

Authors: Gergo Barta, Gyula Borbely, Gabor Nagy, Sandor Kazi, Tamas Henk

Abstract: Energy price forecasting is a relevant yet hard task in the field of multi-step time series forecasting. In this paper we compare a well-known and established method, ARMA with exogenous variables with a relatively new technique Gradient Boosting Regression. The method was tested on data from Global Energy Forecasting Competition 2014 with a year long rolling window forecast. The results from the… ▽ More Energy price forecasting is a relevant yet hard task in the field of multi-step time series forecasting. In this paper we compare a well-known and established method, ARMA with exogenous variables with a relatively new technique Gradient Boosting Regression. The method was tested on data from Global Energy Forecasting Competition 2014 with a year long rolling window forecast. The results from the experiment reveal that a multi-model approach is significantly better performing in terms of error metrics. Gradient Boosting can deal with seasonality and auto-correlation out-of-the box and achieve lower rate of normalized mean absolute error on real-world data. △ Less

Submitted 23 June, 2015; originally announced June 2015.

Comments: 10 pages, 5 figures, KES-IDT 2015 conference. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-19857-6_7

Search v0.5.6 released 2020-02-24