-
ASA -- The Adaptive Scheduling Algorithm
Authors:
Abel Souza,
Kristiaan Pelckmans,
Devarshi Ghoshal,
Lavanya Ramakrishnan,
Johan Tordsson
Abstract:
In High Performance Computing (HPC) infrastructures, the control of resources by batch systems can lead to prolonged queue waiting times and adverse effects on the overall execution times of applications, particularly in data-intensive and low-latency workflows where efficient processing hinges on resource planning and timely allocation. Allocating the maximum capacity upfront ensures the fastest…
▽ More
In High Performance Computing (HPC) infrastructures, the control of resources by batch systems can lead to prolonged queue waiting times and adverse effects on the overall execution times of applications, particularly in data-intensive and low-latency workflows where efficient processing hinges on resource planning and timely allocation. Allocating the maximum capacity upfront ensures the fastest execution but results in spare and idle resources, extended queue waits, and costly usage. Conversely, dynamic allocation based on workflow stage requirements optimizes resource usage but may negatively impact the total workflow makespan. To address these issues, we introduce ASA, the Adaptive Scheduling Algorithm. ASA is a novel, convergence-proven scheduling technique that minimizes jobs inter-stage waiting times by estimating the queue waiting times to proactively submit resource change requests ahead of time. It strikes a balance between exploration and exploitation, considering both learning (waiting times) and applying learnt insights. Real-world experiments over two supercomputers centers with scientific workflows demonstrate ASA's effectiveness, achieving near-optimal resource utilization and accuracy, with up to 10% and 2% reductions in average workflow queue waiting times and makespan, respectively.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
A HPC Co-Scheduler with Reinforcement Learning
Authors:
Abel Souza,
Kristiaan Pelckmans,
Johan Tordsson
Abstract:
Although High Performance Computing (HPC) users understand basic resource requirements such as the number of CPUs and memory limits, internal infrastructural utilization data is exclusively leveraged by cluster operators, who use it to configure batch schedulers. This task is challenging and increasingly complex due to ever larger cluster scales and heterogeneity of modern scientific workflows. As…
▽ More
Although High Performance Computing (HPC) users understand basic resource requirements such as the number of CPUs and memory limits, internal infrastructural utilization data is exclusively leveraged by cluster operators, who use it to configure batch schedulers. This task is challenging and increasingly complex due to ever larger cluster scales and heterogeneity of modern scientific workflows. As a result, HPC systems achieve low utilization with long job completion times (makespans). To tackle these challenges, we propose a co-scheduling algorithm based on an adaptive reinforcement learning algorithm, where application profiling is combined with cluster monitoring. The resulting cluster scheduler matches resource utilization to application performance in a fine-grained manner (i.e., operating system level). As opposed to nominal allocations, we apply decision trees to model applications' actual resource usage, which are used to estimate how much resource capacity from one allocation can be co-allocated to additional applications. Our algorithm learns from incorrect co-scheduling decisions and adapts from changing environment conditions, and evaluates when such changes cause resource contention that impacts quality of service metrics such as jobs slowdowns. We integrate our algorithm in an HPC resource manager that combines Slurm and Mesos for job scheduling and co-allocation, respectively. Our experimental evaluation performed in a dedicated cluster executing a mix of four real different scientific workflows demonstrates improvements on cluster utilization of up to 51% even in high load scenarios, with 55% average queue makespan reductions under low loads.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Detecting Suspicious Events in Fast Information Flows
Authors:
Kristiaan Pelckmans,
Moustafa Aboushady,
Andreas Brosemyr
Abstract:
We describe a computational feather-light and intuitive, yet provably efficient algorithm, named HALFADO. HALFADO is designed for detecting suspicious events in a high-frequency stream of complex entries, based on a relatively small number of examples of human judgement. Operating a sufficiently accurate detection system is vital for {\em assisting} teams of human experts in many different areas o…
▽ More
We describe a computational feather-light and intuitive, yet provably efficient algorithm, named HALFADO. HALFADO is designed for detecting suspicious events in a high-frequency stream of complex entries, based on a relatively small number of examples of human judgement. Operating a sufficiently accurate detection system is vital for {\em assisting} teams of human experts in many different areas of the modern digital society. These systems have intrinsically a far-reaching normative effect, and public knowledge of the workings of such technology should be a human right.
On a conceptual level, the present approach extends one of the most classical learning algorithms for classification, inheriting its theoretical properties. It however works in a semi-supervised way integrating human and computational intelligence. On a practical level, this algorithm transcends existing approaches (expert systems) by managing and boosting their performance into a single global detector.
We illustrate HALFADO's efficacy on two challenging applications: (1) for detecting {\em hate speech} messages in a flow of text messages gathered from a social media platform, and (2) for a Transaction Monitoring System (TMS) in FinTech detecting fraudulent transactions in a stream of financial transactions.
This algorithm illustrates that - contrary to popular belief - advanced methods of machine learning need not require neither advanced levels of computation power nor expensive annotation efforts.
△ Less
Submitted 7 January, 2021;
originally announced January 2021.
-
Launching the VASCO citizen science project
Authors:
Beatriz Villarroel,
Kristiaan Pelckmans,
Enrique Solano,
Mikael Laaksoharju,
Abel Souza,
Onyeuwaoma Nnaemeka Dom,
Khaoula Laggoune,
Jamal Mimouni,
Hichem Guergouri,
Lars Mattsson,
Aurora Lago García,
Johan Soodla,
Diego Castillo,
Matthew E. Shultz,
Rubby Aworka,
Sébastien Comerón,
Stefan Geier,
Geoffrey Marcy,
Alok C. Gupta,
Josefine Bergstedt,
Rudolf E. Bär,
Bart Buelens,
Emilio Enriquez,
Christopher K. Mellon,
M. Almudena Prieto
, et al. (3 additional authors not shown)
Abstract:
The Vanishing & Appearing Sources during a Century of Observations (VASCO) project investigates astronomical surveys spanning a time interval of 70 years, searching for unusual and exotic transients. We present herein the VASCO Citizen Science Project, which can identify unusual candidates driven by three different approaches: hypothesis, exploratory, and machine learning, which is particularly us…
▽ More
The Vanishing & Appearing Sources during a Century of Observations (VASCO) project investigates astronomical surveys spanning a time interval of 70 years, searching for unusual and exotic transients. We present herein the VASCO Citizen Science Project, which can identify unusual candidates driven by three different approaches: hypothesis, exploratory, and machine learning, which is particularly useful for SETI searches. To address the big data challenge, VASCO combines three methods: the Virtual Observatory, user-aided machine learning, and visual inspection through citizen science. Here we demonstrate the citizen science project and its improved candidate selection process, and we give a progress report. We also present the VASCO citizen science network led by amateur astronomy associations mainly located in Algeria, Cameroon, and Nigeria. At the moment of writing, the citizen science project has carefully examined 15,593 candidate image pairs in the data (ca. 10% of the candidates), and has so far identified 798 objects classified as "vanished". The most interesting candidates will be followed up with optical and infrared imaging, together with the observations by the most potent radio telescopes.
△ Less
Submitted 26 December, 2022; v1 submitted 22 September, 2020;
originally announced September 2020.
-
Longitudinal Support Vector Machines for High Dimensional Time Series
Authors:
Kristiaan Pelckmans,
Hong-Li Zeng
Abstract:
We consider the problem of learning a classifier from observed functional data. Here, each data-point takes the form of a single time-series and contains numerous features. Assuming that each such series comes with a binary label, the problem of learning to predict the label of a new coming time-series is considered. Hereto, the notion of {\em margin} underlying the classical support vector machin…
▽ More
We consider the problem of learning a classifier from observed functional data. Here, each data-point takes the form of a single time-series and contains numerous features. Assuming that each such series comes with a binary label, the problem of learning to predict the label of a new coming time-series is considered. Hereto, the notion of {\em margin} underlying the classical support vector machine is extended to the continuous version for such data. The longitudinal support vector machine is also a convex optimization problem and its dual form is derived as well. Empirical results for specified cases with significance tests indicate the efficacy of this innovative algorithm for analyzing such long-term multivariate data.
△ Less
Submitted 22 February, 2020;
originally announced February 2020.
-
APTER: Aggregated Prognosis Through Exponential Reweighting
Authors:
Kristiaan Pelckmans,
Liu Yang
Abstract:
This paper considers the task of learning how to make a prognosis of a patient based on his/her micro-array expression levels. The method is an application of the aggregation method as recently proposed in the literature on theoretical machine learning, and excels in its computational convenience and capability to deal with high-dimensional data. A formal analysis of the method is given, yielding…
▽ More
This paper considers the task of learning how to make a prognosis of a patient based on his/her micro-array expression levels. The method is an application of the aggregation method as recently proposed in the literature on theoretical machine learning, and excels in its computational convenience and capability to deal with high-dimensional data. A formal analysis of the method is given, yielding rates of convergence similar to what traditional techniques obtain, while it is shown to cope well with an exponentially large set of features. Those results are supported by numerical simulations on a range of publicly available survival-micro-array datasets. It is empirically found that the proposed technique combined with a recently proposed preprocessing technique gives excellent performances.
△ Less
Submitted 20 February, 2020;
originally announced February 2020.
-
The Vanishing & Appearing Sources during a Century of Observations project: I. USNO objects missing in modern sky surveys and follow-up observations of a "missing star"
Authors:
Beatriz Villarroel,
Johan Soodla,
Sébastien Comerón,
Lars Mattsson,
Kristiaan Pelckmans,
Martín López-Corredoira,
Kevin Krisciunas,
Eduardo Guerras,
Oleg Kochukhov,
Josefine Bergstedt,
Bart Buelens,
Rudolf E. Bär,
Rubén Cubo,
J. Emilio Enriquez,
Alok C. Gupta,
Iñigo Imaz,
Torgny Karlsson,
M. Almudena Prieto,
Aleksey A. Shlyapnikov,
Rafael S. de Souza,
Irina B. Vavilova,
Martin J. Ward
Abstract:
In this paper we report the current status of a new research program. The primary goal of the "Vanishing & Appearing Sources during a Century of Observations" (VASCO) project is to search for vanishing and appearing sources using existing survey data to find examples of exceptional astrophysical transients. The implications of finding such objects extend from traditional astrophysics fields to the…
▽ More
In this paper we report the current status of a new research program. The primary goal of the "Vanishing & Appearing Sources during a Century of Observations" (VASCO) project is to search for vanishing and appearing sources using existing survey data to find examples of exceptional astrophysical transients. The implications of finding such objects extend from traditional astrophysics fields to the more exotic searches for evidence of technologically advanced civilizations. In this first paper we present new, deeper observations of the tentative candidate discovered by Villarroel et al. (2016). We then perform the first searches for vanishing objects throughout the sky by comparing 600 million objects from the US Naval Observatory Catalogue (USNO) B1.0 down to a limiting magnitude of $\sim 20 - 21$ with the recent Pan-STARRS Data Release-1 (DR1) with a limiting magnitude of $\sim$ 23.4. We find about 150,000 preliminary candidates that do not have any Pan-STARRS counterpart within a 30 arcsec radius. We show that these objects are redder and have larger proper motions than typical USNO objects. We visually examine the images for a subset of about 24,000 candidates, superseding the 2016 study with a sample ten times larger. We find about $\sim$ 100 point sources visible in only one epoch in the red band of the USNO which may be of interest in searches for strong M dwarf flares, high-redshift supernovae or other catagories of unidentified red transients.
△ Less
Submitted 21 November, 2019; v1 submitted 12 November, 2019;
originally announced November 2019.
-
Identifying reionization-epoch galaxies with extreme levels of Lyman continuum leakage in James Webb Space Telescope surveys
Authors:
Sambit K. Giri,
Erik Zackrisson,
Christian Binggeli,
Kristiaan Pelckmans,
Rubén Cubo
Abstract:
The James Webb Space Telescope (JWST) NIRSpec instrument will allow rest-frame ultraviolet/optical spectroscopy of galaxies in the epoch of reionization (EoR). Some galaxies may exhibit significant leakage of hydrogen-ionizing photons into the intergalactic medium, resulting in faint nebular emission lines. We present a machine learning framework for identifying cases of very high hydrogen-ionizin…
▽ More
The James Webb Space Telescope (JWST) NIRSpec instrument will allow rest-frame ultraviolet/optical spectroscopy of galaxies in the epoch of reionization (EoR). Some galaxies may exhibit significant leakage of hydrogen-ionizing photons into the intergalactic medium, resulting in faint nebular emission lines. We present a machine learning framework for identifying cases of very high hydrogen-ionizing photon escape from galaxies based on the data quality expected from potential NIRSpec observations of EoR galaxies in lensed fields. We train our algorithm on mock samples of JWST/NIRSpec data for galaxies at redshifts $z=6$--10. To make the samples more realistic, we combine synthetic galaxy spectra based on cosmological galaxy simulations with observational noise relevant for $z\gtrsim 6$ objects of a brightness similar to EoR galaxy candidates uncovered in Frontier Fields observations of galaxy cluster Abell-2744 and MACS-J0416. We find that ionizing escape fractions ($f_\mathrm{esc}$) of galaxies brighter than $m_\mathrm{AB,1500} \approx 27$ mag may be retrieved with mean absolute error $Δf_\mathrm{esc}\approx$0.09(0.12) for 24h (1.5h) JWST/NIRSpec exposures at resolution R=100. For 24h exposure time, even fainter galaxies ($m_\mathrm{AB,1500} < 28.5$ mag) can be processed with $Δf_\mathrm{esc}\approx$0.14. This framework simultaneously estimates the redshift of these galaxies with a relative error less than 0.03 for both 24h ($m_\mathrm{AB,1500} < 28.5$ mag) and 1.5h ($m_\mathrm{AB,1500} < 27$ mag) exposure times. We also consider scenarios where just a minor fraction of galaxies attain high $f_\mathrm{esc}$ and present the conditions required for detecting a subpopulation of high $f_\mathrm{esc}$ galaxies within the dataset.
△ Less
Submitted 9 December, 2019; v1 submitted 4 March, 2019;
originally announced March 2019.
-
Lyman continuum leakage versus quenching with the James Webb Space Telescope: The spectral signatures of quenched star formation activity in reionization-epoch galaxies
Authors:
C. Binggeli,
E. Zackrisson,
K. Pelckmans,
R. Cubo,
H. Jensen,
I. Shimizu
Abstract:
In this paper, we study the effects of a recent drop in star formation rate (SFR) on the spectra of epoch of reionization (EoR) galaxies, and the resulting degeneracy with the spectral features produced by extreme Lyman continuum leakage. In order to study these effects in the wavelength range relevant for the upcoming James Webb Space Telescope (JWST), we utilize synthetic spectra of simulated Eo…
▽ More
In this paper, we study the effects of a recent drop in star formation rate (SFR) on the spectra of epoch of reionization (EoR) galaxies, and the resulting degeneracy with the spectral features produced by extreme Lyman continuum leakage. In order to study these effects in the wavelength range relevant for the upcoming James Webb Space Telescope (JWST), we utilize synthetic spectra of simulated EoR galaxies from cosmological simulations together with synthetic spectra of partially quenched mock galaxies. We find that rapid declines in the SFR of EoR galaxies could seriously affect the applicability of methods that utilize the equivalent width of Balmer lines and the ultraviolet spectral slope to assess the escape fraction of EoR galaxies. In order to determine if the aforementioned degeneracy can be avoided by using the overall shape of the spectrum, we generate mock NIRCam observations and utilize a classification algorithm to identify galaxies that have undergone quenching. We find that while there are problematic cases, JWST/NIRCam or NIRSpec should be able to reliably identify galaxies with redshifts $z\sim 7$ that have experienced a significant decrease in the SFR (by a factor 10-100) in the past 50-100 Myr with a success rate $\gtrsim 85\%$. We also find that uncertainties in the dust-reddening effects on EoR galaxies significantly affect the performance of the results of the classification algorithm. We argue that studies that aim to characterize the dust extinction law most representative in the EoR would be extremely useful.
△ Less
Submitted 24 July, 2018; v1 submitted 24 April, 2018;
originally announced April 2018.
-
FADO: A Deterministic Detection/Learning Algorithm
Authors:
Kristiaan Pelckmans
Abstract:
This paper proposes and studies a detection technique for adversarial scenarios (dubbed deterministic detection). This technique provides an alternative detection methodology in case the usual stochastic methods are not applicable: this can be because the studied phenomenon does not follow a stochastic sampling scheme, samples are high-dimensional and subsequent multiple-testing corrections render…
▽ More
This paper proposes and studies a detection technique for adversarial scenarios (dubbed deterministic detection). This technique provides an alternative detection methodology in case the usual stochastic methods are not applicable: this can be because the studied phenomenon does not follow a stochastic sampling scheme, samples are high-dimensional and subsequent multiple-testing corrections render results overly conservative, sample sizes are too low for asymptotic results (as e.g. the central limit theorem) to kick in, or one cannot allow for the small probability of failure inherent to stochastic approaches. This paper instead designs a method based on insights from machine learning and online learning theory: this detection algorithm - named Online FAult Detection (FADO) - comes with theoretical guarantees of its detection capabilities. A version of the margin is found to regulate the detection performance of FADO. A precise expression is derived for bounding the performance, and experimental results are presented assessing the influence of involved quantities. A case study of scene detection is used to illustrate the approach. The technology is closely related to the linear perceptron rule, inherits its computational attractiveness and flexibility towards various extensions.
△ Less
Submitted 7 November, 2017;
originally announced November 2017.
-
Worst-case Prediction Performance Analysis of the Kalman Filter
Authors:
Sholeh Yasini,
Kristiaan Pelckmans
Abstract:
In this paper, we study the prediction performance of the Kalman filter (KF) in a worst-case, minimax setting as studied in online machine learning, information - and game theory. The aim is to predict the sequence of observations almost as well as the best reference predictor (comparator) sequence in a comparison class. We prove worst-case bounds on the cumulative squared prediction errors using…
▽ More
In this paper, we study the prediction performance of the Kalman filter (KF) in a worst-case, minimax setting as studied in online machine learning, information - and game theory. The aim is to predict the sequence of observations almost as well as the best reference predictor (comparator) sequence in a comparison class. We prove worst-case bounds on the cumulative squared prediction errors using a priori knowledge about the complexity of reference predictor sequence. In fact, the performance of the KF is derived as a function of the performance of the best reference predictor and the total amount of drift occurs in the schedule of the best comparator.
△ Less
Submitted 22 November, 2016; v1 submitted 7 November, 2016;
originally announced November 2016.
-
A machine-learning approach to measuring the escape of ionizing radiation from galaxies in the reionization epoch
Authors:
Hannes Jensen,
Erik Zackrisson,
Kristiaan Pelckmans,
Christian Binggeli,
Kristiina Ausmees,
Ulrika Lundholm
Abstract:
Recent observations of galaxies at $z \gtrsim 7$, along with the low value of the electron scattering optical depth measured by the Planck mission, make galaxies plausible as dominant sources of ionizing photons during the epoch of reionization. However, scenarios of galaxy-driven reionization hinge on the assumption that the average escape fraction of ionizing photons is significantly higher for…
▽ More
Recent observations of galaxies at $z \gtrsim 7$, along with the low value of the electron scattering optical depth measured by the Planck mission, make galaxies plausible as dominant sources of ionizing photons during the epoch of reionization. However, scenarios of galaxy-driven reionization hinge on the assumption that the average escape fraction of ionizing photons is significantly higher for galaxies in the reionization epoch than in the local Universe. The NIRSpec instrument on the James Webb Space Telescope (JWST) will enable spectroscopic observations of large samples of reionization-epoch galaxies. While the leakage of ionizing photons will not be directly measurable from these spectra, the leakage is predicted to have an indirect effect on the spectral slope and the strength of nebular emission lines in the rest-frame ultraviolet and optical. Here, we apply a machine learning technique known as lasso regression on mock JWST/NIRSpec observations of simulated $z=7$ galaxies in order to obtain a model that can predict the escape fraction from JWST/NIRSpec data. Barring systematic biases in the simulated spectra, our method is able to retrieve the escape fraction with a mean absolute error of $Δf_{\mathrm{esc}} \approx 0.12$ for spectra with $S/N\approx 5$ at a rest-frame wavelength of 1500 Å for our fiducial simulation. This prediction accuracy represents a significant improvement over previous similar approaches.
△ Less
Submitted 1 July, 2016; v1 submitted 31 March, 2016;
originally announced March 2016.
-
An efficient method for sorting and selecting for social behaviour
Authors:
Alex Szorkovszky,
Alexander Kotrschal,
James E. Herbert Read,
David J. T. Sumpter,
Niclas Kolm,
Kristiaan Pelckmans
Abstract:
In this article we provide a systematic experimental method for sorting animals according to socially relevant traits, without assaying them or even tagging them individually. Instead, they are repeatedly subjected to behavioural assays in groups, between which the group memberships are rearranged, in order to test the effect of many different combinations of individuals on a group-level property…
▽ More
In this article we provide a systematic experimental method for sorting animals according to socially relevant traits, without assaying them or even tagging them individually. Instead, they are repeatedly subjected to behavioural assays in groups, between which the group memberships are rearranged, in order to test the effect of many different combinations of individuals on a group-level property or feature. We analyse this method using a general model for the group feature, and simulate a variety of specific cases to track how individuals are sorted in each case. We find that in the case where the members of a group contribute equally to the group feature, the sorting procedure increases the between-group behavioural variation well above what is expected for groups randomly sampled from a population. For a wide class of group feature models, the individual phenotypes are efficiently sorted across the groups and thus become available for further analysis on how individual properties affect group behaviour. We also show that the experimental data can be used to estimate the individual-level repeatability of the underlying traits.
△ Less
Submitted 22 February, 2017; v1 submitted 18 February, 2016;
originally announced February 2016.
-
Sparse Estimation From Noisy Observations of an Overdetermined Linear System
Authors:
Liang Dai,
Kristiaan Pelckmans
Abstract:
This note studies a method for the efficient estimation of a finite number of unknown parameters from linear equations, which are perturbed by Gaussian noise.
In case the unknown parameters have only few nonzero entries, the proposed estimator performs more efficiently than a traditional approach.
The method consists of three steps:
(1) a classical Least Squares Estimate (LSE),
(2) the sup…
▽ More
This note studies a method for the efficient estimation of a finite number of unknown parameters from linear equations, which are perturbed by Gaussian noise.
In case the unknown parameters have only few nonzero entries, the proposed estimator performs more efficiently than a traditional approach.
The method consists of three steps:
(1) a classical Least Squares Estimate (LSE),
(2) the support is recovered through a Linear Programming (LP) optimization problem which can be computed using a soft-thresholding step,
(3) a de-biasing step using a LSE on the estimated support set.
The main contribution of this note is a formal derivation of an associated ORACLE property of the final estimate.
That is, when the number of samples is large enough, the estimate is shown to equal the LSE based on the support of the {\em true} parameters.
△ Less
Submitted 25 May, 2014; v1 submitted 12 February, 2014;
originally announced February 2014.
-
On the Randomized Kaczmarz Algorithm
Authors:
Liang Dai,
Mojtaba Soltanalian,
Kristiaan Pelckmans
Abstract:
The Randomized Kaczmarz Algorithm is a randomized method which aims at solving a consistent system of over determined linear equations. This note discusses how to find an optimized randomization scheme for this algorithm, which is related to the question raised by \cite{c2}. Illustrative experiments are conducted to support the findings.
The Randomized Kaczmarz Algorithm is a randomized method which aims at solving a consistent system of over determined linear equations. This note discusses how to find an optimized randomization scheme for this algorithm, which is related to the question raised by \cite{c2}. Illustrative experiments are conducted to support the findings.
△ Less
Submitted 12 February, 2014;
originally announced February 2014.
-
On the Nuclear Norm heuristic for a Hankel matrix Recovery Problem
Authors:
Liang Dai,
Kristiaan Pelckmans
Abstract:
This note addresses the question if and why the nuclear norm heuristic can recover an impulse response generated by a stable single-real-pole system, if elements of the upper-triangle of the associated Hankel matrix were given.
Since the setting is deterministic, theories based on stochastic assumptions for low-rank matrix recovery do not apply here. A 'certificate' which guarantees the completi…
▽ More
This note addresses the question if and why the nuclear norm heuristic can recover an impulse response generated by a stable single-real-pole system, if elements of the upper-triangle of the associated Hankel matrix were given.
Since the setting is deterministic, theories based on stochastic assumptions for low-rank matrix recovery do not apply here. A 'certificate' which guarantees the completion is constructed by exploring the structural information of the hidden matrix. Experimental results and discussions regarding the nuclear norm heuristic applied to a more general setting are also given.
△ Less
Submitted 23 April, 2014; v1 submitted 18 July, 2012;
originally announced July 2012.
-
MINLIP for the Identification of Monotone Wiener Systems
Authors:
Kristiaan Pelckmans
Abstract:
This paper studies the MINLIP estimator for the identification of Wiener systems consisting of a sequence of a linear FIR dynamical model, and a monotonically increasing (or decreasing) static function. Given $T$ observations, this algorithm boils down to solving a convex quadratic program with $O(T)$ variables and inequality constraints, implementing an inference technique which is based entirely…
▽ More
This paper studies the MINLIP estimator for the identification of Wiener systems consisting of a sequence of a linear FIR dynamical model, and a monotonically increasing (or decreasing) static function. Given $T$ observations, this algorithm boils down to solving a convex quadratic program with $O(T)$ variables and inequality constraints, implementing an inference technique which is based entirely on model complexity control. The resulting estimates of the linear submodel are found to be almost consistent when no noise is present in the data, under a condition of smoothness of the true nonlinearity and local Persistency of Excitation (local PE) of the data. This result is novel as it does not rely on classical tools as a 'linearization' using a Taylor decomposition, nor exploits stochastic properties of the data. It is indicated how to extend the method to cope with noisy data, and empirical evidence contrasts performance of the estimator against other recently proposed techniques.
△ Less
Submitted 24 June, 2010;
originally announced June 2010.
-
Support and Quantile Tubes
Authors:
Kristiaan Pelckmans,
Jos De Brabanter,
Johan A. K. Suykens,
Bart De Moor
Abstract:
This correspondence studies an estimator of the conditional support of a distribution underlying a set of i.i.d. observations. The relation with mutual information is shown via an extension of Fano's theorem in combination with a generalization bound based on a compression argument. Extensions to estimating the conditional quantile interval, and statistical guarantees on the minimal convex hull…
▽ More
This correspondence studies an estimator of the conditional support of a distribution underlying a set of i.i.d. observations. The relation with mutual information is shown via an extension of Fano's theorem in combination with a generalization bound based on a compression argument. Extensions to estimating the conditional quantile interval, and statistical guarantees on the minimal convex hull are given.
△ Less
Submitted 12 March, 2007;
originally announced March 2007.
-
Componentwise Least Squares Support Vector Machines
Authors:
Kristiaan Pelckmans,
Ivan Goethals,
Jos De Brabanter,
Johan A. K. Suykens,
Bart De Moor
Abstract:
This chapter describes componentwise Least Squares Support Vector Machines (LS-SVMs) for the estimation of additive models consisting of a sum of nonlinear components. The primal-dual derivations characterizing LS-SVMs for the estimation of the additive model result in a single set of linear equations with size growing in the number of data-points. The derivation is elaborated for the classifica…
▽ More
This chapter describes componentwise Least Squares Support Vector Machines (LS-SVMs) for the estimation of additive models consisting of a sum of nonlinear components. The primal-dual derivations characterizing LS-SVMs for the estimation of the additive model result in a single set of linear equations with size growing in the number of data-points. The derivation is elaborated for the classification as well as the regression case. Furthermore, different techniques are proposed to discover structure in the data by looking for sparse components in the model based on dedicated regularization schemes on the one hand and fusion of the componentwise LS-SVMs training with a validation criterion on the other hand. (keywords: LS-SVMs, additive models, regularization, structure detection)
△ Less
Submitted 19 April, 2005;
originally announced April 2005.