Search | arXiv e-print repository

Sequential Multivariate Change Detection with Calibrated and Memoryless False Detection Rates

Authors: Oliver Cobb, Arnaud Van Looveren, Janis Klaise

Abstract: Responding appropriately to the detections of a sequential change detector requires knowledge of the rate at which false positives occur in the absence of change. Setting detection thresholds to achieve a desired false positive rate is challenging. Existing works resort to setting time-invariant thresholds that focus on the expected runtime of the detector in the absence of change, either bounding… ▽ More Responding appropriately to the detections of a sequential change detector requires knowledge of the rate at which false positives occur in the absence of change. Setting detection thresholds to achieve a desired false positive rate is challenging. Existing works resort to setting time-invariant thresholds that focus on the expected runtime of the detector in the absence of change, either bounding it loosely from below or targeting it directly but with asymptotic arguments that we show cause significant miscalibration in practice. We present a simulation-based approach to setting time-varying thresholds that allows a desired expected runtime to be accurately targeted whilst additionally kee** the false positive rate constant across time steps. Whilst the approach to threshold setting is metric agnostic, we show how the cost of using the popular quadratic time MMD estimator can be reduced from $O(N^2B)$ to $O(N^2+NB)$ during configuration and from $O(N^2)$ to $O(N)$ during operation, where $N$ and $B$ are the numbers of reference and bootstrap samples respectively. △ Less

Submitted 17 February, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

Comments: 20 pages, 5 figures, open source implementation at http://github.com/SeldonIO/alibi-detect

arXiv:2106.02597 [pdf, other]

Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning

Authors: Robert-Florian Samoilescu, Arnaud Van Looveren, Janis Klaise

Abstract: Counterfactual instances are a powerful tool to obtain valuable insights into automated decision processes, describing the necessary minimal changes in the input space to alter the prediction towards a desired target. Most previous approaches require a separate, computationally expensive optimization procedure per instance, making them impractical for both large amounts of data and high-dimensiona… ▽ More Counterfactual instances are a powerful tool to obtain valuable insights into automated decision processes, describing the necessary minimal changes in the input space to alter the prediction towards a desired target. Most previous approaches require a separate, computationally expensive optimization procedure per instance, making them impractical for both large amounts of data and high-dimensional data. Moreover, these methods are often restricted to certain subclasses of machine learning models (e.g. differentiable or tree-based models). In this work, we propose a deep reinforcement learning approach that transforms the optimization procedure into an end-to-end learnable process, allowing us to generate batches of counterfactual instances in a single forward pass. Our experiments on real-world data show that our method i) is model-agnostic (does not assume differentiability), relying only on feedback from model predictions; ii) allows for generating target-conditional counterfactual instances; iii) allows for flexible feature range constraints for numerical and categorical attributes, including the immutability of protected features (e.g. gender, race); iv) is easily extended to other data modalities such as images. △ Less

Submitted 4 June, 2021; originally announced June 2021.

Comments: 18 pages

arXiv:2101.10123 [pdf, other]

Conditional Generative Models for Counterfactual Explanations

Authors: Arnaud Van Looveren, Janis Klaise, Giovanni Vacanti, Oliver Cobb

Abstract: Counterfactual instances offer human-interpretable insight into the local behaviour of machine learning models. We propose a general framework to generate sparse, in-distribution counterfactual model explanations which match a desired target prediction with a conditional generative model, allowing batches of counterfactual instances to be generated with a single forward pass. The method is flexibl… ▽ More Counterfactual instances offer human-interpretable insight into the local behaviour of machine learning models. We propose a general framework to generate sparse, in-distribution counterfactual model explanations which match a desired target prediction with a conditional generative model, allowing batches of counterfactual instances to be generated with a single forward pass. The method is flexible with respect to the type of generative model used as well as the task of the underlying predictive model. This allows straightforward application of the framework to different modalities such as images, time series or tabular data as well as generative model paradigms such as GANs or autoencoders and predictive tasks like classification or regression. We illustrate the effectiveness of our method on image (CelebA), time series (ECG) and mixed-type tabular (Adult Census) data. △ Less

Submitted 25 January, 2021; originally announced January 2021.

Comments: 12 pages

arXiv:2007.06299 [pdf, other]

Monitoring and explainability of models in production

Authors: Janis Klaise, Arnaud Van Looveren, Clive Cox, Giovanni Vacanti, Alexandru Coca

Abstract: The machine learning lifecycle extends beyond the deployment stage. Monitoring deployed models is crucial for continued provision of high quality machine learning enabled services. Key areas include model performance and data monitoring, detecting outliers and data drift using statistical techniques, and providing explanations of historic predictions. We discuss the challenges to successful implem… ▽ More The machine learning lifecycle extends beyond the deployment stage. Monitoring deployed models is crucial for continued provision of high quality machine learning enabled services. Key areas include model performance and data monitoring, detecting outliers and data drift using statistical techniques, and providing explanations of historic predictions. We discuss the challenges to successful implementation of solutions in each of these areas with some recent examples of production ready solutions using open source tools. △ Less

Submitted 13 July, 2020; originally announced July 2020.

Comments: Workshop on Challenges in Deploying and Monitoring Machine Learning Systems (ICML 2020)

arXiv:2002.09996 [pdf, other]

Practical Bayesian Optimization of Objectives with Conditioning Variables

Authors: Michael Pearce, Janis Klaise, Matthew Groves

Abstract: Bayesian optimization is a class of data efficient model based algorithms typically focused on global optimization. We consider the more general case where a user is faced with multiple problems that each need to be optimized conditional on a state variable, for example given a range of cities with different patient distributions, we optimize the ambulance locations conditioned on patient distribu… ▽ More Bayesian optimization is a class of data efficient model based algorithms typically focused on global optimization. We consider the more general case where a user is faced with multiple problems that each need to be optimized conditional on a state variable, for example given a range of cities with different patient distributions, we optimize the ambulance locations conditioned on patient distribution. Given partitions of CIFAR-10, we optimize CNN hyperparameters for each partition. Similarity across objectives boosts optimization of each objective in two ways: in modelling by data sharing across objectives, and also in acquisition by quantifying how a single point on one objective can provide benefit to all objectives. For this we propose a framework for conditional optimization: ConBO. This can be built on top of a range of acquisition functions and we propose a new Hybrid Knowledge Gradient acquisition function. The resulting method is intuitive and theoretically grounded, performs either similar to or significantly better than recently published works on a range of problems, and is easily parallelized to collect a batch of points. △ Less

Submitted 2 November, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

Comments: 22 pages

arXiv:1907.02584 [pdf, other]

Interpretable Counterfactual Explanations Guided by Prototypes

Authors: Arnaud Van Looveren, Janis Klaise

Abstract: We propose a fast, model agnostic method for finding interpretable counterfactual explanations of classifier predictions by using class prototypes. We show that class prototypes, obtained using either an encoder or through class specific k-d trees, significantly speed up the the search for counterfactual instances and result in more interpretable explanations. We introduce two novel metrics to qua… ▽ More We propose a fast, model agnostic method for finding interpretable counterfactual explanations of classifier predictions by using class prototypes. We show that class prototypes, obtained using either an encoder or through class specific k-d trees, significantly speed up the the search for counterfactual instances and result in more interpretable explanations. We introduce two novel metrics to quantitatively evaluate local interpretability at the instance level. We use these metrics to illustrate the effectiveness of our method on an image and tabular dataset, respectively MNIST and Breast Cancer Wisconsin (Diagnostic). The method also eliminates the computational bottleneck that arises because of numerical gradient evaluation for $\textit{black box}$ models. △ Less

Submitted 18 February, 2020; v1 submitted 3 July, 2019; originally announced July 2019.

Comments: 17 pages, 13 figures. For an open source implementation of the algorithm, see https://github.com/SeldonIO/alibi

arXiv:1708.07739 [pdf, other]

doi 10.1103/PhysRevE.97.012302

Relaxation dynamics of maximally clustered networks

Authors: Janis Klaise, Samuel Johnson

Abstract: We study the relaxation dynamics of fully clustered networks (maximal number of triangles) to an unclustered state under two different edge dynamics---the double-edge swap, corresponding to degree-preserving randomization of the configuration model, and single edge replacement, corresponding to full randomization of the Erdős--Rényi random graph. We derive expressions for the time evolution of the… ▽ More We study the relaxation dynamics of fully clustered networks (maximal number of triangles) to an unclustered state under two different edge dynamics---the double-edge swap, corresponding to degree-preserving randomization of the configuration model, and single edge replacement, corresponding to full randomization of the Erdős--Rényi random graph. We derive expressions for the time evolution of the degree distribution, edge multiplicity distribution and clustering coefficient. We show that under both dynamics networks undergo a continuous phase transition in which a giant connected component is formed. We calculate the position of the phase transition analytically using the Erdős--Rényi phenomenology. △ Less

Submitted 25 August, 2017; originally announced August 2017.

Journal ref: Phys. Rev. E 97, 012302 (2018)

arXiv:1609.04318 [pdf, other]

The Origin of Motif Families in Food Webs

Authors: Janis Klaise, Samuel Johnson

Abstract: Food webs have been found to exhibit remarkable motif profiles, patterns in the relative prevalences of all possible three-species sub-graphs, and this has been related to ecosystem properties such as stability and robustness. Analysing 46 food webs of various kinds, we find that most food webs fall into one of two distinct motif families. The separation between the families is well predicted by a… ▽ More Food webs have been found to exhibit remarkable motif profiles, patterns in the relative prevalences of all possible three-species sub-graphs, and this has been related to ecosystem properties such as stability and robustness. Analysing 46 food webs of various kinds, we find that most food webs fall into one of two distinct motif families. The separation between the families is well predicted by a global measure of hierarchical order in directed networks - trophic coherence. We find that trophic coherence is also a good predictor for the extent of omnivory, defined as the tendency of species to feed on multiple trophic levels. We compare our results to a network assembly model that admits tunable trophic coherence via a single free parameter. The model is able to generate food webs in either of the two families by varying this parameter, and correctly classifies almost all the food webs in our database. This establishes a link between global order and local preying patterns in food webs. △ Less

Submitted 14 September, 2016; originally announced September 2016.

Comments: 19 pages, 7 figures, 1 table. Submitted to Proc. R. Soc. B

arXiv:1603.00670 [pdf, ps, other]

doi 10.1063/1.4953160

From neurons to epidemics: How trophic coherence affects spreading processes

Authors: Janis Klaise, Samuel Johnson

Abstract: Trophic coherence, a measure of the extent to which the nodes of a directed network are organised in levels, has recently been shown to be closely related to many structural and dynamical aspects of complex systems, including graph eigenspectra, the prevalence or absence of feed-back cycles, and linear stability. Furthermore, non-trivial trophic structures have been observed in networks of neurons… ▽ More Trophic coherence, a measure of the extent to which the nodes of a directed network are organised in levels, has recently been shown to be closely related to many structural and dynamical aspects of complex systems, including graph eigenspectra, the prevalence or absence of feed-back cycles, and linear stability. Furthermore, non-trivial trophic structures have been observed in networks of neurons, species, genes, metabolites, cellular signalling, concatenated words, P2P users, and world trade. Here we consider two simple yet apparently quite different dynamical models -- one a Susceptible-Infected-Susceptible (SIS) epidemic model adapted to include complex contagion, the other an Amari-Hopfield neural network -- and show that in both cases the related spreading processes are modulated in similar ways by the trophic coherence of the underlying networks. To do this, we propose a network assembly model which can generate structures with tunable trophic coherence, limiting in either perfectly stratified networks or random graphs. We find that trophic coherence can exert a qualitative change in spreading behaviour, determining whether a pulse of activity will percolate through the entire network or remain confined to a subset of nodes, and whether such activity will quickly die out or endure indefinitely. These results could be important for our understanding of phenomena such as epidemics, rumours, shocks to ecosystems, neuronal avalanches, and many other spreading processes. △ Less

Submitted 21 July, 2016; v1 submitted 2 March, 2016; originally announced March 2016.

Journal ref: Chaos 26, 065310 (2016)

Showing 1–9 of 9 results for author: Klaise, J