-
The Role of Foundation Models in Neuro-Symbolic Learning and Reasoning
Authors:
Daniel Cunnington,
Mark Law,
Jorge Lobo,
Alessandra Russo
Abstract:
Neuro-Symbolic AI (NeSy) holds promise to ensure the safe deployment of AI systems, as interpretable symbolic techniques provide formal behaviour guarantees. The challenge is how to effectively integrate neural and symbolic computation, to enable learning and reasoning from raw data. Existing pipelines that train the neural and symbolic components sequentially require extensive labelling, whereas…
▽ More
Neuro-Symbolic AI (NeSy) holds promise to ensure the safe deployment of AI systems, as interpretable symbolic techniques provide formal behaviour guarantees. The challenge is how to effectively integrate neural and symbolic computation, to enable learning and reasoning from raw data. Existing pipelines that train the neural and symbolic components sequentially require extensive labelling, whereas end-to-end approaches are limited in terms of scalability, due to the combinatorial explosion in the symbol grounding problem. In this paper, we leverage the implicit knowledge within foundation models to enhance the performance in NeSy tasks, whilst reducing the amount of data labelling and manual engineering. We introduce a new architecture, called NeSyGPT, which fine-tunes a vision-language foundation model to extract symbolic features from raw data, before learning a highly expressive answer set program to solve a downstream task. Our comprehensive evaluation demonstrates that NeSyGPT has superior accuracy over various baselines, and can scale to complex NeSy tasks. Finally, we highlight the effective use of a large language model to generate the programmatic interface between the neural and symbolic components, significantly reducing the amount of manual engineering required.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?
Authors:
Jack Furby,
Daniel Cunnington,
Dave Braines,
Alun Preece
Abstract:
Concept Bottleneck Models (CBMs) are considered inherently interpretable because they first predict a set of human-defined concepts before using these concepts to predict the output of a downstream task. For inherent interpretability to be fully realised, and ensure trust in a model's output, we need to guarantee concepts are predicted based on semantically mapped input features. For example, one…
▽ More
Concept Bottleneck Models (CBMs) are considered inherently interpretable because they first predict a set of human-defined concepts before using these concepts to predict the output of a downstream task. For inherent interpretability to be fully realised, and ensure trust in a model's output, we need to guarantee concepts are predicted based on semantically mapped input features. For example, one might expect the pixels representing a broken bone in an image to be used for the prediction of a fracture. However, current literature indicates this is not the case, as concept predictions are often mapped to irrelevant input features. We hypothesise that this occurs when concept annotations are inaccurate or how input features should relate to concepts is unclear. In general, the effect of dataset labelling on concept representations in CBMs remains an understudied area. Therefore, in this paper, we examine how CBMs learn concepts from datasets with fine-grained concept annotations. We demonstrate that CBMs can learn concept representations with semantic map** to input features by removing problematic concept correlations, such as two concepts always appearing together. To support our evaluation, we introduce a new synthetic image dataset based on a playing cards domain, which we hope will serve as a benchmark for future CBM research. For validation, we provide empirical evidence on a real-world dataset of chest X-rays, to demonstrate semantically meaningful concepts can be learned in real-world applications.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Cybersecurity in Motion: A Survey of Challenges and Requirements for Future Test Facilities of CAVs
Authors:
Ioannis Mavromatis,
Theodoros Spyridopoulos,
Pietro Carnelli,
Woon Hau Chin,
Ahmed Khalil,
Jennifer Chakravarty,
Lucia Cipolina Kun,
Robert J. Piechocki,
Colin Robbins,
Daniel Cunnington,
Leigh Chase,
Lamogha Chiazor,
Chris Preston,
Rahul,
Aftab Khan
Abstract:
The way we travel is changing rapidly, and Cooperative Intelligent Transportation Systems (C-ITSs) are at the forefront of this evolution. However, the adoption of C-ITSs introduces new risks and challenges, making cybersecurity a top priority for ensuring safety and reliability. Building on this premise, this paper presents an envisaged Cybersecurity Centre of Excellence (CSCE) designed to bolste…
▽ More
The way we travel is changing rapidly, and Cooperative Intelligent Transportation Systems (C-ITSs) are at the forefront of this evolution. However, the adoption of C-ITSs introduces new risks and challenges, making cybersecurity a top priority for ensuring safety and reliability. Building on this premise, this paper presents an envisaged Cybersecurity Centre of Excellence (CSCE) designed to bolster research, testing, and evaluation of the cybersecurity of C-ITSs. We explore the design, functionality, and challenges of CSCE's testing facilities, outlining the technological, security, and societal requirements. Through a thorough survey and analysis, we assess the effectiveness of these systems in detecting and mitigating potential threats, highlighting their flexibility to adapt to future C-ITSs. Finally, we identify current unresolved challenges in various C-ITS domains, with the aim of motivating further research into the cybersecurity of C-ITSs.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Symbolic Learning for Material Discovery
Authors:
Daniel Cunnington,
Flaviu Cipcigan,
Rodrigo Neumann Barros Ferreira,
Jonathan Booth
Abstract:
Discovering new materials is essential to solve challenges in climate change, sustainability and healthcare. A typical task in materials discovery is to search for a material in a database which maximises the value of a function. That function is often expensive to evaluate, and can rely upon a simulation or an experiment. Here, we introduce SyMDis, a sample efficient optimisation method based on…
▽ More
Discovering new materials is essential to solve challenges in climate change, sustainability and healthcare. A typical task in materials discovery is to search for a material in a database which maximises the value of a function. That function is often expensive to evaluate, and can rely upon a simulation or an experiment. Here, we introduce SyMDis, a sample efficient optimisation method based on symbolic learning, that discovers near-optimal materials in a large database. SyMDis performs comparably to a state-of-the-art optimiser, whilst learning interpretable rules to aid physical and chemical verification. Furthermore, the rules learned by SyMDis generalise to unseen datasets and return high performing candidates in a zero-shot evaluation, which is difficult to achieve with other approaches.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Towards a Deeper Understanding of Concept Bottleneck Models Through End-to-End Explanation
Authors:
Jack Furby,
Daniel Cunnington,
Dave Braines,
Alun Preece
Abstract:
Concept Bottleneck Models (CBMs) first map raw input(s) to a vector of human-defined concepts, before using this vector to predict a final classification. We might therefore expect CBMs capable of predicting concepts based on distinct regions of an input. In doing so, this would support human interpretation when generating explanations of the model's outputs to visualise input features correspondi…
▽ More
Concept Bottleneck Models (CBMs) first map raw input(s) to a vector of human-defined concepts, before using this vector to predict a final classification. We might therefore expect CBMs capable of predicting concepts based on distinct regions of an input. In doing so, this would support human interpretation when generating explanations of the model's outputs to visualise input features corresponding to concepts. The contribution of this paper is threefold: Firstly, we expand on existing literature by looking at relevance both from the input to the concept vector, confirming that relevance is distributed among the input features, and from the concept vector to the final classification where, for the most part, the final classification is made using concepts predicted as present. Secondly, we report a quantitative evaluation to measure the distance between the maximum input feature relevance and the ground truth location; we perform this with the techniques, Layer-wise Relevance Propagation (LRP), Integrated Gradients (IG) and a baseline gradient approach, finding LRP has a lower average distance than IG. Thirdly, we propose using the proportion of relevance as a measurement for explaining concept importance.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Neuro-Symbolic Learning of Answer Set Programs from Raw Data
Authors:
Daniel Cunnington,
Mark Law,
Jorge Lobo,
Alessandra Russo
Abstract:
One of the ultimate goals of Artificial Intelligence is to assist humans in complex decision making. A promising direction for achieving this goal is Neuro-Symbolic AI, which aims to combine the interpretability of symbolic techniques with the ability of deep learning to learn from raw data. However, most current approaches require manually engineered symbolic knowledge, and where end-to-end train…
▽ More
One of the ultimate goals of Artificial Intelligence is to assist humans in complex decision making. A promising direction for achieving this goal is Neuro-Symbolic AI, which aims to combine the interpretability of symbolic techniques with the ability of deep learning to learn from raw data. However, most current approaches require manually engineered symbolic knowledge, and where end-to-end training is considered, such approaches are either restricted to learning definite programs, or are restricted to training binary neural networks. In this paper, we introduce Neuro-Symbolic Inductive Learner (NSIL), an approach that trains a general neural network to extract latent concepts from raw data, whilst learning symbolic knowledge that maps latent concepts to target labels. The novelty of our approach is a method for biasing the learning of symbolic knowledge, based on the in-training performance of both neural and symbolic components. We evaluate NSIL on three problem domains of different complexity, including an NP-complete problem. Our results demonstrate that NSIL learns expressive knowledge, solves computationally complex problems, and achieves state-of-the-art performance in terms of accuracy and data efficiency. Code and technical appendix: https://github.com/DanCunnington/NSIL
△ Less
Submitted 2 February, 2024; v1 submitted 25 May, 2022;
originally announced May 2022.
-
FF-NSL: Feed-Forward Neural-Symbolic Learner
Authors:
Daniel Cunnington,
Mark Law,
Alessandra Russo,
Jorge Lobo
Abstract:
Logic-based machine learning aims to learn general, interpretable knowledge in a data-efficient manner. However, labelled data must be specified in a structured logical form. To address this limitation, we propose a neural-symbolic learning framework, called Feed-Forward Neural-Symbolic Learner (FFNSL), that integrates a logic-based machine learning system capable of learning from noisy examples,…
▽ More
Logic-based machine learning aims to learn general, interpretable knowledge in a data-efficient manner. However, labelled data must be specified in a structured logical form. To address this limitation, we propose a neural-symbolic learning framework, called Feed-Forward Neural-Symbolic Learner (FFNSL), that integrates a logic-based machine learning system capable of learning from noisy examples, with neural networks, in order to learn interpretable knowledge from labelled unstructured data. We demonstrate the generality of FFNSL on four neural-symbolic classification problems, where different pre-trained neural network models and logic-based machine learning systems are integrated to learn interpretable knowledge from sequences of images. We evaluate the robustness of our framework by using images subject to distributional shifts, for which the pre-trained neural networks may predict incorrectly and with high confidence. We analyse the impact that these shifts have on the accuracy of the learned knowledge and run-time performance, comparing FFNSL to tree-based and pure neural approaches. Our experimental results show that FFNSL outperforms the baselines by learning more accurate and interpretable knowledge with fewer examples.
△ Less
Submitted 5 January, 2023; v1 submitted 24 June, 2021;
originally announced June 2021.
-
NSL: Hybrid Interpretable Learning From Noisy Raw Data
Authors:
Daniel Cunnington,
Alessandra Russo,
Mark Law,
Jorge Lobo,
Lance Kaplan
Abstract:
Inductive Logic Programming (ILP) systems learn generalised, interpretable rules in a data-efficient manner utilising existing background knowledge. However, current ILP systems require training examples to be specified in a structured logical format. Neural networks learn from unstructured data, although their learned models may be difficult to interpret and are vulnerable to data perturbations a…
▽ More
Inductive Logic Programming (ILP) systems learn generalised, interpretable rules in a data-efficient manner utilising existing background knowledge. However, current ILP systems require training examples to be specified in a structured logical format. Neural networks learn from unstructured data, although their learned models may be difficult to interpret and are vulnerable to data perturbations at run-time. This paper introduces a hybrid neural-symbolic learning framework, called NSL, that learns interpretable rules from labelled unstructured data. NSL combines pre-trained neural networks for feature extraction with FastLAS, a state-of-the-art ILP system for rule learning under the answer set semantics. Features extracted by the neural components define the structured context of labelled examples and the confidence of the neural predictions determines the level of noise of the examples. Using the scoring function of FastLAS, NSL searches for short, interpretable rules that generalise over such noisy examples. We evaluate our framework on propositional and first-order classification tasks using the MNIST dataset as raw data. Specifically, we demonstrate that NSL is able to learn robust rules from perturbed MNIST data and achieve comparable or superior accuracy when compared to neural network and random forest baselines whilst being more general and interpretable.
△ Less
Submitted 25 June, 2021; v1 submitted 9 December, 2020;
originally announced December 2020.
-
Synthetic Ground Truth Generation for Evaluating Generative Policy Models
Authors:
Daniel Cunnington,
Graham White,
Geeth de Mel
Abstract:
Generative Policy-based Models aim to enable a coalition of systems, be they devices or services to adapt according to contextual changes such as environmental factors, user preferences and different tasks whilst adhering to various constraints and regulations as directed by a managing party or the collective vision of the coalition. Recent developments have proposed new architectures to realize t…
▽ More
Generative Policy-based Models aim to enable a coalition of systems, be they devices or services to adapt according to contextual changes such as environmental factors, user preferences and different tasks whilst adhering to various constraints and regulations as directed by a managing party or the collective vision of the coalition. Recent developments have proposed new architectures to realize the potential of GPMs but as the complexity of systems and their associated requirements increases, there is an emerging requirement to have scenarios and associated datasets to realistically evaluate GPMs with respect to the properties of the operating environment, be it the future battlespace or an autonomous organization. In order to address this requirement, in this paper, we present a method of applying an agile knowledge representation framework to model requirements, both individualistic and collective that enables synthetic generation of ground truth data such that advanced GPMs can be evaluated robustly in complex environments. We also release conceptual models, annotated datasets, as well as means to extend the data generation approach so that similar datasets can be developed for varying complexities and different situations.
△ Less
Submitted 26 April, 2019;
originally announced April 2019.
-
Observational selection biases in time-delay strong lensing and their impact on cosmography
Authors:
Thomas E. Collett,
Steven D. Cunnington
Abstract:
Inferring cosmological parameters from time-delay strong lenses requires a significant investment of telescope time; it is therefore tempting to focus on the systems with the brightest sources, the highest image multiplicities and the widest image separations. We investigate if this selection bias can influence the properties of the lenses studied and the cosmological parameters that are inferred.…
▽ More
Inferring cosmological parameters from time-delay strong lenses requires a significant investment of telescope time; it is therefore tempting to focus on the systems with the brightest sources, the highest image multiplicities and the widest image separations. We investigate if this selection bias can influence the properties of the lenses studied and the cosmological parameters that are inferred. Using a population of lenses with ellipsoidal powerlaw density profiles, we build a sample of double and quadruple image systems. Assuming reasonable thresholds on image separation and flux, based on current lens monitoring campaigns, we find that the typical density profile slopes of monitorable lenses are significantly shallower than the input ensemble. From a sample of quadruple image lenses we find that this selection function can introduce a 3.5% bias on the inferred time-delay distances if the ensemble of deflector properties is used as a prior for a cosmographical analysis. This bias remains at the 2.4% level when high resolution imaging of the quasar host is used to precisely infer the density profiles of individual lenses. We also investigate if the lines-of-sight for monitorable strong lenses are biased. After adding external convergence, $κ$, and shear to our lens population we find that the expectation value for $κ$ is increased by 0.004 and 0.009 for doubles and quads respectively. $κ$ is degenerate with the value of $H_0$ inferred from time delays; fortunately the shift in $κ$ only induces a 0.9 (0.4) percent bias on $H_0$ for quads (doubles). We therefore conclude that whilst the properties of typical quasar lenses and their lines-of-sight do deviate from the global population, the total magnitude of this effect is likely a subdominant effect for current analyses, but has the potential to be a major systematic for samples of $\sim$25 or more lenses.
△ Less
Submitted 26 May, 2016;
originally announced May 2016.