Skip to main content

Showing 1–16 of 16 results for author: Cristianini, N

.
  1. arXiv:2311.11085  [pdf, other

    cs.LG

    Compositional Fusion of Signals in Data Embedding

    Authors: Zhi** Guo, Zhaozhen Xu, Martha Lewis, Nello Cristianini

    Abstract: Embeddings in AI convert symbolic structures into fixed-dimensional vectors, effectively fusing multiple signals. However, the nature of this fusion in real-world data is often unclear. To address this, we introduce two methods: (1) Correlation-based Fusion Detection, measuring correlation between known attributes and embeddings, and (2) Additive Fusion Detection, viewing embeddings as sums of ind… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  2. arXiv:2311.00115  [pdf, other

    cs.LG cs.CY

    EXTRACT: Explainable Transparent Control of Bias in Embeddings

    Authors: Zhi** Guo, Zhaozhen Xu, Martha Lewis, Nello Cristianini

    Abstract: Knowledge Graphs are a widely used method to represent relations between entities in various AI applications, and Graph Embedding has rapidly become a standard technique to represent Knowledge Graphs in such a way as to facilitate inferences and decisions. As this representation is obtained from behavioural data, and is not in a form readable by humans, there is a concern that it might incorporate… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Aequitas 2023: Workshop on Fairness and Bias in AI | co-located with ECAI 2023, Kraków, Poland

  3. arXiv:2212.01967  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    QBERT: Generalist Model for Processing Questions

    Authors: Zhaozhen Xu, Nello Cristianini

    Abstract: Using a single model across various tasks is beneficial for training and applying deep neural sequence models. We address the problem of develo** generalist representations of text that can be used to perform a range of different tasks rather than being specialised to a single application. We focus on processing short questions and develo** an embedding for these questions that is useful on a… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  4. arXiv:2110.15409  [pdf, other

    cs.CL cs.AI cs.LG

    What makes us curious? analysis of a corpus of open-domain questions

    Authors: Zhaozhen Xu, Amelia Howarth, Nicole Briggs, Nello Cristianini

    Abstract: Every day people ask short questions through smart devices or online forums to seek answers to all kinds of queries. With the increasing number of questions collected it becomes difficult to provide answers to each of them, which is one of the reasons behind the growing interest in automated question answering. Some questions are similar to existing ones that have already been answered, while othe… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  5. arXiv:2006.09896  [pdf, ps, other

    cs.CL cs.AI cs.LG

    On the Learnability of Concepts: With Applications to Comparing Word Embedding Algorithms

    Authors: Adam Sutton, Nello Cristianini

    Abstract: Word Embeddings are used widely in multiple Natural Language Processing (NLP) applications. They are coordinates associated with each word in a dictionary, inferred from statistical properties of these words in a large corpus. In this paper we introduce the notion of "concept" as a list of words that have shared semantic content. We use this notion to analyse the learnability of certain concepts,… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 7 Pages. AIAI 2020. 5 equations 6 tables

    Journal ref: Artificial Intelligence Applications and Innovations. AIAI 2020. IFIP Advances in Information and Communication Technology, vol 584

  6. arXiv:1904.13316  [pdf

    cs.CY cs.AI

    On Social Machines for Algorithmic Regulation

    Authors: Nello Cristianini, Teresa Scantamburlo

    Abstract: Autonomous mechanisms have been proposed to regulate certain aspects of society and are already being used to regulate business organisations. We take seriously recent proposals for algorithmic regulation of society, and we identify the existing technologies that can be used to implement them, most of them originally introduced in business contexts. We build on the notion of 'social machine' and w… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

    Comments: Preprint. Under review

  7. arXiv:1811.06747  [pdf

    cs.CY cs.AI

    Machine Decisions and Human Consequences

    Authors: Teresa Scantamburlo, Andrew Charlesworth, Nello Cristianini

    Abstract: As we increasingly delegate decision-making to algorithms, whether directly or indirectly, important questions emerge in circumstances where those decisions have direct consequences for individual rights and personal opportunities, as well as for the collective good. A key problem for policymakers is that the social implications of these new methods can only be grasped if there is an adequate comp… ▽ More

    Submitted 30 April, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

  8. arXiv:1806.06301  [pdf, ps, other

    cs.CL cs.AI stat.ML

    Biased Embeddings from Wild Data: Measuring, Understanding and Removing

    Authors: Adam Sutton, Thomas Lansdall-Welfare, Nello Cristianini

    Abstract: Many modern Artificial Intelligence (AI) systems make use of data embeddings, particularly in the domain of Natural Language Processing (NLP). These embeddings are learnt from data that has been gathered "from the wild" and have been found to contain unwanted biases. In this paper we make three contributions towards measuring, understanding and removing this problem. We present a rigorous way to m… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.

    Comments: Author's original version

  9. arXiv:1806.06296  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Right for the Right Reason: Training Agnostic Networks

    Authors: Sen Jia, Thomas Lansdall-Welfare, Nello Cristianini

    Abstract: We consider the problem of a neural network being requested to classify images (or other inputs) without making implicit use of a "protected concept", that is a concept that should not play any role in the decision of the network. Typically these concepts include information such as gender or race, or other contextual information such as image backgrounds that might be implicitly reflected in unkn… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.

    Comments: Author's original version

  10. arXiv:1806.01185  [pdf

    cs.CL

    History Playground: A Tool for Discovering Temporal Trends in Massive Textual Corpora

    Authors: Thomas Lansdall-Welfare, Nello Cristianini

    Abstract: Recent studies have shown that macroscopic patterns of continuity and change over the course of centuries can be detected through the analysis of time series extracted from massive textual corpora. Similar data-driven approaches have already revolutionised the natural sciences, and are widely believed to hold similar potential for the humanities and social sciences, driven by the mass-digitisation… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

  11. Efficient Classification of Multi-Labelled Text Streams by Clashing

    Authors: Ricardo Ñanculef, Ilias Flaounas, Nello Cristianini

    Abstract: We present a method for the classification of multi-labelled text documents explicitly designed for data stream applications that require to process a virtually infinite sequence of data using constant memory and constant processing time. Our method is composed of an online procedure used to efficiently map text into a low-dimensional feature space and a partition of this space into a set of regio… ▽ More

    Submitted 11 April, 2016; originally announced April 2016.

  12. arXiv:1402.6208  [pdf

    cs.MA cs.AI cs.DC

    The Anatomy of a Modular System for Media Content Analysis

    Authors: Ilias Flaounas, Thomas Lansdall-Welfare, Panagiota Antonakaki, Nello Cristianini

    Abstract: Intelligent systems for the annotation of media content are increasingly being used for the automation of parts of social science research. In this domain the problem of integrating various Artificial Intelligence (AI) algorithms into a single intelligent system arises spontaneously. As part of our ongoing effort in automating media content analysis for the social sciences, we have built a modular… ▽ More

    Submitted 4 June, 2018; v1 submitted 25 February, 2014; originally announced February 2014.

    Comments: Updated to include previously missing figures

  13. arXiv:1309.6869  [pdf

    cs.LG stat.ML

    Finite-Time Analysis of Kernelised Contextual Bandits

    Authors: Michal Valko, Nathaniel Korda, Remi Munos, Ilias Flaounas, Nelo Cristianini

    Abstract: We tackle the problem of online reward maximisation over a large finite set of actions described by their contexts. We focus on the case when the number of actions is too big to sample all of them even once. However we assume that we have access to the similarities between actions' contexts and that the expected reward is an arbitrary linear function of the contexts' images in the related reproduc… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-654-663

  14. arXiv:1304.5507  [pdf, other

    cs.SI physics.soc-ph

    Analysing Mood Patterns in the United Kingdom through Twitter Content

    Authors: Vasileios Lampos, Thomas Lansdall-Welfare, Ricardo Araya, Nello Cristianini

    Abstract: Social Media offer a vast amount of geo-located and time-stamped textual content directly generated by people. This information can be analysed to obtain insights about the general state of a large population of users and to address scientific questions from a diversity of disciplines. In this work, we estimate temporal patterns of mood variation through the use of emotionally loaded words contain… ▽ More

    Submitted 19 April, 2013; originally announced April 2013.

  15. arXiv:1111.2111  [pdf, other

    cs.DS cs.LG

    Generic Multiplicative Methods for Implementing Machine Learning Algorithms on MapReduce

    Authors: Song Liu, Peter Flach, Nello Cristianini

    Abstract: In this paper we introduce a generic model for multiplicative algorithms which is suitable for the MapReduce parallel programming paradigm. We implement three typical machine learning algorithms to demonstrate how similarity comparison, gradient descent, power method and other classic learning techniques fit this model well. Two versions of large-scale matrix multiplication are discussed in this p… ▽ More

    Submitted 1 December, 2011; v1 submitted 9 November, 2011; originally announced November 2011.

    ACM Class: D.1; F.1

  16. arXiv:1006.0849  [pdf, ps, other

    cs.DS stat.ML

    Reconstruction of Causal Networks by Set Covering

    Authors: Nick Fyson, Tijl De Bie, Nello Cristianini

    Abstract: We present a method for the reconstruction of networks, based on the order of nodes visited by a stochastic branching process. Our algorithm reconstructs a network of minimal size that ensures consistency with the data. Crucially, we show that global consistency with the data can be achieved through purely local considerations, inferring the neighbourhood of each node in turn. The optimisation pro… ▽ More

    Submitted 4 June, 2010; originally announced June 2010.

    Comments: Under consideration for the ECML PKDD 2010 conference