-
Quantum Natural Language Processing
Authors:
Dominic Widdows,
Willie Aboumrad,
Dohun Kim,
Sayonee Ray,
Jonathan Mei
Abstract:
Language processing is at the heart of current developments in artificial intelligence, and quantum computers are becoming available at the same time. This has led to great interest in quantum natural language processing, and several early proposals and experiments.
This paper surveys the state of this area, showing how NLP-related techniques have been used in quantum language processing. We exa…
▽ More
Language processing is at the heart of current developments in artificial intelligence, and quantum computers are becoming available at the same time. This has led to great interest in quantum natural language processing, and several early proposals and experiments.
This paper surveys the state of this area, showing how NLP-related techniques have been used in quantum language processing. We examine the art of word embeddings and sequential models, proposing some avenues for future investigation and discussing the tradeoffs present in these directions. We also highlight some recent methods to compute attention in transformer models, and perform grammatical parsing. We also introduce a new quantum design for the basic task of text encoding (representing a string of characters in memory), which has not been addressed in detail before.
Quantum theory has contributed toward quantifying uncertainty and explaining "What is intelligence?" In this context, we argue that "hallucinations" in modern artificial intelligence systems are a misunderstanding of the way facts are conceptualized: language can express many plausible hypotheses, of which only a few become actual.
△ Less
Submitted 26 April, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Spatial Entity Resolution between Restaurant Locations and Transportation Destinations in Southeast Asia
Authors:
Emily Gao,
Dominic Widdows
Abstract:
As a tech company, Grab has expanded from transportation to food delivery, aiming to serve Southeast Asia with hyperlocalized applications. Information about places as transportation destinations can help to improve our knowledge about places as restaurants, so long as the spatial entity resolution problem between these datasets can be solved. In this project, we attempted to recognize identical p…
▽ More
As a tech company, Grab has expanded from transportation to food delivery, aiming to serve Southeast Asia with hyperlocalized applications. Information about places as transportation destinations can help to improve our knowledge about places as restaurants, so long as the spatial entity resolution problem between these datasets can be solved. In this project, we attempted to recognize identical place entities from databases of Points-of-Interest (POI) and GrabFood restaurants, using their spatial and textual attributes, i.e., latitude, longitude, place name, and street address.
Distance metrics were calculated for these attributes and fed to tree-based classifiers. POI-restaurant matching was conducted separately for Singapore, Philippines, Indonesia, and Malaysia. Experimental estimates demonstrate that a matching POI can be found for over 35% of restaurants in these countries. As part of these estimates, test datasets were manually created, and RandomForest, AdaBoost, Gradient Boosting, and XGBoost perform well, with most accuracy, precision, and recall scores close to or higher than 90% for matched vs. unmatched classification. To the authors' knowledge, there are no previous published scientific papers devoted to matching of spatial entities for the Southeast Asia region.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Mod2VQLS: a Variational Quantum Algorithm for Solving Systems of Linear Equations Modulo 2
Authors:
Willie Aboumrad,
Dominic Widdows
Abstract:
This paper presents a system for solving binary-valued linear equations using quantum computers. The system is called Mod2VQLS, which stands for Modulo2 Variational Quantum Linear Solver. As far as we know, this is the first such proposal. The design is a classical-quantum hybrid. The quantum components are a new circuit design for implementing matrix multiplication modulo 2, and a variational cir…
▽ More
This paper presents a system for solving binary-valued linear equations using quantum computers. The system is called Mod2VQLS, which stands for Modulo2 Variational Quantum Linear Solver. As far as we know, this is the first such proposal. The design is a classical-quantum hybrid. The quantum components are a new circuit design for implementing matrix multiplication modulo 2, and a variational circuit to be optimized. The classical components are the optimizer which measures the cost function and updates the quantum parameters for each iteration, and the controller that runs the quantum job and classical optimizer iterations. We propose two alternative ansatze or templates for the variational circuit, and present results showing that the rotation ansatz designed specifically for this problem provides the most direct path to a valid solution. Numerical experiments in low dimensions indicate that Mod2VQLS, using the custom rotations ansatz, is on-a-par with the block Wiedemann algorithm, the best-known to date for this problem.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Quantum Financial Modeling on Noisy Intermediate-Scale Quantum Hardware: Random Walks using Approximate Quantum Counting
Authors:
Dominic Widdows,
Amit Bhattacharyya
Abstract:
Quantum computers are expected to contribute more efficient and accurate ways of modeling economic processes. Quantum hardware is currently available at a relatively small scale, but effective algorithms are limited by the number of logic gates that can be used, before noise from gate inaccuracies tends to dominate results. Some theoretical algorithms that have been proposed and studied for years…
▽ More
Quantum computers are expected to contribute more efficient and accurate ways of modeling economic processes. Quantum hardware is currently available at a relatively small scale, but effective algorithms are limited by the number of logic gates that can be used, before noise from gate inaccuracies tends to dominate results. Some theoretical algorithms that have been proposed and studied for years do not perform well yet on quantum hardware in practice. This encourages the development of suitable alternative algorithms that play similar roles in limited contexts.
This paper implements this strategy in the case of quantum counting, which is used as a component for kee** track of position in a quantum walk, which is used as a model for simulating asset prices over time. We introduce quantum approximate counting circuits that use far fewer 2-qubit entangling gates than traditional quantum counting that relies on binary positional encoding. The robustness of these circuits to noise is demonstrated.
We compare the results to price change distributions from stock indices, and compare the behavior of quantum circuits with and without mid-measurement to trends in the housing market. The housing data shows that low liquidity brings price volatility, as expected with the quantum models.
△ Less
Submitted 14 December, 2023; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Quantum and Classical Combinatorial Optimizations Applied to Lattice-Based Factorization
Authors:
Willie Aboumrad,
Dominic Widdows,
Ananth Kaushik
Abstract:
The availability of working quantum computers has led to several proposals and claims of quantum advantage. In 2023, this has included claims that quantum computers can successfully factor large integers, by optimizing the search for nearby integers whose prime factors are all small.
This paper demonstrates that the hope of factoring numbers of commercial significance using these methods is unfo…
▽ More
The availability of working quantum computers has led to several proposals and claims of quantum advantage. In 2023, this has included claims that quantum computers can successfully factor large integers, by optimizing the search for nearby integers whose prime factors are all small.
This paper demonstrates that the hope of factoring numbers of commercial significance using these methods is unfounded. Mathematically, this is because the density of smooth numbers (numbers all of whose prime factors are small) decays exponentially as n grows. Our experimental reproductions and analysis show that lattice-based factoring does not scale successfully to larger numbers, that the proposed quantum enhancements do not alter this conclusion, and that other simpler classical optimization heuristics perform much better for lattice-based factoring.
However, many topics in this area have interesting applications and mathematical challenges, independently of factoring itself. We consider particular cases of the CVP, and opportunities for applying quantum techniques to other parts of the factorization pipeline, including the solution of linear equations modulo 2. Though the goal of factoring 1000-bit numbers is still out-of-reach, the combinatoric landscape is promising, and warrants further research with more circumspect objectives.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Quantum Circuit Components for Cognitive Decision-Making
Authors:
Dominic Widdows,
Jyoti Rani,
Emmanuel Pothos
Abstract:
This paper demonstrates that some non-classical models of human decision-making can be run successfully as circuits on quantum computers. Since the 1960s, many observed cognitive behaviors have been shown to violate rules based on classical probability and set theory. For example, the order in which questions are posed in a survey affects whether participants answer 'yes' or 'no', so the populatio…
▽ More
This paper demonstrates that some non-classical models of human decision-making can be run successfully as circuits on quantum computers. Since the 1960s, many observed cognitive behaviors have been shown to violate rules based on classical probability and set theory. For example, the order in which questions are posed in a survey affects whether participants answer 'yes' or 'no', so the population that answers 'yes' to both questions cannot be modeled as the intersection of two fixed sets. It can, however, be modeled as a sequence of projections carried out in different orders. This and other examples have been described successfully using quantum probability, which relies on comparing angles between subspaces rather than volumes between subsets.
Now in the early 2020s, quantum computers have reached the point where some of these quantum cognitive models can be implemented and investigated on quantum hardware, by representing the mental states in qubit registers, and the cognitive operations and decisions using different gates and measurements. This paper develops such quantum circuit representations for quantum cognitive models, focusing particularly on modeling order effects and decision-making under uncertainty. The claim is not that the human brain uses qubits and quantum circuits explicitly (just like the use of Boolean set theory does not require the brain to be using classical bits), but that the mathematics shared between quantum cognition and quantum computing motivates the exploration of quantum computers for cognition modeling. Key quantum properties include superposition, entanglement, and collapse, as these mathematical elements provide a common language between cognitive models, quantum hardware, and circuit implementations.
△ Less
Submitted 23 March, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Quantum Text Encoding for Classification Tasks
Authors:
Aaranya Alexander,
Dominic Widdows
Abstract:
This paper explores text classification on quantum computers. Previous results have achieved perfect accuracy on an artificial dataset of 100 short sentences, but at the unscalable cost of using a qubit for each word. This paper demonstrates that an amplitude encoded feature map combined with a quantum support vector machine can achieve 62% average accuracy predicting sentiment using a dataset of…
▽ More
This paper explores text classification on quantum computers. Previous results have achieved perfect accuracy on an artificial dataset of 100 short sentences, but at the unscalable cost of using a qubit for each word. This paper demonstrates that an amplitude encoded feature map combined with a quantum support vector machine can achieve 62% average accuracy predicting sentiment using a dataset of 50 actual movie reviews. This is still small, but considerably larger than previously-reported results in quantum NLP.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Near-Term Advances in Quantum Natural Language Processing
Authors:
Dominic Widdows,
Aaranya Alexander,
Daiwei Zhu,
Chase Zimmerman,
Arunava Majumder
Abstract:
This paper describes experiments showing that some tasks in natural language processing (NLP) can already be performed using quantum computers, though so far only with small datasets.
We demonstrate various approaches to topic classification. The first uses an explicit word-based approach, in which word-topic scoring weights are implemented as fractional rotations of individual qubit, and a new…
▽ More
This paper describes experiments showing that some tasks in natural language processing (NLP) can already be performed using quantum computers, though so far only with small datasets.
We demonstrate various approaches to topic classification. The first uses an explicit word-based approach, in which word-topic scoring weights are implemented as fractional rotations of individual qubit, and a new phrase is classified based on the accumulation of these weights in a scoring qubit using entangling controlled-NOT gates. This is compared with more scalable quantum encodings of word embedding vectors, which are used in the computation of kernel values in a quantum support vector machine: this approach achieved an average of 62% accuracy on classification tasks involving over 10000 words, which is the largest such quantum computing experiment to date.
We describe a quantum probability approach to bigram modeling that can be applied to sequences of words and formal concepts, investigating a generative approximation to these distributions using a quantum circuit Born machine, and an approach to ambiguity resolution in verb-noun composition using single-qubit rotations for simple nouns and 2-qubit controlled-NOT gates for simple verbs.
The smaller systems described have been run successfully on physical quantum computers, and the larger ones have been simulated. We show that statistically meaningful results can be obtained using real datasets, but this is much more difficult to predict than with easier artificial language examples used previously in develo** quantum NLP systems.
Other approaches to quantum NLP are compared, partly with respect to contemporary issues including informal language, fluency, and truthfulness.
△ Less
Submitted 15 April, 2024; v1 submitted 5 June, 2022;
originally announced June 2022.
-
Nonlinear Addition of Qubit States Using Entangled Quaternionic Powers of Single-Qubit Gates
Authors:
Dominic Widdows
Abstract:
This paper presents a novel way to use the algebra of unit quaternions to express arbitrary roots or fractional powers of single-qubit gates, and to use such fractional powers as generators for algebras that combine these fractional input signals, behaving as a kind of nonlinear addition. The method works by connecting several well-known equivalences. The group of all single-qubit gates is $U(2)$,…
▽ More
This paper presents a novel way to use the algebra of unit quaternions to express arbitrary roots or fractional powers of single-qubit gates, and to use such fractional powers as generators for algebras that combine these fractional input signals, behaving as a kind of nonlinear addition. The method works by connecting several well-known equivalences. The group of all single-qubit gates is $U(2)$, the unitary transformations of $C^2$. Using an appropriate phase multiplier, every element of $U(2)$ can be mapped to a corresponding element of $SU(2)$ with unit determinant, whose quantum mechanical behavior is identical. The group $SU(2)$ is isomorphic to the group of unit quaternions. Powers and roots of unit quaternions can be constructed by extending de Moivre's theorem for roots of complex numbers to the quaternions by selecting a preferred square root of -1. Using this chain of equivalences, for any single-qubit gate $A$ and real exponent $k$, a gate $B$ can be predictably constructed so that $B^k = A$. Different fractions generated in this way can be combined by connecting the individually rotated qubits to a common 'sum' qubit using 2-qubit CNOT gates. Examples of such algebras are explored including those generated by roots of the quaternion $k$ (which corresponds to and $X$-rotation of the Bloch sphere), the quaternion $\frac{\sqrt{2}}{2}(i + k)$ (which corresponds to the Hadamard gate), and a mixture of these. One of the goals of this research is to develop quantum versions of classical components such as the classifier ensembles and activation functions used in machine learning and artificial intelligence. An example application for text classification is presented, which uses fractional rotation gates to represent classifier weights, and classifies new input by using CNOT gates to collect the appropriate classifier weights in a topic-scoring qubit.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Actionable Conversational Quality Indicators for Improving Task-Oriented Dialog Systems
Authors:
Michael Higgins,
Dominic Widdows,
Chris Brew,
Gwen Christian,
Andrew Maurer,
Matthew Dunn,
Sujit Mathi,
Akshay Hazare,
George Bonev,
Beth Ann Hockey,
Kristen Howell,
Joe Bradley
Abstract:
Automatic dialog systems have become a mainstream part of online customer service. Many such systems are built, maintained, and improved by customer service specialists, rather than dialog systems engineers and computer programmers. As conversations between people and machines become commonplace, it is critical to understand what is working, what is not, and what actions can be taken to reduce the…
▽ More
Automatic dialog systems have become a mainstream part of online customer service. Many such systems are built, maintained, and improved by customer service specialists, rather than dialog systems engineers and computer programmers. As conversations between people and machines become commonplace, it is critical to understand what is working, what is not, and what actions can be taken to reduce the frequency of inappropriate system responses. These analyses and recommendations need to be presented in terms that directly reflect the user experience rather than the internal dialog processing.
This paper introduces and explains the use of Actionable Conversational Quality Indicators (ACQIs), which are used both to recognize parts of dialogs that can be improved, and to recommend how to improve them. This combines benefits of previous approaches, some of which have focused on producing dialog quality scoring while others have sought to categorize the types of errors the dialog system is making.
We demonstrate the effectiveness of using ACQIs on LivePerson internal dialog systems used in commercial customer service applications, and on the publicly available CMU LEGOv2 conversational dataset (Raux et al. 2005). We report on the annotation and analysis of conversational datasets showing which ACQIs are important to fix in various situations.
The annotated datasets are then used to build a predictive model which uses a turn-based vector embedding of the message texts and achieves an 79% weighted average f1-measure at the task of finding the correct ACQI for a given conversation. We predict that if such a model worked perfectly, the range of potential improvement actions a bot-builder must consider at each turn could be reduced by an average of 81%.
△ Less
Submitted 22 September, 2021;
originally announced September 2021.
-
Language Identification with a Reciprocal Rank Classifier
Authors:
Dominic Widdows,
Chris Brew
Abstract:
Language identification is a critical component of language processing pipelines (Jauhiainen et al.,2019) and is not a solved problem in real-world settings. We present a lightweight and effective language identifier that is robust to changes of domain and to the absence of copious training data.
The key idea for classification is that the reciprocal of the rank in a frequency table makes an eff…
▽ More
Language identification is a critical component of language processing pipelines (Jauhiainen et al.,2019) and is not a solved problem in real-world settings. We present a lightweight and effective language identifier that is robust to changes of domain and to the absence of copious training data.
The key idea for classification is that the reciprocal of the rank in a frequency table makes an effective additive feature score, hence the term Reciprocal Rank Classifier (RRC). The key finding for language classification is that ranked lists of words and frequencies of characters form a sufficient and robust representation of the regularities of key languages and their orthographies.
We test this on two 22-language data sets and demonstrate zero-effort domain adaptation from a Wikipedia training set to a Twitter test set. When trained on Wikipedia but applied to Twitter the macro-averaged F1-score of a conventionally trained SVM classifier drops from 90.9% to 77.7%. By contrast, the macro F1-score of RRC drops only from 93.1% to 90.6%. These classifiers are compared with those from fastText and langid. The RRC performs better than these established systems in most experiments, especially on short Wikipedia texts and Twitter.
The RRC classifier can be improved for particular domains and conversational situations by adding words to the ranked lists. Using new terms learned from such conversations, we demonstrate a further 7.9% increase in accuracy of sample message classification, and 1.7% increase for conversation classification. Surprisingly, this made results on Twitter data slightly worse.
The RRC classifier is available as an open source Python package (https://github.com/LivePersonInc/lplangid).
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Should Semantic Vector Composition be Explicit? Can it be Linear?
Authors:
Dominic Widdows,
Kristen Howell,
Trevor Cohen
Abstract:
Vector representations have become a central element in semantic language modelling, leading to mathematical overlaps with many fields including quantum theory. Compositionality is a core goal for such representations: given representations for 'wet' and 'fish', how should the concept 'wet fish' be represented?
This position paper surveys this question from two points of view. The first consider…
▽ More
Vector representations have become a central element in semantic language modelling, leading to mathematical overlaps with many fields including quantum theory. Compositionality is a core goal for such representations: given representations for 'wet' and 'fish', how should the concept 'wet fish' be represented?
This position paper surveys this question from two points of view. The first considers the question of whether an explicit mathematical representation can be successful using only tools from within linear algebra, or whether other mathematical tools are needed. The second considers whether semantic vector composition should be explicitly described mathematically, or whether it can be a model-internal side-effect of training a neural network.
A third and newer question is whether a compositional model can be implemented on a quantum computer. Given the fundamentally linear nature of quantum mechanics, we propose that these questions are related, and that this survey may help to highlight candidate operations for future quantum implementation.
△ Less
Submitted 10 May, 2021; v1 submitted 13 April, 2021;
originally announced April 2021.
-
Quantum Mathematics in Artificial Intelligence
Authors:
Dominic Widdows,
Kirsty Kitto,
Trevor Cohen
Abstract:
In the decade since 2010, successes in artificial intelligence have been at the forefront of computer science and technology, and vector space models have solidified a position at the forefront of artificial intelligence. At the same time, quantum computers have become much more powerful, and announcements of major advances are frequently in the news.
The mathematical techniques underlying both…
▽ More
In the decade since 2010, successes in artificial intelligence have been at the forefront of computer science and technology, and vector space models have solidified a position at the forefront of artificial intelligence. At the same time, quantum computers have become much more powerful, and announcements of major advances are frequently in the news.
The mathematical techniques underlying both these areas have more in common than is sometimes realized. Vector spaces took a position at the axiomatic heart of quantum mechanics in the 1930s, and this adoption was a key motivation for the derivation of logic and probability from the linear geometry of vector spaces. Quantum interactions between particles are modelled using the tensor product, which is also used to express objects and operations in artificial neural networks.
This paper describes some of these common mathematical areas, including examples of how they are used in artificial intelligence (AI), particularly in automated reasoning and natural language processing (NLP). Techniques discussed include vector spaces, scalar products, subspaces and implication, orthogonal projection and negation, dual vectors, density matrices, positive operators, and tensor products. Application areas include information retrieval, categorization and implication, modelling word-senses and disambiguation, inference in knowledge bases, and semantic composition.
Some of these approaches can potentially be implemented on quantum hardware. Many of the practical steps in this implementation are in early stages, and some are already realized. Explaining some of the common mathematical tools can help researchers in both AI and quantum computing further exploit these overlaps, recognizing and exploring new directions along the way.
△ Less
Submitted 16 December, 2021; v1 submitted 11 January, 2021;
originally announced January 2021.
-
A Quantum Logic of Down Below
Authors:
P. D. Bruza,
D. Widdows,
John Woods
Abstract:
This chapter is offered as a contribution to the logic of down below. We attempt to demonstrate that the nature of human agency necessitates that there actually be such a logic. The ensuing sections develop the suggestion that cognition down below has a structure strikingly similar to the physical structure of quantum states. In its general form, this is not an idea that originates with the pres…
▽ More
This chapter is offered as a contribution to the logic of down below. We attempt to demonstrate that the nature of human agency necessitates that there actually be such a logic. The ensuing sections develop the suggestion that cognition down below has a structure strikingly similar to the physical structure of quantum states. In its general form, this is not an idea that originates with the present authors. It is known that there exist mathematical models from the cognitive science of cognition down below that have certain formal similarities to quantum mechanics. We want to take this idea seriously. We will propose that the subspaces of von Neumann-Birkhoff lattices are too crisp for modelling requisite cognitive aspects in relation to subsymbolic logic. Instead, we adopt an approach which relies on projections into nonorthogonal density states. The projection operator is motivated from cues which probe human memory.
△ Less
Submitted 6 December, 2006;
originally announced December 2006.
-
Using Curvature and Markov Clustering in Graphs for Lexical Acquisition and Word Sense Discrimination
Authors:
Beate Dorow,
Dominic Widdows,
Katarina Ling,
Jean-Pierre Eckmann,
Danilo Sergi,
Elisha Moses
Abstract:
We introduce two different approaches for clustering semantically similar words. We accommodate ambiguity by allowing a word to belong to several clusters.
Both methods use a graph-theoretic representation of words and their paradigmatic relationships. The first approach is based on the concept of curvature and divides the word graph into classes of similar words by removing words of low curva…
▽ More
We introduce two different approaches for clustering semantically similar words. We accommodate ambiguity by allowing a word to belong to several clusters.
Both methods use a graph-theoretic representation of words and their paradigmatic relationships. The first approach is based on the concept of curvature and divides the word graph into classes of similar words by removing words of low curvature which connect several dispersed clusters.
The second method, instead of clustering the nodes, clusters the links in our graph. These contain more specific contextual information than nodes representing just words. In so doing, we naturally accommodate ambiguity by allowing multiple class membership.
Both methods are evaluated on a lexical acquisition task, using clustering to add nouns to the WordNet taxonomy. The most effective method is link clustering.
△ Less
Submitted 29 March, 2004;
originally announced March 2004.
-
A Dolbeault-type Double Complex on Quaternionic Manifolds
Authors:
Dominic Widdows
Abstract:
It has long been known that differential forms on complex manifolds can be decomposed under the action of the complex structure to give the Dolbeault complex.
This paper presents an analogous double complex for quaternionic manifolds using the fact that the cotangent space is isomorphic to a quaternionic vector space. This defines an action of the group Sp(1) of unit quaternions on the cotange…
▽ More
It has long been known that differential forms on complex manifolds can be decomposed under the action of the complex structure to give the Dolbeault complex.
This paper presents an analogous double complex for quaternionic manifolds using the fact that the cotangent space is isomorphic to a quaternionic vector space. This defines an action of the group Sp(1) of unit quaternions on the cotangent space, which induces an action of Sp(1) on the space of k-forms. A double complex is obtained by decomposing the k-forms into irreducible representations of Sp(1), resulting in new 'quaternionic Dolbeault' operators and cohomology groups.
Links with previous work in quaternionic geometry, particularly the differential complex of Salamon and the q-holomorphic functions of Joyce, are demonstrated.
△ Less
Submitted 8 December, 2000;
originally announced December 2000.