Skip to main content

Showing 1–25 of 25 results for author: Kartsaklis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.15696  [pdf, other

    quant-ph cs.AI cs.LG

    Peptide Binding Classification on Quantum Computers

    Authors: Charles London, Douglas Brown, Wenduan Xu, Sezen Vatansever, Christopher James Langmead, Dimitri Kartsaklis, Stephen Clark, Konstantinos Meichanetzidis

    Abstract: We conduct an extensive study on using near-term quantum computers for a task in the domain of computational biology. By constructing quantum models based on parameterised quantum circuits we perform sequence classification on a task relevant to the design of therapeutic proteins, and find competitive performance with classical baselines of similar scale. To study the effect of noise, we run some… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  2. arXiv:2110.04236  [pdf, other

    cs.CL cs.AI quant-ph

    lambeq: An Efficient High-Level Python Library for Quantum NLP

    Authors: Dimitri Kartsaklis, Ian Fan, Richie Yeung, Anna Pearson, Robin Lorenz, Alexis Toumi, Giovanni de Felice, Konstantinos Meichanetzidis, Stephen Clark, Bob Coecke

    Abstract: We present lambeq, the first high-level Python library for Quantum Natural Language Processing (QNLP). The open-source toolkit offers a detailed hierarchy of modules and classes implementing all stages of a pipeline for converting sentences to string diagrams, tensor networks, and quantum circuits ready to be used on a quantum computer. lambeq supports syntactic parsing, rewriting and simplificati… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  3. arXiv:2105.07720  [pdf, other

    cs.CL math.CT

    A CCG-Based Version of the DisCoCat Framework

    Authors: Richie Yeung, Dimitri Kartsaklis

    Abstract: While the DisCoCat model (Coecke et al., 2010) has been proved a valuable tool for studying compositional aspects of language at the level of semantics, its strong dependency on pregroup grammars poses important restrictions: first, it prevents large-scale experimentation due to the absence of a pregroup parser; and second, it limits the expressibility of the model to context-free grammars. In thi… ▽ More

    Submitted 24 May, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: SemSpace 2021: Semantic Spaces at the Intersection of NLP, Physics, and Cognitive Science

  4. arXiv:2102.12846  [pdf, other

    cs.CL cs.AI cs.LG quant-ph

    QNLP in Practice: Running Compositional Models of Meaning on a Quantum Computer

    Authors: Robin Lorenz, Anna Pearson, Konstantinos Meichanetzidis, Dimitri Kartsaklis, Bob Coecke

    Abstract: Quantum Natural Language Processing (QNLP) deals with the design and implementation of NLP models intended to be run on quantum hardware. In this paper, we present results on the first NLP experiments conducted on Noisy Intermediate-Scale Quantum (NISQ) computers for datasets of size greater than 100 sentences. Exploiting the formal similarity of the compositional model of meaning by Coecke, Sadrz… ▽ More

    Submitted 4 May, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: 38 pages

    Journal ref: Journal of Artificial Intelligence Research Vol. 76 (2023), 1305-1342

  5. arXiv:2010.12770  [pdf, other

    cs.CL

    Conversational Semantic Parsing for Dialog State Tracking

    Authors: Jianpeng Cheng, Devang Agrawal, Hector Martinez Alonso, Shruti Bhargava, Joris Driesen, Federico Flego, Shaona Ghosh, Dain Kaplan, Dimitri Kartsaklis, Lin Li, Dhivya Piraviperumal, Jason D Williams, Hong Yu, Diarmuid O Seaghdha, Anders Johannsen

    Abstract: We consider a new perspective on dialog state tracking (DST), the task of estimating a user's goal through the course of a dialog. By formulating DST as a semantic parsing task over hierarchical representations, we can incorporate semantic compositionality, cross-domain knowledge sharing and co-reference. We present TreeDST, a dataset of 27k conversations annotated with tree-structured dialog stat… ▽ More

    Submitted 13 May, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: Publish as a conference paper at EMNLP 2020

  6. arXiv:1811.04983  [pdf, other

    cs.CL cs.AI cs.LG

    Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces

    Authors: Victor Prokhorov, Mohammad Taher Pilehvar, Dimitri Kartsaklis, Pietro Lio, Nigel Collier

    Abstract: Word embedding techniques heavily rely on the abundance of training data for individual words. Given the Zipfian distribution of words in natural language texts, a large number of words do not usually appear frequently or at all in the training data. In this paper we put forward a technique that exploits the knowledge encoded in lexical resources, such as WordNet, to induce embeddings for unseen w… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: Accepted for presentation at AAAI 2019

  7. arXiv:1811.02701   

    cs.CL cs.AI cs.GT

    Proceedings of the 2018 Workshop on Compositional Approaches in Physics, NLP, and Social Sciences

    Authors: Martha Lewis, Bob Coecke, Jules Hedges, Dimitri Kartsaklis, Dan Marsden

    Abstract: The ability to compose parts to form a more complex whole, and to analyze a whole as a combination of elements, is desirable across disciplines. This workshop bring together researchers applying compositional approaches to physics, NLP, cognitive science, and game theory. Within NLP, a long-standing aim is to represent how words can combine to form phrases and sentences. Within the framework of di… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Journal ref: EPTCS 283, 2018

  8. arXiv:1808.09308  [pdf, other

    cs.CL

    Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

    Authors: Mohammad Taher Pilehvar, Dimitri Kartsaklis, Victor Prokhorov, Nigel Collier

    Abstract: Rare word representation has recently enjoyed a surge of interest, owing to the crucial role that effective handling of infrequent words can play in accurate semantic understanding. However, there is a paucity of reliable benchmarks for evaluation and comparison of these techniques. We show in this paper that the only existing benchmark (the Stanford Rare Word dataset) suffers from low-confidence… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  9. arXiv:1808.07724  [pdf, other

    cs.CL cs.AI cs.LG

    Map** Text to Knowledge Graph Entities using Multi-Sense LSTMs

    Authors: Dimitri Kartsaklis, Mohammad Taher Pilehvar, Nigel Collier

    Abstract: This paper addresses the problem of map** natural language text to knowledge base entities. The map** process is approached as a composition of a phrase or a sentence into a point in a multi-dimensional entity space obtained from a knowledge graph. The compositional model is an LSTM equipped with a dynamic disambiguation mechanism on the input word embeddings (a Multi-Sense LSTM), addressing p… ▽ More

    Submitted 23 August, 2018; originally announced August 2018.

    Comments: Accepted for presentation at EMNLP 2018 (main conference)

  10. arXiv:1707.07554  [pdf, other

    cs.CL cs.AI

    Learning Rare Word Representations using Semantic Bridging

    Authors: Victor Prokhorov, Mohammad Taher Pilehvar, Dimitri Kartsaklis, Pietro Lió, Nigel Collier

    Abstract: We propose a methodology that adapts graph embedding techniques (DeepWalk (Perozzi et al., 2014) and node2vec (Grover and Leskovec, 2016)) as well as cross-lingual vector space map** approaches (Least Squares and Canonical Correlation Analysis) in order to merge the corpus and ontological sources of lexical knowledge. We also perform comparative analysis of the used algorithms in order to identi… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

  11. arXiv:1703.10252  [pdf, other

    cs.CL hep-th math.CO

    Linguistic Matrix Theory

    Authors: Dimitrios Kartsaklis, Sanjaye Ramgoolam, Mehrnoosh Sadrzadeh

    Abstract: Recent research in computational linguistics has developed algorithms which associate matrices with adjectives and verbs, based on the distribution of words in a corpus of text. These matrices are linear operators on a vector space of context words. They are used to construct the meaning of composite expressions from that of the elementary constituents, forming part of a compositional distribution… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.

    Comments: 32 pages, 3 figures

    Report number: QMUL-PH-17-03

  12. arXiv:1610.04416  [pdf, ps, other

    cs.CL cs.AI

    Distributional Inclusion Hypothesis for Tensor-based Composition

    Authors: Dimitri Kartsaklis, Mehrnoosh Sadrzadeh

    Abstract: According to the distributional inclusion hypothesis, entailment between words can be measured via the feature inclusions of their distributional vectors. In recent work, we showed how this hypothesis can be extended from words to phrases and sentences in the setting of compositional distributional semantics. This paper focuses on inclusion properties of tensors; its main contribution is a theoret… ▽ More

    Submitted 14 October, 2016; originally announced October 2016.

    Comments: To appear in COLING 2016

  13. arXiv:1608.01018   

    cs.CL cs.AI math.CT quant-ph

    Proceedings of the 2016 Workshop on Semantic Spaces at the Intersection of NLP, Physics and Cognitive Science

    Authors: Dimitrios Kartsaklis, Martha Lewis, Laura Rimell

    Abstract: This volume contains the Proceedings of the 2016 Workshop on Semantic Spaces at the Intersection of NLP, Physics and Cognitive Science (SLPCS 2016), which was held on the 11th of June at the University of Strathclyde, Glasgow, and was co-located with Quantum Physics and Logic (QPL 2016). Exploiting the common ground provided by the concept of a vector space, the workshop brought together researche… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

    Journal ref: EPTCS 221, 2016

  14. arXiv:1606.01515  [pdf, ps, other

    cs.CL cs.AI math.CT

    Coordination in Categorical Compositional Distributional Semantics

    Authors: Dimitri Kartsaklis

    Abstract: An open problem with categorical compositional distributional semantics is the representation of words that are considered semantically vacuous from a distributional perspective, such as determiners, prepositions, relative pronouns or coordinators. This paper deals with the topic of coordination between identical syntactic types, which accounts for the majority of coordination cases in language. B… ▽ More

    Submitted 3 August, 2016; v1 submitted 5 June, 2016; originally announced June 2016.

    Comments: In Proceedings SLPCS 2016, arXiv:1608.01018

    Journal ref: EPTCS 221, 2016, pp. 29-38

  15. arXiv:1512.04419  [pdf, other

    cs.CL cs.AI math.CT

    Sentence Entailment in Compositional Distributional Semantics

    Authors: Esma Balkir, Dimitri Kartsaklis, Mehrnoosh Sadrzadeh

    Abstract: Distributional semantic models provide vector representations for words by gathering co-occurrence frequencies from corpora of text. Compositional distributional models extend these from words to phrases and sentences. In categorical compositional distributional semantics, phrase and sentence representations are functions of their grammatical structure and representations of the words therein. In… ▽ More

    Submitted 9 October, 2018; v1 submitted 14 December, 2015; originally announced December 2015.

    Comments: 8 pages, 1 figure, 2 tables, short version presented in the International Symposium on Artificial Intelligence and Mathematics (ISAIM), 2016

    MSC Class: 03B65 ACM Class: I.2.7

    Journal ref: Ann Math Artif Intell (2018) 82: 189. https://doi.org/10.1007/s10472-017-9570-x

  16. arXiv:1508.02354  [pdf, other

    cs.CL cs.AI cs.NE

    Syntax-Aware Multi-Sense Word Embeddings for Deep Compositional Models of Meaning

    Authors: Jianpeng Cheng, Dimitri Kartsaklis

    Abstract: Deep compositional models of meaning acting on distributional representations of words in order to produce vectors of larger text constituents are evolving to a popular area of NLP research. We detail a compositional distributional framework based on a rich form of word embeddings that aims at facilitating the interactions between words in the context of a sentence. Embeddings and composition laye… ▽ More

    Submitted 13 August, 2015; v1 submitted 10 August, 2015; originally announced August 2015.

    Comments: Accepted for presentation at EMNLP 2015

  17. arXiv:1505.06294  [pdf, ps, other

    cs.CL cs.AI math.CT math.RA

    A Frobenius Model of Information Structure in Categorical Compositional Distributional Semantics

    Authors: Dimitri Kartsaklis, Mehrnoosh Sadrzadeh

    Abstract: The categorical compositional distributional model of Coecke, Sadrzadeh and Clark provides a linguistically motivated procedure for computing the meaning of a sentence as a function of the distributional meaning of the words therein. The theoretical framework allows for reasoning about compositional aspects of language and offers structural ways of studying the underlying relationships. While the… ▽ More

    Submitted 23 May, 2015; originally announced May 2015.

    Comments: Accepted for presentation in the 14th Meeting on Mathematics of Language (2015)

  18. arXiv:1505.00138  [pdf, other

    cs.CL cs.AI math.CT math.QA quant-ph

    Compositional Distributional Semantics with Compact Closed Categories and Frobenius Algebras

    Authors: Dimitri Kartsaklis

    Abstract: This thesis contributes to ongoing research related to the categorical compositional model for natural language of Coecke, Sadrzadeh and Clark in three ways: Firstly, I propose a concrete instantiation of the abstract framework based on Frobenius algebras (joint work with Sadrzadeh). The theory improves shortcomings of previous proposals, extends the coverage of the language, and is supported by e… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.

    Comments: Ph.D. Dissertation, University of Oxford

  19. arXiv:1502.00831  [pdf, ps, other

    cs.CL cs.LO math.CT math.QA

    Open System Categorical Quantum Semantics in Natural Language Processing

    Authors: Robin Piedeleu, Dimitri Kartsaklis, Bob Coecke, Mehrnoosh Sadrzadeh

    Abstract: Originally inspired by categorical quantum mechanics (Abramsky and Coecke, LiCS'04), the categorical compositional distributional model of natural language meaning of Coecke, Sadrzadeh and Clark provides a conceptually motivated procedure to compute the meaning of a sentence, given its grammatical structure within a Lambek pregroup and a vectorial representation of the meaning of its parts. The pr… ▽ More

    Submitted 4 February, 2015; v1 submitted 3 February, 2015; originally announced February 2015.

  20. arXiv:1411.4116  [pdf, ps, other

    cs.CL cs.LG cs.NE

    Investigating the Role of Prior Disambiguation in Deep-learning Compositional Models of Meaning

    Authors: Jianpeng Cheng, Dimitri Kartsaklis, Edward Grefenstette

    Abstract: This paper aims to explore the effect of prior disambiguation on neural network- based compositional models, with the hope that better semantic representations for text compounds can be produced. We disambiguate the input word vectors before they are fed into a compositional deep net. A series of evaluations shows the positive effect of prior disambiguation for such deep models.

    Submitted 15 November, 2014; originally announced November 2014.

    Comments: NIPS 2014

  21. arXiv:1408.6181  [pdf, ps, other

    cs.CL

    Resolving Lexical Ambiguity in Tensor Regression Models of Meaning

    Authors: Dimitri Kartsaklis, Nal Kalchbrenner, Mehrnoosh Sadrzadeh

    Abstract: This paper provides a method for improving tensor-based compositional distributional models of meaning by the addition of an explicit disambiguation step prior to composition. In contrast with previous research where this hypothesis has been successfully tested against relatively simple compositional models, in our work we use a robust model trained with linear regression. The results we get in tw… ▽ More

    Submitted 26 August, 2014; originally announced August 2014.

    Journal ref: Proceedings of ACL 2014, Vol. 2:Short Papers, pp:212-217

  22. arXiv:1408.6179  [pdf, ps, other

    cs.CL

    Evaluating Neural Word Representations in Tensor-Based Compositional Settings

    Authors: Dmitrijs Milajevs, Dimitri Kartsaklis, Mehrnoosh Sadrzadeh, Matthew Purver

    Abstract: We provide a comparative study between neural word representations and traditional vector spaces based on co-occurrence counts, in a number of compositional tasks. We use three different semantic spaces and implement seven tensor-based compositional models, which we then test (together with simpler additive and multiplicative approaches) in tasks involving verb disambiguation and sentence similari… ▽ More

    Submitted 26 August, 2014; originally announced August 2014.

    Comments: To be published in EMNLP 2014

  23. arXiv:1405.2874  [pdf, other

    cs.CL cs.AI math.CT quant-ph

    A Study of Entanglement in a Categorical Framework of Natural Language

    Authors: Dimitri Kartsaklis, Mehrnoosh Sadrzadeh

    Abstract: In both quantum mechanics and corpus linguistics based on vector spaces, the notion of entanglement provides a means for the various subsystems to communicate with each other. In this paper we examine a number of implementations of the categorical framework of Coecke, Sadrzadeh and Clark (2010) for natural language, from an entanglement perspective. Specifically, our goal is to better understand i… ▽ More

    Submitted 29 December, 2014; v1 submitted 12 May, 2014; originally announced May 2014.

    Comments: In Proceedings QPL 2014, arXiv:1412.8102

    Journal ref: EPTCS 172, 2014, pp. 249-261

  24. arXiv:1401.5980  [pdf, ps, other

    cs.CL cs.AI math.CT

    Reasoning about Meaning in Natural Language with Compact Closed Categories and Frobenius Algebras

    Authors: Dimitri Kartsaklis, Mehrnoosh Sadrzadeh, Stephen Pulman, Bob Coecke

    Abstract: Compact closed categories have found applications in modeling quantum information protocols by Abramsky-Coecke. They also provide semantics for Lambek's pregroup algebras, applied to formalizing the grammatical structure of natural language, and are implicit in a distributional model of word meaning based on vector spaces. Specifically, in previous work Coecke-Clark-Sadrzadeh used the product cate… ▽ More

    Submitted 23 January, 2014; originally announced January 2014.

  25. arXiv:1401.5327  [pdf, other

    cs.CL cs.AI math.CT

    Compositional Operators in Distributional Semantics

    Authors: Dimitri Kartsaklis

    Abstract: This survey presents in some detail the main advances that have been recently taking place in Computational Linguistics towards the unification of the two prominent semantic paradigms: the compositional formal semantics view and the distributional models of meaning based on vector spaces. After an introduction to these two approaches, I review the most important models that aim to provide composit… ▽ More

    Submitted 21 January, 2014; originally announced January 2014.