Skip to main content

Showing 1–14 of 14 results for author: Wijnholds, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.14917  [pdf, other

    cs.CL cs.AI

    Structural Ambiguity and its Disambiguation in Language Model Based Parsers: the Case of Dutch Clause Relativization

    Authors: Gijs Wijnholds, Michael Moortgat

    Abstract: This paper addresses structural ambiguity in Dutch relative clauses. By investigating the task of disambiguation by grounding, we study how the presence of a prior sentence can resolve relative clause ambiguities. We apply this method to two parsing architectures in an attempt to demystify the parsing and language model components of two present-day neural parsers. Results show that a neurosymboli… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  2. Proceedings End-to-End Compositional Models of Vector-Based Semantics

    Authors: Michael Moortgat, Gijs Wijnholds

    Abstract: The workshop End-to-End Compositional Models of Vector-Based Semantics was held at NUI Galway on 15 and 16 August 2022 as part of the 33rd European Summer School in Logic, Language and Information (ESSLLI 2022). The workshop was sponsored by the research project 'A composition calculus for vector-based semantic modelling with a localization for Dutch' (Dutch Research Council 360-89-070, 2017-202… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Journal ref: EPTCS 366, 2022

  3. arXiv:2203.01063  [pdf, other

    cs.CL cs.LG

    Discontinuous Constituency and BERT: A Case Study of Dutch

    Authors: Konstantinos Kogkalidis, Gijs Wijnholds

    Abstract: In this paper, we set out to quantify the syntactic capacity of BERT in the evaluation regime of non-context free patterns, as occurring in Dutch. We devise a test suite based on a mildly context-sensitive formalism, from which we derive grammars that capture the linguistic phenomena of control verb nesting and verb raising. The grammars, paired with a small lexicon, provide us with a large collec… ▽ More

    Submitted 8 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: 8 pages plus references. To appear in Findings of the Association for Computational Linguistics 2022

  4. arXiv:2110.10641  [pdf, ps, other

    cs.LO

    Anaphora and Ellipsis in Lambek Calculus with a Relevant Modality: Syntax and Semantics

    Authors: Lachlan McPheat, Gijs Wijnholds, Mehrnoosh Sadrzadeh, Adriana Correia, Alexis Toumi

    Abstract: Lambek calculus with a relevant modality $!\mathbf{L^*}$ of arXiv:1601.06303 syntactically resolves parasitic gaps in natural language. It resembles the Lambek calculus with anaphora $\mathbf{LA}$ of (Jäger, 1998) and the Lambek calculus with controlled contraction, $\mathbf{L}_{\Diamond}$, of arXiv:1905.01647v1 which deal with anaphora and ellipsis. What all these calculi add to Lambek calculus i… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  5. Fuzzy Generalised Quantifiers for Natural Language in Categorical Compositional Distributional Semantics

    Authors: Matej Dostal, Mehrnoosh Sadrzadeh, Gijs Wijnholds

    Abstract: Recent work on compositional distributional models shows that bialgebras over finite dimensional vector spaces can be applied to treat generalised quantifiers for natural language. That technique requires one to construct the vector space over powersets, and therefore is computationally costly. In this paper, we overcome this problem by considering fuzzy versions of quantifiers along the lines of… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: https://link.springer.com/chapter/10.1007/978-3-030-53654-1_6

    ACM Class: I.2.7

    Journal ref: In: Mojtahedi M., Rahman S., Zarepour M.S. (eds) Mathematics, Logic, and their Philosophies. Logic, Epistemology, and the Unity of Science, vol 49, pp 135-160 Springer, 2021

  6. arXiv:2104.10516  [pdf, other

    cs.CL cs.LG

    Improving BERT Pretraining with Syntactic Supervision

    Authors: Giorgos Tziafas, Konstantinos Kogkalidis, Gijs Wijnholds, Michael Moortgat

    Abstract: Bidirectional masked Transformers have become the core theme in the current NLP landscape. Despite their impressive benchmarks, a recurring theme in recent research has been to question such models' capacity for syntactic generalization. In this work, we seek to address this question by adding a supervised, token-level supertagging objective to standard unsupervised pretraining, enabling the expli… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: 4 pages, rejected by IWCS due to "not fitting the conference theme"

  7. Categorical Vector Space Semantics for Lambek Calculus with a Relevant Modality (Extended Abstract)

    Authors: Lachlan McPheat, Mehrnoosh Sadrzadeh, Hadi Wazni, Gijs Wijnholds

    Abstract: We develop a categorical compositional distributional semantics for Lambek Calculus with a Relevant Modality, which has a limited version of the contraction and permutation rules. The categorical part of the semantics is a monoidal biclosed category with a coalgebra modality as defined on Differential Categories. We instantiate this category to finite dimensional vector spaces and linear maps via… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: In Proceedings ACT 2020, arXiv:2101.07888. arXiv admin note: substantial text overlap with arXiv:2005.03074

    Journal ref: EPTCS 333, 2021, pp. 168-182

  8. arXiv:2101.05716  [pdf, other

    cs.CL

    SICKNL: A Dataset for Dutch Natural Language Inference

    Authors: Gijs Wijnholds, Michael Moortgat

    Abstract: We present SICK-NL (read: signal), a dataset targeting Natural Language Inference in Dutch. SICK-NL is obtained by translating the SICK dataset of Marelli et al. (2014)from English into Dutch. Having a parallel inference dataset allows us to compare both monolingual and multilingual NLP models for English and Dutch on the two tasks. In the paper, we motivate and detail the translation process, per… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: To appear at EACL 2021

  9. arXiv:2005.05639  [pdf, ps, other

    cs.CL

    A Frobenius Algebraic Analysis for Parasitic Gaps

    Authors: Michael Moortgat, Mehrnoosh Sadrzadeh, Gijs Wijnholds

    Abstract: The interpretation of parasitic gaps is an ostensible case of non-linearity in natural language composition. Existing categorial analyses, both in the typelogical and in the combinatory traditions, rely on explicit forms of syntactic copying. We identify two types of parasitic gap** where the duplication of semantic content can be confined to the lexicon. Parasitic gaps in adjuncts are analysed… ▽ More

    Submitted 7 July, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: SemSpace 2019, to appear in Journal of Applied Logics

  10. Categorical Vector Space Semantics for Lambek Calculus with a Relevant Modality

    Authors: Lachlan McPheat, Mehrnoosh Sadrzadeh, Hadi Wazni, Gijs Wijnholds

    Abstract: We develop a categorical compositional distributional semantics for Lambek Calculus with a Relevant Modality !L*, which has a limited edition of the contraction and permutation rules. The categorical part of the semantics is a monoidal biclosed category with a coalgebra modality, very similar to the structure of a Differential Category. We instantiate this category to finite dimensional vector spa… ▽ More

    Submitted 11 May, 2023; v1 submitted 6 May, 2020; originally announced May 2020.

    Journal ref: Compositionality 5, 2 (2023)

  11. arXiv:1905.01647  [pdf, ps, other

    cs.CL cs.AI cs.LO math.LO

    A Typedriven Vector Semantics for Ellipsis with Anaphora using Lambek Calculus with Limited Contraction

    Authors: Gijs Wijnholds, Mehrnoosh Sadrzadeh

    Abstract: We develop a vector space semantics for verb phrase ellipsis with anaphora using type-driven compositional distributional semantics based on the Lambek calculus with limited contraction (LCC) of Jäger (2006). Distributional semantics has a lot to say about the statistical collocation-based meanings of content words, but provides little guidance on how to treat function words. Formal semantics on t… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

    Comments: Forthcoming in: Journal of Logic, Language and Information

  12. arXiv:1811.03276  [pdf, ps, other

    cs.CL cs.AI cs.LO

    Classical Copying versus Quantum Entanglement in Natural Language: The Case of VP-ellipsis

    Authors: Gijs Wijnholds, Mehrnoosh Sadrzadeh

    Abstract: This paper compares classical copying and quantum entanglement in natural language by considering the case of verb phrase (VP) ellipsis. VP ellipsis is a non-linear linguistic phenomenon that requires the reuse of resources, making it the ideal test case for a comparative study of different copying behaviours in compositional models of natural language. Following the line of research in compositio… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: In Proceedings CAPNS 2018, arXiv:1811.02701

    Journal ref: EPTCS 283, 2018, pp. 103-119

  13. arXiv:1810.10297  [pdf, ps, other

    cs.CL cs.LO

    A Proof-Theoretic Approach to Scope Ambiguity in Compositional Vector Space Models

    Authors: Gijs Jasper Wijnholds

    Abstract: We investigate the extent to which compositional vector space models can be used to account for scope ambiguity in quantified sentences (of the form "Every man loves some woman"). Such sentences containing two quantifiers introduce two readings, a direct scope reading and an inverse scope reading. This ambiguity has been treated in a vector space model using bialgebras by (Hedges and Sadrzadeh, 20… ▽ More

    Submitted 25 October, 2018; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: This is a preprint of a paper to appear in: Journal of Language Modelling, 2018

  14. arXiv:1711.11513  [pdf, ps, other

    cs.CL

    Lexical and Derivational Meaning in Vector-Based Models of Relativisation

    Authors: Michael Moortgat, Gijs Wijnholds

    Abstract: Sadrzadeh et al (2013) present a compositional distributional analysis of relative clauses in English in terms of the Frobenius algebraic structure of finite dimensional vector spaces. The analysis relies on distinct type assignments and lexical recipes for subject vs object relativisation. The situation for Dutch is different: because of the verb final nature of Dutch, relative clauses are ambigu… ▽ More

    Submitted 1 December, 2017; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: 10 page version to appear in Proceedings Amsterdam Colloquium, updated with appendix