Skip to main content

Showing 1–13 of 13 results for author: Kogkalidis, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.02104  [pdf, other

    cs.LG cs.PL

    Learning Structure-Aware Representations of Dependent Types

    Authors: Konstantinos Kogkalidis, Orestis Melkonian, Jean-Philippe Bernardy

    Abstract: Agda is a dependently-typed programming language and a proof assistant, pivotal in proof formalization and programming language theory. This paper extends the Agda ecosystem into machine learning territory, and, vice versa, makes Agda-related resources available to machine learning practitioners. We introduce and release a novel dataset of Agda program-proofs that is elaborate and extensive enough… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 15 pages, submitted to ICML2024

  2. arXiv:2312.16045  [pdf, other

    cs.LG cs.AI

    Algebraic Positional Encodings

    Authors: Konstantinos Kogkalidis, Jean-Philippe Bernardy, Vikas Garg

    Abstract: We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework provides a flexible map** from the algebraic specification of a domain to an interpretation as orthogonal operators. This design preserves the algebraic characteristics of the source domain, ensuring that the model upholds the desired st… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  3. arXiv:2309.07009  [pdf, other

    cs.CL

    OYXOY: A Modern NLP Test Suite for Modern Greek

    Authors: Konstantinos Kogkalidis, Stergios Chatzikyriakidis, Eirini Chrysovalantou Giannikouri, Vassiliki Katsouli, Christina Klironomou, Christina Koula, Dimitris Papadakis, Thelka Pasparaki, Erofili Psaltaki, Efthymia Sakellariou, Hara Soupiona

    Abstract: This paper serves as a foundational step towards the development of a linguistically motivated and technically relevant evaluation suite for Greek NLP. We initiate this endeavor by introducing four expert-verified evaluation tasks, specifically targeted at natural language inference, word sense disambiguation (through example comparison or sense selection) and metaphor detection. More than languag… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: EACL 2023 (Findings)

  4. arXiv:2302.12050  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    SPINDLE: Spinning Raw Text into Lambda Terms with Graph Attention

    Authors: Konstantinos Kogkalidis, Michael Moortgat, Richard Moot

    Abstract: This paper describes SPINDLE - an open source Python module implementing an efficient and accurate parser for written Dutch that transforms raw text input to programs for meaning composition, expressed as λ terms. The parser integrates a number of breakthrough advances made in recent years. Its output consists of hi-res derivations of a multimodal type-logical grammar, capturing two orthogonal axe… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: EACL23 System Demonstrations

  5. arXiv:2203.12235  [pdf, other

    cs.CL cs.LG

    Geometry-Aware Supertagging with Heterogeneous Dynamic Convolutions

    Authors: Konstantinos Kogkalidis, Michael Moortgat

    Abstract: The syntactic categories of categorial grammar formalisms are structured units made of smaller, indivisible primitives, bound together by the underlying grammar's category formation rules. In the trending approach of constructive supertagging, neural models are increasingly made aware of the internal category structure, which in turn enables them to more reliably predict rare and out-of-vocabulary… ▽ More

    Submitted 23 January, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 8 pages plus references, unpublished preprint v2: fixed small typos, added appendix with a visualization of the decoding process; v3: improved presentation, improved the decoding figure

  6. arXiv:2203.01063  [pdf, other

    cs.CL cs.LG

    Discontinuous Constituency and BERT: A Case Study of Dutch

    Authors: Konstantinos Kogkalidis, Gijs Wijnholds

    Abstract: In this paper, we set out to quantify the syntactic capacity of BERT in the evaluation regime of non-context free patterns, as occurring in Dutch. We devise a test suite based on a mildly context-sensitive formalism, from which we derive grammars that capture the linguistic phenomena of control verb nesting and verb raising. The grammars, paired with a small lexicon, provide us with a large collec… ▽ More

    Submitted 8 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: 8 pages plus references. To appear in Findings of the Association for Computational Linguistics 2022

  7. arXiv:2110.03323  [pdf, ps, other

    cs.CL

    A Logic-Based Framework for Natural Language Inference in Dutch

    Authors: Lasha Abzianidze, Konstantinos Kogkalidis

    Abstract: We present a framework for deriving inference relations between Dutch sentence pairs. The proposed framework relies on logic-based reasoning to produce inspectable proofs leading up to inference labels; its judgements are therefore transparent and formally verifiable. At its core, the system is powered by two $λ$-calculi, used as syntactic and semantic theories, respectively. Sentences are first c… ▽ More

    Submitted 14 January, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 20 pages plus references. Presented in Natural Logic Meets Machine Learning (NaLoMa II) workshop at the 14th International Conference on Computational Semantics (IWCS 2021). Presented in the 31st Meeting of Computational Linguistics in The Netherlands (CLIN31). Accepted for publication in Volume 11 of the CLIN Journal. Code available at http://github.com/kovvalsky/prove_SICK_NL

    ACM Class: F.4.1; I.2.7

  8. arXiv:2104.10516  [pdf, other

    cs.CL cs.LG

    Improving BERT Pretraining with Syntactic Supervision

    Authors: Giorgos Tziafas, Konstantinos Kogkalidis, Gijs Wijnholds, Michael Moortgat

    Abstract: Bidirectional masked Transformers have become the core theme in the current NLP landscape. Despite their impressive benchmarks, a recurring theme in recent research has been to question such models' capacity for syntactic generalization. In this work, we seek to address this question by adding a supervised, token-level supertagging objective to standard unsupervised pretraining, enabling the expli… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: 4 pages, rejected by IWCS due to "not fitting the conference theme"

  9. arXiv:2104.05745  [pdf, other

    cs.CL

    Fighting the COVID-19 Infodemic with a Holistic BERT Ensemble

    Authors: Giorgos Tziafas, Konstantinos Kogkalidis, Tommaso Caselli

    Abstract: This paper describes the TOKOFOU system, an ensemble model for misinformation detection tasks based on six different transformer-based pre-trained encoders, implemented in the context of the COVID-19 Infodemic Shared Task for English. We fine tune each model on each of the task's questions and aggregate their prediction scores using a majority voting approach. TOKOFOU obtains an overall F1 score o… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 4 pages, NLP4IF 2021

  10. Neural Proof Nets

    Authors: Konstantinos Kogkalidis, Michael Moortgat, Richard Moot

    Abstract: Linear logic and the linear λ-calculus have a long standing tradition in the study of natural language form and meaning. Among the proof calculi of linear logic, proof nets are of particular interest, offering an attractive geometric representation of derivations that is unburdened by the bureaucratic complications of conventional prooftheoretic formats. Building on recent advances in set-theoreti… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: 14 pages, CoNLL2020

    Journal ref: Proceedings of the 24th Conference on Computational Natural Language Learning (2020)

  11. arXiv:1912.12635  [pdf, other

    cs.CL

    ÆTHEL: Automatically Extracted Typelogical Derivations for Dutch

    Authors: Konstantinos Kogkalidis, Michael Moortgat, Richard Moot

    Abstract: We present ÆTHEL, a semantic compositionality dataset for written Dutch. ÆTHEL consists of two parts. First, it contains a lexicon of supertags for about 900 000 words in context. The supertags correspond to types of the simply typed linear lambda-calculus, enhanced with dependency decorations that capture grammatical roles supplementary to function-argument structures. On the basis of these types… ▽ More

    Submitted 6 March, 2020; v1 submitted 29 December, 2019; originally announced December 2019.

    Comments: 8 pages plus abstract, LREC 2020

    Journal ref: Proceedings of The 12th Language Resources and Evaluation Conference (2020)

  12. arXiv:1909.02955  [pdf, other

    cs.CL

    Extracting and Learning a Dependency-Enhanced Type Lexicon for Dutch

    Authors: Konstantinos Kogkalidis

    Abstract: This thesis is concerned with type-logical grammars and their practical applicability as tools of reasoning about sentence syntax and semantics. The focal point is narrowed to Dutch, a language exhibiting a large degree of word order variability. In order to overcome difficulties arising as a result of that variability, the thesis explores and expands upon a type grammar based on Multiplicative In… ▽ More

    Submitted 10 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: MSc Thesis, 94 pages

  13. Constructive Type-Logical Supertagging with Self-Attention Networks

    Authors: Konstantinos Kogkalidis, Michael Moortgat, Tejaswini Deoskar

    Abstract: We propose a novel application of self-attention networks towards grammar induction. We present an attention-based supertagger for a refined type-logical grammar, trained on constructing types inductively. In addition to achieving a high overall type accuracy, our model is able to learn the syntax of the grammar's type system along with its denotational semantics. This lifts the closed world assum… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: REPL4NLP 4, ACL 2019

    Journal ref: Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)