Skip to main content

Showing 1–4 of 4 results for author: Theodoropoulos, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.14423  [pdf, other

    cs.CL

    GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction

    Authors: Andrei C. Coman, Christos Theodoropoulos, Marie-Francine Moens, James Henderson

    Abstract: Document-level relation extraction typically relies on text-based encoders and hand-coded pooling heuristics to aggregate information learned by the encoder. In this paper, we leverage the intrinsic graph processing capabilities of the Transformer model and propose replacing hand-coded pooling methods with new tokens in the input, which are designed to aggregate information via explicit graph rela… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted to KnowledgeNLP workshop at ACL 2024

  2. arXiv:2305.05640  [pdf, other

    cs.AI cs.CL cs.LG

    Representation Learning for Person or Entity-centric Knowledge Graphs: An Application in Healthcare

    Authors: Christos Theodoropoulos, Natasha Mulligan, Thaddeus Stappenbeck, Joao Bettencourt-Silva

    Abstract: Knowledge graphs (KGs) are a popular way to organise information based on ontologies or schemas and have been used across a variety of scenarios from search to recommendation. Despite advances in KGs, representing knowledge remains a non-trivial task across industries and it is especially challenging in the biomedical and healthcare domains due to complex interdependent relations between entities,… ▽ More

    Submitted 9 October, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted into the Twelfth International Conference on Knowledge Capture (K-CAP 2023)

  3. An Information Extraction Study: Take In Mind the Tokenization!

    Authors: Christos Theodoropoulos, Marie-Francine Moens

    Abstract: Current research on the advantages and trade-offs of using characters, instead of tokenized text, as input for deep learning models, has evolved substantially. New token-free models remove the traditional tokenization step; however, their efficiency remains unclear. Moreover, the effect of tokenization is relatively unexplored in sequence tagging tasks. To this end, we investigate the impact of to… ▽ More

    Submitted 1 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Submitted Manuscript/Preprint (accepted at EUSFLAT 2023, to be published in Lecture Notes in Computer Science (LNCS))

    Journal ref: Conference: 2023 13th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT)

  4. Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

    Authors: Christos Theodoropoulos, James Henderson, Andrei C. Coman, Marie-Francine Moens

    Abstract: Though language model text embeddings have revolutionized NLP research, their ability to capture high-level semantic information, such as relations between entities in text, is limited. In this paper, we propose a novel contrastive learning framework that trains sentence embeddings to encode the relations in a graph structure. Given a sentence (unstructured text) and its graph, we use contrastive… ▽ More

    Submitted 4 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: To be presented at CoNLL 2021

    Journal ref: Conference: 2021 Proceedings of the 25th Conference on Computational Natural Language Learning