Skip to main content

Showing 1–5 of 5 results for author: Coman, A C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.17936  [pdf, other

    cs.CL cs.AI cs.LG

    Transformers as Graph-to-Graph Models

    Authors: James Henderson, Alireza Mohammadshahi, Andrei C. Coman, Lesly Miculicich

    Abstract: We argue that Transformers are essentially graph-to-graph models, with sequences just being a special case. Attention weights are functionally equivalent to graph edges. Our Graph-to-Graph Transformer architecture makes this ability explicit, by inputting graph edges into the attention weight computations and predicting graph edges with attention-like functions, thereby integrating explicit graphs… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to Big Picture workshop at EMNLP 2023

  2. arXiv:2310.14708  [pdf, other

    cs.CL

    Strong and Efficient Baselines for Open Domain Conversational Question Answering

    Authors: Andrei C. Coman, Gianni Barlacchi, AdriĆ  de Gispert

    Abstract: Unlike the Open Domain Question Answering (ODQA) setting, the conversational (ODConvQA) domain has received limited attention when it comes to reevaluating baselines for both efficiency and effectiveness. In this paper, we study the State-of-the-Art (SotA) Dense Passage Retrieval (DPR) retriever and Fusion-in-Decoder (FiD) reader pipeline, and show that it significantly underperforms when applied… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Findings

  3. arXiv:2308.14423  [pdf, other

    cs.CL

    GADePo: Graph-Assisted Declarative Pooling Transformers for Document-Level Relation Extraction

    Authors: Andrei C. Coman, Christos Theodoropoulos, Marie-Francine Moens, James Henderson

    Abstract: Document-level relation extraction typically relies on text-based encoders and hand-coded pooling heuristics to aggregate information learned by the encoder. In this paper, we leverage the intrinsic graph processing capabilities of the Transformer model and propose replacing hand-coded pooling methods with new tokens in the input, which are designed to aggregate information via explicit graph rela… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted to KnowledgeNLP workshop at ACL 2024

  4. Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning

    Authors: Christos Theodoropoulos, James Henderson, Andrei C. Coman, Marie-Francine Moens

    Abstract: Though language model text embeddings have revolutionized NLP research, their ability to capture high-level semantic information, such as relations between entities in text, is limited. In this paper, we propose a novel contrastive learning framework that trains sentence embeddings to encode the relations in a graph structure. Given a sentence (unstructured text) and its graph, we use contrastive… ▽ More

    Submitted 4 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: To be presented at CoNLL 2021

    Journal ref: Conference: 2021 Proceedings of the 25th Conference on Computational Natural Language Learning

  5. arXiv:1905.11806  [pdf, other

    cs.CL

    An Incremental Turn-Taking Model For Task-Oriented Dialog Systems

    Authors: Andrei C. Coman, Koichiro Yoshino, Yukitoshi Murase, Satoshi Nakamura, Giuseppe Riccardi

    Abstract: In a human-machine dialog scenario, deciding the appropriate time for the machine to take the turn is an open research problem. In contrast, humans engaged in conversations are able to timely decide when to interrupt the speaker for competitive or non-competitive reasons. In state-of-the-art turn-by-turn dialog systems the decision on the next dialog action is taken at the end of the utterance. In… ▽ More

    Submitted 11 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted to INTERSPEECH 2019