Skip to main content

Showing 1–11 of 11 results for author: Charnois, T

.
  1. arXiv:2404.12493  [pdf, other

    cs.CL cs.AI

    EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction

    Authors: Urchade Zaratiana, Nadi Tomeh, Yann Dauxais, Pierre Holat, Thierry Charnois

    Abstract: Joint entity and relation extraction plays a pivotal role in various applications, notably in the construction of knowledge graphs. Despite recent progress, existing approaches often fall short in two key aspects: richness of representation and coherence in output structure. These models often rely on handcrafted heuristics for computing entity and relation representations, potentially leading to… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Work in progress

  2. arXiv:2404.12491  [pdf, other

    cs.CL cs.AI

    GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction

    Authors: Urchade Zaratiana, Nadi Tomeh, Niama El Khbir, Pierre Holat, Thierry Charnois

    Abstract: Information extraction (IE) is an important task in Natural Language Processing (NLP), involving the extraction of named entities and their relationships from unstructured text. In this paper, we propose a novel approach to this task by formulating it as graph structure learning (GSL). By formulating IE as GSL, we enhance the model's ability to dynamically refine and optimize the graph structure d… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Work in progress

  3. arXiv:2401.01326  [pdf, other

    cs.CL cs.AI cs.LG

    An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction

    Authors: Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois

    Abstract: In this paper, we propose a novel method for joint entity and relation extraction from unstructured text by framing it as a conditional sequence generation problem. In contrast to conventional generative information extraction models that are left-to-right token-level generators, our approach is \textit{span-based}. It generates a linearized graph where nodes represent text spans and edges represe… ▽ More

    Submitted 15 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: AAAI 2024 (camera ready version)

  4. arXiv:2311.18028  [pdf, other

    cs.CL cs.AI cs.LG

    Filtered Semi-Markov CRF

    Authors: Urchade Zaratiana, Nadi Tomeh, Niama El Khbir, Pierre Holat, Thierry Charnois

    Abstract: Semi-Markov CRF has been proposed as an alternative to the traditional Linear Chain CRF for text segmentation tasks such as Named Entity Recognition (NER). Unlike CRF, which treats text segmentation as token-level prediction, Semi-CRF considers segments as the basic unit, making it more expressive. However, Semi-CRF suffers from two major drawbacks: (1) quadratic complexity over sequence length, a… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 (Findings)

  5. arXiv:2311.08526  [pdf, other

    cs.CL cs.AI cs.LG

    GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer

    Authors: Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois

    Abstract: Named Entity Recognition (NER) is essential in various Natural Language Processing (NLP) applications. Traditional NER models are effective but limited to a set of predefined entity types. In contrast, Large Language Models (LLMs) can extract arbitrary entities through natural language instructions, offering greater flexibility. However, their size and cost, particularly for those accessed via API… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Work in progress

  6. arXiv:2210.15048  [pdf, other

    cs.CL cs.AI

    DyREx: Dynamic Query Representation for Extractive Question Answering

    Authors: Urchade Zaratiana, Niama El Khbir, Dennis Núñez, Pierre Holat, Nadi Tomeh, Thierry Charnois

    Abstract: Extractive question answering (ExQA) is an essential task for Natural Language Processing. The dominant approach to ExQA is one that represents the input sequence tokens (question and passage) with a pre-trained transformer, then uses two learned query vectors to compute distributions over the start and end answer span positions. These query vectors lack the context of the inputs, which can be a b… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted at "2nd Workshop on Efficient Natural Language and Speech Processing (ENLSP-II)" @ NeurIPS 2022

  7. arXiv:2203.14710  [pdf, other

    cs.CL cs.LG

    Hierarchical Transformer Model for Scientific Named Entity Recognition

    Authors: Urchade Zaratiana, Pierre Holat, Nadi Tomeh, Thierry Charnois

    Abstract: The task of Named Entity Recognition (NER) is an important component of many natural language processing systems, such as relation extraction and knowledge graph construction. In this work, we present a simple and effective approach for Named Entity Recognition. The main idea of our approach is to encode the input subword sequence with a pre-trained transformer such as BERT, and then, instead of d… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  8. arXiv:2110.03567  [pdf, other

    cs.CL cs.IR

    GeSERA: General-domain Summary Evaluation by Relevance Analysis

    Authors: Jessica López Espejel, Gaël de Chalendar, Jorge Garcia Flores, Thierry Charnois, Ivan Vladimir Meza Ruiz

    Abstract: We present GeSERA, an open-source improved version of SERA for evaluating automatic extractive and abstractive summaries from the general domain. SERA is based on a search engine that compares candidate and reference summaries (called queries) against an information retrieval document base (called index). SERA was originally designed for the biomedical domain only, where it showed a better correla… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted in RANLP 2021 conference

  9. arXiv:1511.08350  [pdf, other

    cs.AI

    A global Constraint for mining Sequential Patterns with GAP constraint

    Authors: Amina Kemmar, Samir Loudni, Yahia Lebbah, Patrice Boizumault, Thierry Charnois

    Abstract: Sequential pattern mining (SPM) under gap constraint is a challenging task. Many efficient specialized methods have been developed but they are all suffering from a lack of genericity. The Constraint Programming (CP) approaches are not so effective because of the size of their encodings. In[7], we have proposed the global constraint Prefix-Projection for SPM which remedies to this drawback. Howeve… ▽ More

    Submitted 26 November, 2015; originally announced November 2015.

  10. arXiv:1504.07877  [pdf, other

    cs.AI

    Prefix-Projection Global Constraint for Sequential Pattern Mining

    Authors: Amina Kemmar, Samir Loudni, Yahia Lebbah, Patrice Boizumault, Thierry Charnois

    Abstract: Sequential pattern mining under constraints is a challenging data mining task. Many efficient ad hoc methods have been developed for mining sequential patterns, but they are all suffering from a lack of genericity. Recent works have investigated Constraint Programming (CP) methods, but they are not still effective because of their encoding. In this paper, we propose a global constraint based on th… ▽ More

    Submitted 23 June, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

  11. arXiv:1311.6907  [pdf, ps, other

    cs.AI cs.DB

    A Constraint Programming Approach for Mining Sequential Patterns in a Sequence Database

    Authors: Jean-Philippe Métivier, Samir Loudni, Thierry Charnois

    Abstract: Constraint-based pattern discovery is at the core of numerous data mining tasks. Patterns are extracted with respect to a given set of constraints (frequency, closedness, size, etc). In the context of sequential pattern mining, a large number of devoted techniques have been developed for solving particular classes of constraints. The aim of this paper is to investigate the use of Constraint Progra… ▽ More

    Submitted 27 November, 2013; originally announced November 2013.