Skip to main content

Showing 1–8 of 8 results for author: Cardoso, H L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04233  [pdf, other

    cs.CL cs.AI

    FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages

    Authors: Bernardo Leite, Tomás Freitas Osório, Henrique Lopes Cardoso

    Abstract: Question Answering (QA) datasets are crucial in assessing reading comprehension skills for both machines and humans. While numerous datasets have been developed in English for this purpose, a noticeable void exists in less-resourced languages. To alleviate this gap, our paper introduces machine-translated versions of FairytaleQA, a renowned QA dataset designed to assess and enhance narrative compr… ▽ More

    Submitted 24 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint - Accepted for publication at ECTEL 2024

  2. arXiv:2404.05333  [pdf, ps, other

    cs.CL

    PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese

    Authors: Tomás Osório, Bernardo Leite, Henrique Lopes Cardoso, Luís Gomes, João Rodrigues, Rodrigo Santos, António Branco

    Abstract: Leveraging research on the neural modelling of Portuguese, we contribute a collection of datasets for an array of language processing tasks and a corresponding collection of fine-tuned neural language models on these downstream tasks. To align with mainstream benchmarks in the literature, originally developed in English, and to kick start their Portuguese counterparts, the datasets were machine-tr… ▽ More

    Submitted 8 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Preprint - Paper accepted for BUCC 2024

  3. arXiv:2404.02800  [pdf, other

    cs.CL cs.AI

    On Few-Shot Prompting for Controllable Question-Answer Generation in Narrative Comprehension

    Authors: Bernardo Leite, Henrique Lopes Cardoso

    Abstract: Question Generation aims to automatically generate questions based on a given input provided as context. A controllable question generation scheme focuses on generating questions with specific attributes, allowing better control. In this study, we propose a few-shot prompting strategy for controlling the generation of question-answer pairs from children's narrative texts. We aim to control two att… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Preprint - Accepted for publication at CSEDU 2024

  4. arXiv:2403.01897  [pdf, other

    cs.CL

    Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina PT* Family

    Authors: Rodrigo Santos, João Rodrigues, Luís Gomes, João Silva, António Branco, Henrique Lopes Cardoso, Tomás Freitas Osório, Bernardo Leite

    Abstract: To foster the neural encoding of Portuguese, this paper contributes foundation encoder models that represent an expansion of the still very scarce ecosystem of large language models specifically developed for this language that are fully open, in the sense that they are open source and openly distributed for free under an open license for any purpose, thus including research and commercial usages.… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  5. arXiv:2306.14917  [pdf

    cs.CL cs.AI cs.CY cs.LG

    Towards Enriched Controllability for Educational Question Generation

    Authors: Bernardo Leite, Henrique Lopes Cardoso

    Abstract: Question Generation (QG) is a task within Natural Language Processing (NLP) that involves automatically generating questions given an input, typically composed of a text and a target answer. Recent work on QG aims to control the type of generated questions so that they meet educational needs. A remarkable example of controllability in educational QG is the generation of questions underlying certai… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: This is a preprint of an article to be published at the Int. Conf. on Artificial Intelligence in Education (AIED, 2023)

  6. arXiv:2306.04314  [pdf, other

    cs.CL

    Cross-Genre Argument Mining: Can Language Models Automatically Fill in Missing Discourse Markers?

    Authors: Gil Rocha, Henrique Lopes Cardoso, Jonas Belouadi, Steffen Eger

    Abstract: Available corpora for Argument Mining differ along several axes, and one of the key differences is the presence (or absence) of discourse markers to signal argumentative content. Exploring effective ways to use discourse markers has received wide attention in various discourse parsing tasks, from which it is well-known that discourse markers are strong indicators of discourse relations. To improve… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  7. Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*

    Authors: João Rodrigues, Luís Gomes, João Silva, António Branco, Rodrigo Santos, Henrique Lopes Cardoso, Tomás Osório

    Abstract: To advance the neural encoding of Portuguese (PT), and a fortiori the technological preparation of this language for the digital age, we developed a Transformer-based foundation model that sets a new state of the art in this respect for two of its variants, namely European Portuguese from Portugal (PT-PT) and American Portuguese from Brazil (PT-BR). To develop this encoder, which we named Albert… ▽ More

    Submitted 20 June, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  8. arXiv:1301.5946  [pdf

    cs.AI

    Computer Poker Research at LIACC

    Authors: Luís Filipe Teófilo, Luís Paulo Reis, Henrique Lopes Cardoso, Dinis Félix, Rui Sêca, João Ferreira, Pedro Mendes, Nuno Cruz, Vitor Pereira, Nuno Passos

    Abstract: Computer Poker's unique characteristics present a well-suited challenge for research in artificial intelligence. For that reason, and due to the Poker's market increase in popularity in Portugal since 2008, several members of LIACC have researched in this field. Several works were published as papers and master theses and more recently a member of LIACC engaged on a research in this area as a Ph.D… ▽ More

    Submitted 24 January, 2013; originally announced January 2013.