Skip to main content

Showing 1–11 of 11 results for author: Fierro, C

.
  1. arXiv:2404.15206  [pdf, other

    cs.CL

    Does Instruction Tuning Make LLMs More Consistent?

    Authors: Constanza Fierro, Jiaang Li, Anders Søgaard

    Abstract: The purpose of instruction tuning is enabling zero-shot performance, but instruction tuning has also been shown to improve chain-of-thought reasoning and value alignment (Si et al., 2023). Here we consider the impact on $\textit{consistency}$, i.e., the sensitivity of language models to small perturbations in the input. We compare 10 instruction-tuned LLaMA models to the original LLaMA-7b model an… ▽ More

    Submitted 30 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  2. arXiv:2404.03381  [pdf, other

    cs.CL

    Learning to Plan and Generate Text with Citations

    Authors: Constanza Fierro, Reinald Kim Amplayo, Fantine Huot, Nicola De Cao, Joshua Maynez, Shashi Narayan, Mirella Lapata

    Abstract: The increasing demand for the deployment of LLMs in information-seeking scenarios has spurred efforts in creating verifiable systems, which generate responses to queries along with supporting evidence. In this paper, we explore the attribution capabilities of plan-based models which have been recently shown to improve the faithfulness, grounding, and controllability of generated text. We conceptua… ▽ More

    Submitted 13 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2404.03036  [pdf, other

    cs.CL

    MuLan: A Study of Fact Mutability in Language Models

    Authors: Constanza Fierro, Nicolas Garneau, Emanuele Bugliarello, Yova Kementchedjhieva, Anders Søgaard

    Abstract: Facts are subject to contingencies and can be true or false in different circumstances. One such contingency is time, wherein some facts mutate over a given period, e.g., the president of a country or the winner of a championship. Trustworthy language models ideally identify mutable facts as such and process them accordingly. We create MuLan, a benchmark for evaluating the ability of English langu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2305.14205  [pdf, other

    cs.CL

    $μ$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge

    Authors: Fantine Huot, Joshua Maynez, Chris Alberti, Reinald Kim Amplayo, Priyanka Agrawal, Constanza Fierro, Shashi Narayan, Mirella Lapata

    Abstract: Cross-lingual summarization consists of generating a summary in one language given an input document in a different language, allowing for the dissemination of relevant content across speakers of other languages. The task is challenging mainly due to the paucity of cross-lingual datasets and the compounded difficulty of summarizing and translating. This work presents $μ$PLAN, an approach to cross-… ▽ More

    Submitted 31 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EACL 2024

  5. arXiv:2302.06555  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Do Vision and Language Models Share Concepts? A Vector Space Alignment Study

    Authors: Jiaang Li, Yova Kementchedjhieva, Constanza Fierro, Anders Søgaard

    Abstract: Large-scale pretrained language models (LMs) are said to ``lack the ability to connect utterances to the world'' (Bender and Koller, 2020), because they do not have ``mental models of the world' '(Mitchell and Krakauer, 2023). If so, one would expect LM representations to be unrelated to representations induced by vision models. We present an empirical evaluation across four families of LMs (BERT,… ▽ More

    Submitted 6 July, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 12 pages, long paper accepted by TACL

  6. Factual Consistency of Multilingual Pretrained Language Models

    Authors: Constanza Fierro, Anders Søgaard

    Abstract: Pretrained language models can be queried for factual knowledge, with potential applications in knowledge base acquisition and tasks that require inference. However, for that, we need to know how reliable this knowledge is, and recent work has shown that monolingual English language models lack consistency when predicting factual knowledge, that is, they fill-in-the-blank differently for paraphras… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2022, pages 3046-3052, Dublin, Ireland. Association for Computational Linguistics

  7. arXiv:2203.10020  [pdf, other

    cs.CL

    Challenges and Strategies in Cross-Cultural NLP

    Authors: Daniel Hershcovich, Stella Frank, Heather Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello, Laura Cabello Piqueras, Ilias Chalkidis, Ruixiang Cui, Constanza Fierro, Katerina Margatina, Phillip Rust, Anders Søgaard

    Abstract: Various efforts in the Natural Language Processing (NLP) community have been made to accommodate linguistic diversity and serve speakers of many different languages. However, it is important to acknowledge that speakers and the content they produce and require, vary not just by language, but also by culture. Although language and culture are tightly linked, there are important differences. Analogo… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: ACL 2022 - Theme track

  8. arXiv:2003.11622  [pdf, other

    cs.CL cs.LG stat.ML

    Predicting Unplanned Readmissions with Highly Unstructured Data

    Authors: Constanza Fierro, Jorge Pérez, Javier Mora

    Abstract: Deep learning techniques have been successfully applied to predict unplanned readmissions of patients in medical centers. The training data for these models is usually based on historical medical records that contain a significant amount of free-text from admission reports, referrals, exam notes, etc. Most of the models proposed so far are tailored to English text data and assume that electronic m… ▽ More

    Submitted 5 April, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

  9. arXiv:1406.7051  [pdf, other

    astro-ph.GA astro-ph.SR

    New galactic star clusters discovered in the VVV survey. Candidates projected on the inner disk and bulge

    Authors: J. Borissova, A. -N. Chené, S. Ramírez Alegría, Saurabh Sharma, J. R. A. Clarke, R. Kurtev, I. Negueruela, A. Marco, P. Amigo, D. Minniti, E. Bica, C. Bonatto, M. Catelan, C. Fierro, D. Geisler, M. Gromadzki, M. Hempel, M. M. Hanson, V. D. Ivanov, P. Lucas, D. Majaess, C. Moni Bidin, B. Popescu, R. K. Saito

    Abstract: VISTA Variables in the Vía Láctea (VVV) is one of six ESO Public Surveys using the 4 meter Visible and Infrared Survey Telescope for Astronomy (VISTA). The VVV survey covers the Milky Way bulge and an adjacent section of the disk, and one of the principal objectives is to search for new star clusters within previously unreachable obscured parts of the Galaxy. The primary motivation behind this w… ▽ More

    Submitted 26 June, 2014; originally announced June 2014.

    Comments: 23 pages, 20 figures and 3 tables

    Journal ref: A&A 569, A24 (2014)

  10. Massive open star clusters using the VVV survey III: A young massive cluster at the far edge of the Galactic bar

    Authors: S. Ramírez Alegría, J. Borissova, A. N. Chené, E. O'Leary, P. Amigo, D. Minniti, R. K. Saito, D. Geisler, R. Kurtev, M. Hempel, M. Gromadzki, J. R. A. Clarke, I. Negueruela, A. Marco, C. Fierro, C. Bonatto, M. Catelan

    Abstract: Context: Young massive clusters are key to map the Milky Way's structure, and near-IR large area sky surveys have contributed strongly to the discovery of new obscured massive stellar clusters. Aims: We present the third article in a series of papers focused on young and massive clusters discovered in the VVV survey. This article is dedicated to the physical characterization of VVV CL086, using… ▽ More

    Submitted 13 March, 2014; originally announced March 2014.

    Comments: Accepted for publication as a Letter in A&A

  11. arXiv:0707.0567  [pdf

    q-bio.GN q-bio.QM

    Exploring nervous system transcriptomes during embryogenesis and metamorphosis in Xenopus tropicalis using EST analysis

    Authors: Ana C Fierro, Raphaël Thuret, Laurent Coen, Muriel Perron, Barbara A Demeneix, Maurice Wegnez, Gabor Gyapay, Jean Weissenbach, Patrick Wincker, André Mazabraud, Nicolas Pollet

    Abstract: Xenopus tropicalis is an anuran amphibian species used as model in vertebrate comparative genomics. It provides the same advantages as Xenopus laevis but is diploid and has a smaller genome of 1.7 Gbp. Therefore X. tropicalis is more amenable to systematic transcriptome surveys. We initiated a large-scale partial cDNA sequencing project to provide a functional genomics resource on genes expresse… ▽ More

    Submitted 4 July, 2007; originally announced July 2007.

    Journal ref: BMC Genomics 8 (2007) 118