Skip to main content

Showing 1–6 of 6 results for author: Flores, L J Y

.
  1. arXiv:2403.05788  [pdf, other

    cs.CL cs.AI

    On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization

    Authors: Lorenzo Jaime Yu Flores, Arman Cohan

    Abstract: Text summarization and simplification are among the most widely used applications of AI. However, models developed for such tasks are often prone to hallucination, which can result from training on unaligned data. One efficient approach to address this issue is Loss Truncation (LT) (Kang and Hashimoto, 2020), an approach to modify the standard log loss to adaptively remove noisy examples during tr… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: EACL 2024

  2. arXiv:2310.11191  [pdf, other

    cs.CL cs.AI

    Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding

    Authors: Lorenzo Jaime Yu Flores, Heyuan Huang, Kejian Shi, Sophie Chheang, Arman Cohan

    Abstract: Text simplification has emerged as an increasingly useful application of AI for bridging the communication gap in specialized fields such as medicine, where the lexicon is often dominated by technical jargon and complex constructs. Despite notable progress, methods in medical simplification sometimes result in the generated text having lower quality and diversity. In this work, we explore ways to… ▽ More

    Submitted 25 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  3. arXiv:2302.02962  [pdf, other

    cs.CL

    LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control

    Authors: Yilun Zhao, Zhenting Qi, Linyong Nan, Lorenzo Jaime Yu Flores, Dragomir Radev

    Abstract: Logical Table-to-Text (LT2T) generation is tasked with generating logically faithful sentences from tables. There currently exists two challenges in the field: 1) Faithfulness: how to generate sentences that are factually correct given the table content; 2) Diversity: how to generate multiple sentences that offer different perspectives on the table. This work proposes LoFT, which utilizes logic fo… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at EACL 2023 as a short paper

  4. arXiv:2210.02675  [pdf, other

    cs.CL

    Look Ma, Only 400 Samples! Revisiting the Effectiveness of Automatic N-Gram Rule Generation for Spelling Normalization in Filipino

    Authors: Lorenzo Jaime Yu Flores, Dragomir Radev

    Abstract: With 84.75 million Filipinos online, the ability for models to process online text is crucial for develo** Filipino NLP applications. To this end, spelling correction is a crucial preprocessing step for downstream processing. However, the lack of data prevents the use of language models for this task. In this paper, we propose an N-Gram + Damerau Levenshtein distance model with automatic rule ex… ▽ More

    Submitted 5 November, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 4 pages, 1 figure, Presented at EMNLP 2022 Third Workshop on Simple and Efficient Natural Language Processing

  5. arXiv:2205.12467  [pdf, other

    cs.CL

    R2D2: Robust Data-to-Text with Replacement Detection

    Authors: Linyong Nan, Lorenzo Jaime Yu Flores, Yilun Zhao, Yixin Liu, Luke Benson, Wei** Zou, Dragomir Radev

    Abstract: Unfaithful text generation is a common problem for text generation systems. In the case of Data-to-Text (D2T) systems, the factuality of the generated text is particularly crucial for any real-world applications. We introduce R2D2, a training framework that addresses unfaithful Data-to-Text generation by training a system both as a generator and a faithfulness discriminator with additional replace… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  6. arXiv:2201.00912  [pdf, other

    cs.CL

    An Adversarial Benchmark for Fake News Detection Models

    Authors: Lorenzo Jaime Yu Flores, Yiding Hao

    Abstract: With the proliferation of online misinformation, fake news detection has gained importance in the artificial intelligence community. In this paper, we propose an adversarial benchmark that tests the ability of fake news detectors to reason about real-world facts. We formulate adversarial attacks that target three aspects of "understanding": compositional semantics, lexical relations, and sensitivi… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: 6 pages, 2 figures, Presented at AAAI 2022, Workshop on Adversarial Machine Learning and Beyond

    ACM Class: I.2.7