Skip to main content

Showing 1–1 of 1 results for author: Lapastora, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15750  [pdf, other

    cs.CL cs.AI cs.LG

    Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence

    Authors: Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld

    Abstract: This paper introduces Filtered Corpus Training, a method that trains language models (LMs) on corpora with certain linguistic constructions filtered out from the training data, and uses it to measure the ability of LMs to perform linguistic generalization on the basis of indirect evidence. We apply the method to both LSTM and Transformer LMs (of roughly comparable size), develo** filtered corpor… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see http://github.com/CLMBRs/corpus-filtering