Skip to main content

Showing 1–24 of 24 results for author: Mayfield, J

.
  1. On the Evaluation of Machine-Generated Reports

    Authors: James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Kayi, Kate Sanders, Marc Mason, Noah Hibbler

    Abstract: Large Language Models (LLMs) have enabled new ways to satisfy information needs. Although great strides have been made in applying them to settings like document ranking and short-form text generation, they still struggle to compose complete, accurate, and verifiable long-form reports. Reports with these qualities are necessary to satisfy the complex, nuanced, or multi-faceted information needs of… ▽ More

    Submitted 9 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paper

  2. Language Fairness in Multilingual Information Retrieval

    Authors: Eugene Yang, Thomas Jänich, James Mayfield, Dawn Lawrie

    Abstract: Multilingual information retrieval (MLIR) considers the problem of ranking documents in several languages for a query expressed in a language that may differ from any of those languages. Recent work has observed that approaches such as combining ranked lists representing a single document language each or using multilingual pretrained language models demonstrate a preference for one language over… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper

  3. Distillation for Multilingual Information Retrieval

    Authors: Eugene Yang, Dawn Lawrie, James Mayfield

    Abstract: Recent work in cross-language information retrieval (CLIR), where queries and documents are in different languages, has shown the benefit of the Translate-Distill framework that trains a cross-language neural dual-encoder model using translation and distillation. However, Translate-Distill only supports a single document language. Multilingual information retrieval (MLIR), which ranks a multilingu… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 6 pages, 1 figure, accepted at SIGIR 2024 as short paper

  4. PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval

    Authors: Dawn Lawrie, Efsun Kayi, Eugene Yang, James Mayfield, Douglas W. Oard

    Abstract: PLAID, an efficient implementation of the ColBERT late interaction bi-encoder using pretrained language models for ranking, consistently achieves state-of-the-art performance in monolingual, cross-language, and multilingual retrieval. PLAID differs from ColBERT by assigning terms to clusters and representing those terms as cluster centroids plus compressed residual vectors. While PLAID is effectiv… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper

  5. arXiv:2404.18797  [pdf, other

    cs.IR

    Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval

    Authors: Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield, Douglas W. Oard, Kevin Duh

    Abstract: Probabilistic Structured Queries (PSQ) is a cross-language information retrieval (CLIR) method that uses translation probabilities statistically derived from aligned corpora. PSQ is a strong baseline for efficient CLIR using sparse indexing. It is, therefore, useful as the first stage in a cascaded neural CLIR system whose second stage is more effective but too inefficient to be used on its own to… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 11 pages, 5 figures

  6. arXiv:2404.08134  [pdf, other

    cs.IR cs.CL

    Extending Translate-Train for ColBERT-X to African Language CLIR

    Authors: Eugene Yang, Dawn J. Lawrie, Paul McNamee, James Mayfield

    Abstract: This paper describes the submission runs from the HLTCOE team at the CIRAL CLIR tasks for African languages at FIRE 2023. Our submissions use machine translation models to translate the documents and the training passages, and ColBERT-X as the retrieval model. Additionally, we present a set of unofficial runs that use an alternative training procedure with a similar training setting.

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 10 pages, 2 figures. System description paper for HLTCOE's participation in CIRAL@FIRE 2023

  7. arXiv:2404.08118  [pdf, ps, other

    cs.CL cs.IR

    HLTCOE at TREC 2023 NeuCLIR Track

    Authors: Eugene Yang, Dawn Lawrie, James Mayfield

    Abstract: The HLTCOE team applied PLAID, an mT5 reranker, and document translation to the TREC 2023 NeuCLIR track. For PLAID we included a variety of models and training techniques -- the English model released with ColBERT v2, translate-train~(TT), Translate Distill~(TD) and multilingual translate-train~(MTT). TT trains a ColBERT model with English queries and passages automatically translated into the doc… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 6 pages. Part of TREC 2023 Proceedings

  8. arXiv:2404.08071  [pdf, other

    cs.IR

    Overview of the TREC 2023 NeuCLIR Track

    Authors: Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang

    Abstract: The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR) track is to study the impact of neural approaches to cross-language information retrieval. The track has created four collections, large collections of Chinese, Persian, and Russian newswire and a smaller collection of Chinese scientific abstracts. The principal tasks are ranked retrieval of news in one of the thr… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 27 pages, 17 figures. Part of the TREC 2023 Proceedings

  9. arXiv:2401.12132  [pdf

    cs.LG cs.AI cs.ET eess.IV

    Evaluation of QCNN-LSTM for Disability Forecasting in Multiple Sclerosis Using Sequential Multisequence MRI

    Authors: John D. Mayfield, Issam El Naqa

    Abstract: Introduction Quantum Convolutional Neural Network (QCNN)-Long Short-Term Memory (LSTM) models were studied to provide sequential relationships for each timepoint in MRIs of patients with Multiple Sclerosis (MS). In this pilot study, we compared three QCNN-LSTM models for binary classification of MS disability benchmarked against classical neural network architectures. Our hypothesis is that quantu… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    ACM Class: I.2.0; I.2.6

  10. arXiv:2401.04810  [pdf, other

    cs.IR cs.CL

    Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation

    Authors: Eugene Yang, Dawn Lawrie, James Mayfield, Douglas W. Oard, Scott Miller

    Abstract: Prior work on English monolingual retrieval has shown that a cross-encoder trained using a large number of relevance judgments for query-document pairs can be used as a teacher to train more efficient, but similarly effective, dual-encoder student models. Applying a similar knowledge distillation approach to training an efficient dual-encoder model for Cross-Language Information Retrieval (CLIR),… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 17 pages, 1 figure, accepted at ECIR 2024

  11. arXiv:2305.00331  [pdf, other

    cs.IR

    Synthetic Cross-language Information Retrieval Training Data

    Authors: James Mayfield, Eugene Yang, Dawn Lawrie, Samuel Barham, Orion Weller, Marc Mason, Suraj Nair, Scott Miller

    Abstract: A key stumbling block for neural cross-language information retrieval (CLIR) systems has been the paucity of training data. The appearance of the MS MARCO monolingual training set led to significant advances in the state of the art in neural monolingual retrieval. By translating the MS MARCO documents into other languages using machine translation, this resource has been made useful to the CLIR co… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: 11 pages, 4 figures

  12. arXiv:2304.12367  [pdf, other

    cs.IR

    Overview of the TREC 2022 NeuCLIR Track

    Authors: Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang

    Abstract: This is the first year of the TREC Neural CLIR (NeuCLIR) track, which aims to study the impact of neural approaches to cross-language information retrieval. The main task in this year's track was ad hoc ranked retrieval of Chinese, Persian, or Russian newswire documents using queries expressed in English. Topics were developed using standard TREC processes, except that topics developed by an annot… ▽ More

    Submitted 24 September, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 22 pages, 13 figures, 10 tables. Part of the Thirty-First Text REtrieval Conference (TREC 2022) Proceedings. Replace the misplaced Russian result table

  13. arXiv:2212.10448  [pdf, other

    cs.IR cs.CL

    Parameter-efficient Zero-shot Transfer for Cross-Language Dense Retrieval with Adapters

    Authors: Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield, Douglas W. Oard

    Abstract: A popular approach to creating a zero-shot cross-language retrieval model is to substitute a monolingual pretrained language model in the retrieval model with a multilingual pretrained language model such as Multilingual BERT. This multilingual model is fined-tuned to the retrieval task with monolingual data such as English MS MARCO using the same training recipe as the monolingual retrieval model… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 15 pages, 1 figure

  14. arXiv:2211.10029  [pdf, other

    stat.AP

    Being Bayesian in the 2020s: opportunities and challenges in the practice of modern applied Bayesian statistics

    Authors: Joshua J. Bon, Adam Bretherton, Katie Buchhorn, Susanna Cramb, Christopher Drovandi, Conor Hassan, Adrianne L. Jenner, Helen J. Mayfield, James M. McGree, Kerrie Mengersen, Aiden Price, Robert Salomone, Edgar Santos-Fernandez, Julie Vercelloni, Xiaoyu Wang

    Abstract: Building on a strong foundation of philosophy, theory, methods and computation over the past three decades, Bayesian approaches are now an integral part of the toolkit for most statisticians and data scientists. Whether they are dedicated Bayesians or opportunistic users, applied professionals can now reap many of the benefits afforded by the Bayesian paradigm. In this paper, we touch on six moder… ▽ More

    Submitted 17 January, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 27 pages, 8 figures

  15. arXiv:2209.01335  [pdf, other

    cs.IR cs.CL

    Neural Approaches to Multilingual Information Retrieval

    Authors: Dawn Lawrie, Eugene Yang, Douglas W. Oard, James Mayfield

    Abstract: Providing access to information across languages has been a goal of Information Retrieval (IR) for decades. While progress has been made on Cross Language IR (CLIR) where queries are expressed in one language and documents in another, the multilingual (MLIR) task to create a single ranked list of documents across many languages is considerably more challenging. This paper investigates whether adva… ▽ More

    Submitted 9 February, 2023; v1 submitted 3 September, 2022; originally announced September 2022.

    Comments: 17 pages, 3 figures, accepted at ECIR 2023

  16. arXiv:2201.09996  [pdf, ps, other

    cs.IR

    Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

    Authors: Cash Costello, Eugene Yang, Dawn Lawrie, James Mayfield

    Abstract: While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many langu… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 5 pages, accepted at ECIR 2022 as a demo paper

  17. arXiv:2201.09992  [pdf, other

    cs.IR cs.CL

    HC4: A New Suite of Test Collections for Ad Hoc CLIR

    Authors: Dawn Lawrie, James Mayfield, Douglas Oard, Eugene Yang

    Abstract: HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval (CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in English and in the document languages, and graded relevance judgments. New test collections are needed because existing CLIR test collections built using pooling of traditional CLIR runs have systematic gaps in their relevance j… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 16 pages, 2 figures, accepted at ECIR 2022

  18. arXiv:2201.08471  [pdf, other

    cs.IR cs.CL

    Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

    Authors: Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield, Douglas W. Oard

    Abstract: The advent of transformer-based models such as BERT has led to the rise of neural ranking models. These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25. While monolingual retrieval tasks have benefited from large-scale training collections such as MS MARCO and advances in neural architectures, cross-language retrieval tasks… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted at ECIR 2022 (Full paper)

  19. arXiv:2104.08082  [pdf, other

    cs.CL

    Improving Zero-Shot Multi-Lingual Entity Linking

    Authors: Elliot Schumacher, James Mayfield, Mark Dredze

    Abstract: Entity linking -- the task of identifying references in free text to relevant knowledge base representations -- often focuses on single languages. We consider multilingual entity linking, where a single model is trained to link references to same-language knowledge bases in several languages. We propose a neural ranker architecture, which leverages multilingual transformer representations of text… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  20. arXiv:2010.09828  [pdf, other

    cs.CL

    Cross-Lingual Transfer in Zero-Shot Cross-Language Entity Linking

    Authors: Elliot Schumacher, James Mayfield, Mark Dredze

    Abstract: Cross-language entity linking grounds mentions in multiple languages to a single-language knowledge base. We propose a neural ranking architecture for this task that uses multilingual BERT representations of the mention and the context in a neural network. We find that the multilingual ability of BERT leads to robust performance in monolingual and multilingual settings. Furthermore, we explore zer… ▽ More

    Submitted 7 July, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted in the Findings of ACL 2021

  21. arXiv:2006.01739  [pdf, other

    math.NA math.OC

    Accelerating the Distributed Kaczmarz Algorithm by Strong Over-relaxation

    Authors: Riley Borgard, Steven N. Harding, Haley Duba, Chloe Makdad, Jay Mayfield, Randal Tuggle, Eric Weber

    Abstract: The distributed Kaczmarz algorithm is an adaptation of the standard Kaczmarz algorithm to the situation in which data is distributed throughout a network represented by a tree. We isolate substructures of the network and study convergence of the distributed Kazmarz algorithm for relatively large relaxation parameters associated to these substructures. If the system is consistent, then the algorith… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    MSC Class: 15A06; 15A24

  22. arXiv:2003.03072  [pdf, other

    cs.CL cs.LG

    Improving Neural Named Entity Recognition with Gazetteers

    Authors: Chan Hee Song, Dawn Lawrie, Tim Finin, James Mayfield

    Abstract: The goal of this work is to improve the performance of a neural named entity recognition system by adding input features that indicate a word is part of a name included in a gazetteer. This article describes how to generate gazetteers from the Wikidata knowledge graph as well as how to integrate the information into a neural NER system. Experiments reveal that the approach yields performance gains… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: Short version accepted to the 33rd FLAIRS conference

  23. arXiv:1506.00301  [pdf, ps, other

    cs.AI cs.CL

    Interactive Knowledge Base Population

    Authors: Travis Wolfe, Mark Dredze, James Mayfield, Paul McNamee, Craig Harman, Tim Finin, Benjamin Van Durme

    Abstract: Most work on building knowledge bases has focused on collecting entities and facts from as large a collection of documents as possible. We argue for and describe a new paradigm where the focus is on a high-recall extraction over a small collection of documents under the supervision of a human expert, that we call Interactive Knowledge Base Population (IKBP).

    Submitted 31 May, 2015; originally announced June 2015.

  24. arXiv:1308.3748  [pdf, ps, other

    nucl-th cond-mat.stat-mech nucl-ex

    Work distribution of an expanding gas and transverse energy production in relativistic heavy ion collisions

    Authors: Bin Zhang, Jay P. Mayfield

    Abstract: The work distribution of an expanding extreme relativistic gas is shown to be a gamma distribution with a different shape parameter as compared with its non-relativistic counterpart. This implies that the shape of the transverse energy distribution in relativistic heavy ion collisions depends on the particle contents during the evolution of the hot and dense matter. Therefore, transverse energy fl… ▽ More

    Submitted 18 January, 2014; v1 submitted 16 August, 2013; originally announced August 2013.

    Comments: Added discussions in Section 3, improved introductions to Eqs. (4) and (15), corrected typos, updated references