Skip to main content

Showing 1–20 of 20 results for author: Pradeep, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16828  [pdf, other

    cs.IR cs.AI cs.CL

    Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

    Authors: Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Did you try out the new Bing Search? Or maybe you fiddled around with Google AI~Overviews? These might sound familiar because the modern-day search stack has recently evolved to include retrieval-augmented generation (RAG) systems. They allow searching and incorporating real-time data into large language models (LLMs) to provide a well-informed, attributed, concise summary in contrast to the tradi… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.11706  [pdf, other

    cs.IR cs.CL cs.LG

    Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

    Authors: Jasper Xian, Saron Samuel, Faraz Khoubsirat, Ronak Pradeep, Md Arafat Sultan, Radu Florian, Salim Roukos, Avirup Sil, Christopher Potts, Omar Khattab

    Abstract: We develop a method for training small-scale (under 100M parameter) neural information retrieval models with as few as 10 gold relevance labels. The method depends on generating synthetic queries for documents using a language model (LM), and the key step is that we automatically optimize the LM prompt that is used to generate these queries based on training quality. In experiments with the BIRCO… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.06519  [pdf, other

    cs.IR

    UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor

    Authors: Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Nick Craswell, Jimmy Lin

    Abstract: Copious amounts of relevance judgments are necessary for the effective training and accurate evaluation of retrieval systems. Conventionally, these judgments are made by human assessors, rendering this process expensive and laborious. A recent study by Thomas et al. from Microsoft Bing suggested that large language models (LLMs) can accurately perform the relevance assessment task and provide huma… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures

  4. arXiv:2404.01626  [pdf, other

    cs.CL cs.IR

    Entity Disambiguation via Fusion Entity Decoding

    Authors: Junxiong Wang, Ali Mousavi, Omar Attia, Ronak Pradeep, Saloni Potdar, Alexander M. Rush, Umar Farooq Minhas, Yunyao Li

    Abstract: Entity disambiguation (ED), which links the mentions of ambiguous entities to their referent entities in a knowledge base, serves as a core component in entity linking (EL). Existing generative approaches demonstrate improved accuracy compared to classification approaches under the standardized ZELDA benchmark. Nevertheless, generative approaches suffer from the need for large-scale pre-training a… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL'24 main

  5. arXiv:2312.16159  [pdf, other

    cs.IR cs.CL

    Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages

    Authors: Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin

    Abstract: Large language models (LLMs) have shown impressive zero-shot capabilities in various document reranking tasks. Despite their successful implementations, there is still a gap in existing literature on their effectiveness in low-resource languages. To address this gap, we investigate how LLMs function as rerankers in cross-lingual information retrieval (CLIR) systems for African languages. Our imple… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  6. arXiv:2312.16098  [pdf, other

    cs.IR

    Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder-Decoder Models

    Authors: Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin

    Abstract: Recent work in zero-shot listwise reranking using LLMs has achieved state-of-the-art results. However, these methods are not without drawbacks. The proposed methods rely on large LLMs with billions of parameters and limited context sizes. This paper introduces LiT5-Distill and LiT5-Score, two methods for efficient zero-shot listwise reranking, leveraging T5 sequence-to-sequence encoder-decoder mod… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  7. arXiv:2312.02724  [pdf, other

    cs.IR

    RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!

    Authors: Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin

    Abstract: In information retrieval, proprietary large language models (LLMs) such as GPT-4 and open-source counterparts such as LLaMA and Vicuna have played a vital role in reranking. However, the gap between open-source and closed models persists, with reliance on proprietary, non-transparent models constraining reproducibility. Addressing this gap, we introduce RankZephyr, a state-of-the-art, open-source… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  8. arXiv:2309.15088  [pdf, other

    cs.IR cs.CL

    RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models

    Authors: Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin

    Abstract: Researchers have successfully applied large language models (LLMs) such as ChatGPT to reranking in an information retrieval context, but to date, such work has mostly been built on proprietary models hidden behind opaque API endpoints. This approach yields experimental results that are not reproducible and non-deterministic, threatening the veracity of outcomes that build on such shaky foundations… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  9. arXiv:2308.14963  [pdf, other

    cs.IR

    Vector Search with OpenAI Embeddings: Lucene Is All You Need

    Authors: Jimmy Lin, Ronak Pradeep, Tommaso Teofili, Jasper Xian

    Abstract: We provide a reproducible, end-to-end demonstration of vector search with OpenAI embeddings using Lucene on the popular MS MARCO passage ranking test collection. The main goal of our work is to challenge the prevailing narrative that a dedicated vector store is necessary to take advantage of recent advances in deep neural networks as applied to search. Quite the contrary, we show that hierarchical… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  10. arXiv:2306.07875  [pdf, other

    cs.IR cs.AI cs.CL cs.HC

    ReadProbe: A Demo of Retrieval-Enhanced Large Language Models to Support Lateral Reading

    Authors: Dake Zhang, Ronak Pradeep

    Abstract: With the rapid growth and spread of online misinformation, people need tools to help them evaluate the credibility and accuracy of online information. Lateral reading, a strategy that involves cross-referencing information with multiple sources, may be an effective approach to achieving this goal. In this paper, we present ReadProbe, a tool to support lateral reading, powered by generative large l… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  11. arXiv:2305.11841  [pdf, other

    cs.IR cs.CL

    How Does Generative Retrieval Scale to Millions of Passages?

    Authors: Ronak Pradeep, Kai Hui, Jai Gupta, Adam D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran

    Abstract: Popularized by the Differentiable Search Index, the emerging paradigm of generative retrieval re-frames the classic information retrieval problem into a sequence-to-sequence modeling task, forgoing external indices and encoding an entire document corpus within a single Transformer. Although many different approaches have been proposed to improve the effectiveness of generative retrieval, they have… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  12. arXiv:2305.02156  [pdf, other

    cs.IR cs.CL

    Zero-Shot Listwise Document Reranking with a Large Language Model

    Authors: Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin

    Abstract: Supervised ranking methods based on bi-encoder or cross-encoder architectures have shown success in multi-stage text ranking tasks, but they require large amounts of relevance judgments as training data. In this work, we propose Listwise Reranker with a Large Language Model (LRL), which achieves strong reranking effectiveness without using any task-specific training data. Different from the existi… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  13. arXiv:2304.03141  [pdf, other

    cs.DC

    For-Each Operations in Collaborative Apps

    Authors: Matthew Weidner, Ria Pradeep, Benito Geordie, Heather Miller

    Abstract: Conflict-free Replicated Data Types (CRDTs) allow collaborative access to an app's data. We describe a novel CRDT operation, for-each on the list of CRDTs, and demonstrate its use in collaborative apps. Our for-each operation applies a given mutation to each element of a list, including elements inserted concurrently. This often preserves user intention in a way that would otherwise require custom… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures, to appear at PaPoC 2023

  14. arXiv:2212.02618  [pdf, other

    cs.DC cs.DS

    Collabs: A Flexible and Performant CRDT Collaboration Framework

    Authors: Matthew Weidner, Huairui Qi, Maxime Kjaer, Ria Pradeep, Benito Geordie, Yicheng Zhang, Gregory Schare, Xuan Tang, Sicheng Xing, Heather Miller

    Abstract: A collaboration framework is a distributed system that serves as the data layer for a collaborative app. Conflict-free Replicated Data Types (CRDTs) are a promising theoretical technique for implementing collaboration frameworks. However, existing frameworks are inflexible: they are often one-off implementations of research papers or only permit a restricted set of CRDT semantics, and they do not… ▽ More

    Submitted 13 October, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 18 pages, 19 figures

  15. arXiv:2104.05740  [pdf, ps, other

    cs.CL cs.IR

    A Replication Study of Dense Passage Retriever

    Authors: Xueguang Ma, Kai Sun, Ronak Pradeep, Jimmy Lin

    Abstract: Text retrieval using learned dense representations has recently emerged as a promising alternative to "traditional" text retrieval using sparse bag-of-words representations. One recent work that has garnered much attention is the dense passage retriever (DPR) technique proposed by Karpukhin et al. (2020) for end-to-end open-domain question answering. We present a replication study of this work, st… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  16. arXiv:2102.10073  [pdf, ps, other

    cs.IR

    Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations

    Authors: Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Nogueira

    Abstract: Pyserini is an easy-to-use Python toolkit that supports replicable IR research by providing effective first-stage retrieval in a multi-stage ranking architecture. Our toolkit is self-contained as a standard Python package and comes with queries, relevance judgments, pre-built indexes, and evaluation scripts for many commonly used IR test collections. We aim to support, out of the box, the entire r… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

  17. arXiv:2101.05667  [pdf, other

    cs.IR cs.CL

    The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models

    Authors: Ronak Pradeep, Rodrigo Nogueira, Jimmy Lin

    Abstract: We propose a design pattern for tackling text ranking problems, dubbed "Expando-Mono-Duo", that has been empirically validated for a number of ad hoc retrieval tasks in different domains. At the core, our design relies on pretrained sequence-to-sequence models within a standard multi-stage ranking architecture. "Expando" refers to the use of document expansion techniques to enrich keyword represen… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  18. arXiv:2010.11930  [pdf, other

    cs.CL cs.IR

    Scientific Claim Verification with VERT5ERINI

    Authors: Ronak Pradeep, Xueguang Ma, Rodrigo Nogueira, Jimmy Lin

    Abstract: This work describes the adaptation of a pretrained sequence-to-sequence model to the task of scientific claim verification in the biomedical domain. We propose VERT5ERINI that exploits T5 for abstract retrieval, sentence selection and label prediction, which are three critical sub-tasks of claim verification. We evaluate our pipeline on SCIFACT, a newly curated dataset that requires models to not… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  19. arXiv:2007.07846  [pdf, other

    cs.IR cs.CL

    Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset

    Authors: Edwin Zhang, Nikhil Gupta, Raphael Tang, Xiao Han, Ronak Pradeep, Kuang Lu, Yue Zhang, Rodrigo Nogueira, Kyunghyun Cho, Hui Fang, Jimmy Lin

    Abstract: We present Covidex, a search engine that exploits the latest neural ranking models to provide information access to the COVID-19 Open Research Dataset curated by the Allen Institute for AI. Our system has been online and serving users since late March 2020. The Covidex is the user application component of our three-pronged strategy to develop technologies for hel** domain experts tackle the ongo… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: text overlap with arXiv:2004.05125

  20. arXiv:0710.4824  [pdf

    cs.AR

    FPGA based Agile Algorithm-On-Demand Co-Processor

    Authors: R. Pradeep, S. Vinay, Sanjay Burman, V. Kamakoti

    Abstract: With growing computational needs of many real-world applications, frequently changing specifications of standards, and the high design and NRE costs of ASICs, an algorithm-agile FPGA based co-processor has become a viable alternative. In this article, we report about the general design of an algorith-agile co-processor and the proof-of-concept implementation.

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: Submitted on behalf of EDAA (http://www.edaa.com/)

    Journal ref: Dans Design, Automation and Test in Europe | Designers'Forum - DATE'05, Munich : Allemagne (2005)