Skip to main content

Showing 1–5 of 5 results for author: Deguchi, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06004  [pdf, other

    cs.IR cs.CL cs.DS

    AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval

    Authors: Kento Tatsuno, Daisuke Miyashita, Taiga Ikeda, Kiyoshi Ishiyama, Kazunari Sumiyoshi, Jun Deguchi

    Abstract: In approximate nearest neighbor search (ANNS) methods based on approximate proximity graphs, DiskANN achieves good recall-speed balance for large-scale datasets using both of RAM and storage. Despite it claims to save memory usage by loading compressed vectors by product quantization (PQ), its memory usage increases in proportion to the scale of datasets. In this paper, we propose All-in-Storage A… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 5 pages, 6 figures and 4 tables

  2. arXiv:2308.10633  [pdf, other

    cs.CL cs.AI

    RaLLe: A Framework for Develo** and Evaluating Retrieval-Augmented Large Language Models

    Authors: Yasuto Hoshi, Daisuke Miyashita, Youyang Ng, Kento Tatsuno, Yasuhiro Morioka, Osamu Torii, Jun Deguchi

    Abstract: Retrieval-augmented large language models (R-LLMs) combine pre-trained large language models (LLMs) with information retrieval systems to improve the accuracy of factual question-answering. However, current libraries for building R-LLMs provide high-level abstractions without sufficient transparency for evaluating and optimizing prompts within specific inference processes such as retrieval and gen… ▽ More

    Submitted 16 October, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: 18 pages, 2 figures, see https://youtu.be/JYbm75qnfTg for the demonstration screencast, accepted by EMNLP 2023 System Demonstrations

  3. arXiv:2308.03983  [pdf, other

    cs.CL cs.AI

    SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool

    Authors: Youyang Ng, Daisuke Miyashita, Yasuto Hoshi, Yasuhiro Morioka, Osamu Torii, Tomoya Kodama, Jun Deguchi

    Abstract: Large Language Model (LLM) based Generative AI systems have seen significant progress in recent years. Integrating a knowledge retrieval architecture allows for seamless integration of private data into publicly available Generative AI systems using pre-trained LLM without requiring additional model fine-tuning. Moreover, Retrieval-Centric Generation (RCG) approach, a promising future research dir… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 12 pages, 6 figures

  4. arXiv:2303.05153  [pdf, other

    cs.CL cs.AI

    Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions?

    Authors: Yasuto Hoshi, Daisuke Miyashita, Yasuhiro Morioka, Youyang Ng, Osamu Torii, Jun Deguchi

    Abstract: Neural document retrievers, including dense passage retrieval (DPR), have outperformed classical lexical-matching retrievers, such as BM25, when fine-tuned and tested on specific question-answering datasets. However, it has been shown that the existing dense retrievers do not generalize well not only out of domain but even in domain such as Wikipedia, especially when a named entity in a question i… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted to Workshop on Knowledge Augmented Methods for Natural Language Processing, in conjunction with AAAI 2023

  5. arXiv:2204.01186  [pdf, ps, other

    cs.CV

    Revisiting a kNN-based Image Classification System with High-capacity Storage

    Authors: Kengo Nakata, Youyang Ng, Daisuke Miyashita, Asuka Maki, Yu-Chieh Lin, Jun Deguchi

    Abstract: In existing image classification systems that use deep neural networks, the knowledge needed for image classification is implicitly stored in model parameters. If users want to update this knowledge, then they need to fine-tune the model parameters. Moreover, users cannot verify the validity of inference results or evaluate the contribution of knowledge to the results. In this paper, we investigat… ▽ More

    Submitted 28 July, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Accepted to ECCV 2022 (Oral)