Retrieval Augmented Generation for Domain-specific Question Answering
Authors:
Sanat Sharma,
David Seunghyun Yoon,
Franck Dernoncourt,
Dewang Sultania,
Karishma Bagga,
Mengjiao Zhang,
Trung Bui,
Varun Kotte
Abstract:
Question answering (QA) has become an important application in the advanced development of large language models. General pre-trained large language models for question-answering are not trained to properly understand the knowledge or terminology for a specific domain, such as finance, healthcare, education, and customer service for a product. To better cater to domain-specific understanding, we b…
▽ More
Question answering (QA) has become an important application in the advanced development of large language models. General pre-trained large language models for question-answering are not trained to properly understand the knowledge or terminology for a specific domain, such as finance, healthcare, education, and customer service for a product. To better cater to domain-specific understanding, we build an in-house question-answering system for Adobe products. We propose a novel framework to compile a large question-answer database and develop the approach for retrieval-aware finetuning of a Large Language model. We showcase that fine-tuning the retriever leads to major improvements in the final generation. Our overall approach reduces hallucinations during generation while kee** in context the latest retrieval information for contextual grounding.
△ Less
Submitted 29 May, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.