Skip to main content

Showing 1–6 of 6 results for author: Aksitov, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10003  [pdf, other

    cs.CL

    ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

    Authors: Renat Aksitov, Sobhan Miryoosefi, Zonglin Li, Daliang Li, Sheila Babayan, Kavya Kopparapu, Zachary Fisher, Ruiqi Guo, Sushant Prakash, Pranesh Srinivasan, Manzil Zaheer, Felix Yu, Sanjiv Kumar

    Abstract: Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 19 pages, 4 figures, 4 tables, 8 listings

  2. arXiv:2311.17311  [pdf, other

    cs.CL cs.AI

    Universal Self-Consistency for Large Language Model Generation

    Authors: Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou

    Abstract: Self-consistency with chain-of-thought prompting (CoT) has demonstrated remarkable performance gains on various challenging tasks, by utilizing multiple reasoning paths sampled from large language models (LLMs). However, self-consistency relies on the answer extraction process to aggregate multiple solutions, which is not applicable to free-form answers. In this work, we propose Universal Self-Con… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  3. arXiv:2311.07424  [pdf, other

    cs.CL cs.AI

    Hallucination Augmented Recitations for Language Models

    Authors: Abdullatif Köksal, Renat Aksitov, Chung-Ching Chang

    Abstract: Attribution is a key concept in large language models (LLMs) as it enables control over information sources and enhances the factuality of LLMs. While existing approaches utilize open book question answering to improve attribution, factual datasets may reward language models to recall facts that they already know from their pretraining data, not attribution. In contrast, counterfactual open book Q… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  4. arXiv:2306.01286  [pdf, other

    cs.CL cs.AI

    KL-Divergence Guided Temperature Sampling

    Authors: Chung-Ching Chang, David Reitter, Renat Aksitov, Yun-Hsuan Sung

    Abstract: Temperature sampling is a conventional approach to diversify large language model predictions. As temperature increases, the prediction becomes diverse but also vulnerable to hallucinations -- generating tokens that are sensible but not factual. One common approach to mitigate hallucinations is to provide source/grounding documents and the model is trained to produce predictions that bind to and a… ▽ More

    Submitted 29 November, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  5. arXiv:2305.13725  [pdf, other

    cs.CL cs.IR

    Conversational Recommendation as Retrieval: A Simple, Strong Baseline

    Authors: Raghav Gupta, Renat Aksitov, Samrat Phatale, Simral Chaudhary, Harrison Lee, Abhinav Rastogi

    Abstract: Conversational recommendation systems (CRS) aim to recommend suitable items to users through natural language conversation. However, most CRS approaches do not effectively utilize the signal provided by these conversations. They rely heavily on explicit external knowledge e.g., knowledge graphs to augment the models' understanding of the items and attributes, which is quite hard to scale. To allev… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: To appear at the 5th NLP4ConvAI workshop

  6. arXiv:2302.05578  [pdf, ps, other

    cs.CL cs.AI

    Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models

    Authors: Renat Aksitov, Chung-Ching Chang, David Reitter, Siamak Shakeri, Yunhsuan Sung

    Abstract: Despite recent progress, it has been difficult to prevent semantic hallucinations in generative Large Language Models. One common solution to this is augmenting LLMs with a retrieval system and making sure that the generated output is attributable to the retrieved information. Given this new added constraint, it is plausible to expect that the overall quality of the output will be affected, for ex… ▽ More

    Submitted 14 February, 2023; v1 submitted 10 February, 2023; originally announced February 2023.