Skip to main content

Showing 1–3 of 3 results for author: Dhakal, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.09654  [pdf, other

    cs.AI cs.CL cs.HC cs.MA stat.ML

    GPT-4's assessment of its performance in a USMLE-based case study

    Authors: Uttam Dhakal, Aniket Kumar Singh, Suman Devkota, Yogesh Sapkota, Bishal Lamichhane, Suprinsa Paudyal, Chandra Dhakal

    Abstract: This study investigates GPT-4's assessment of its performance in healthcare applications. A simple prompting technique was used to prompt the LLM with questions taken from the United States Medical Licensing Examination (USMLE) questionnaire and it was tasked to evaluate its confidence score before posing the question and after asking the question. The questionnaire was categorized into two groups… ▽ More

    Submitted 26 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  2. arXiv:2309.16145  [pdf, other

    cs.CL cs.CY cs.HC

    The Confidence-Competence Gap in Large Language Models: A Cognitive Study

    Authors: Aniket Kumar Singh, Suman Devkota, Bishal Lamichhane, Uttam Dhakal, Chandra Dhakal

    Abstract: Large Language Models (LLMs) have acquired ubiquitous attention for their performances across diverse domains. Our study here searches through LLMs' cognitive abilities and confidence dynamics. We dive deep into understanding the alignment between their self-assessed confidence and actual performance. We exploit these models with diverse sets of questionnaires and real-world scenarios and extract… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 19 pages, 8 Figures, to be published in a journal (Journal TBD), All Authors contributed equally and were Supervised by Chandra Dhakal

    MSC Class: ACM-class: I.2.0

  3. arXiv:2306.11892  [pdf, other

    cs.CL

    Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications

    Authors: Saed Rezayi, Zhengliang Liu, Zihao Wu, Chandra Dhakal, Bao Ge, Haixing Dai, Gengchen Mai, Ninghao Liu, Chen Zhen, Tianming Liu, Sheng Li

    Abstract: This paper explores new frontiers in agricultural natural language processing by investigating the effectiveness of using food-related text corpora for pretraining transformer-based language models. In particular, we focus on the task of semantic matching, which involves establishing map**s between food descriptions and nutrition data. To accomplish this, we fine-tune a pre-trained transformer-b… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.