Skip to main content

Showing 1–14 of 14 results for author: Shah, R S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01047  [pdf, other

    cs.CL

    Development of Cognitive Intelligence in Pre-trained Language Models

    Authors: Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma

    Abstract: Recent studies show evidence for emergent cognitive abilities in Large Pre-trained Language Models (PLMs). The increasing cognitive alignment of these models has made them candidates for cognitive science theories. Prior research into the emergent cognitive abilities of PLMs has largely been path-independent to model training, i.e., has focused on the final model weights and not the intermediate s… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.11106  [pdf, other

    cs.CL cs.AI

    From Intentions to Techniques: A Comprehensive Taxonomy and Challenges in Text Watermarking for Large Language Models

    Authors: Harsh Nishant Lalai, Aashish Anantha Ramakrishnan, Raj Sanjay Shah, Dongwon Lee

    Abstract: With the rapid growth of Large Language Models (LLMs), safeguarding textual content against unauthorized use is crucial. Text watermarking offers a vital solution, protecting both - LLM-generated and plain text sources. This paper presents a unified overview of different perspectives behind designing watermarking techniques, through a comprehensive survey of the research literature. Our work has t… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  4. arXiv:2405.16128  [pdf, other

    cs.AI cs.CL

    How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect

    Authors: Siddhartha K. Vemuri, Raj Sanjay Shah, Sashank Varma

    Abstract: How well do representations learned by ML models align with those of humans? Here, we consider concept representations learned by deep learning models and evaluate whether they show a fundamental behavioral signature of human concepts, the typicality effect. This is the finding that people judge some instances (e.g., robin) of a category (e.g., Bird) to be more typical than others (e.g., penguin).… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: To appear at CogSci 2024

  5. arXiv:2405.16042  [pdf, other

    cs.CL

    Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention

    Authors: Andrew Li, Xianle Feng, Siddhant Narang, Austin Peng, Tianle Cai, Raj Sanjay Shah, Sashank Varma

    Abstract: When reading temporarily ambiguous garden-path sentences, misinterpretations sometimes linger past the point of disambiguation. This phenomenon has traditionally been studied in psycholinguistic experiments using online measures such as reading times and offline measures such as comprehension questions. Here, we investigate the processing of garden-path sentences and the fate of lingering misinter… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by CogSci-24

  6. arXiv:2403.15482  [pdf, other

    cs.CL cs.HC cs.LG

    Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors

    Authors: Alicja Chaszczewicz, Raj Sanjay Shah, Ryan Louie, Bruce A Arnow, Robert Kraut, Diyi Yang

    Abstract: Realistic practice and tailored feedback are key processes for training peer counselors with clinical skills. However, existing mechanisms of providing feedback largely rely on human supervision. Peer counselors often lack mechanisms to receive detailed feedback from experienced mentors, making it difficult for them to support the large number of people with mental health issues who use peer couns… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  7. arXiv:2401.10393  [pdf, other

    cs.LG cs.AI

    Catastrophic Interference is Mitigated in Naturalistic Power-Law Learning Environments

    Authors: Atith Gandhi, Raj Sanjay Shah, Vijay Marupudi, Sashank Varma

    Abstract: Neural networks often suffer from catastrophic interference (CI): performance on previously learned tasks drops off significantly when learning a new task. This contrasts strongly with humans, who can sequentially learn new tasks without appreciably forgetting previous tasks. Prior work has explored various techniques for mitigating CI such as regularization, rehearsal, generative replay, and dist… ▽ More

    Submitted 22 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  8. arXiv:2312.10775  [pdf, other

    cs.HC

    What Makes Digital Support Effective? How Therapeutic Skills Affect Clinical Well-Being

    Authors: Anna Fang, Wenjie Yang, Raj Sanjay Shah, Yash Mathur, Diyi Yang, Haiyi Zhu, Robert Kraut

    Abstract: Online mental health support communities have grown in recent years for providing accessible mental and emotional health support through volunteer counselors. Despite millions of people participating in chat support on these platforms, the clinical effectiveness of these communities on mental health symptoms remains unknown. Furthermore, although volunteers receive some training based on establish… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  9. arXiv:2311.04666  [pdf, other

    cs.CL cs.AI

    Pre-training LLMs using human-like development data corpus

    Authors: Khushi Bhardwaj, Raj Sanjay Shah, Sashank Varma

    Abstract: Pre-trained Large Language Models (LLMs) have shown success in a diverse set of language inference and understanding tasks. The pre-training stage of LLMs looks at a large corpus of raw textual data. The BabyLM shared task compares LLM pre-training to human language acquisition, where the number of tokens seen by 13-year-old kids is magnitudes smaller than the number of tokens seen by LLMs. In thi… ▽ More

    Submitted 10 January, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning

  10. arXiv:2305.10782  [pdf, other

    cs.AI

    Human Behavioral Benchmarking: Numeric Magnitude Comparison Effects in Large Language Models

    Authors: Raj Sanjay Shah, Vijay Marupudi, Reba Koenen, Khushi Bhardwaj, Sashank Varma

    Abstract: Large Language Models (LLMs) do not differentially represent numbers, which are pervasive in text. In contrast, neuroscience research has identified distinct neural representations for numbers and words. In this work, we investigate how well popular LLMs capture the magnitudes of numbers (e.g., that $4 < 5$) from a behavioral lens. Prior research on the representational capabilities of LLMs evalua… ▽ More

    Submitted 8 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: ACL findings 2023

  11. arXiv:2305.08982  [pdf, other

    cs.HC cs.CL

    Hel** the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback

    Authors: Shang-Ling Hsu, Raj Sanjay Shah, Prathik Senthil, Zahra Ashktorab, Casey Dugan, Werner Geyer, Diyi Yang

    Abstract: Millions of users come to online peer counseling platforms to seek support on diverse topics ranging from relationship stress to anxiety. However, studies show that online peer support groups are not always as effective as expected largely due to users' negative experiences with unhelpful counselors. Peer counselors are key to the success of online peer counseling platforms, but most of them often… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  12. arXiv:2211.05182  [pdf, ps, other

    cs.HC cs.AI

    Modeling Motivational Interviewing Strategies On An Online Peer-to-Peer Counseling Platform

    Authors: Raj Sanjay Shah, Faye Holt, Shirley Anugrah Hayati, Aastha Agarwal, Yi-Chia Wang, Robert E. Kraut, Diyi Yang

    Abstract: Millions of people participate in online peer-to-peer support sessions, yet there has been little prior research on systematic psychology-based evaluations of fine-grained peer-counselor behavior in relation to client satisfaction. This paper seeks to bridge this gap by map** peer-counselor chat-messages to motivational interviewing (MI) techniques. We annotate 14,797 utterances from 734 chat co… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted at CSCW 2022

  13. arXiv:2211.00083  [pdf, other

    cs.CL cs.AI cs.LG

    WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain

    Authors: Raj Sanjay Shah, Kunal Chawla, Dheeraj Eidnani, Agam Shah, Wendi Du, Sudheer Chava, Natraj Raman, Charese Smiley, Jiaao Chen, Diyi Yang

    Abstract: Pre-trained language models have shown impressive performance on a variety of tasks and domains. Previous research on financial language models usually employs a generic training scheme to train standard model architectures, without completely leveraging the richness of the financial data. We propose a novel domain specific Financial LANGuage model (FLANG) which uses financial keywords and phrases… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

  14. arXiv:2106.05728  [pdf

    cs.CV eess.IV

    Face mask detection using convolution neural network

    Authors: Riya Shah Rutva Shah

    Abstract: In the recent times, the Coronaviruses that are a big family of different viruses have become very common, contagious and dangerous to the whole human kind. It spreads human to human by exhaling the infection breath, which leaves droplets of the virus on different surface which is then inhaled by other person and catches the infection too. So it has become very important to protect ourselves and t… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 4 PAGES, 3 FIGURES, 1 TABLE