Skip to main content

Showing 1–10 of 10 results for author: Das, S S S

.
  1. arXiv:2404.03602  [pdf, other

    cs.CL

    Evaluating LLMs at Detecting Errors in LLM Responses

    Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

    Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

  2. arXiv:2311.03748  [pdf, other

    cs.CL

    Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning

    Authors: Sarkar Snigdha Sarathi Das, Ranran Haoran Zhang, Peng Shi, Wenpeng Yin, Rui Zhang

    Abstract: Unified Sequence Labeling that articulates different sequence labeling problems such as Named Entity Recognition, Relation Extraction, Semantic Role Labeling, etc. in a generalized sequence-to-sequence format opens up the opportunity to make the maximum utilization of large language model knowledge toward structured prediction. Unfortunately, this requires formatting them into specialized augmente… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted by EMNLP 2023

  3. arXiv:2310.04381  [pdf, other

    cs.CR cs.AI cs.CL

    Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications

    Authors: Abdullah Al Ishtiaq, Sarkar Snigdha Sarathi Das, Syed Md Mukit Rashid, Ali Ranjbar, Kai Tu, Tianwei Wu, Zhezheng Song, Weixuan Wang, Mujtahid Akon, Rui Zhang, Syed Rafiul Hussain

    Abstract: In this paper, we present Hermes, an end-to-end framework to automatically generate formal representations from natural language cellular specifications. We first develop a neural constituency parser, NEUTREX, to process transition-relevant texts and extract transition components (i.e., states, conditions, and actions). We also design a domain-specific language to translate these transition compon… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted at USENIX Security 24

  4. arXiv:2309.13063  [pdf, other

    cs.IR cs.AI cs.CL

    Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies

    Authors: Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Scott Counts, Sarkar Snigdha Sarathi Das, Ali Montazer, Sathish Manivannan, Jennifer Neville, Xiaochuan Ni, Nagu Rangan, Tara Safavi, Siddharth Suri, Mengting Wan, Leijie Wang, Longqi Yang

    Abstract: Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics.… ▽ More

    Submitted 9 May, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Report number: MSR-TR-2023-32

  5. arXiv:2309.08827  [pdf, other

    cs.CL cs.AI

    S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs

    Authors: Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi

    Abstract: The traditional Dialogue State Tracking (DST) problem aims to track user preferences and intents in user-agent conversations. While sufficient for task-oriented dialogue systems supporting narrow domain applications, the advent of Large Language Model (LLM)-based chat systems has introduced many real-world intricacies in open-domain dialogues. These intricacies manifest in the form of increased co… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  6. arXiv:2109.07589  [pdf, other

    cs.CL

    CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning

    Authors: Sarkar Snigdha Sarathi Das, Arzoo Katiyar, Rebecca J. Passonneau, Rui Zhang

    Abstract: Named Entity Recognition (NER) in Few-Shot setting is imperative for entity tagging in low resource domains. Existing approaches only learn class-specific semantic features and intermediate representations from source domains. This affects generalizability to unseen target domains, resulting in suboptimal performances. To this end, we present CONTaiNER, a novel contrastive learning technique that… ▽ More

    Submitted 28 March, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted by ACL 2022 (Main Conference, Long Paper)

  7. arXiv:2011.10187  [pdf, ps, other

    cs.IR cs.LG

    A Survey on Deep Learning Based Point-Of-Interest (POI) Recommendations

    Authors: Md. Ashraful Islam, Mir Mahathir Mohammad, Sarkar Snigdha Sarathi Das, Mohammed Eunus Ali

    Abstract: Location-based Social Networks (LBSNs) enable users to socialize with friends and acquaintances by sharing their check-ins, opinions, photos, and reviews. Huge volume of data generated from LBSNs opens up a new avenue of research that gives birth to a new sub-field of recommendation systems, known as Point-of-Interest (POI) recommendation. A POI recommendation technique essentially exploits users'… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 21 pages, 5 figures

  8. arXiv:2011.00753  [pdf, other

    cs.LG eess.SP

    BayesBeat: Reliable Atrial Fibrillation Detection from Noisy Photoplethysmography Data

    Authors: Sarkar Snigdha Sarathi Das, Subangkar Karmaker Shanto, Masum Rahman, Md. Saiful Islam, Atif Rahman, Mohammad Mehedy Masud, Mohammed Eunus Ali

    Abstract: Smartwatches or fitness trackers have garnered a lot of popularity as potential health tracking devices due to their affordable and longitudinal monitoring capabilities. To further widen their health tracking capabilities, in recent years researchers have started to look into the possibility of Atrial Fibrillation (AF) detection in real-time leveraging photoplethysmography (PPG) data, an inexpensi… ▽ More

    Submitted 16 September, 2022; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: IMWUT March 2022, Vol 6 Article 8 (UbiComp 2022)

    Journal ref: Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 6, 1, Article 8 (March 2022), 21 pages

  9. arXiv:2009.00254  [pdf, ps, other

    cs.LG stat.ML

    Boosting House Price Predictions using Geo-Spatial Network Embedding

    Authors: Sarkar Snigdha Sarathi Das, Mohammed Eunus Ali, Yuan-Fang Li, Yong-Bin Kang, Timos Sellis

    Abstract: Real estate contributes significantly to all major economies around the world. In particular, house prices have a direct impact on stakeholders, ranging from house buyers to financing companies. Thus, a plethora of techniques have been developed for real estate price prediction. Most of the existing techniques rely on different house features to build a variety of prediction models to predict hous… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: 23 pages, 5 figures, 5 tables

  10. arXiv:1912.05765  [pdf, other

    cs.CV

    CCCNet: An Attention Based Deep Learning Framework for Categorized Crowd Counting

    Authors: Sarkar Snigdha Sarathi Das, Syed Md. Mukit Rashid, Mohammed Eunus Ali

    Abstract: Crowd counting problem that counts the number of people in an image has been extensively studied in recent years. In this paper, we introduce a new variant of crowd counting problem, namely "Categorized Crowd Counting", that counts the number of people sitting and standing in a given image. Categorized crowd counting has many real-world applications such as crowd monitoring, customer service, and… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: 12 pages, 5 figures