Skip to main content

Showing 1–15 of 15 results for author: Semnani, S J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17952  [pdf, other

    cs.LG cs.CG

    LINSCAN -- A Linearity Based Clustering Algorithm

    Authors: Andrew Dennehy, Xiaoyu Zou, Shabnam J. Semnani, Yuri Fialko, Alexander Cloninger

    Abstract: DBSCAN and OPTICS are powerful algorithms for identifying clusters of points in domains where few assumptions can be made about the structure of the data. In this paper, we leverage these strengths and introduce a new algorithm, LINSCAN, designed to seek lineated clusters that are difficult to find and isolate with existing methods. In particular, by embedding points as normal distributions approx… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.00562  [pdf, other

    cs.CL

    SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing

    Authors: Heidi C. Zhang, Sina J. Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica S. Lam

    Abstract: We introduce SPAGHETTI: Semantic Parsing Augmented Generation for Hybrid English information from Text Tables and Infoboxes, a hybrid question-answering (QA) pipeline that utilizes information from heterogeneous knowledge sources, including knowledge base, text, tables, and infoboxes. Our LLM-augmented approach achieves state-of-the-art performance on the Compmix dataset, the most comprehensive he… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  3. arXiv:2405.17840  [pdf, other

    cs.CL

    Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents

    Authors: Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gäel de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam

    Abstract: Creating multilingual task-oriented dialogue (TOD) agents is challenging due to the high cost of training data acquisition. Following the research trend of improving training data efficiency, we show for the first time, that in-context learning is sufficient to tackle multilingual TOD. To handle the challenging dialogue state tracking (DST) subtask, we break it down to simpler steps that are mor… ▽ More

    Submitted 16 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2311.17336  [pdf, other

    cs.CE

    Incremental Neural Controlled Differential Equations for Modeling of Path-dependent Material Behavior

    Authors: Yangzi He, Shabnam J. Semnani

    Abstract: Data-driven surrogate modeling has emerged as a promising approach for reducing computational expenses of multiscale simulations. Recurrent Neural Network (RNN) is a common choice for modeling of path-dependent behavior. However, previous studies have shown that RNNs fail to make predictions that are consistent with perturbation in the input strain, leading to potential oscillations and lack of co… ▽ More

    Submitted 28 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

  5. arXiv:2311.09818  [pdf, other

    cs.CL cs.PL

    SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models

    Authors: Shicheng Liu, Jialiang Xu, Wesley Tjangnaka, Sina J. Semnani, Chen Jie Yu, Monica S. Lam

    Abstract: While most conversational agents are grounded on either free-text or structured knowledge, many knowledge corpora consist of hybrid sources. This paper presents the first conversational agent that supports the full generality of hybrid data access for large knowledge corpora, through a language we developed called SUQL (Structured and Unstructured Query Language). Specifically, SUQL extends SQL wi… ▽ More

    Submitted 13 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  6. arXiv:2306.17674  [pdf, other

    cs.CL

    X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents

    Authors: Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam

    Abstract: Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. To reduce the cost, we apply manual editing to automatically translated data. We create a new multilingual benchmark, X-RiSAWOZ, by translating the Chinese RiSAWOZ to 4 languages: English, French, Hindi, Korean; and a code-mixed English-H… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 Findings

  7. WikiChat: Stop** the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia

    Authors: Sina J. Semnani, Violet Z. Yao, Heidi C. Zhang, Monica S. Lam

    Abstract: This paper presents the first few-shot LLM-based chatbot that almost never hallucinates and has high conversationality and low latency. WikiChat is grounded on the English Wikipedia, the largest curated free-text corpus. WikiChat generates a response from an LLM, retains only the grounded facts, and combines them with additional information it retrieves from the corpus to form factual and engagi… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  8. arXiv:2305.14202  [pdf, other

    cs.CL

    Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata

    Authors: Silei Xu, Shicheng Liu, Theo Culhane, Elizaveta Pertseva, Meng-Hsi Wu, Sina J. Semnani, Monica S. Lam

    Abstract: While large language models (LLMs) can answer many questions correctly, they can also hallucinate and give wrong answers. Wikidata, with its over 12 billion facts, can be used to ground LLMs to improve their factuality. This paper presents WikiWebQuestions, a high-quality question answering benchmark for Wikidata. Ported over from WebQuestions for Freebase, it consists of real-world data with SPAR… ▽ More

    Submitted 5 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Main

  9. arXiv:2302.09424  [pdf, other

    cs.CL

    Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation

    Authors: Mehrad Moradshahi, Sina J. Semnani, Monica S. Lam

    Abstract: Task-oriented Dialogue (ToD) agents are mostly limited to a few widely-spoken languages, mainly due to the high cost of acquiring training data for each language. Existing low-cost approaches that rely on cross-lingual embeddings or naive machine translation sacrifice a lot of accuracy for data efficiency, and largely fail in creating a usable dialogue agent. We propose automatic methods that use… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: Published in EACL 2023

  10. arXiv:2203.12751  [pdf, other

    cs.PL cs.CL

    ThingTalk: An Extensible, Executable Representation Language for Task-Oriented Dialogues

    Authors: Monica S. Lam, Giovanni Campagna, Mehrad Moradshahi, Sina J. Semnani, Silei Xu

    Abstract: Task-oriented conversational agents rely on semantic parsers to translate natural language to formal representations. In this paper, we propose the design and rationale of the ThingTalk formal representation, and how the design improves the development of transactional task-oriented agents. ThingTalk is built on four core principles: (1) representing user requests directly as executable statemen… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: 8 pages, 3 figures

  11. arXiv:2105.06060  [pdf, other

    cs.LG stat.ML

    House Price Prediction using Satellite Imagery

    Authors: Sina Jandaghi Semnani, Hoormazd Rezaei

    Abstract: In this paper we show how using satellite images can improve the accuracy of housing price estimation models. Using Los Angeles County's property assessment dataset, by transferring learning from an Inception-v3 model pretrained on ImageNet, we could achieve an improvement of ~10% in R-squared score compared to two baseline models that only use non-image features of the house.

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: Stanford CS230 Deep Learning, Winter 2018

  12. arXiv:2010.05106  [pdf, other

    cs.CL cs.LG

    Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation

    Authors: Mehrad Moradshahi, Giovanni Campagna, Sina J. Semnani, Silei Xu, Monica S. Lam

    Abstract: We propose Semantic Parser Localizer (SPL), a toolkit that leverages Neural Machine Translation (NMT) systems to localize a semantic parser for a new language. Our methodology is to (1) generate training data automatically in the target language by augmenting machine-translated datasets with local entities scraped from public websites, (2) add a few-shot boost of human-translated sentences and tra… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: Published in EMNLP 2020

  13. arXiv:2010.04806  [pdf, other

    cs.CL

    AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training Data

    Authors: Silei Xu, Sina J. Semnani, Giovanni Campagna, Monica S. Lam

    Abstract: We propose AutoQA, a methodology and toolkit to generate semantic parsers that answer questions on databases, with no manual effort. Given a database schema and its data, AutoQA automatically generates a large set of high-quality questions for training that covers different database operations. It uses automatic paraphrasing combined with template-based parsing to find alternative expressions of a… ▽ More

    Submitted 7 June, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: To appear in EMNLP 2020

  14. arXiv:2009.07968  [pdf, other

    cs.CL

    A Few-Shot Semantic Parser for Wizard-of-Oz Dialogues with the Precise ThingTalk Representation

    Authors: Giovanni Campagna, Sina J. Semnani, Ryan Kearns, Lucas Jun Koba Sato, Silei Xu, Monica S. Lam

    Abstract: Previous attempts to build effective semantic parsers for Wizard-of-Oz (WOZ) conversations suffer from the difficulty in acquiring a high-quality, manually annotated training set. Approaches based only on dialogue synthesis are insufficient, as dialogues generated from state-machine based models are poor approximations of real-life conversations. Furthermore, previously proposed dialogue state rep… ▽ More

    Submitted 7 April, 2022; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: Published in Findings of ACL 2022, 9 pages

  15. arXiv:2009.00914  [pdf, other

    cs.CL cs.AI

    Revisiting the Open-Domain Question Answering Pipeline

    Authors: Sina J. Semnani, Manish Pandey

    Abstract: Open-domain question answering (QA) is the tasl of identifying answers to natural questions from a large corpus of documents. The typical open-domain QA system starts with information retrieval to select a subset of documents from the corpus, which are then processed by a machine reader to select the answer spans. This paper describes Mindstone, an open-domain QA system that consists of a new mult… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.