Skip to main content

Showing 1–14 of 14 results for author: Sheth, D

.
  1. arXiv:2406.19545  [pdf, other

    cs.CL cs.AI

    Leveraging Machine-Generated Rationales to Facilitate Social Meaning Detection in Conversations

    Authors: Ritam Dutt, Zhen Wu, Kelly Shi, Divyanshu Sheth, Prakhar Gupta, Carolyn Penstein Rose

    Abstract: We present a generalizable classification approach that leverages Large Language Models (LLMs) to facilitate the detection of implicitly encoded social meaning in conversations. We design a multi-faceted prompt to extract a textual explanation of the reasoning that connects visible cues to underlying social meanings. These extracted explanations or rationales serve as augmentations to the conversa… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear at The Proceedings of the Association for Computational Linguistics, 2024

  2. arXiv:2404.00566  [pdf, other

    cs.SE cs.CL

    CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

    Authors: Yiqing Xie, Alex Xie, Divyanshu Sheth, Pengfei Liu, Daniel Fried, Carolyn Rose

    Abstract: To facilitate evaluation of code generation systems across diverse scenarios, we present CodeBenchGen, a framework to create scalable execution-based benchmarks that only requires light guidance from humans. Specifically, we leverage a large language model (LLM) to convert an arbitrary piece of code into an evaluation example, including test cases for execution-based evaluation. We illustrate the… ▽ More

    Submitted 7 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  3. arXiv:2310.08383  [pdf, other

    cs.CL cond-mat.mtrl-sci

    Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

    Authors: Kausik Hira, Mohd Zaki, Dhruvil Sheth, Mausam, N M Anoop Krishnan

    Abstract: The discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature… ▽ More

    Submitted 26 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Journal ref: Digital Discovery, 2024, Advance Article

  4. arXiv:2211.17046  [pdf, other

    cs.CL cs.CY

    Rationale-Guided Few-Shot Classification to Detect Abusive Language

    Authors: Punyajoy Saha, Divyanshu Sheth, Kushal Kedia, Binny Mathew, Animesh Mukherjee

    Abstract: Abusive language is a concerning problem in online social media. Past research on detecting abusive language covers different platforms, languages, demographies, etc. However, models trained using these datasets do not perform well in cross-domain evaluation settings. To overcome this, a common strategy is to use a few samples from the target domain to train models to get better performance in tha… ▽ More

    Submitted 27 July, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: 11 pages, 14 tables, 3 figures, The code repository is https://github.com/punyajoy/RGFS_ECAI

  5. arXiv:2210.13055  [pdf, other

    cs.CL

    A Unified Framework for Pun Generation with Humor Principles

    Authors: Yufei Tian, Divyanshu Sheth, Nanyun Peng

    Abstract: We propose a unified framework to generate both homophonic and homographic puns to resolve the split-up in existing works. Specifically, we incorporate three linguistic attributes of puns to the language models: ambiguity, distinctiveness, and surprise. Our framework consists of three parts: 1) a context words/phrases selector to promote the aforementioned attributes, 2) a generation model trained… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  6. arXiv:2112.13237  [pdf, other

    cs.CL cs.AI cs.IR

    CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form Extraction

    Authors: Nithish Kannen, Divyanshu Sheth, Abhranil Chandra, Shubhraneel Pal

    Abstract: Acronyms and long-forms are commonly found in research documents, more so in documents from scientific and legal domains. Many acronyms used in such documents are domain-specific and are very rarely found in normal text corpora. Owing to this, transformer-based NLP models often detect OOV (Out of Vocabulary) for acronym tokens, especially for non-English languages, and their performance suffers wh… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

  7. arXiv:2109.05771  [pdf, other

    cs.CL

    Perturbation CheckLists for Evaluating NLG Evaluation Metrics

    Authors: Ananya B. Sai, Tanay Dixit, Dev Yashpal Sheth, Sreyas Mohan, Mitesh M. Khapra

    Abstract: Natural Language Generation (NLG) evaluation is a multifaceted task requiring assessment of multiple desirable criteria, e.g., fluency, coherency, coverage, relevance, adequacy, overall quality, etc. Across existing datasets for 6 NLG tasks, we observe that the human evaluation scores on these multiple criteria are often not correlated. For example, there is a very low correlation between human sc… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021. See https://iitmnlp.github.io/EvalEval/ for our templates and code

  8. arXiv:2101.07770  [pdf

    cond-mat.mtrl-sci eess.IV

    Develo** and Evaluating Deep Neural Network-based Denoising for Nanoparticle TEM Images with Ultra-low Signal-to-Noise

    Authors: Joshua L. Vincent, Ramon Manzorro, Sreyas Mohan, Binh Tang, Dev Y. Sheth, Eero P. Simoncelli, David S. Matteson, Carlos Fernandez-Granda, Peter A. Crozier

    Abstract: A deep convolutional neural network has been developed to denoise atomic-resolution TEM image datasets of nanoparticles acquired using direct electron counting detectors, for applications where the image signal is severely limited by shot noise. The network was applied to a model system of CeO2-supported Pt nanoparticles. We leverage multislice image simulations to generate a large and flexible da… ▽ More

    Submitted 17 March, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

  9. arXiv:2011.15045  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Unsupervised Deep Video Denoising

    Authors: Dev Yashpal Sheth, Sreyas Mohan, Joshua L. Vincent, Ramon Manzorro, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli, Carlos Fernandez-Granda

    Abstract: Deep convolutional neural networks (CNNs) for video denoising are typically trained with supervision, assuming the availability of clean videos. However, in many applications, such as microscopy, noiseless videos are not available. To address this, we propose an Unsupervised Deep Video Denoiser (UDVD), a CNN architecture designed to be trained exclusively with noisy data. The performance of UDVD i… ▽ More

    Submitted 19 August, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: Dev and Sreyas contributed equally. To appear at 2021 IEEE/CVF International Conference on Computer Vision (ICCV). See https://sreyas-mohan.github.io/udvd/ for code and more results

  10. arXiv:2010.12970  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Denoising For Scientific Discovery: A Case Study In Electron Microscopy

    Authors: Sreyas Mohan, Ramon Manzorro, Joshua L. Vincent, Binh Tang, Dev Yashpal Sheth, Eero P. Simoncelli, David S. Matteson, Peter A. Crozier, Carlos Fernandez-Granda

    Abstract: Denoising is a fundamental challenge in scientific imaging. Deep convolutional neural networks (CNNs) provide the current state of the art in denoising natural images, where they produce impressive results. However, their potential has barely been explored in the context of scientific imaging. Denoising CNNs are typically trained on real natural images artificially corrupted with simulated noise.… ▽ More

    Submitted 13 July, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: The dataset and the code used to train and evaluate and our models are available at https://sreyas-mohan.github.io/electron-microscopy-denoising/

  11. arXiv:2003.10882  [pdf, other

    gr-qc

    Halo Orbits around $L_1$ and $L_2$ in the Photogravitational Sun-Earth System with Oblateness

    Authors: Dhwani Sheth, Thomas V. O

    Abstract: The Photogravitational Restricted Three Body Problem with oblateness has been studied to obtain halo orbits around the Lagrangian points $L_1$ and $L_2$ of the Sun-Earth system in which the Sun is taken as radiating and the Earth as an oblate spheroid. The halo orbits corresponding to fourth and fifth order approximations around $L_1$ and $L_2$ for actual oblateness of the Earth and for different… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

    Comments: 18 pages, 18 figures

    MSC Class: 70F07

  12. arXiv:1904.11882  [pdf

    cs.OH cs.LG eess.SP

    Smart Laptop Bag with Machine Learning for Activity Recognition

    Authors: Dwij Sukeshkumar Sheth, Shantanu Singh, Prakhar S Mathur, Vydeki D

    Abstract: In todays world of smart living, the smart laptop bag, presented in this paper, provides a better solution to keep track of our precious possessions and monitoring them in real time. As the world moves towards a much tech-savvy direction, the novel laptop bag discussed here facilitates the user to perform location tracking, ambiance monitoring, user-state monitoring etc. in one device. The innovat… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

  13. arXiv:1711.03432  [pdf, ps, other

    math.CO

    Galois coverings of Schreier graphs of groups generated by bounded automata

    Authors: Asif Shaikh, Daniele D'Angeli, Hemant Bhate, Dilip Sheth

    Abstract: We give a characterization of the covering Schreier graphs of groups generated by bounded automata to be Galois. We also investigate the zeta and $L$ functions of Schreier graphs of few groups namely the Grigorchuk group, Gupta-Sidki $p$ group, Gupta-Fabrykowski group and BSV torsion-free group.

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: 32 pages, 13 figures, 6 tables

    MSC Class: 05C25; 05C31; 20E08

  14. arXiv:1701.07490  [pdf

    cs.SI q-bio.OT

    What Are People Tweeting about Zika? An Exploratory Study Concerning Symptoms, Treatment, Transmission, and Prevention

    Authors: Michele Miller, Dr. Tanvi Banerjee, RoopTeja Muppalla, Dr. William Romine, Dr. Amit Sheth

    Abstract: The purpose of this study was to do a dataset distribution analysis, a classification performance analysis, and a topical analysis concerning what people are tweeting about four disease characteristics: symptoms, transmission, prevention, and treatment. A combination of natural language processing and machine learning techniques were used to determine what people are tweeting about Zika. Specifica… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.