Skip to main content

Showing 1–13 of 13 results for author: Joshi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.03714  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

    Authors: Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts

    Abstract: The ML community is rapidly exploring techniques for prompting language models (LMs) and for stacking them into pipelines that solve complex tasks. Unfortunately, existing LM pipelines are typically implemented using hard-coded "prompt templates", i.e. lengthy strings discovered via trial and error. Toward a more systematic approach for develo** and optimizing LM pipelines, we introduce DSPy, a… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  2. arXiv:2308.09341  [pdf

    cs.CL cs.LG

    Document Automation Architectures: Updated Survey in Light of Large Language Models

    Authors: Mohammad Ahmadi Achachlouei, Omkar Patil, Tarun Joshi, Vijayan N. Nair

    Abstract: This paper surveys the current state of the art in document automation (DA). The objective of DA is to reduce the manual effort during the generation of documents by automatically creating and integrating input from different sources and assembling documents conforming to defined templates. There have been reviews of commercial solutions of DA, particularly in the legal domain, but to date there h… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: The current paper is the updated version of an earlier survey on document automation [Ahmadi Achachlouei et al. 2021]. Updates in the current paper are as follows: We shortened almost all sections to reduce the size of the main paper (without references) from 28 pages to 10 pages, added a review of selected papers on large language models, removed certain sections and most of diagrams. arXiv admin note: substantial text overlap with arXiv:2109.11603

    MSC Class: 68T50 ACM Class: I.7.0; I.2.7; I.2.4

  3. arXiv:2205.13119  [pdf, other

    cs.CL cs.LG

    Understanding Metrics for Paraphrasing

    Authors: Omkar Patil, Rahul Singh, Tarun Joshi

    Abstract: Paraphrase generation is a difficult problem. This is not only because of the limitations in text generation capabilities but also due that to the lack of a proper definition of what qualifies as a paraphrase and corresponding metrics to measure how good it is. Metrics for evaluation of paraphrasing quality is an on going research problem. Most of the existing metrics in use having been borrowed f… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: 19 pages, 7 figures, 12 Tables

  4. arXiv:2204.10685  [pdf, other

    cs.LG eess.SY

    TASAC: a twin-actor reinforcement learning framework with stochastic policy for batch process control

    Authors: Tanuja Joshi, Hariprasad Kodamana, Harikumar Kandath, Niket Kaisare

    Abstract: Due to their complex nonlinear dynamics and batch-to-batch variability, batch processes pose a challenge for process control. Due to the absence of accurate models and resulting plant-model mismatch, these problems become harder to address for advanced model-based control strategies. Reinforcement Learning (RL), wherein an agent learns the policy by directly interacting with the environment, offer… ▽ More

    Submitted 2 May, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 11 pages

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary)

  5. arXiv:2110.15225  [pdf

    cs.CL cs.LG

    Pruning Attention Heads of Transformer Models Using A* Search: A Novel Approach to Compress Big NLP Architectures

    Authors: Archit Parnami, Rahul Singh, Tarun Joshi

    Abstract: Recent years have seen a growing adoption of Transformer models such as BERT in Natural Language Processing and even in Computer Vision. However, due to their size, there has been limited adoption of such models within resource-constrained computing environments. This paper proposes novel pruning algorithm to compress transformer models by eliminating redundant Attention Heads. We apply the A* sea… ▽ More

    Submitted 17 November, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: 23 Pages, 18 figures, 3 tables

  6. arXiv:2109.11603  [pdf

    cs.CL cs.LG

    Document Automation Architectures and Technologies: A Survey

    Authors: Mohammad Ahmadi Achachlouei, Omkar Patil, Tarun Joshi, Vijayan N. Nair

    Abstract: This paper surveys the current state of the art in document automation (DA). The objective of DA is to reduce the manual effort during the generation of documents by automatically integrating input from different sources and assembling documents conforming to defined templates. There have been reviews of commercial solutions of DA, particularly in the legal domain, but to date there has been no co… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 34 pages, 11 figures, 5 tables

    MSC Class: 68T50 ACM Class: I.7.0; I.2.7; I.2.4

  7. arXiv:2105.13025  [pdf

    cs.SI cs.CL physics.soc-ph

    Finding top performers through email patterns analysis

    Authors: Q. Wen, P. A. Gloor, A. Fronzetti Colladon, P. Tickoo, T. Joshi

    Abstract: In the information economy, individuals' work performance is closely associated with their digital communication strategies. This study combines social network and semantic analysis to develop a method to identify top performers based on email communication. By reviewing existing literature, we identified the indicators that quantify email communication into measurable dimensions. To empirically e… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    ACM Class: J.4; I.2.7; H.4.0

    Journal ref: Journal of Information Science 46(4), 508-527 (2020)

  8. arXiv:2105.08589  [pdf

    cs.CL

    Self-interpretable Convolutional Neural Networks for Text Classification

    Authors: Wei Zhao, Rahul Singh, Tarun Joshi, Agus Sudjianto, Vijayan N. Nair

    Abstract: Deep learning models for natural language processing (NLP) are inherently complex and often viewed as black box in nature. This paper develops an approach for interpreting convolutional neural networks for text classification problems by exploiting the local-linear models inherent in ReLU-DNNs. The CNN model combines the word embedding through convolutional layers, filters them using max-pooling,… ▽ More

    Submitted 8 July, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

  9. arXiv:2104.09978  [pdf

    cs.CL cs.AI cs.LG

    Robustness Tests of NLP Machine Learning Models: Search and Semantically Replace

    Authors: Rahul Singh, Karan **dal, Yufei Yu, Hanyu Yang, Tarun Joshi, Matthew A. Campbell, Wayne B. Shoumaker

    Abstract: This paper proposes a strategy to assess the robustness of different machine learning models that involve natural language processing (NLP). The overall approach relies upon a Search and Semantically Replace strategy that consists of two steps: (1) Search, which identifies important parts in the text; (2) Semantically Replace, which finds replacements for the important parts, and constrains the re… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: 18 pages, 2 figures, 18 tables

  10. arXiv:2102.09866  [pdf

    cs.CL cs.LG

    KBCNMUJAL@HASOC-Dravidian-CodeMix-FIRE2020: Using Machine Learning for Detection of Hate Speech and Offensive Code-Mixed Social Media text

    Authors: Varsha Pathak, Manish Joshi, Prasad Joshi, Monica Mundada, Tanmay Joshi

    Abstract: This paper describes the system submitted by our team, KBCNMUJAL, for Task 2 of the shared task Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC), at Forum for Information Retrieval Evaluation, December 16-20, 2020, Hyderabad, India. The datasets of two Dravidian languages Viz. Malayalam and Tamil of size 4000 observations, each were shared by the HASOC organizers… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

  11. arXiv:2009.02358  [pdf

    cs.CL cs.AI

    Recent Trends in the Use of Deep Learning Models for Grammar Error Handling

    Authors: Mina Naghshnejad, Tarun Joshi, Vijayan N. Nair

    Abstract: Grammar error handling (GEH) is an important topic in natural language processing (NLP). GEH includes both grammar error detection and grammar error correction. Recent advances in computation systems have promoted the use of deep learning (DL) models for NLP problems such as GEH. In this survey we focus on two main DL approaches for GEH: neural machine translation models and editor models. We desc… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

  12. arXiv:2008.11825  [pdf

    cs.CL cs.LG

    SHAP values for Explaining CNN-based Text Classification Models

    Authors: Wei Zhao, Tarun Joshi, Vijayan N. Nair, Agus Sudjianto

    Abstract: Deep neural networks are increasingly used in natural language processing (NLP) models. However, the need to interpret and explain the results from complex algorithms are limiting their widespread adoption in regulated industries such as banking. There has been recent work on interpretability of machine learning algorithms with structured data. But there are only limited techniques for NLP applica… ▽ More

    Submitted 8 July, 2021; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: 17 pages, 5 figures

  13. arXiv:2008.05536  [pdf

    cs.CL cs.LG

    Model Robustness with Text Classification: Semantic-preserving adversarial attacks

    Authors: Rahul Singh, Tarun Joshi, Vijayan N. Nair, Agus Sudjianto

    Abstract: We propose algorithms to create adversarial attacks to assess model robustness in text classification problems. They can be used to create white box attacks and black box attacks while at the same time preserving the semantics and syntax of the original text. The attacks cause significant number of flips in white-box setting and same rule based can be used in black-box setting. In a black-box sett… ▽ More

    Submitted 13 August, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 12 Pages, 3 Figures, 10 Tables