Skip to main content

Showing 1–7 of 7 results for author: Rajpoot, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.02247  [pdf, ps, other

    cs.CL

    Birbal: An efficient 7B instruct-model fine-tuned with curated datasets

    Authors: Ashvini Kumar **dal, Pawan Kumar Rajpoot, Ankur Parikh

    Abstract: LLMOps incur significant costs due to hardware requirements, hindering their widespread accessibility. Additionally, a lack of transparency in model training methods and data contributes to the majority of models being non-reproducible. To tackle these challenges, the LLM Efficiency Challenge was introduced at NeurIPS Workshop, aiming to adapt foundation models on a diverse set of tasks via fine-t… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  2. arXiv:2310.17714  [pdf, other

    cs.CL cs.CE

    Nearest Neighbor Search over Vectorized Lexico-Syntactic Patterns for Relation Extraction from Financial Documents

    Authors: Pawan Kumar Rajpoot, Ankur Parikh

    Abstract: Relation extraction (RE) has achieved remarkable progress with the help of pre-trained language models. However, existing RE models are usually incapable of handling two situations: implicit expressions and long-tail relation classes, caused by language complexity and data sparsity. Further, these approaches and models are largely inaccessible to users who don't have direct access to large languag… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  3. arXiv:2306.17519  [pdf, other

    cs.CL

    GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models

    Authors: Pawan Kumar Rajpoot, Ankur Parikh

    Abstract: Relation extraction (RE) is a crucial task in natural language processing (NLP) that aims to identify and classify relationships between entities mentioned in text. In the financial domain, relation extraction plays a vital role in extracting valuable information from financial documents, such as news articles, earnings reports, and company filings. This paper describes our solution to relation ex… ▽ More

    Submitted 21 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.02105 by other authors

  4. arXiv:2302.01823  [pdf

    cs.CL

    Lexical Simplification using multi level and modular approach

    Authors: Nikita Katyal, Pawan Kumar Rajpoot

    Abstract: Text Simplification is an ongoing problem in Natural Language Processing, solution to which has varied implications. In conjunction with the TSAR-2022 Workshop @EMNLP2022 Lexical Simplification is the process of reducing the lexical complexity of a text by replacing difficult words with easier to read (or understand) expressions while preserving the original information and meaning. This paper exp… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  5. arXiv:2202.07001  [pdf, other

    eess.IV cs.CV

    Handcrafted Histological Transformer (H2T): Unsupervised Representation of Whole Slide Images

    Authors: Quoc Dang Vu, Kashif Rajpoot, Shan E Ahmed Raza, Nasir Rajpoot

    Abstract: Diagnostic, prognostic and therapeutic decision-making of cancer in pathology clinics can now be carried out based on analysis of multi-gigapixel tissue images, also known as whole-slide images (WSIs). Recently, deep convolutional neural networks (CNNs) have been proposed to derive unsupervised WSI representations; these are attractive as they rely less on expert annotation which is cumbersome. Ho… ▽ More

    Submitted 6 September, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  6. arXiv:2112.09496  [pdf

    eess.IV cs.CV cs.LG

    Towards Launching AI Algorithms for Cellular Pathology into Clinical & Pharmaceutical Orbits

    Authors: Amina Asif, Kashif Rajpoot, David Snead, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Computational Pathology (CPath) is an emerging field concerned with the study of tissue pathology via computational algorithms for the processing and analysis of digitized high-resolution images of tissue slides. Recent deep learning based developments in CPath have successfully leveraged sheer volume of raw pixel data in histology images for predicting target parameters in the domains of diagnost… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  7. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter