Skip to main content

Showing 1–5 of 5 results for author: Talukdar, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10295  [pdf

    cs.CL cs.IR

    Robustness of Structured Data Extraction from In-plane Rotated Documents using Multi-Modal Large Language Models (LLM)

    Authors: Anjanava Biswas, Wrick Talukdar

    Abstract: Multi-modal large language models (LLMs) have shown remarkable performance in various natural language processing tasks, including data extraction from documents. However, the accuracy of these models can be significantly affected by document in-plane rotation, also known as skew, a common issue in real-world scenarios for scanned documents. This study investigates the impact of document skew on t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 20 pages, 6 figures

    Journal ref: Journal of Artificial Intelligence Research: Vol. 4 (2024): No. 1, 176-195

  2. Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy

    Authors: Anjanava Biswas, Wrick Talukdar

    Abstract: Accurate and comprehensive clinical documentation is crucial for delivering high-quality healthcare, facilitating effective communication among providers, and ensuring compliance with regulatory requirements. However, manual transcription and data entry processes can be time-consuming, error-prone, and susceptible to inconsistencies, leading to incomplete or inaccurate medical records. This paper… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Journal ref: International Journal of Innovative Science and Research Technology: Vol. 9 (2024): No. 5, 1553-1566

  3. FinEmbedDiff: A Cost-Effective Approach of Classifying Financial Documents with Vector Sampling using Multi-modal Embedding Models

    Authors: Anjanava Biswas, Wrick Talukdar

    Abstract: Accurate classification of multi-modal financial documents, containing text, tables, charts, and images, is crucial but challenging. Traditional text-based approaches often fail to capture the complex multi-modal nature of these documents. We propose FinEmbedDiff, a cost-effective vector sampling method that leverages pre-trained multi-modal embedding models to classify financial documents. Our ap… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures

    Journal ref: International Research Journal of Modernization in Engineering Technology and Science: Vol. 06 (2024): No. 5, 6142-6152

  4. Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling

    Authors: Wrick Talukdar, Anjanava Biswas

    Abstract: While supervised learning models have shown remarkable performance in various natural language processing (NLP) tasks, their success heavily relies on the availability of large-scale labeled datasets, which can be costly and time-consuming to obtain. Conversely, unsupervised learning techniques can leverage abundant unlabeled text data to learn rich representations, but they do not directly optimi… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Journal ref: International Journal of Innovative Science and Research Technology: Vol. 9 (2024): No. 5, 1499-1508

  5. Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation

    Authors: Anjanava Biswas, Wrick Talukdar

    Abstract: Comprehensive clinical documentation is crucial for effective healthcare delivery, yet it poses a significant burden on healthcare professionals, leading to burnout, increased medical errors, and compromised patient safety. This paper explores the potential of generative AI (Artificial Intelligence) to streamline the clinical documentation process, specifically focusing on generating SOAP (Subject… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 15 pages, 7 figures

    Journal ref: International Journal of Innovative Science and Research Technology: Vol. 9 (2024): No. 5, 994-1008