Skip to main content

Showing 1–17 of 17 results for author: Karn, S

.
  1. arXiv:2404.16192  [pdf, other

    cs.CL cs.CV

    Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering

    Authors: Cuong Nhat Ha, Shima Asaadi, Sanjeev Kumar Karn, Oladimeji Farri, Tobias Heimann, Thomas Runkler

    Abstract: Vision-language models, while effective in general domains and showing strong performance in diverse multi-modal applications like visual question-answering (VQA), struggle to maintain the same level of effectiveness in more specialized domains, e.g., medical. We propose a medical vision-language model that integrates large vision and language models adapted for the medical domain. This model goes… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Clinical NLP @ NAACL 2024

  2. arXiv:2402.01758  [pdf, other

    cs.CY cs.AI cs.CL

    Aalap: AI Assistant for Legal & Paralegal Functions in India

    Authors: Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta

    Abstract: Using proprietary Large Language Models on legal tasks poses challenges due to data privacy issues, domain data heterogeneity, domain knowledge sophistication, and domain objectives uniqueness. We created Aalalp, a fine-tuned Mistral 7B model on instructions data related to specific Indian legal tasks. The performance of Aalap is better than gpt-3.5-turbo in 31\% of our test data and obtains an eq… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

  3. arXiv:2311.17213  [pdf

    cs.CL eess.IV

    General-Purpose vs. Domain-Adapted Large Language Models for Extraction of Structured Data from Chest Radiology Reports

    Authors: Ali H. Dhanaliwala, Rikhiya Ghosh, Sanjeev Kumar Karn, Poikavila Ullaskrishnan, Oladimeji Farri, Dorin Comaniciu, Charles E. Kahn

    Abstract: Radiologists produce unstructured data that can be valuable for clinical care when consumed by information systems. However, variability in style limits usage. Study compares system using domain-adapted language model (RadLing) and general-purpose LLM (GPT-4) in extracting relevant features from chest radiology reports and standardizing them to common data elements (CDEs). Three radiologists annot… ▽ More

    Submitted 9 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  4. arXiv:2306.10448  [pdf, other

    cs.CV cs.CL

    Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge

    Authors: Manuela Daniela Danu, George Marica, Sanjeev Kumar Karn, Bogdan Georgescu, Awais Mansoor, Florin Ghesu, Lucian Mihai Itu, Constantin Suciu, Sasa Grbic, Oladimeji Farri, Dorin Comaniciu

    Abstract: Among all the sub-sections in a typical radiology report, the Clinical Indications, Findings, and Impression often reflect important details about the health status of a patient. The information included in Impression is also often covered in Findings. While Findings and Impression can be deduced by inspecting the image, Clinical Indications often require additional context. The cognitive task of… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Information Technology and Quantitative Management (ITQM 2023)

    Journal ref: Information Technology and Quantitative Management (ITQM 2023

  5. arXiv:2306.03264  [pdf, other

    cs.CL

    shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

    Authors: Sanjeev Kumar Karn, Rikhiya Ghosh, Kusuma P, Oladimeji Farri

    Abstract: Instruction-tuned generative Large language models (LLMs) like ChatGPT and Bloomz possess excellent generalization abilities, but they face limitations in understanding radiology reports, particularly in the task of generating the IMPRESSIONS section from the FINDINGS section. They tend to generate either verbose or incomplete IMPRESSIONS, mainly due to insufficient exposure to medical text data d… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 1st Place in Task 1B: Radiology Report Summarization at BioNLP 2023

    Journal ref: BioNLP 2023, Co-located with ACL 2023

  6. arXiv:2306.02492  [pdf, other

    cs.CL

    RadLing: Towards Efficient Radiology Report Understanding

    Authors: Rikhiya Ghosh, Sanjeev Kumar Karn, Manuela Daniela Danu, Larisa Micu, Ramya Vunikili, Oladimeji Farri

    Abstract: Most natural language tasks in the radiology domain use language models pre-trained on biomedical corpus. There are few pretrained language models trained specifically for radiology, and fewer still that have been trained in a low data setting and gone on to produce comparable results in fine-tuning tasks. We present RadLing, a continuously pretrained language model using Electra-small (Clark et a… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Association for Computational Linguistics (ACL), 2023

    Journal ref: 61st Annual Meeting of the Association for Computational Linguistics (ACL), July 9-14, 2023, Toronto, Canada

  7. arXiv:2304.09548  [pdf, other

    cs.CL cs.AI cs.LG

    SemEval 2023 Task 6: LegalEval - Understanding Legal Texts

    Authors: Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for develo** NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we organized the shared task LegalEval - Understanding Legal Texts at SemEval 2023. LegalEval task has three sub-tasks: Task-A (Rhetorical Roles Labeling) is about… ▽ More

    Submitted 1 May, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 13 Pages (9 Pages + References), Accepted at SemEval 2023 at ACL 2023

  8. arXiv:2211.03442  [pdf, other

    cs.CL cs.AI

    Named Entity Recognition in Indian court judgments

    Authors: Prathamesh Kalamkar, Astha Agarwal, Aman Tiwari, Smita Gupta, Saurabh Karn, Vivek Raghavan

    Abstract: Identification of named entities from legal texts is an essential building block for develo** other legal Artificial Intelligence applications. Named Entities in legal texts are slightly different and more fine-grained than commonly used named entities like Person, Organization, Location etc. In this paper, we introduce a new corpus of 46545 annotated legal named entities mapped to 14 legal enti… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: to be published in NLLP 2022 Workshop at EMNLP

  9. arXiv:2203.08257  [pdf, other

    cs.CL

    Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

    Authors: Sanjeev Kumar Karn, Ning Liu, Hinrich Schuetze, Oladimeji Farri

    Abstract: The IMPRESSIONS section of a radiology report about an imaging study is a summary of the radiologist's reasoning and conclusions, and it also aids the referring physician in confirming or excluding certain diagnoses. A cascade of tasks are required to automatically generate an abstractive summary of the typical information-rich radiology report. These tasks include acquisition of salient content f… ▽ More

    Submitted 29 April, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted at 60th Annual Meeting of the Association for Computational Linguistics 2022 Main Conference

    Journal ref: 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland, 2022

  10. arXiv:2201.13125  [pdf, other

    cs.CL cs.AI cs.LG

    Corpus for Automatic Structuring of Legal Documents

    Authors: Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for develo** techniques for processing and organizing legal documents. In this paper, we introduce a new corpus for structuring legal documents. In particular, we introduce a corpus of legal judgment documents in English that are segmented into topical and coherent parts. Each of these parts is annotated… ▽ More

    Submitted 19 September, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Accepted at LREC 2022, 10 Pages (8 page main paper + 2 page references)

  11. arXiv:2103.05131  [pdf, other

    cs.CL

    Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data

    Authors: Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schuetze

    Abstract: Interleaved texts, where posts belonging to different threads occur in a sequence, commonly occur in online chat posts, so that it can be time-consuming to quickly obtain an overview of the discussions. Existing systems first disentangle the posts by threads and then extract summaries from those threads. A major issue with such systems is error propagation from the disentanglement component. While… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Adapt-NLP: The Second Workshop on Domain Adaptation for NLP

  12. A theoretical approach to study J/$Ψ$ suppression in relativistic heavy ion collisions

    Authors: Santosh K. Karn

    Abstract: With a view to understanding J/$Ψ$ suppression in relativistic heavy ion collisions, we compute the suppression rate within the framework of hydrodynamical evolution model. For this, we consider an ellipsoidal flow and use an ansatz for temperature profile function which accounts for time and the three dimensional space evolution of the quark-gluon plasma. We have calculated the survival probabili… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: 06 pages

    Report number: 2002.02412

    Journal ref: Open Journal of Microphysics, 2020, 10, 1-7

  13. arXiv:1906.01973  [pdf, other

    cs.CL

    A Hierarchical Decoder with Three-level Hierarchical Attention to Generate Abstractive Summaries of Interleaved Texts

    Authors: Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schütze

    Abstract: Interleaved texts, where posts belonging to different threads occur in one sequence, are a common occurrence, e.g., online chat conversations. To quickly obtain an overview of such texts, existing systems first disentangle the posts by threads and then extract summaries from those threads. The major issues with such systems are error propagation and non-fluent summary. To address those, we propose… ▽ More

    Submitted 9 April, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  14. arXiv:1807.11535  [pdf, other

    cs.CL

    News Article Teaser Tweets and How to Generate Them

    Authors: Sanjeev Kumar Karn, Mark Buckley, Ulli Waltinger, Hinrich Schütze

    Abstract: In this work, we define the task of teaser generation and provide an evaluation benchmark and baseline systems for the process of generating teasers. A teaser is a short reading suggestion for an article that is illustrative and includes curiosity-arousing elements to entice potential readers to read particular news items. Teasers are one of the main vehicles for transmitting news to social media… ▽ More

    Submitted 18 April, 2019; v1 submitted 30 July, 2018; originally announced July 2018.

    Journal ref: 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

  15. Neural Architectures for Open-Type Relation Argument Extraction

    Authors: Benjamin Roth, Costanza Conforti, Nina Poerner, Sanjeev Karn, Hinrich Schütze

    Abstract: In this work, we introduce the task of Open-Type Relation Argument Extraction (ORAE): Given a corpus, a query entity Q and a knowledge base relation (e.g.,"Q authored notable work with title X"), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, e.g. X: the title of a book or a work of art) from the corpus. A dist… ▽ More

    Submitted 30 September, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Journal ref: Nat. Lang. Eng. 25 (2019) 219-238

  16. arXiv:hep-ph/0511140  [pdf

    hep-ph

    On the Equation of State for Diquark Systems and their importance in Astrophysics and Cosmology

    Authors: Santosh K. Karn

    Abstract: With a view to studying diquark formation in QCD phase transition and its consequences in high energy density physics, the energy per quark for the extended scalar diquarks (ESD) as a function of density is calculated, within the framework of an effective phi four theory, for several values of the effective interaction parameter (lamda). Various equations of state for the ESD systems correspondi… ▽ More

    Submitted 20 November, 2005; v1 submitted 11 November, 2005; originally announced November 2005.

    Comments: Talk given in the Seventh International Workshop on Quantum Field Theory under the influence of External Conditions(QFEXT)'05, Barcelona, Spain, Sept.5-9, 2005

  17. arXiv:hep-ph/0510239  [pdf

    hep-ph

    A Review On Diquark Physics in QCD Phase Transition

    Authors: Santosh K. Karn

    Abstract: The importance of diquarks has been noticed in the context of several elementary particle processes. In recent years the study of diquarks has become of considerable interest in highlighting its role in high density physics from the point of view of astrophysical situations and cosmological conditions. In the present paper an attempt is made to briefly review the role of diquarks in conventional… ▽ More

    Submitted 18 October, 2005; originally announced October 2005.