Skip to main content

Showing 1–7 of 7 results for author: Terdalkar, H

.
  1. arXiv:2406.18276  [pdf

    cs.CL cs.SE

    Sanskrit Knowledge-based Systems: Annotation and Computational Tools

    Authors: Hrishikesh Terdalkar

    Abstract: We address the challenges and opportunities in the development of knowledge systems for Sanskrit, with a focus on question answering. By proposing a framework for the automated construction of knowledge graphs, introducing annotation tools for ontology-driven and general-purpose tasks, and offering a diverse collection of web-interfaces, tools, and software libraries, we have made significant cont… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: PhD Thesis. 204 pages, 6 publications

  2. arXiv:2310.07848  [pdf

    cs.CL

    Framework for Question-Answering in Sanskrit through Automated Construction of Knowledge Graphs

    Authors: Hrishikesh Terdalkar, Arnab Bhattacharya

    Abstract: Sanskrit (sa\d{m}sk\d{r}ta) enjoys one of the largest and most varied literature in the whole world. Extracting the knowledge from it, however, is a challenging task due to multiple reasons including complexity of the language and paucity of standard natural language processing tools. In this paper, we target the problem of building knowledge graphs for particular types of relationships from sa\d{… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted at 6th International Sanskrit Computational Linguistics Symposium (ISCLS) 2019

    Journal ref: In Proceedings of the 6th International Sanskrit Computational Linguistics Symposium, 2019, pages 97--116, IIT Kharagpur, India. Association for Computational Linguistics

  3. arXiv:2310.07826  [pdf, other

    cs.CL

    Antarlekhaka: A Comprehensive Tool for Multi-task Natural Language Annotation

    Authors: Hrishikesh Terdalkar, Arnab Bhattacharya

    Abstract: One of the primary obstacles in the advancement of Natural Language Processing (NLP) technologies for low-resource languages is the lack of annotated datasets for training and testing machine learning models. In this paper, we present Antarlekhaka, a tool for manual annotation of a comprehensive set of tasks relevant to NLP. The tool is Unicode-compatible, language-agnostic, Web-deployable and sup… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted: 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS) @ EMNLP 2023

  4. arXiv:2209.14924  [pdf

    cs.SE cs.CL

    Chandojnanam: A Sanskrit Meter Identification and Utilization System

    Authors: Hrishikesh Terdalkar, Arnab Bhattacharya

    Abstract: We present Chandojñānam, a web-based Sanskrit meter (Chanda) identification and utilization system. In addition to the core functionality of identifying meters, it sports a friendly user interface to display the scansion, which is a graphical representation of the metrical pattern. The system supports identification of meters from uploaded images by using optical character recognition (OCR) engine… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: to be published in "18th World Sanskrit Conference (WSC 2023)"

    Journal ref: In Proceedings of the Computational Sanskrit & Digital Humanities: Selected papers presented at the 18th World Sanskrit Conference, 2023, pages 113--127, Canberra, Australia (Online mode). Association for Computational Linguistics

  5. arXiv:2208.10310  [pdf, other

    cs.CL

    A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit

    Authors: Jivnesh Sandhan, Ashish Gupta, Hrishikesh Terdalkar, Tushar Sandhan, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal

    Abstract: The phenomenon of compounding is ubiquitous in Sanskrit. It serves for achieving brevity in expressing thoughts, while simultaneously enriching the lexical and structural formation of the language. In this work, we focus on the Sanskrit Compound Type Identification (SaCTI) task, where we consider the problem of identifying semantic relations between the components of a compound word. Earlier appro… ▽ More

    Submitted 11 September, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: The work is accepted at COLING22, Gyeongju, Republic of Korea

  6. arXiv:2202.00216  [pdf

    cs.IR cs.CL

    Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text

    Authors: Hrishikesh Terdalkar, Arnab Bhattacharya, Madhulika Dubey, Ramamurthy S, Bhavna Naneria Singh

    Abstract: Knowledge bases (KB) are an important resource in a number of natural language processing (NLP) and information retrieval (IR) tasks, such as semantic search, automated question-answering etc. They are also useful for researchers trying to gain information from a text. Unfortunately, however, the state-of-the-art in Sanskrit NLP does not yet allow automated construction of knowledge bases due to u… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 19 pages including appendix

    Journal ref: n Proceedings of the Computational Sanskrit & Digital Humanities: Selected papers presented at the 18th World Sanskrit Conference, 2023, pages 155--173, Canberra, Australia (Online mode). Association for Computational Linguistics

  7. Sangrahaka: A Tool for Annotating and Querying Knowledge Graphs

    Authors: Hrishikesh Terdalkar, Arnab Bhattacharya

    Abstract: In this work, we present a web-based annotation and querying tool Sangrahaka. It annotates entities and relationships from text corpora and constructs a knowledge graph (KG). The KG is queried using templatized natural language queries. The application is language and corpus agnostic, but can be tuned for special needs of a specific language or a corpus. A customized version of the framework has b… ▽ More

    Submitted 23 August, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Journal ref: ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, August 2021, Pages 1520--1524