Skip to main content

Showing 1–17 of 17 results for author: Ganesan, B

.
  1. arXiv:2403.10944  [pdf, other

    cs.HC cs.AI

    Human Centered AI for Indian Legal Text Analytics

    Authors: Sudipto Ghosh, Devanshu Verma, Balaji Ganesan, Purnima Bindal, Vikas Kumar, Vasudha Bhatnagar

    Abstract: Legal research is a crucial task in the practice of law. It requires intense human effort and intellectual prudence to research a legal case and prepare arguments. Recent boom in generative AI has not translated to proportionate rise in impactful legal applications, because of low trustworthiness and and the scarcity of specialized datasets for training Large Language Models (LLMs). This position… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 7 pages, 7 figures

  2. arXiv:2403.09806  [pdf, other

    cs.AI

    xLP: Explainable Link Prediction for Master Data Management

    Authors: Balaji Ganesan, Matheen Ahmed Pasha, Srinivasa Parkala, Neeraj R Singh, Gayatri Mishra, Sumit Bhatia, Hima Patel, Somashekar Naganna, Sameep Mehta

    Abstract: Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link prediction in master data management, we have built a number of explainability solutions drawing from research in interpretability, fact verification, path ranking, neu… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures, NeurIPS 2020 Competition and Demonstration Track. arXiv admin note: text overlap with arXiv:2012.05516

  3. arXiv:2403.01481  [pdf, other

    cs.CL

    Infusing Knowledge into Large Language Models with Contextual Prompts

    Authors: Kinshuk Vasisht, Balaji Ganesan, Vikas Kumar, Vasudha Bhatnagar

    Abstract: Knowledge infusion is a promising method for enhancing Large Language Models for domain-specific NLP tasks rather than pre-training models over large data from scratch. These augmented LLMs typically depend on additional pre-training or knowledge prompts from an existing knowledge graph, which is impractical in many applications. In contrast, knowledge infusion directly from relevant documents is… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 5 pages, 1 figure, In Proceedings of ICON 2023

  4. arXiv:2402.01602  [pdf, other

    cs.AI

    Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning

    Authors: Debarun Bhattacharjya, Junkyu Lee, Don Joven Agravante, Balaji Ganesan, Radu Marinescu

    Abstract: Foundation models (FMs) such as large language models have revolutionized the field of AI by showing remarkable performance in various tasks. However, they exhibit numerous limitations that prevent their broader adoption in many real-world systems, which often require a higher bar for trustworthiness and usability. Since FMs are trained using loss functions aimed at reconstructing the training cor… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 9 pages

  5. arXiv:2401.08688  [pdf, other

    cs.CL cs.IR

    Automated Answer Validation using Text Similarity

    Authors: Balaji Ganesan, Arjun Ravikumar, Lakshay Piplani, Rini Bhaumik, Dhivya Padmanaban, Shwetha Narasimhamurthy, Chetan Adhikary, Subhash Deshapogu

    Abstract: Automated answer validation can help improve learning outcomes by providing appropriate feedback to learners, and by making question answering systems and online learning solutions more widely available. There have been some works in science question answering which show that information retrieval methods outperform neural methods, especially in the multiple choice version of this problem. We impl… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 8 pages, 4 figures, International Conference on Natural Language Processing (ICON) 2023

  6. arXiv:2212.00342  [pdf, other

    cs.AI

    xEM: Explainable Entity Matching in Customer 360

    Authors: Sukriti Jaitly, Deepa Mariam George, Balaji Ganesan, Muhammad Ameen, Srinivas Pusapati

    Abstract: Entity matching in Customer 360 is the task of determining if multiple records represent the same real world entity. Entities are typically people, organizations, locations, and events represented as attributed nodes in a graph, though they can also be represented as records in relational data. While probabilistic matching engines and artificial neural network models exist for this task, explainin… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 4 pages, 5 figures. CODS-COMAD 2023 Demo

  7. arXiv:2107.04771  [pdf, other

    cs.AI

    Similar Cases Recommendation using Legal Knowledge Graphs

    Authors: Jaspreet Singh Dhani, Ruchika Bhatt, Balaji Ganesan, Parikshet Sirohi, Vasudha Bhatnagar

    Abstract: A legal knowledge graph constructed from court cases, judgments, laws and other legal documents can enable a number of applications like question answering, document similarity, and search. While the use of knowledge graphs for distant supervision in NLP tasks is well researched, using knowledge graphs for applications like case similarity presents challenges. In this work, we describe our solutio… ▽ More

    Submitted 2 March, 2024; v1 submitted 10 July, 2021; originally announced July 2021.

    Comments: 10 pages. 6 figures. 3rd Symposium on Artificial Intelligence and Law. SAIL 2023

  8. arXiv:2106.12665  [pdf, other

    cs.LG cs.AI

    Reimagining GNN Explanations with ideas from Tabular Data

    Authors: Anjali Singh, Shamanth R Nayak K, Balaji Ganesan

    Abstract: Explainability techniques for Graph Neural Networks still have a long way to go compared to explanations available for both neural and decision decision tree-based models trained on tabular data. Using a task that straddles both graphs and tabular data, namely Entity Matching, we comment on key aspects of explainability that are missing in GNN model explanations.

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 4 pages, 8 figures, XAI Workshop at ICML 2021

  9. arXiv:2106.11864  [pdf, other

    cs.AI cs.LG

    Towards Automated Evaluation of Explanations in Graph Neural Networks

    Authors: Vanya BK, Balaji Ganesan, Aniket Saxena, Devbrat Sharma, Arvind Agarwal

    Abstract: Explaining Graph Neural Networks predictions to end users of AI applications in easily understandable terms remains an unsolved problem. In particular, we do not have well developed methods for automatically evaluating explanations, in ways that are closer to how users consume those explanations. Based on recent application trends and our own experiences in real world problems, we propose automati… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: 5 pages, 4 figures, XAI Workshop at ICML 2021

  10. arXiv:2104.12950  [pdf, other

    cs.AI cs.CL

    Document Structure aware Relational Graph Convolutional Networks for Ontology Population

    Authors: Abhay M Shalghar, Ayush Kumar, Balaji Ganesan, Aswin Kannan, Akshay Parekh, Shobha G

    Abstract: Ontologies comprising of concepts, their attributes, and relationships are used in many knowledge based AI systems. While there have been efforts towards populating domain specific ontologies, we examine the role of document structure in learning ontological relationships between concepts in any document corpus. Inspired by ideas from hypernym discovery and explainability, our method performs abou… ▽ More

    Submitted 12 April, 2022; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 8 pages single column, 5 figures. DLG4NLP Workshop at ICLR 2022

  11. arXiv:2012.05516  [pdf, other

    cs.CR cs.AI cs.LG cs.SI

    Explainable Link Prediction for Privacy-Preserving Contact Tracing

    Authors: Balaji Ganesan, Hima Patel, Sameep Mehta

    Abstract: Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update t… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: 8 pages, 7 figures, SpicyFL 2020 Workshop at NeurIPS 2020

  12. arXiv:2003.04732  [pdf, other

    cs.SI cs.AI

    Link Prediction using Graph Neural Networks for Master Data Management

    Authors: Balaji Ganesan, Srinivas Parkala, Neeraj R Singh, Sumit Bhatia, Gayatri Mishra, Matheen Ahmed Pasha, Hima Patel, Somashekar Naganna

    Abstract: Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNN… ▽ More

    Submitted 28 August, 2020; v1 submitted 7 March, 2020; originally announced March 2020.

    Comments: 10 pages, 11 figures

  13. arXiv:2002.10943  [pdf, other

    cs.IR cs.AI cs.CL

    Data Augmentation for Personal Knowledge Base Population

    Authors: Lingraj S Vannur, Balaji Ganesan, Lokesh Nagalapatti, Hima Patel, MN Thippeswamy

    Abstract: Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to d… ▽ More

    Submitted 18 August, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: 8 pages, 9 figures, 6 tables. under review. arXiv admin note: text overlap with arXiv:2001.08013

  14. arXiv:2001.08013  [pdf, other

    cs.AI cs.CL cs.IR

    A Neural Architecture for Person Ontology population

    Authors: Balaji Ganesan, Riddhiman Dasgupta, Akshay Parekh, Hima Patel, Berthold Reinwald

    Abstract: A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual pr… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: 6 pages, 10 figures. arXiv admin note: substantial text overlap with arXiv:1811.09368

  15. arXiv:1811.12728  [pdf, ps, other

    cs.CL

    Document Structure Measure for Hypernym discovery

    Authors: Aswin Kannan, Shanmukha C Guttula, Balaji Ganesan, Hima P Karanam, Arun Kumar

    Abstract: Hypernym discovery is the problem of finding terms that have is-a relationship with a given term. We introduce a new context type, and a relatedness measure to differentiate hypernyms from other types of semantic relationships. Our Document Structure measure is based on hierarchical position of terms in a document, and their presence or otherwise in definition text. This measure quantifies the doc… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

  16. arXiv:1811.09368  [pdf, other

    cs.CL cs.IR

    Fine Grained Classification of Personal Data Entities

    Authors: Riddhiman Dasgupta, Balaji Ganesan, Aswin Kannan, Berthold Reinwald, Arun Kumar

    Abstract: Entity Type Classification can be defined as the task of assigning category labels to entity mentions in documents. While neural networks have recently improved the classification of general entity mentions, pattern matching and other systems continue to be used for classifying personal data entities (e.g. classifying an organization as a media company or a government institution for GDPR, and HIP… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.

  17. arXiv:1810.08782  [pdf, other

    cs.CL cs.AI

    Collective Learning From Diverse Datasets for Entity Ty** in the Wild

    Authors: Abhishek Abhishek, Amar Prakash Azad, Balaji Ganesan, Ashish Anand, Amit Awekar

    Abstract: Entity ty** (ET) is the problem of assigning labels to given entity mentions in a sentence. Existing works for ET require knowledge about the domain and target label set for a given test instance. ET in the absence of such knowledge is a novel problem that we address as ET in the wild. We hypothesize that the solution to this problem is to build supervised models that generalize better on the ET… ▽ More

    Submitted 16 September, 2019; v1 submitted 20 October, 2018; originally announced October 2018.

    Comments: Accepted at EYRE'19 Workshop, CIKM 2019