-
Human Centered AI for Indian Legal Text Analytics
Authors:
Sudipto Ghosh,
Devanshu Verma,
Balaji Ganesan,
Purnima Bindal,
Vikas Kumar,
Vasudha Bhatnagar
Abstract:
Legal research is a crucial task in the practice of law. It requires intense human effort and intellectual prudence to research a legal case and prepare arguments. Recent boom in generative AI has not translated to proportionate rise in impactful legal applications, because of low trustworthiness and and the scarcity of specialized datasets for training Large Language Models (LLMs). This position…
▽ More
Legal research is a crucial task in the practice of law. It requires intense human effort and intellectual prudence to research a legal case and prepare arguments. Recent boom in generative AI has not translated to proportionate rise in impactful legal applications, because of low trustworthiness and and the scarcity of specialized datasets for training Large Language Models (LLMs). This position paper explores the potential of LLMs within Legal Text Analytics (LTA), highlighting specific areas where the integration of human expertise can significantly enhance their performance to match that of experts. We introduce a novel dataset and describe a human centered, compound AI system that principally incorporates human inputs for performing LTA tasks with LLMs.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.
-
xLP: Explainable Link Prediction for Master Data Management
Authors:
Balaji Ganesan,
Matheen Ahmed Pasha,
Srinivasa Parkala,
Neeraj R Singh,
Gayatri Mishra,
Sumit Bhatia,
Hima Patel,
Somashekar Naganna,
Sameep Mehta
Abstract:
Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link prediction in master data management, we have built a number of explainability solutions drawing from research in interpretability, fact verification, path ranking, neu…
▽ More
Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link prediction in master data management, we have built a number of explainability solutions drawing from research in interpretability, fact verification, path ranking, neuro-symbolic reasoning and self-explaining AI. In this demo, we present explanations for link prediction in a creative way, to allow users to choose explanations they are more comfortable with.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Infusing Knowledge into Large Language Models with Contextual Prompts
Authors:
Kinshuk Vasisht,
Balaji Ganesan,
Vikas Kumar,
Vasudha Bhatnagar
Abstract:
Knowledge infusion is a promising method for enhancing Large Language Models for domain-specific NLP tasks rather than pre-training models over large data from scratch. These augmented LLMs typically depend on additional pre-training or knowledge prompts from an existing knowledge graph, which is impractical in many applications. In contrast, knowledge infusion directly from relevant documents is…
▽ More
Knowledge infusion is a promising method for enhancing Large Language Models for domain-specific NLP tasks rather than pre-training models over large data from scratch. These augmented LLMs typically depend on additional pre-training or knowledge prompts from an existing knowledge graph, which is impractical in many applications. In contrast, knowledge infusion directly from relevant documents is more generalisable and alleviates the need for structured knowledge graphs while also being useful for entities that are usually not found in any knowledge graph. With this motivation, we propose a simple yet generalisable approach for knowledge infusion by generating prompts from the context in the input text. Our experiments show the effectiveness of our approach which we evaluate by probing the fine-tuned LLMs.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning
Authors:
Debarun Bhattacharjya,
Junkyu Lee,
Don Joven Agravante,
Balaji Ganesan,
Radu Marinescu
Abstract:
Foundation models (FMs) such as large language models have revolutionized the field of AI by showing remarkable performance in various tasks. However, they exhibit numerous limitations that prevent their broader adoption in many real-world systems, which often require a higher bar for trustworthiness and usability. Since FMs are trained using loss functions aimed at reconstructing the training cor…
▽ More
Foundation models (FMs) such as large language models have revolutionized the field of AI by showing remarkable performance in various tasks. However, they exhibit numerous limitations that prevent their broader adoption in many real-world systems, which often require a higher bar for trustworthiness and usability. Since FMs are trained using loss functions aimed at reconstructing the training corpus in a self-supervised manner, there is no guarantee that the model's output aligns with users' preferences for a specific task at hand. In this survey paper, we propose a conceptual framework that encapsulates different modes by which agents could interact with FMs and guide them suitably for a set of tasks, particularly through knowledge augmentation and reasoning. Our framework elucidates agent role categories such as updating the underlying FM, assisting with prompting the FM, and evaluating the FM output. We also categorize several state-of-the-art approaches into agent interaction protocols, highlighting the nature and extent of involvement of the various agent roles. The proposed framework provides guidance for future directions to further realize the power of FMs in practical AI systems.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Automated Answer Validation using Text Similarity
Authors:
Balaji Ganesan,
Arjun Ravikumar,
Lakshay Piplani,
Rini Bhaumik,
Dhivya Padmanaban,
Shwetha Narasimhamurthy,
Chetan Adhikary,
Subhash Deshapogu
Abstract:
Automated answer validation can help improve learning outcomes by providing appropriate feedback to learners, and by making question answering systems and online learning solutions more widely available. There have been some works in science question answering which show that information retrieval methods outperform neural methods, especially in the multiple choice version of this problem. We impl…
▽ More
Automated answer validation can help improve learning outcomes by providing appropriate feedback to learners, and by making question answering systems and online learning solutions more widely available. There have been some works in science question answering which show that information retrieval methods outperform neural methods, especially in the multiple choice version of this problem. We implement Siamese neural network models and produce a generalised solution to this problem. We compare our supervised model with other text similarity based solutions.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
xEM: Explainable Entity Matching in Customer 360
Authors:
Sukriti Jaitly,
Deepa Mariam George,
Balaji Ganesan,
Muhammad Ameen,
Srinivas Pusapati
Abstract:
Entity matching in Customer 360 is the task of determining if multiple records represent the same real world entity. Entities are typically people, organizations, locations, and events represented as attributed nodes in a graph, though they can also be represented as records in relational data. While probabilistic matching engines and artificial neural network models exist for this task, explainin…
▽ More
Entity matching in Customer 360 is the task of determining if multiple records represent the same real world entity. Entities are typically people, organizations, locations, and events represented as attributed nodes in a graph, though they can also be represented as records in relational data. While probabilistic matching engines and artificial neural network models exist for this task, explaining entity matching has received less attention. In this demo, we present our Explainable Entity Matching (xEM) system and discuss the different AI/ML considerations that went into its implementation.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
Similar Cases Recommendation using Legal Knowledge Graphs
Authors:
Jaspreet Singh Dhani,
Ruchika Bhatt,
Balaji Ganesan,
Parikshet Sirohi,
Vasudha Bhatnagar
Abstract:
A legal knowledge graph constructed from court cases, judgments, laws and other legal documents can enable a number of applications like question answering, document similarity, and search. While the use of knowledge graphs for distant supervision in NLP tasks is well researched, using knowledge graphs for applications like case similarity presents challenges. In this work, we describe our solutio…
▽ More
A legal knowledge graph constructed from court cases, judgments, laws and other legal documents can enable a number of applications like question answering, document similarity, and search. While the use of knowledge graphs for distant supervision in NLP tasks is well researched, using knowledge graphs for applications like case similarity presents challenges. In this work, we describe our solution for predicting similar cases in Indian court judgements. We present our results and also discuss the impact of large language models on this task.
△ Less
Submitted 2 March, 2024; v1 submitted 10 July, 2021;
originally announced July 2021.
-
Reimagining GNN Explanations with ideas from Tabular Data
Authors:
Anjali Singh,
Shamanth R Nayak K,
Balaji Ganesan
Abstract:
Explainability techniques for Graph Neural Networks still have a long way to go compared to explanations available for both neural and decision decision tree-based models trained on tabular data. Using a task that straddles both graphs and tabular data, namely Entity Matching, we comment on key aspects of explainability that are missing in GNN model explanations.
Explainability techniques for Graph Neural Networks still have a long way to go compared to explanations available for both neural and decision decision tree-based models trained on tabular data. Using a task that straddles both graphs and tabular data, namely Entity Matching, we comment on key aspects of explainability that are missing in GNN model explanations.
△ Less
Submitted 23 June, 2021;
originally announced June 2021.
-
Towards Automated Evaluation of Explanations in Graph Neural Networks
Authors:
Vanya BK,
Balaji Ganesan,
Aniket Saxena,
Devbrat Sharma,
Arvind Agarwal
Abstract:
Explaining Graph Neural Networks predictions to end users of AI applications in easily understandable terms remains an unsolved problem. In particular, we do not have well developed methods for automatically evaluating explanations, in ways that are closer to how users consume those explanations. Based on recent application trends and our own experiences in real world problems, we propose automati…
▽ More
Explaining Graph Neural Networks predictions to end users of AI applications in easily understandable terms remains an unsolved problem. In particular, we do not have well developed methods for automatically evaluating explanations, in ways that are closer to how users consume those explanations. Based on recent application trends and our own experiences in real world problems, we propose automatic evaluation approaches for GNN Explanations.
△ Less
Submitted 22 June, 2021;
originally announced June 2021.
-
Document Structure aware Relational Graph Convolutional Networks for Ontology Population
Authors:
Abhay M Shalghar,
Ayush Kumar,
Balaji Ganesan,
Aswin Kannan,
Akshay Parekh,
Shobha G
Abstract:
Ontologies comprising of concepts, their attributes, and relationships are used in many knowledge based AI systems. While there have been efforts towards populating domain specific ontologies, we examine the role of document structure in learning ontological relationships between concepts in any document corpus. Inspired by ideas from hypernym discovery and explainability, our method performs abou…
▽ More
Ontologies comprising of concepts, their attributes, and relationships are used in many knowledge based AI systems. While there have been efforts towards populating domain specific ontologies, we examine the role of document structure in learning ontological relationships between concepts in any document corpus. Inspired by ideas from hypernym discovery and explainability, our method performs about 15 points more accurate than a stand-alone R-GCN model for this task.
△ Less
Submitted 12 April, 2022; v1 submitted 26 April, 2021;
originally announced April 2021.
-
Explainable Link Prediction for Privacy-Preserving Contact Tracing
Authors:
Balaji Ganesan,
Hima Patel,
Sameep Mehta
Abstract:
Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update t…
▽ More
Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact tracing. However, there are a number of privacy issues in the implementation of contract tracing applications, which make people reluctant to install or update their infection status on these applications. In this concept paper, we present ideas from Graph Neural Networks and explainability, that could improve trust in these applications, and encourage adoption by people.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Link Prediction using Graph Neural Networks for Master Data Management
Authors:
Balaji Ganesan,
Srinivas Parkala,
Neeraj R Singh,
Sumit Bhatia,
Gayatri Mishra,
Matheen Ahmed Pasha,
Hima Patel,
Somashekar Naganna
Abstract:
Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNN…
▽ More
Learning graph representations of n-ary relational data has a number of real world applications like anti-money laundering, fraud detection, and customer due diligence. Contact tracing of COVID19 positive persons could also be posed as a Link Prediction problem. Predicting links between people using Graph Neural Networks requires careful ethical and privacy considerations than in domains where GNNs have typically been applied so far. We introduce novel methods for anonymizing data, model training, explainability and verification for Link Prediction in Master Data Management, and discuss our results.
△ Less
Submitted 28 August, 2020; v1 submitted 7 March, 2020;
originally announced March 2020.
-
Data Augmentation for Personal Knowledge Base Population
Authors:
Lingraj S Vannur,
Balaji Ganesan,
Lokesh Nagalapatti,
Hima Patel,
MN Thippeswamy
Abstract:
Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to d…
▽ More
Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to data protection, fairness and privacy. In this work, we present a system that uses rule based annotators and a graph neural network for missing link prediction, to populate a more complete, fair and diverse knowledge base from the TACRED dataset.
△ Less
Submitted 18 August, 2020; v1 submitted 23 February, 2020;
originally announced February 2020.
-
A Neural Architecture for Person Ontology population
Authors:
Balaji Ganesan,
Riddhiman Dasgupta,
Akshay Parekh,
Hima Patel,
Berthold Reinwald
Abstract:
A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual pr…
▽ More
A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, didentification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual process, because it requires a fixed set of semantic relations between concepts. In this work, we present a system for automatically populating a person ontology graph from unstructured data using neural models for Entity Classification and Relation Extraction. We introduce a new dataset for these tasks and discuss our results.
△ Less
Submitted 22 January, 2020;
originally announced January 2020.
-
Document Structure Measure for Hypernym discovery
Authors:
Aswin Kannan,
Shanmukha C Guttula,
Balaji Ganesan,
Hima P Karanam,
Arun Kumar
Abstract:
Hypernym discovery is the problem of finding terms that have is-a relationship with a given term. We introduce a new context type, and a relatedness measure to differentiate hypernyms from other types of semantic relationships. Our Document Structure measure is based on hierarchical position of terms in a document, and their presence or otherwise in definition text. This measure quantifies the doc…
▽ More
Hypernym discovery is the problem of finding terms that have is-a relationship with a given term. We introduce a new context type, and a relatedness measure to differentiate hypernyms from other types of semantic relationships. Our Document Structure measure is based on hierarchical position of terms in a document, and their presence or otherwise in definition text. This measure quantifies the document structure using multiple attributes, and classes of weighted distance functions.
△ Less
Submitted 30 November, 2018;
originally announced November 2018.
-
Fine Grained Classification of Personal Data Entities
Authors:
Riddhiman Dasgupta,
Balaji Ganesan,
Aswin Kannan,
Berthold Reinwald,
Arun Kumar
Abstract:
Entity Type Classification can be defined as the task of assigning category labels to entity mentions in documents. While neural networks have recently improved the classification of general entity mentions, pattern matching and other systems continue to be used for classifying personal data entities (e.g. classifying an organization as a media company or a government institution for GDPR, and HIP…
▽ More
Entity Type Classification can be defined as the task of assigning category labels to entity mentions in documents. While neural networks have recently improved the classification of general entity mentions, pattern matching and other systems continue to be used for classifying personal data entities (e.g. classifying an organization as a media company or a government institution for GDPR, and HIPAA compliance). We propose a neural model to expand the class of personal data entities that can be classified at a fine grained level, using the output of existing pattern matching systems as additional contextual features. We introduce new resources, a personal data entities hierarchy with 134 types, and two datasets from the Wikipedia pages of elected representatives and Enron emails. We hope these resource will aid research in the area of personal data discovery, and to that effect, we provide baseline results on these datasets, and compare our method with state of the art models on OntoNotes dataset.
△ Less
Submitted 23 November, 2018;
originally announced November 2018.
-
Collective Learning From Diverse Datasets for Entity Ty** in the Wild
Authors:
Abhishek Abhishek,
Amar Prakash Azad,
Balaji Ganesan,
Ashish Anand,
Amit Awekar
Abstract:
Entity ty** (ET) is the problem of assigning labels to given entity mentions in a sentence. Existing works for ET require knowledge about the domain and target label set for a given test instance. ET in the absence of such knowledge is a novel problem that we address as ET in the wild. We hypothesize that the solution to this problem is to build supervised models that generalize better on the ET…
▽ More
Entity ty** (ET) is the problem of assigning labels to given entity mentions in a sentence. Existing works for ET require knowledge about the domain and target label set for a given test instance. ET in the absence of such knowledge is a novel problem that we address as ET in the wild. We hypothesize that the solution to this problem is to build supervised models that generalize better on the ET task as a whole, rather than a specific dataset. In this direction, we propose a Collective Learning Framework (CLF), which enables learning from diverse datasets in a unified way. The CLF first creates a unified hierarchical label set (UHLS) and a label map** by aggregating label information from all available datasets. Then it builds a single neural network classifier using UHLS, label map**, and a partial loss function. The single classifier predicts the finest possible label across all available domains even though these labels may not be present in any domain-specific dataset. We also propose a set of evaluation schemes and metrics to evaluate the performance of models in this novel problem. Extensive experimentation on seven diverse real-world datasets demonstrates the efficacy of our CLF.
△ Less
Submitted 16 September, 2019; v1 submitted 20 October, 2018;
originally announced October 2018.