-
TAGMol: Target-Aware Gradient-guided Molecule Generation
Authors:
Vineeth Dorna,
D. Subhalingam,
Keshav Kolluru,
Shreshth Tuli,
Mrityunjay Singh,
Saurabh Singal,
N. M. Anoop Krishnan,
Sayan Ranu
Abstract:
3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug disco…
▽ More
3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug discovery, such as the development of novel ligands with desired properties like drug-likeness, and synthesizability, underscoring the multifaceted nature of the drug design process. To overcome these challenges, we decouple the problem into molecular generation and property prediction. The latter synergistically guides the diffusion sampling process, facilitating guided diffusion and resulting in the creation of meaningful molecules with the desired properties. We call this guided molecular generation process as TAGMol. Through experiments on benchmark datasets, TAGMol demonstrates superior performance compared to state-of-the-art baselines, achieving a 22% improvement in average Vina Score and yielding favorable outcomes in essential auxiliary properties. This establishes TAGMol as a comprehensive framework for drug generation.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
GenToC: Leveraging Partially-Labeled Data for Product Attribute-Value Identification
Authors:
D. Subhalingam,
Keshav Kolluru,
Mausam,
Saurabh Singal
Abstract:
In the e-commerce domain, the accurate extraction of attribute-value pairs from product listings (e.g., Brand: Apple) is crucial for enhancing search and recommendation systems. The automation of this extraction process is challenging due to the vast diversity of product categories and their respective attributes, compounded by the lack of extensive, accurately annotated training datasets and the…
▽ More
In the e-commerce domain, the accurate extraction of attribute-value pairs from product listings (e.g., Brand: Apple) is crucial for enhancing search and recommendation systems. The automation of this extraction process is challenging due to the vast diversity of product categories and their respective attributes, compounded by the lack of extensive, accurately annotated training datasets and the demand for low latency to meet the real-time needs of e-commerce platforms. To address these challenges, we introduce GenToC, a novel two-stage model for extracting attribute-value pairs from product titles. GenToC is designed to train with partially-labeled data, leveraging incomplete attribute-value pairs and obviating the need for a fully annotated dataset. Moreover, we introduce a bootstrap** method that enables GenToC to progressively refine and expand its training dataset. This enhancement substantially improves the quality of data available for training other neural network models that are typically faster but are inherently less capable than GenToC in terms of their capacity to handle partially-labeled data. By supplying an enriched dataset for training, GenToC significantly advances the performance of these alternative models, making them more suitable for real-time deployment. Our results highlight the unique capability of GenToC to learn from a limited set of labeled data and to contribute to the training of more efficient models, marking a significant leap forward in the automated extraction of attribute-value pairs from product titles. GenToC has been successfully integrated into India's largest B2B e-commerce platform, IndiaMART.com, achieving a significant increase of 21.1% in recall over the existing deployed system while maintaining a high precision of 89.5% in this challenging task.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
Authors:
Shubham Mittal,
Keshav Kolluru,
Soumen Chakrabarti,
Mausam
Abstract:
Automated completion of open knowledge bases (Open KBs), which are constructed from triples of the form (subject phrase, relation phrase, object phrase), obtained via open information extraction (Open IE) system, are useful for discovering novel facts that may not be directly present in the text. However, research in Open KB completion (Open KBC) has so far been limited to resource-rich languages…
▽ More
Automated completion of open knowledge bases (Open KBs), which are constructed from triples of the form (subject phrase, relation phrase, object phrase), obtained via open information extraction (Open IE) system, are useful for discovering novel facts that may not be directly present in the text. However, research in Open KB completion (Open KBC) has so far been limited to resource-rich languages like English. Using the latest advances in multilingual Open IE, we construct the first multilingual Open KBC dataset, called mOKB6, containing facts from Wikipedia in six languages (including English). Improving the previous Open KB construction pipeline by doing multilingual coreference resolution and kee** only entity-linked triples, we create a dense Open KB. We experiment with several models for the task and observe a consistent benefit of combining languages with the help of shared embedding space as well as translations of facts. We also observe that current multilingual models struggle to remember facts seen in languages of different scripts.
△ Less
Submitted 28 May, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
"Covid vaccine is against Covid but Oxford vaccine is made at Oxford!" Semantic Interpretation of Proper Noun Compounds
Authors:
Keshav Kolluru,
Gabriel Stanovsky,
Mausam
Abstract:
Proper noun compounds, e.g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease"). These are commonly used in short-form domains, such as news headlines, but are largely ignored in information-seeking applications. To address this limitation, we release a new manually annotated dataset, ProNCI, consisting of 22.5K prop…
▽ More
Proper noun compounds, e.g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease"). These are commonly used in short-form domains, such as news headlines, but are largely ignored in information-seeking applications. To address this limitation, we release a new manually annotated dataset, ProNCI, consisting of 22.5K proper noun compounds along with their free-form semantic interpretations. ProNCI is 60 times larger than prior noun compound datasets and also includes non-compositional examples, which have not been previously explored. We experiment with various neural models for automatically generating the semantic interpretations from proper noun compounds, ranging from few-shot prompting to supervised learning, with varying degrees of knowledge about the constituent nouns. We find that adding targeted knowledge, particularly about the common noun, results in performance gains of upto 2.8%. Finally, we integrate our model generated interpretations with an existing Open IE system and observe an 7.5% increase in yield at a precision of 85%. The dataset and code are available at https://github.com/dair-iitd/pronci.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Essentially entropic lattice Boltzmann model: Theory and simulations
Authors:
Mohammad Atif,
Praveen Kumar Kolluru,
Santosh Ansumali
Abstract:
We present a detailed description of the essentially entropic lattice Boltzmann model. The entropic lattice Boltzmann model guarantees unconditional numerical stability by iteratively solving the nonlinear entropy evolution equation. In this paper we explain the construction of closed-form analytic solutions to this equation. We demonstrate that near equilibrium this exact solution reduces to the…
▽ More
We present a detailed description of the essentially entropic lattice Boltzmann model. The entropic lattice Boltzmann model guarantees unconditional numerical stability by iteratively solving the nonlinear entropy evolution equation. In this paper we explain the construction of closed-form analytic solutions to this equation. We demonstrate that near equilibrium this exact solution reduces to the standard lattice Boltzmann model. We consider a few test cases to show that the exact solution does not exhibit any significant deviation from the iterative solution. We also extend the analytical solution for the ES-BGK model to remove the limitation on the Prandtl number for heat transfer problems. The simplicity of the exact solution removes the computational overhead and algorithmic complexity associated with the entropic lattice Boltzmann models.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Reduced kinetic model of polyatomic gases
Authors:
Praveen Kumar Kolluru,
Mohammad Atif,
Santosh Ansumali
Abstract:
Kinetic models of polyatomic gas typically account for the internal degrees of freedom at the level of the two-particle distribution function. However, close to the hydrodynamic limit, the internal (rotational) degrees of freedom tend to be well represented just by rotational kinetic energy density. We account for the rotational energy by augmenting the Ellipsoidal-statistical BGK (ES-BGK) model,…
▽ More
Kinetic models of polyatomic gas typically account for the internal degrees of freedom at the level of the two-particle distribution function. However, close to the hydrodynamic limit, the internal (rotational) degrees of freedom tend to be well represented just by rotational kinetic energy density. We account for the rotational energy by augmenting the Ellipsoidal-statistical BGK (ES-BGK) model, an extension of the Bhatnagar-Gross- Krook (BGK) model, at the level of the single-particle distribution function with an advection-diffusion-relaxation equation for the rotational energy. This reduced model respects the H theorem and recovers the compressible hydrodynamics for polyatomic gases as its macroscopic limit. As required for a polyatomic gas model, this extension of the ES-BGK model has not only correct specific heat ratio but also allows for three independent tunable transport coefficients: thermal conductivity, shear viscosity, and bulk viscosity. We illustrate the effectiveness of the model via a lattice Boltzmann method implementation.
△ Less
Submitted 13 January, 2022;
originally announced January 2022.
-
Multilingual Fact Linking
Authors:
Keshav Kolluru,
Martin Rezk,
Pat Verga,
William W. Cohen,
Partha Talukdar
Abstract:
Knowledge-intensive NLP tasks can benefit from linking natural language text with facts from a Knowledge Graph (KG). Although facts themselves are language-agnostic, the fact labels (i.e., language-specific representation of the fact) in the KG are often present only in a few languages. This makes it challenging to link KG facts to sentences in languages other than the limited set of languages. To…
▽ More
Knowledge-intensive NLP tasks can benefit from linking natural language text with facts from a Knowledge Graph (KG). Although facts themselves are language-agnostic, the fact labels (i.e., language-specific representation of the fact) in the KG are often present only in a few languages. This makes it challenging to link KG facts to sentences in languages other than the limited set of languages. To address this problem, we introduce the task of Multilingual Fact Linking (MFL) where the goal is to link fact expressed in a sentence to corresponding fact in the KG, even when the fact label in the KG is not available in the language of the sentence. To facilitate research in this area, we present a new evaluation dataset, IndicLink. This dataset contains 11,293 linked WikiData facts and 6,429 sentences spanning English and six Indian languages. We propose a Retrieval+Generation model, ReFCoG, that can scale to millions of KG facts by combining Dual Encoder based retrieval with a Seq2Seq based generation model which is constrained to output only valid KG facts. ReFCoG outperforms standard Retrieval+Re-ranking models by 10.7 pts in Precision@1. In spite of this gain, the model achieves an overall score of 52.1, showing ample scope for improvement in the task.ReFCoG code and IndicLink data are available at https://github.com/SaiKeshav/mfl
△ Less
Submitted 30 September, 2021; v1 submitted 29 September, 2021;
originally announced September 2021.
-
CEAR: Cross-Entity Aware Reranker for Knowledge Base Completion
Authors:
Keshav Kolluru,
Mayank Singh Chauhan,
Yatin Nandwani,
Parag Singla,
Mausam
Abstract:
Pre-trained language models (LMs) like BERT have shown to store factual knowledge about the world. This knowledge can be used to augment the information present in Knowledge Bases, which tend to be incomplete. However, prior attempts at using BERT for task of Knowledge Base Completion (KBC) resulted in performance worse than embedding based techniques that rely only on the graph structure. In this…
▽ More
Pre-trained language models (LMs) like BERT have shown to store factual knowledge about the world. This knowledge can be used to augment the information present in Knowledge Bases, which tend to be incomplete. However, prior attempts at using BERT for task of Knowledge Base Completion (KBC) resulted in performance worse than embedding based techniques that rely only on the graph structure. In this work we develop a novel model, Cross-Entity Aware Reranker (CEAR), that uses BERT to re-rank the output of existing KBC models with cross-entity attention. Unlike prior work that scores each entity independently, CEAR uses BERT to score the entities together, which is effective for exploiting its factual knowledge. CEAR achieves a new state of art for the OLPBench dataset.
△ Less
Submitted 27 January, 2022; v1 submitted 18 April, 2021;
originally announced April 2021.
-
OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction
Authors:
Keshav Kolluru,
Vaibhav Adlakha,
Samarth Aggarwal,
Mausam,
Soumen Chakrabarti
Abstract:
A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based s…
▽ More
A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based system that establishes a new state of the art for OpenIE, while extracting 10x faster. This is achieved through a novel Iterative Grid Labeling (IGL) architecture, which treats OpenIE as a 2-D grid labeling task. We improve its performance further by applying coverage (soft) constraints on the grid at training time.
Moreover, on observing that the best OpenIE systems falter at handling coordination structures, our OpenIE system also incorporates a new coordination analyzer built with the same IGL architecture. This IGL based coordination analyzer helps our OpenIE system handle complicated coordination structures, while also establishing a new state of the art on the task of coordination analysis, with a 12.3 pts improvement in F1 over previous analyzers. Our OpenIE system, OpenIE6, beats the previous systems by as much as 4 pts in F1, while being much faster.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
Lattice Boltzmann Method for wave propagation in elastic solids with a regular lattice: Theoretical analysis and validation
Authors:
Maxime Escande,
Praveen Kumar Kolluru,
Louis Marie Cléon,
Pierre Sagaut
Abstract:
The von Neumann stability analysis along with a Chapman-Enskog analysis is proposed for a single-relaxation-time lattice Boltzmann Method (LBM) for wave propagation in isotropic linear elastic solids, using a regular D2Q9 lattice. Different boundary conditions are considered: periodic, free surface, rigid interface. An original absorbing layer model is proposed to prevent spurious wave reflection…
▽ More
The von Neumann stability analysis along with a Chapman-Enskog analysis is proposed for a single-relaxation-time lattice Boltzmann Method (LBM) for wave propagation in isotropic linear elastic solids, using a regular D2Q9 lattice. Different boundary conditions are considered: periodic, free surface, rigid interface. An original absorbing layer model is proposed to prevent spurious wave reflection at domain boundaries. The present method is assessed considering several test cases. First, a spatial Gaussian force modulated in time by a Ricker wavelet is used as a source. Comparisons are made with results obtained using a classical Fourier spectral method. Both P and S waves are shown to be very accurately predicted. The case of Rayleigh surface waves is then addressed to check the accuracy of the method.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
IMoJIE: Iterative Memory-Based Joint Open Information Extraction
Authors:
Keshav Kolluru,
Samarth Aggarwal,
Vipul Rathore,
Mausam,
Soumen Chakrabarti
Abstract:
While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information.…
▽ More
While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information.
We present IMoJIE, an extension to CopyAttention, which produces the next extraction conditioned on all previously extracted tuples. This approach overcomes both shortcomings of CopyAttention, resulting in a variable number of diverse extractions per sentence. We train IMoJIE on training data bootstrapped from extractions of several non-neural systems, which have been automatically filtered to reduce redundancy and noise. IMoJIE outperforms CopyAttention by about 18 F1 pts, and a BERT-based strong baseline by 2 F1 pts, establishing a new state of the art for the task.
△ Less
Submitted 17 May, 2020;
originally announced May 2020.
-
Why and when should you pool? Analyzing Pooling in Recurrent Architectures
Authors:
Pratyush Maini,
Keshav Kolluru,
Danish Pruthi,
Mausam
Abstract:
Pooling-based recurrent neural architectures consistently outperform their counterparts without pooling. However, the reasons for their enhanced performance are largely unexamined. In this work, we examine three commonly used pooling techniques (mean-pooling, max-pooling, and attention), and propose max-attention, a novel variant that effectively captures interactions among predictive tokens in a…
▽ More
Pooling-based recurrent neural architectures consistently outperform their counterparts without pooling. However, the reasons for their enhanced performance are largely unexamined. In this work, we examine three commonly used pooling techniques (mean-pooling, max-pooling, and attention), and propose max-attention, a novel variant that effectively captures interactions among predictive tokens in a sentence. We find that pooling-based architectures substantially differ from their non-pooling equivalents in their learning ability and positional biases--which elucidate their performance benefits. By analyzing the gradient propagation, we discover that pooling facilitates better gradient flow compared to BiLSTMs. Further, we expose how BiLSTMs are positionally biased towards tokens in the beginning and the end of a sequence. Pooling alleviates such biases. Consequently, we identify settings where pooling offers large benefits: (i) in low resource scenarios, and (ii) when important words lie towards the middle of the sentence. Among the pooling techniques studied, max-attention is the most effective, resulting in significant performance gains on several text classification tasks.
△ Less
Submitted 27 October, 2020; v1 submitted 30 April, 2020;
originally announced May 2020.
-
Lattice Boltzmann model for weakly compressible flows
Authors:
Praveen Kumar Kolluru,
Mohammad Atif,
Manjusha Namburi,
Santosh Ansumali
Abstract:
We present an energy conserving lattice Boltzmann model based on a crystallographic lattice for simulation of weakly compressible flows. The theoretical requirements and the methodology to construct such a model are discussed. We demonstrate that the model recovers the isentropic sound speed in addition to the effects of viscous heating and heat flux dynamics. Several test cases for acoustics, the…
▽ More
We present an energy conserving lattice Boltzmann model based on a crystallographic lattice for simulation of weakly compressible flows. The theoretical requirements and the methodology to construct such a model are discussed. We demonstrate that the model recovers the isentropic sound speed in addition to the effects of viscous heating and heat flux dynamics. Several test cases for acoustics, thermal and thermoacoustic flows are simulated to show the accuracy of the proposed model.
△ Less
Submitted 15 September, 2019;
originally announced September 2019.
-
Cavity Enhanced Interference of Orthogonal Modes in a Birefringent Medium
Authors:
Kiran Kolluru,
Subhasish Dutta Gupta
Abstract:
Interference of orthogonal modes in a birefringent crystal is known to lead to interesting physical effects (Solli et al., Phys. Rev. Lett. 91, 143906 (2003)). In this paper we show that the cavity with an intra-cavity rotator can enhance the mixing to the extent of normal mode splitting and avoided crossing depending on the orientation of the rotator with respect to the optic axis of the crystal.…
▽ More
Interference of orthogonal modes in a birefringent crystal is known to lead to interesting physical effects (Solli et al., Phys. Rev. Lett. 91, 143906 (2003)). In this paper we show that the cavity with an intra-cavity rotator can enhance the mixing to the extent of normal mode splitting and avoided crossing depending on the orientation of the rotator with respect to the optic axis of the crystal. A high finesse cavity is shown to be capable of resolving small angles. The results are based on direct calculations of the cavity transmissions along with an analysis of its dispersion relation.
△ Less
Submitted 7 August, 2017;
originally announced August 2017.
-
Query Clustering using Segment Specific Context Embeddings
Authors:
S. K Kolluru,
Prasenjit Mukherjee
Abstract:
This paper presents a novel query clustering approach to capture the broad interest areas of users querying search engines. We make use of recent advances in NLP - word2vec and extend it to get query2vec, vector representations of queries, based on query contexts, obtained from the top search results for the query and use a highly scalable Divide & Merge clustering algorithm on top of the query ve…
▽ More
This paper presents a novel query clustering approach to capture the broad interest areas of users querying search engines. We make use of recent advances in NLP - word2vec and extend it to get query2vec, vector representations of queries, based on query contexts, obtained from the top search results for the query and use a highly scalable Divide & Merge clustering algorithm on top of the query vectors, to get the clusters. We have tried this approach on a variety of segments, including Retail, Travel, Health, Phones and found the clusters to be effective in discovering user's interest areas which have high monetization potential.
△ Less
Submitted 5 November, 2016; v1 submitted 3 August, 2016;
originally announced August 2016.