Search | arXiv e-print repository

TAGMol: Target-Aware Gradient-guided Molecule Generation

Authors: Vineeth Dorna, D. Subhalingam, Keshav Kolluru, Shreshth Tuli, Mrityunjay Singh, Saurabh Singal, N. M. Anoop Krishnan, Sayan Ranu

Abstract: 3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug disco… ▽ More 3D generative models have shown significant promise in structure-based drug design (SBDD), particularly in discovering ligands tailored to specific target binding sites. Existing algorithms often focus primarily on ligand-target binding, characterized by binding affinity. Moreover, models trained solely on target-ligand distribution may fall short in addressing the broader objectives of drug discovery, such as the development of novel ligands with desired properties like drug-likeness, and synthesizability, underscoring the multifaceted nature of the drug design process. To overcome these challenges, we decouple the problem into molecular generation and property prediction. The latter synergistically guides the diffusion sampling process, facilitating guided diffusion and resulting in the creation of meaningful molecules with the desired properties. We call this guided molecular generation process as TAGMol. Through experiments on benchmark datasets, TAGMol demonstrates superior performance compared to state-of-the-art baselines, achieving a 22% improvement in average Vina Score and yielding favorable outcomes in essential auxiliary properties. This establishes TAGMol as a comprehensive framework for drug generation. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.10918 [pdf, other]

GenToC: Leveraging Partially-Labeled Data for Product Attribute-Value Identification

Authors: D. Subhalingam, Keshav Kolluru, Mausam, Saurabh Singal

Abstract: In the e-commerce domain, the accurate extraction of attribute-value pairs from product listings (e.g., Brand: Apple) is crucial for enhancing search and recommendation systems. The automation of this extraction process is challenging due to the vast diversity of product categories and their respective attributes, compounded by the lack of extensive, accurately annotated training datasets and the… ▽ More In the e-commerce domain, the accurate extraction of attribute-value pairs from product listings (e.g., Brand: Apple) is crucial for enhancing search and recommendation systems. The automation of this extraction process is challenging due to the vast diversity of product categories and their respective attributes, compounded by the lack of extensive, accurately annotated training datasets and the demand for low latency to meet the real-time needs of e-commerce platforms. To address these challenges, we introduce GenToC, a novel two-stage model for extracting attribute-value pairs from product titles. GenToC is designed to train with partially-labeled data, leveraging incomplete attribute-value pairs and obviating the need for a fully annotated dataset. Moreover, we introduce a bootstrap** method that enables GenToC to progressively refine and expand its training dataset. This enhancement substantially improves the quality of data available for training other neural network models that are typically faster but are inherently less capable than GenToC in terms of their capacity to handle partially-labeled data. By supplying an enriched dataset for training, GenToC significantly advances the performance of these alternative models, making them more suitable for real-time deployment. Our results highlight the unique capability of GenToC to learn from a limited set of labeled data and to contribute to the training of more efficient models, marking a significant leap forward in the automated extraction of attribute-value pairs from product titles. GenToC has been successfully integrated into India's largest B2B e-commerce platform, IndiaMART.com, achieving a significant increase of 21.1% in recall over the existing deployed system while maintaining a high precision of 89.5% in this challenging task. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2211.06959 [pdf, other]

mOKB6: A Multilingual Open Knowledge Base Completion Benchmark

Authors: Shubham Mittal, Keshav Kolluru, Soumen Chakrabarti, Mausam

Abstract: Automated completion of open knowledge bases (Open KBs), which are constructed from triples of the form (subject phrase, relation phrase, object phrase), obtained via open information extraction (Open IE) system, are useful for discovering novel facts that may not be directly present in the text. However, research in Open KB completion (Open KBC) has so far been limited to resource-rich languages… ▽ More Automated completion of open knowledge bases (Open KBs), which are constructed from triples of the form (subject phrase, relation phrase, object phrase), obtained via open information extraction (Open IE) system, are useful for discovering novel facts that may not be directly present in the text. However, research in Open KB completion (Open KBC) has so far been limited to resource-rich languages like English. Using the latest advances in multilingual Open IE, we construct the first multilingual Open KBC dataset, called mOKB6, containing facts from Wikipedia in six languages (including English). Improving the previous Open KB construction pipeline by doing multilingual coreference resolution and kee** only entity-linked triples, we create a dense Open KB. We experiment with several models for the task and observe a consistent benefit of combining languages with the help of shared embedding space as well as translations of facts. We also observe that current multilingual models struggle to remember facts seen in languages of different scripts. △ Less

Submitted 28 May, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

Comments: camera-ready version for ACL 2023

arXiv:2210.13039 [pdf, other]

"Covid vaccine is against Covid but Oxford vaccine is made at Oxford!" Semantic Interpretation of Proper Noun Compounds

Authors: Keshav Kolluru, Gabriel Stanovsky, Mausam

Abstract: Proper noun compounds, e.g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease"). These are commonly used in short-form domains, such as news headlines, but are largely ignored in information-seeking applications. To address this limitation, we release a new manually annotated dataset, ProNCI, consisting of 22.5K prop… ▽ More Proper noun compounds, e.g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease"). These are commonly used in short-form domains, such as news headlines, but are largely ignored in information-seeking applications. To address this limitation, we release a new manually annotated dataset, ProNCI, consisting of 22.5K proper noun compounds along with their free-form semantic interpretations. ProNCI is 60 times larger than prior noun compound datasets and also includes non-compositional examples, which have not been previously explored. We experiment with various neural models for automatically generating the semantic interpretations from proper noun compounds, ranging from few-shot prompting to supervised learning, with varying degrees of knowledge about the constituent nouns. We find that adding targeted knowledge, particularly about the common noun, results in performance gains of upto 2.8%. Finally, we integrate our model generated interpretations with an existing Open IE system and observe an 7.5% increase in yield at a precision of 85%. The dataset and code are available at https://github.com/dair-iitd/pronci. △ Less

Submitted 24 October, 2022; originally announced October 2022.

Comments: Accepted at EMNLP'22

arXiv:2203.12946 [pdf, other]

doi 10.1103/PhysRevE.106.055307

Essentially entropic lattice Boltzmann model: Theory and simulations

Authors: Mohammad Atif, Praveen Kumar Kolluru, Santosh Ansumali

Abstract: We present a detailed description of the essentially entropic lattice Boltzmann model. The entropic lattice Boltzmann model guarantees unconditional numerical stability by iteratively solving the nonlinear entropy evolution equation. In this paper we explain the construction of closed-form analytic solutions to this equation. We demonstrate that near equilibrium this exact solution reduces to the… ▽ More We present a detailed description of the essentially entropic lattice Boltzmann model. The entropic lattice Boltzmann model guarantees unconditional numerical stability by iteratively solving the nonlinear entropy evolution equation. In this paper we explain the construction of closed-form analytic solutions to this equation. We demonstrate that near equilibrium this exact solution reduces to the standard lattice Boltzmann model. We consider a few test cases to show that the exact solution does not exhibit any significant deviation from the iterative solution. We also extend the analytical solution for the ES-BGK model to remove the limitation on the Prandtl number for heat transfer problems. The simplicity of the exact solution removes the computational overhead and algorithmic complexity associated with the entropic lattice Boltzmann models. △ Less

Submitted 24 March, 2022; originally announced March 2022.

arXiv:2201.05280 [pdf, ps, other]

doi 10.1017/jfm.2023.323

Reduced kinetic model of polyatomic gases

Authors: Praveen Kumar Kolluru, Mohammad Atif, Santosh Ansumali

Abstract: Kinetic models of polyatomic gas typically account for the internal degrees of freedom at the level of the two-particle distribution function. However, close to the hydrodynamic limit, the internal (rotational) degrees of freedom tend to be well represented just by rotational kinetic energy density. We account for the rotational energy by augmenting the Ellipsoidal-statistical BGK (ES-BGK) model,… ▽ More Kinetic models of polyatomic gas typically account for the internal degrees of freedom at the level of the two-particle distribution function. However, close to the hydrodynamic limit, the internal (rotational) degrees of freedom tend to be well represented just by rotational kinetic energy density. We account for the rotational energy by augmenting the Ellipsoidal-statistical BGK (ES-BGK) model, an extension of the Bhatnagar-Gross- Krook (BGK) model, at the level of the single-particle distribution function with an advection-diffusion-relaxation equation for the rotational energy. This reduced model respects the H theorem and recovers the compressible hydrodynamics for polyatomic gases as its macroscopic limit. As required for a polyatomic gas model, this extension of the ES-BGK model has not only correct specific heat ratio but also allows for three independent tunable transport coefficients: thermal conductivity, shear viscosity, and bulk viscosity. We illustrate the effectiveness of the model via a lattice Boltzmann method implementation. △ Less

Submitted 13 January, 2022; originally announced January 2022.

arXiv:2109.14364 [pdf, other]

Multilingual Fact Linking

Authors: Keshav Kolluru, Martin Rezk, Pat Verga, William W. Cohen, Partha Talukdar

Abstract: Knowledge-intensive NLP tasks can benefit from linking natural language text with facts from a Knowledge Graph (KG). Although facts themselves are language-agnostic, the fact labels (i.e., language-specific representation of the fact) in the KG are often present only in a few languages. This makes it challenging to link KG facts to sentences in languages other than the limited set of languages. To… ▽ More Knowledge-intensive NLP tasks can benefit from linking natural language text with facts from a Knowledge Graph (KG). Although facts themselves are language-agnostic, the fact labels (i.e., language-specific representation of the fact) in the KG are often present only in a few languages. This makes it challenging to link KG facts to sentences in languages other than the limited set of languages. To address this problem, we introduce the task of Multilingual Fact Linking (MFL) where the goal is to link fact expressed in a sentence to corresponding fact in the KG, even when the fact label in the KG is not available in the language of the sentence. To facilitate research in this area, we present a new evaluation dataset, IndicLink. This dataset contains 11,293 linked WikiData facts and 6,429 sentences spanning English and six Indian languages. We propose a Retrieval+Generation model, ReFCoG, that can scale to millions of KG facts by combining Dual Encoder based retrieval with a Seq2Seq based generation model which is constrained to output only valid KG facts. ReFCoG outperforms standard Retrieval+Re-ranking models by 10.7 pts in Precision@1. In spite of this gain, the model achieves an overall score of 52.1, showing ample scope for improvement in the task.ReFCoG code and IndicLink data are available at https://github.com/SaiKeshav/mfl △ Less

Submitted 30 September, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

Comments: AKBC 2021

arXiv:2104.08741 [pdf, other]

CEAR: Cross-Entity Aware Reranker for Knowledge Base Completion

Authors: Keshav Kolluru, Mayank Singh Chauhan, Yatin Nandwani, Parag Singla, Mausam

Abstract: Pre-trained language models (LMs) like BERT have shown to store factual knowledge about the world. This knowledge can be used to augment the information present in Knowledge Bases, which tend to be incomplete. However, prior attempts at using BERT for task of Knowledge Base Completion (KBC) resulted in performance worse than embedding based techniques that rely only on the graph structure. In this… ▽ More Pre-trained language models (LMs) like BERT have shown to store factual knowledge about the world. This knowledge can be used to augment the information present in Knowledge Bases, which tend to be incomplete. However, prior attempts at using BERT for task of Knowledge Base Completion (KBC) resulted in performance worse than embedding based techniques that rely only on the graph structure. In this work we develop a novel model, Cross-Entity Aware Reranker (CEAR), that uses BERT to re-rank the output of existing KBC models with cross-entity attention. Unlike prior work that scores each entity independently, CEAR uses BERT to score the entities together, which is effective for exploiting its factual knowledge. CEAR achieves a new state of art for the OLPBench dataset. △ Less

Submitted 27 January, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

Comments: We found a bug in the code that invalidates the reported results for FB15k-237 and WN18RR. The results for OLPBench hold the same. We are in process of updating the paper

arXiv:2010.03147 [pdf, other]

OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction

Authors: Keshav Kolluru, Vaibhav Adlakha, Samarth Aggarwal, Mausam, Soumen Chakrabarti

Abstract: A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based s… ▽ More A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based system that establishes a new state of the art for OpenIE, while extracting 10x faster. This is achieved through a novel Iterative Grid Labeling (IGL) architecture, which treats OpenIE as a 2-D grid labeling task. We improve its performance further by applying coverage (soft) constraints on the grid at training time. Moreover, on observing that the best OpenIE systems falter at handling coordination structures, our OpenIE system also incorporates a new coordination analyzer built with the same IGL architecture. This IGL based coordination analyzer helps our OpenIE system handle complicated coordination structures, while also establishing a new state of the art on the task of coordination analysis, with a 12.3 pts improvement in F1 over previous analyzers. Our OpenIE system, OpenIE6, beats the previous systems by as much as 4 pts in F1, while being much faster. △ Less

Submitted 7 October, 2020; originally announced October 2020.

Comments: EMNLP 2020 (Long)

arXiv:2009.06404 [pdf, other]

Lattice Boltzmann Method for wave propagation in elastic solids with a regular lattice: Theoretical analysis and validation

Authors: Maxime Escande, Praveen Kumar Kolluru, Louis Marie Cléon, Pierre Sagaut

Abstract: The von Neumann stability analysis along with a Chapman-Enskog analysis is proposed for a single-relaxation-time lattice Boltzmann Method (LBM) for wave propagation in isotropic linear elastic solids, using a regular D2Q9 lattice. Different boundary conditions are considered: periodic, free surface, rigid interface. An original absorbing layer model is proposed to prevent spurious wave reflection… ▽ More The von Neumann stability analysis along with a Chapman-Enskog analysis is proposed for a single-relaxation-time lattice Boltzmann Method (LBM) for wave propagation in isotropic linear elastic solids, using a regular D2Q9 lattice. Different boundary conditions are considered: periodic, free surface, rigid interface. An original absorbing layer model is proposed to prevent spurious wave reflection at domain boundaries. The present method is assessed considering several test cases. First, a spatial Gaussian force modulated in time by a Ricker wavelet is used as a source. Comparisons are made with results obtained using a classical Fourier spectral method. Both P and S waves are shown to be very accurately predicted. The case of Rayleigh surface waves is then addressed to check the accuracy of the method. △ Less

Submitted 10 September, 2020; originally announced September 2020.

arXiv:2005.08178 [pdf, other]

IMoJIE: Iterative Memory-Based Joint Open Information Extraction

Authors: Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, Mausam, Soumen Chakrabarti

Abstract: While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information.… ▽ More While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information. We present IMoJIE, an extension to CopyAttention, which produces the next extraction conditioned on all previously extracted tuples. This approach overcomes both shortcomings of CopyAttention, resulting in a variable number of diverse extractions per sentence. We train IMoJIE on training data bootstrapped from extractions of several non-neural systems, which have been automatically filtered to reduce redundancy and noise. IMoJIE outperforms CopyAttention by about 18 F1 pts, and a BERT-based strong baseline by 2 F1 pts, establishing a new state of the art for the task. △ Less

Submitted 17 May, 2020; originally announced May 2020.

Journal ref: ACL 2020, Long paper

arXiv:2005.00159 [pdf, other]

Why and when should you pool? Analyzing Pooling in Recurrent Architectures

Authors: Pratyush Maini, Keshav Kolluru, Danish Pruthi, Mausam

Abstract: Pooling-based recurrent neural architectures consistently outperform their counterparts without pooling. However, the reasons for their enhanced performance are largely unexamined. In this work, we examine three commonly used pooling techniques (mean-pooling, max-pooling, and attention), and propose max-attention, a novel variant that effectively captures interactions among predictive tokens in a… ▽ More Pooling-based recurrent neural architectures consistently outperform their counterparts without pooling. However, the reasons for their enhanced performance are largely unexamined. In this work, we examine three commonly used pooling techniques (mean-pooling, max-pooling, and attention), and propose max-attention, a novel variant that effectively captures interactions among predictive tokens in a sentence. We find that pooling-based architectures substantially differ from their non-pooling equivalents in their learning ability and positional biases--which elucidate their performance benefits. By analyzing the gradient propagation, we discover that pooling facilitates better gradient flow compared to BiLSTMs. Further, we expose how BiLSTMs are positionally biased towards tokens in the beginning and the end of a sequence. Pooling alleviates such biases. Consequently, we identify settings where pooling offers large benefits: (i) in low resource scenarios, and (ii) when important words lie towards the middle of the sentence. Among the pooling techniques studied, max-attention is the most effective, resulting in significant performance gains on several text classification tasks. △ Less

Submitted 27 October, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

Comments: Accepted to Findings of EMNLP 2020, to be presented at BlackBoxNLP. Updated Version

arXiv:1909.08406 [pdf, other]

doi 10.1103/PhysRevE.101.013309

Lattice Boltzmann model for weakly compressible flows

Authors: Praveen Kumar Kolluru, Mohammad Atif, Manjusha Namburi, Santosh Ansumali

Abstract: We present an energy conserving lattice Boltzmann model based on a crystallographic lattice for simulation of weakly compressible flows. The theoretical requirements and the methodology to construct such a model are discussed. We demonstrate that the model recovers the isentropic sound speed in addition to the effects of viscous heating and heat flux dynamics. Several test cases for acoustics, the… ▽ More We present an energy conserving lattice Boltzmann model based on a crystallographic lattice for simulation of weakly compressible flows. The theoretical requirements and the methodology to construct such a model are discussed. We demonstrate that the model recovers the isentropic sound speed in addition to the effects of viscous heating and heat flux dynamics. Several test cases for acoustics, thermal and thermoacoustic flows are simulated to show the accuracy of the proposed model. △ Less

Submitted 15 September, 2019; originally announced September 2019.

Journal ref: Phys. Rev. E 101, 013309 (2020)

arXiv:1708.02006 [pdf, ps, other]

doi 10.1016/j.optcom.2017.11.056

Cavity Enhanced Interference of Orthogonal Modes in a Birefringent Medium

Authors: Kiran Kolluru, Subhasish Dutta Gupta

Abstract: Interference of orthogonal modes in a birefringent crystal is known to lead to interesting physical effects (Solli et al., Phys. Rev. Lett. 91, 143906 (2003)). In this paper we show that the cavity with an intra-cavity rotator can enhance the mixing to the extent of normal mode splitting and avoided crossing depending on the orientation of the rotator with respect to the optic axis of the crystal.… ▽ More Interference of orthogonal modes in a birefringent crystal is known to lead to interesting physical effects (Solli et al., Phys. Rev. Lett. 91, 143906 (2003)). In this paper we show that the cavity with an intra-cavity rotator can enhance the mixing to the extent of normal mode splitting and avoided crossing depending on the orientation of the rotator with respect to the optic axis of the crystal. A high finesse cavity is shown to be capable of resolving small angles. The results are based on direct calculations of the cavity transmissions along with an analysis of its dispersion relation. △ Less

Submitted 7 August, 2017; originally announced August 2017.

arXiv:1608.01247 [pdf, other]

Query Clustering using Segment Specific Context Embeddings

Authors: S. K Kolluru, Prasenjit Mukherjee

Abstract: This paper presents a novel query clustering approach to capture the broad interest areas of users querying search engines. We make use of recent advances in NLP - word2vec and extend it to get query2vec, vector representations of queries, based on query contexts, obtained from the top search results for the query and use a highly scalable Divide & Merge clustering algorithm on top of the query ve… ▽ More This paper presents a novel query clustering approach to capture the broad interest areas of users querying search engines. We make use of recent advances in NLP - word2vec and extend it to get query2vec, vector representations of queries, based on query contexts, obtained from the top search results for the query and use a highly scalable Divide & Merge clustering algorithm on top of the query vectors, to get the clusters. We have tried this approach on a variety of segments, including Retail, Travel, Health, Phones and found the clusters to be effective in discovering user's interest areas which have high monetization potential. △ Less

Submitted 5 November, 2016; v1 submitted 3 August, 2016; originally announced August 2016.

Comments: 9 pages

Showing 1–15 of 15 results for author: Kolluru, K