Skip to main content

Showing 1–37 of 37 results for author: Dhami, D S

Searching in archive cs. Search in all archives.
.
  1. Towards Probabilistic Clearance, Explanation and Optimization

    Authors: Simon Kohaut, Benedict Flade, Devendra Singh Dhami, Julian Eggert, Kristian Kersting

    Abstract: Employing Unmanned Aircraft Systems (UAS) beyond visual line of sight (BVLOS) is an endearing and challenging task. While UAS have the potential to significantly enhance today's logistics and emergency response capabilities, unmanned flying objects above the heads of unprotected pedestrians induce similarly significant safety risks. In this work, we make strides towards improved safety and legal c… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.06107  [pdf, other

    cs.AI

    EXPIL: Explanatory Predicate Invention for Learning in Games

    Authors: **gyuan Sha, Hikaru Shindo, Quentin Delfosse, Kristian Kersting, Devendra Singh Dhami

    Abstract: Reinforcement learning (RL) has proven to be a powerful tool for training agents that excel in various games. However, the black-box nature of neural network models often hinders our ability to understand the reasoning behind the agent's actions. Recent research has attempted to address this issue by using the guidance of pretrained neural agents to encode logic-based policies, allowing for interp… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 pages references, 8 figures, 3 tables

  3. Mission Design for Unmanned Aerial Vehicles using Hybrid Probabilistic Logic Program

    Authors: Simon Kohaut, Benedict Flade, Devendra Singh Dhami, Julian Eggert, Kristian Kersting

    Abstract: Advanced Air Mobility (AAM) is a growing field that demands a deep understanding of legal, spatial and temporal concepts in navigation. Hence, any implementation of AAM is forced to deal with the inherent uncertainties of human-inhabited spaces. Enabling growth and innovation requires the creation of a system for safe and robust mission design, i.e., the way we formalize intentions and decide thei… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2402.15404  [pdf, other

    cs.LG

    United We Pretrain, Divided We Fail! Representation Learning for Time Series by Pretraining on 75 Datasets at Once

    Authors: Maurice Kraus, Felix Divo, David Steinmann, Devendra Singh Dhami, Kristian Kersting

    Abstract: In natural language processing and vision, pretraining is utilized to learn effective representations. Unfortunately, the success of pretraining does not easily carry over to time series due to potential mismatch between sources and target. Actually, common belief is that multi-dataset pretraining does not work for time series! Au contraire, we introduce a new self-supervised contrastive pretraini… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  5. arXiv:2402.14123  [pdf, other

    cs.LG cs.AI cs.CV

    DeiSAM: Segment Anything with Deictic Prompting

    Authors: Hikaru Shindo, Manuel Brack, Gopika Sudhakaran, Devendra Singh Dhami, Patrick Schramowski, Kristian Kersting

    Abstract: Large-scale, pre-trained neural networks have demonstrated strong capabilities in various tasks, including zero-shot image segmentation. To identify concrete objects in complex scenes, humans instinctively rely on deictic descriptions in natural language, i.e., referring to something depending on the context such as "The object that is on the desk and behind the cup.". However, deep learning appro… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint

  6. arXiv:2402.08280  [pdf, other

    cs.AI cs.CV cs.LG

    Pix2Code: Learning to Compose Neural Visual Concepts as Programs

    Authors: Antonia Wüst, Wolfgang Stammer, Quentin Delfosse, Devendra Singh Dhami, Kristian Kersting

    Abstract: The challenge in learning abstract concepts from images in an unsupervised fashion lies in the required integration of visual perception and generalizable relational reasoning. Moreover, the unsupervised nature of this task makes it necessary for human users to be able to understand a model's learnt concepts and potentially revise false behaviours. To tackle both the generalizability and interpret… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  7. arXiv:2310.08377  [pdf, other

    cs.AI

    Do Not Marginalize Mechanisms, Rather Consolidate!

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Structural causal models (SCMs) are a powerful tool for understanding the complex causal relationships that underlie many real-world systems. As these systems grow in size, the number of variables and complexity of interactions between them does, too. Thus, becoming convoluted and difficult to analyze. This is particularly true in the context of machine learning and artificial intelligence, where… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 19 pages, 8 figures

  8. arXiv:2308.13067  [pdf, other

    cs.AI cs.CL

    Causal Parrots: Large Language Models May Talk Causality But Are Not Causal

    Authors: Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting

    Abstract: Some argue scale is all what is needed to achieve AI, covering even causal models. We make it clear that large language models (LLMs) cannot be causal and give reason onto why sometimes we might feel otherwise. To this end, we define and exemplify a new subgroup of Structural Causal Model (SCM) that we call meta SCM which encode causal facts about other SCM within their variables. We conjecture th… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Published in Transactions in Machine Learning Research (TMLR) (08/2023). Main paper: 17 pages, References: 3 pages, Appendix: 7 pages. Figures: 5 main, 3 appendix. Tables: 3 main

    Journal ref: Transactions in Machine Learning Research (08/2023)

  9. arXiv:2308.09472  [pdf, other

    cs.CV cs.AI

    Vision Relation Transformer for Unbiased Scene Graph Generation

    Authors: Gopika Sudhakaran, Devendra Singh Dhami, Kristian Kersting, Stefan Roth

    Abstract: Recent years have seen a growing interest in Scene Graph Generation (SGG), a comprehensive visual scene understanding task that aims to predict entity relationships using a relation encoder-decoder pipeline stacked on top of an object encoder-decoder backbone. Unfortunately, current SGG methods suffer from an information loss regarding the entities local-level cues during the relation encoding pro… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in ICCV 2023

  10. arXiv:2307.00928  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Differentiable Logic Programs for Abstract Visual Reasoning

    Authors: Hikaru Shindo, Viktor Pfanschilling, Devendra Singh Dhami, Kristian Kersting

    Abstract: Visual reasoning is essential for building intelligent agents that understand the world and perform problem-solving beyond perception. Differentiable forward reasoning has been developed to integrate reasoning with gradient-based machine learning paradigms. However, due to the memory intensity, most existing approaches do not bring the best of the expressivity of first-order logic, excluding a cru… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: under review

  11. arXiv:2306.08397  [pdf, other

    cs.AI

    Scalable Neural-Probabilistic Answer Set Programming

    Authors: Arseny Skryagin, Daniel Ochs, Devendra Singh Dhami, Kristian Kersting

    Abstract: The goal of combining the robustness of neural networks and the expressiveness of symbolic methods has rekindled the interest in Neuro-Symbolic AI. Deep Probabilistic Programming Languages (DPPLs) have been developed for probabilistic logic programming to be carried out via the probability estimations of deep neural networks. However, recent SOTA DPPL approaches allow only for limited conditional… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 37 pages, 14 figures

  12. arXiv:2306.07743  [pdf, other

    cs.AI cs.CV cs.LG

    V-LoL: A Diagnostic Dataset for Visual Logical Learning

    Authors: Lukas Helff, Wolfgang Stammer, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Despite the successes of recent developments in visual AI, different shortcomings still exist; from missing exact logical reasoning, to abstract generalization abilities, to understanding complex and noisy scenes. Unfortunately, existing benchmarks, were not designed to capture more than a few of these aspects. Whereas deep learning datasets focus on visually complex data but simple visual reasoni… ▽ More

    Submitted 3 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  13. arXiv:2212.12570  [pdf, other

    cs.AI cs.CV

    Pearl Causal Hierarchy on Image Data: Intricacies & Challenges

    Authors: Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting

    Abstract: Many researchers have voiced their support towards Pearl's counterfactual theory of causation as a step** stone for AI/ML research's ultimate goal of intelligent systems. As in any other growing subfield, patience seems to be a virtue since significant progress on integrating notions from both fields takes time, yet, major challenges such as the lack of ground truth benchmarks or a unified persp… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 9 pages, References: 2 pages. Main paper: 7 figures

  14. arXiv:2211.11650  [pdf, other

    cs.AI

    Neural Meta-Symbolic Reasoning and Learning

    Authors: Zihan Ye, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Deep neural learning uses an increasing amount of computation and data to solve very specific problems. By stark contrast, human minds solve a wide range of problems using a fixed amount of computation and limited experience. One ability that seems crucial to this kind of general intelligence is meta-reasoning, i.e., our ability to reason about reasoning. To make deep learning do more from less, w… ▽ More

    Submitted 15 December, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  15. arXiv:2208.13518  [pdf, other

    cs.AI cs.CL cs.CV cs.LO cs.SC

    LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems

    Authors: Björn Deiseroth, Patrick Schramowski, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Text-to-image models have recently achieved remarkable success with seemingly accurate samples in photo-realistic quality. However as state-of-the-art language models still struggle evaluating precise statements consistently, so do language model based image generation processes. In this work we showcase problems of state-of-the-art text-to-image models like DALL-E with generating accurate samples… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  16. arXiv:2206.12342  [pdf, other

    cs.LG

    FEATHERS: Federated Architecture and Hyperparameter Search

    Authors: Jonas Seng, Pooja Prasad, Martin Mundt, Devendra Singh Dhami, Kristian Kersting

    Abstract: Deep neural architectures have profound impact on achieved performance in many of today's AI tasks, yet, their design still heavily relies on human prior knowledge and experience. Neural architecture search (NAS) together with hyperparameter optimization (HO) helps to reduce this dependence. However, state of the art NAS and HO rapidly become infeasible with increasing amount of data being stored… ▽ More

    Submitted 27 March, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Main paper: 8 pages, References: 2 pages, Supplement: 4.5 pages, Main paper: 3 figures, 2 tables, 1 algorithm, Supplement: 2 figure, 4 algorithms, extended previous version by Differential Privacy, theoretical results and more experiments. Updated author list as it was incomplete

  17. arXiv:2206.10591  [pdf, other

    cs.AI cs.CL cs.LG

    Can Foundation Models Talk Causality?

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Foundation models are subject to an ongoing heated debate, leaving open the question of progress towards AGI and dividing the community into two camps: the ones who see the arguably impressive results as evidence to the scaling hypothesis, and the others who are worried about the lack of interpretability and reasoning capabilities. By investigating to which extent causal representations might be c… ▽ More

    Submitted 23 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 6 pages, References: 1.5 pages, Supplement: 11.5 pages. Main paper: 4 figures, Supplement: 3 figures, 8 tables

  18. arXiv:2206.07203  [pdf, other

    cs.LG

    Attributions Beyond Neural Networks: The Linear Program Case

    Authors: Florian Peter Busch, Matej Zečević, Kristian Kersting, Devendra Singh Dhami

    Abstract: Linear Programs (LPs) have been one of the building blocks in machine learning and have championed recent strides in differentiable optimizers for learning systems. While there exist solvers for even high-dimensional LPs, understanding said high-dimensional solutions poses an orthogonal and unresolved problem. We introduce an approach where we consider neural encodings for LPs that justify the app… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2 pages, Supplement: 2.5 pages. Main paper: 5 figures, 2 tables, Supplement: 1 figure

  19. arXiv:2206.07196  [pdf, other

    cs.LG

    Towards a Solution to Bongard Problems: A Causal Approach

    Authors: Salahedine Youssef, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Even though AI has advanced rapidly in recent years displaying success in solving highly complex problems, the class of Bongard Problems (BPs) yet remain largely unsolved by modern ML techniques. In this paper, we propose a new approach in an attempt to not only solve BPs but also extract meaning out of learned representations. This includes the reformulation of the classical BP into a reinforceme… ▽ More

    Submitted 23 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 12 pages, References: 2 pages, Supplement: 3 pages. Main paper: 9 figures, Supplement: 3 figures

  20. arXiv:2206.07195  [pdf, other

    cs.LG

    Tearing Apart NOTEARS: Controlling the Graph Prediction via Variance Manipulation

    Authors: Jonas Seng, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Simulations are ubiquitous in machine learning. Especially in graph learning, simulations of Directed Acyclic Graphs (DAG) are being deployed for evaluating new algorithms. In the literature, it was recently argued that continuous-optimization approaches to structure discovery such as NOTEARS might be exploiting the sortability of the variable's variances in the available data due to their use of… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 5.5 pages, References: 1 page, Supplement: 2 pages. Main paper: 3 figures, Supplement: 1 figure, 1 table

  21. arXiv:2206.07194  [pdf, other

    cs.LG

    Machines Explaining Linear Programs

    Authors: David Steinmann, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: There has been a recent push in making machine learning models more interpretable so that their performance can be trusted. Although successful, these methods have mostly focused on the deep learning methods while the fundamental optimization methods in machine learning such as linear programs (LP) have been left out. Even if LPs can be considered as whitebox or clearbox models, they are not easy… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2.5 pages, Supplement: 6 pages. Main paper: 5 figures, 4 tables, Supplement: 3 figures, 6 tables

  22. arXiv:2203.15274  [pdf, other

    cs.AI

    Finding Structure and Causality in Linear Programs

    Authors: Matej Zečević, Florian Peter Busch, Devendra Singh Dhami, Kristian Kersting

    Abstract: Linear Programs (LP) are celebrated widely, particularly so in machine learning where they have allowed for effectively solving probabilistic inference tasks or imposing structure on end-to-end learning systems. Their potential might seem depleted but we propose a foundational, causal perspective that reveals intriguing intra- and inter-structure relations for LP components. We conduct a systemati… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Main paper: 5 pages, References: 2 pages, Appendix: 1 page. Figures: 8 main, 1 appendix. Tables: 1 appendix

  23. arXiv:2110.12066  [pdf, other

    cs.LG stat.ML

    The Causal Loss: Driving Correlation to Imply Causation

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Most algorithms in classical and contemporary machine learning focus on correlation-based dependence between features to drive performance. Although success has been observed in many relevant problems, these algorithms fail when the underlying causality is inconsistent with the assumed relations. We propose a novel model-agnostic loss function called Causal Loss that improves the interventional qu… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 8 pages, References: 2 pages, Appendix: 3 pages. Figures: 4 main, 4 appendix. Tables: 2 main

  24. arXiv:2110.12052  [pdf, other

    cs.LG cs.AI

    A Taxonomy for Inference in Causal Model Families

    Authors: Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Neurally-parameterized Structural Causal Models in the Pearlian notion to causality, referred to as NCM, were recently introduced as a step towards next-generation learning systems. However, said NCM are only concerned with the learning aspect of causal inference but totally miss out on the architecture aspect. That is, actual causal inference within NCM is intractable in that the NCM won't return… ▽ More

    Submitted 23 December, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 12 pages, References: 3 pages, Appendix: 4 pages. Figures: 3 main, 2 appendix

  25. arXiv:2110.09383  [pdf, other

    cs.AI cs.CV cs.LG

    Neuro-Symbolic Forward Reasoning

    Authors: Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Reasoning is an essential part of human intelligence and thus has been a long-standing goal in artificial intelligence research. With the recent success of deep learning, incorporating reasoning with deep learning systems, i.e., neuro-symbolic AI has become a major field of interest. We propose the Neuro-Symbolic Forward Reasoner (NSFR), a new approach for reasoning tasks taking advantage of diffe… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Preprint

  26. arXiv:2110.03395  [pdf, other

    cs.AI

    SLASH: Embracing Probabilistic Circuits into Neural Answer Set Programming

    Authors: Arseny Skryagin, Wolfgang Stammer, Daniel Ochs, Devendra Singh Dhami, Kristian Kersting

    Abstract: The goal of combining the robustness of neural networks and the expressivity of symbolic methods has rekindled the interest in neuro-symbolic AI. Recent advancements in neuro-symbolic AI often consider specifically-tailored architectures consisting of disjoint neural and symbolic components, and thus do not exhibit desired gains that can be achieved by integrating them into a unifying framework. W… ▽ More

    Submitted 23 November, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 18 pages, 7 figures and 6 tables

    ACM Class: I.2.5; D.3.2

  27. arXiv:2110.02395  [pdf, other

    cs.LG

    Causal Explanations of Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Constantin A. Rothkopf, Kristian Kersting

    Abstract: In explanatory interactive learning (XIL) the user queries the learner, then the learner explains its answer to the user and finally the loop repeats. XIL is attractive for two reasons, (1) the learner becomes better and (2) the user's trust increases. For both reasons to hold, the learner's explanations must be useful to the user and the user must be allowed to ask useful questions. Ideally, both… ▽ More

    Submitted 23 December, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Main paper: 9 pages, References: 2.5 pages, Supplement: 12 pages. Main paper: 4 figures, Supplement: 6 figures, 2 tables

  28. arXiv:2109.06587  [pdf, other

    cs.LG

    Sum-Product-Attention Networks: Leveraging Self-Attention in Probabilistic Circuits

    Authors: Zhongjie Yu, Devendra Singh Dhami, Kristian Kersting

    Abstract: Probabilistic circuits (PCs) have become the de-facto standard for learning and inference in probabilistic modeling. We introduce Sum-Product-Attention Networks (SPAN), a new generative model that integrates probabilistic circuits with Transformers. SPAN uses self-attention to select the most relevant parts of a probabilistic circuit, here sum-product networks, to improve the modeling capability o… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

  29. arXiv:2109.04173  [pdf, other

    cs.LG stat.ML

    Relating Graph Neural Networks to Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Petar Veličković, Kristian Kersting

    Abstract: Causality can be described in terms of a structural causal model (SCM) that carries information on the variables of interest and their mechanistic relations. For most processes of interest the underlying SCM will only be partially observable, thus causal inference tries leveraging the exposed. Graph neural networks (GNN) as universal approximators on structured input pose a viable candidate for ca… ▽ More

    Submitted 22 October, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Main paper: 12 pages, References: 2 pages, Appendix: 13 pages; Main paper: 4 figures, Appendix: 2 figures

  30. arXiv:2105.12697  [pdf, other

    cs.LG cs.CR

    Structural Causal Models Reveal Confounder Bias in Linear Program Modelling

    Authors: Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: The recent years have been marked by extended research on adversarial attacks, especially on deep neural networks. With this work we intend on posing and investigating the question of whether the phenomenon might be more general in nature, that is, adversarial-style attacks outside classical classification tasks. Specifically, we investigate optimization problems as they constitute a fundamental p… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Published at the 15th Asian Conference on Machine Learning (ACML 2023) Journal Track. Main paper: 19 pages, References: 2 pages, Supplement: .5 page. Main paper: 3 figures, 3 tables, Supplement: 1 table

  31. arXiv:2103.10916  [pdf, other

    cs.LG

    Predicting Drug-Drug Interactions from Heterogeneous Data: An Embedding Approach

    Authors: Devendra Singh Dhami, Siwen Yan, Gautam Kunapuli, David Page, Sriraam Natarajan

    Abstract: Predicting and discovering drug-drug interactions (DDIs) using machine learning has been studied extensively. However, most of the approaches have focused on text data or textual representation of the drug structures. We present the first work that uses multiple data sources such as drug structure images, drug structure string representation and relational representation of drug relationships as t… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 10 pages, 6 figures, Accepted as a short paper to 'Artificial Intelligence in Medicine 2021'

  32. arXiv:2102.10440  [pdf, other

    cs.LG

    Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models

    Authors: Matej Zečević, Devendra Singh Dhami, Athresh Karanam, Sriraam Natarajan, Kristian Kersting

    Abstract: While probabilistic models are an important tool for studying causality, doing so suffers from the intractability of inference. As a step towards tractable causal models, we consider the problem of learning interventional distributions using sum-product networks (SPNs) that are over-parameterized by gate functions, e.g., neural networks. Providing an arbitrarily intervened causal graph as input, e… ▽ More

    Submitted 25 October, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 8 pages. Main paper: 6 figures, Appendix: 5 figures

  33. arXiv:2102.07007  [pdf, other

    cs.LG

    A Statistical Relational Approach to Learning Distance-based GCNs

    Authors: Devendra Singh Dhami, Siwen Yan, Sriraam Natarajan

    Abstract: We consider the problem of learning distance-based Graph Convolutional Networks (GCNs) for relational data. Specifically, we first embed the original graph into the Euclidean space $\mathbb{R}^m$ using a relational density estimation technique thereby constructing a secondary Euclidean graph. The graph vertices correspond to the target triples and edges denote the Euclidean distances between the t… ▽ More

    Submitted 12 October, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: 8 pages, 5 figures, 4 tables; accepted to STARAI workshop

  34. arXiv:2001.00528  [pdf, other

    cs.LG stat.ML

    Non-Parametric Learning of Gaifman Models

    Authors: Devendra Singh Dhami, Siwen Yan, Gautam Kunapuli, Sriraam Natarajan

    Abstract: We consider the problem of structure learning for Gaifman models and learn relational features that can be used to derive feature representations from a knowledge base. These relational features are first-order rules that are then partially grounded and counted over local neighborhoods of a Gaifman model to obtain the feature representations. We propose a method for learning these relational featu… ▽ More

    Submitted 15 January, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: 8 pages, 6 figures

  35. arXiv:1911.06356  [pdf, other

    cs.LG stat.ML

    Beyond Textual Data: Predicting Drug-Drug Interactions from Molecular Structure Images using Siamese Neural Networks

    Authors: Devendra Singh Dhami, Siwen Yan, Gautam Kunapuli, David Page, Sriraam Natarajan

    Abstract: Predicting and discovering drug-drug interactions (DDIs) is an important problem and has been studied extensively both from medical and machine learning point of view. Almost all of the machine learning approaches have focused on text data or textual representation of the structural data of drugs. We present the first work that uses drug structure images as the input and utilizes a Siamese convolu… ▽ More

    Submitted 29 June, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: 9 pages, 9 figures

  36. arXiv:1906.01432  [pdf, other

    cs.LG cs.AI stat.ML

    Knowledge-augmented Column Networks: Guiding Deep Learning with Advice

    Authors: Mayukh Das, Devendra Singh Dhami, Yang Yu, Gautam Kunapuli, Sriraam Natarajan

    Abstract: Recently, deep models have had considerable success in several tasks, especially with low-level representations. However, effective learning from sparse noisy samples is a major challenge in most deep models, especially in domains with structured representations. Inspired by the proven success of human guided machine learning, we propose Knowledge-augmented Column Networks, a relational deep learn… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: Presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1904.06950

  37. arXiv:1904.06950  [pdf, other

    cs.LG cs.AI stat.ML

    Human-Guided Learning of Column Networks: Augmenting Deep Learning with Advice

    Authors: Mayukh Das, Yang Yu, Devendra Singh Dhami, Gautam Kunapuli, Sriraam Natarajan

    Abstract: Recently, deep models have been successfully applied in several applications, especially with low-level representations. However, sparse, noisy samples and structured domains (with multiple objects and interactions) are some of the open challenges in most deep models. Column Networks, a deep architecture, can succinctly capture such domain structure and interactions, but may still be prone to sub-… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Under Review at 'Machine Learning Journal' (MLJ)