Skip to main content

Showing 1–17 of 17 results for author: Zečević, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.08377  [pdf, other

    cs.AI

    Do Not Marginalize Mechanisms, Rather Consolidate!

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Structural causal models (SCMs) are a powerful tool for understanding the complex causal relationships that underlie many real-world systems. As these systems grow in size, the number of variables and complexity of interactions between them does, too. Thus, becoming convoluted and difficult to analyze. This is particularly true in the context of machine learning and artificial intelligence, where… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 19 pages, 8 figures

  2. arXiv:2308.13067  [pdf, other

    cs.AI cs.CL

    Causal Parrots: Large Language Models May Talk Causality But Are Not Causal

    Authors: Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting

    Abstract: Some argue scale is all what is needed to achieve AI, covering even causal models. We make it clear that large language models (LLMs) cannot be causal and give reason onto why sometimes we might feel otherwise. To this end, we define and exemplify a new subgroup of Structural Causal Model (SCM) that we call meta SCM which encode causal facts about other SCM within their variables. We conjecture th… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Published in Transactions in Machine Learning Research (TMLR) (08/2023). Main paper: 17 pages, References: 3 pages, Appendix: 7 pages. Figures: 5 main, 3 appendix. Tables: 3 main

    Journal ref: Transactions in Machine Learning Research (08/2023)

  3. arXiv:2212.12575  [pdf, other

    cs.AI

    Continual Causal Abstractions

    Authors: Matej Zečević, Moritz Willig, Jonas Seng, Florian Peter Busch

    Abstract: This short paper discusses continually updated causal abstractions as a potential direction of future research. The key idea is to revise the existing level of causal abstraction to a different level of detail that is both consistent with the history of observed data and more effective in solving a given task.

    Submitted 6 January, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 3 pages, 1 figure. References: 1 page

  4. arXiv:2212.12570  [pdf, other

    cs.AI cs.CV

    Pearl Causal Hierarchy on Image Data: Intricacies & Challenges

    Authors: Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting

    Abstract: Many researchers have voiced their support towards Pearl's counterfactual theory of causation as a step** stone for AI/ML research's ultimate goal of intelligent systems. As in any other growing subfield, patience seems to be a virtue since significant progress on integrating notions from both fields takes time, yet, major challenges such as the lack of ground truth benchmarks or a unified persp… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 9 pages, References: 2 pages. Main paper: 7 figures

  5. arXiv:2212.12560  [pdf, other

    cs.AI

    On How AI Needs to Change to Advance the Science of Drug Discovery

    Authors: Kieran Didi, Matej Zečević

    Abstract: Research around AI for Science has seen significant success since the rise of deep learning models over the past decade, even with longstanding challenges such as protein structure prediction. However, this fast development inevitably made their flaws apparent -- especially in domains of reasoning where understanding the cause-effect relationship is important. One such domain is drug discovery, in… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 6 pages, References: 1.5 pages. Main paper: 3 figures

  6. arXiv:2206.10591  [pdf, other

    cs.AI cs.CL cs.LG

    Can Foundation Models Talk Causality?

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Foundation models are subject to an ongoing heated debate, leaving open the question of progress towards AGI and dividing the community into two camps: the ones who see the arguably impressive results as evidence to the scaling hypothesis, and the others who are worried about the lack of interpretability and reasoning capabilities. By investigating to which extent causal representations might be c… ▽ More

    Submitted 23 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 6 pages, References: 1.5 pages, Supplement: 11.5 pages. Main paper: 4 figures, Supplement: 3 figures, 8 tables

  7. arXiv:2206.07203  [pdf, other

    cs.LG

    Attributions Beyond Neural Networks: The Linear Program Case

    Authors: Florian Peter Busch, Matej Zečević, Kristian Kersting, Devendra Singh Dhami

    Abstract: Linear Programs (LPs) have been one of the building blocks in machine learning and have championed recent strides in differentiable optimizers for learning systems. While there exist solvers for even high-dimensional LPs, understanding said high-dimensional solutions poses an orthogonal and unresolved problem. We introduce an approach where we consider neural encodings for LPs that justify the app… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2 pages, Supplement: 2.5 pages. Main paper: 5 figures, 2 tables, Supplement: 1 figure

  8. arXiv:2206.07196  [pdf, other

    cs.LG

    Towards a Solution to Bongard Problems: A Causal Approach

    Authors: Salahedine Youssef, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Even though AI has advanced rapidly in recent years displaying success in solving highly complex problems, the class of Bongard Problems (BPs) yet remain largely unsolved by modern ML techniques. In this paper, we propose a new approach in an attempt to not only solve BPs but also extract meaning out of learned representations. This includes the reformulation of the classical BP into a reinforceme… ▽ More

    Submitted 23 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 12 pages, References: 2 pages, Supplement: 3 pages. Main paper: 9 figures, Supplement: 3 figures

  9. arXiv:2206.07195  [pdf, other

    cs.LG

    Tearing Apart NOTEARS: Controlling the Graph Prediction via Variance Manipulation

    Authors: Jonas Seng, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Simulations are ubiquitous in machine learning. Especially in graph learning, simulations of Directed Acyclic Graphs (DAG) are being deployed for evaluating new algorithms. In the literature, it was recently argued that continuous-optimization approaches to structure discovery such as NOTEARS might be exploiting the sortability of the variable's variances in the available data due to their use of… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 5.5 pages, References: 1 page, Supplement: 2 pages. Main paper: 3 figures, Supplement: 1 figure, 1 table

  10. arXiv:2206.07194  [pdf, other

    cs.LG

    Machines Explaining Linear Programs

    Authors: David Steinmann, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: There has been a recent push in making machine learning models more interpretable so that their performance can be trusted. Although successful, these methods have mostly focused on the deep learning methods while the fundamental optimization methods in machine learning such as linear programs (LP) have been left out. Even if LPs can be considered as whitebox or clearbox models, they are not easy… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2.5 pages, Supplement: 6 pages. Main paper: 5 figures, 4 tables, Supplement: 3 figures, 6 tables

  11. arXiv:2203.15274  [pdf, other

    cs.AI

    Finding Structure and Causality in Linear Programs

    Authors: Matej Zečević, Florian Peter Busch, Devendra Singh Dhami, Kristian Kersting

    Abstract: Linear Programs (LP) are celebrated widely, particularly so in machine learning where they have allowed for effectively solving probabilistic inference tasks or imposing structure on end-to-end learning systems. Their potential might seem depleted but we propose a foundational, causal perspective that reveals intriguing intra- and inter-structure relations for LP components. We conduct a systemati… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Main paper: 5 pages, References: 2 pages, Appendix: 1 page. Figures: 8 main, 1 appendix. Tables: 1 appendix

  12. arXiv:2110.12066  [pdf, other

    cs.LG stat.ML

    The Causal Loss: Driving Correlation to Imply Causation

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Most algorithms in classical and contemporary machine learning focus on correlation-based dependence between features to drive performance. Although success has been observed in many relevant problems, these algorithms fail when the underlying causality is inconsistent with the assumed relations. We propose a novel model-agnostic loss function called Causal Loss that improves the interventional qu… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 8 pages, References: 2 pages, Appendix: 3 pages. Figures: 4 main, 4 appendix. Tables: 2 main

  13. arXiv:2110.12052  [pdf, other

    cs.LG cs.AI

    A Taxonomy for Inference in Causal Model Families

    Authors: Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Neurally-parameterized Structural Causal Models in the Pearlian notion to causality, referred to as NCM, were recently introduced as a step towards next-generation learning systems. However, said NCM are only concerned with the learning aspect of causal inference but totally miss out on the architecture aspect. That is, actual causal inference within NCM is intractable in that the NCM won't return… ▽ More

    Submitted 23 December, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 12 pages, References: 3 pages, Appendix: 4 pages. Figures: 3 main, 2 appendix

  14. arXiv:2110.02395  [pdf, other

    cs.LG

    Causal Explanations of Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Constantin A. Rothkopf, Kristian Kersting

    Abstract: In explanatory interactive learning (XIL) the user queries the learner, then the learner explains its answer to the user and finally the loop repeats. XIL is attractive for two reasons, (1) the learner becomes better and (2) the user's trust increases. For both reasons to hold, the learner's explanations must be useful to the user and the user must be allowed to ask useful questions. Ideally, both… ▽ More

    Submitted 23 December, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Main paper: 9 pages, References: 2.5 pages, Supplement: 12 pages. Main paper: 4 figures, Supplement: 6 figures, 2 tables

  15. arXiv:2109.04173  [pdf, other

    cs.LG stat.ML

    Relating Graph Neural Networks to Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Petar Veličković, Kristian Kersting

    Abstract: Causality can be described in terms of a structural causal model (SCM) that carries information on the variables of interest and their mechanistic relations. For most processes of interest the underlying SCM will only be partially observable, thus causal inference tries leveraging the exposed. Graph neural networks (GNN) as universal approximators on structured input pose a viable candidate for ca… ▽ More

    Submitted 22 October, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Main paper: 12 pages, References: 2 pages, Appendix: 13 pages; Main paper: 4 figures, Appendix: 2 figures

  16. arXiv:2105.12697  [pdf, other

    cs.LG cs.CR

    Structural Causal Models Reveal Confounder Bias in Linear Program Modelling

    Authors: Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: The recent years have been marked by extended research on adversarial attacks, especially on deep neural networks. With this work we intend on posing and investigating the question of whether the phenomenon might be more general in nature, that is, adversarial-style attacks outside classical classification tasks. Specifically, we investigate optimization problems as they constitute a fundamental p… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Published at the 15th Asian Conference on Machine Learning (ACML 2023) Journal Track. Main paper: 19 pages, References: 2 pages, Supplement: .5 page. Main paper: 3 figures, 3 tables, Supplement: 1 table

  17. arXiv:2102.10440  [pdf, other

    cs.LG

    Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models

    Authors: Matej Zečević, Devendra Singh Dhami, Athresh Karanam, Sriraam Natarajan, Kristian Kersting

    Abstract: While probabilistic models are an important tool for studying causality, doing so suffers from the intractability of inference. As a step towards tractable causal models, we consider the problem of learning interventional distributions using sum-product networks (SPNs) that are over-parameterized by gate functions, e.g., neural networks. Providing an arbitrarily intervened causal graph as input, e… ▽ More

    Submitted 25 October, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 8 pages. Main paper: 6 figures, Appendix: 5 figures