Skip to main content

Showing 1–50 of 82 results for author: Gavves, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11252  [pdf, other

    cs.CV

    Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning

    Authors: Cilin Yan, Haochen Wang, Xiaolong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves

    Abstract: Contrastive Vision-Language Pre-training(CLIP) demonstrates impressive zero-shot capability. The key to improve the adaptation of CLIP to downstream task with few exemplars lies in how to effectively model and transfer the useful knowledge embedded in CLIP. Previous work mines the knowledge typically based on the limited visual samples and close-set semantics (i.e., within target category set of d… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.06660  [pdf, other

    cs.LG cs.AI cs.NE

    Space-Time Continuous PDE Forecasting using Equivariant Neural Fields

    Authors: David M. Knigge, David R. Wessels, Riccardo Valperga, Samuele Papa, Jan-Jakob Sonke, Efstratios Gavves, Erik J. Bekkers

    Abstract: Recently, Conditional Neural Fields (NeFs) have emerged as a powerful modelling paradigm for PDEs, by learning solutions as flows in the latent space of the Conditional NeF. Although benefiting from favourable properties of NeFs such as grid-agnosticity and space-time-continuous dynamics modelling, this approach limits the ability to impose known constraints of the PDE on the solutions -- e.g. sym… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2406.05753  [pdf, other

    cs.LG cs.AI cs.CV

    Grounding Continuous Representations in Geometry: Equivariant Neural Fields

    Authors: David R Wessels, David M Knigge, Samuele Papa, Riccardo Valperga, Sharvaree Vadgama, Efstratios Gavves, Erik J Bekkers

    Abstract: Recently, Neural Fields have emerged as a powerful modelling paradigm to represent continuous signals. In a conditional neural field, a field is represented by a latent variable that conditions the NeF, whose parametrisation is otherwise shared over an entire dataset. We propose Equivariant Neural Fields based on cross attention transformers, in which NeFs are conditioned on a geometric conditioni… ▽ More

    Submitted 17 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: Preprint for Neurips submission

  4. arXiv:2406.03818  [pdf, other

    cs.CV cs.LG cs.MA cs.SC

    Amortized Equation Discovery in Hybrid Dynamical Systems

    Authors: Yongtuo Liu, Sara Magliacane, Miltiadis Kofinas, Efstratios Gavves

    Abstract: Hybrid dynamical systems are prevalent in science and engineering to express complex systems with continuous and discrete states. To learn the laws of systems, all previous methods for equation discovery in hybrid systems follow a two-stage paradigm, i.e. they first group time series into small cluster fragments and then discover equations in each fragment separately through methods in non-hybrid… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 24 pages, 5 figures, accepted by International Conference on Machine Learning (ICML) 2024

  5. arXiv:2406.02490  [pdf, other

    cs.LG stat.ML

    Ai-Sampler: Adversarial Learning of Markov kernels with involutive maps

    Authors: Evgenii Egorov, Ricardo Valperga, Efstratios Gavves

    Abstract: Markov chain Monte Carlo methods have become popular in statistics as versatile techniques to sample from complicated probability distributions. In this work, we propose a method to parameterize and train transition kernels of Markov chains to achieve efficient sampling and good mixing. This training procedure minimizes the total variation distance between the stationary distribution of the chain… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Journal ref: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  6. arXiv:2405.19978  [pdf, other

    cs.LG stat.ML

    Domain Adaptation with Cauchy-Schwarz Divergence

    Authors: Wenzhe Yin, Shujian Yu, Yicong Lin, Jie Liu, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: Domain adaptation aims to use training data from one or multiple source domains to learn a hypothesis that can be generalized to a different, but related, target domain. As such, having a reliable measure for evaluating the discrepancy of both marginal and conditional distributions is crucial. We introduce Cauchy-Schwarz (CS) divergence to the problem of unsupervised domain adaptation (UDA). The C… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by UAI-24

  7. arXiv:2404.17202  [pdf, other

    cs.CV

    Self-supervised visual learning in the low-data regime: a comparative evaluation

    Authors: Sotirios Konstantakos, Despina Ioanna Chalkiadaki, Ioannis Mademlis, Yuki M. Asano, Efstratios Gavves, Georgios Th. Papadopoulos

    Abstract: Self-Supervised Learning (SSL) is a valuable and robust training methodology for contemporary Deep Neural Networks (DNNs), enabling unsupervised pretraining on a `pretext task' that does not require ground-truth labels/annotation. This allows efficient representation learning from massive amounts of unlabeled training data, which in turn leads to increased accuracy in a `downstream task' by exploi… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2404.14082  [pdf, other

    cs.AI

    Mechanistic Interpretability for AI Safety -- A Review

    Authors: Leonard Bereska, Efstratios Gavves

    Abstract: Understanding AI systems' inner workings is critical for ensuring value alignment and safety. This review explores mechanistic interpretability: reverse-engineering the computational mechanisms and representations learned by neural networks into human-understandable algorithms and concepts to provide a granular, causal understanding. We establish foundational concepts such as features encoding kno… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  9. arXiv:2403.12143  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Neural Networks for Learning Equivariant Representations of Neural Networks

    Authors: Miltiadis Kofinas, Boris Knyazev, Yan Zhang, Yunlu Chen, Gertjan J. Burghouts, Efstratios Gavves, Cees G. M. Snoek, David W. Zhang

    Abstract: Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However, existing approaches either overlook the inherent permutation symmetry in the neural network or rely on intricate weight-sharing patterns to achieve equivariance,… ▽ More

    Submitted 20 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: In ICLR 2024. Source code: https://github.com/mkofinas/neural-graphs

  10. arXiv:2402.13077  [pdf, other

    cs.LG cs.AI cs.NE

    Mechanistic Neural Networks for Scientific Machine Learning

    Authors: Adeel Pervez, Francesco Locatello, Efstratios Gavves

    Abstract: This paper presents Mechanistic Neural Networks, a neural network design for machine learning applications in the sciences. It incorporates a new Mechanistic Block in standard architectures to explicitly learn governing differential equations as representations, revealing the underlying dynamics of data and enhancing interpretability and efficiency in data modeling. Central to our approach is a no… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  11. arXiv:2401.16051  [pdf, other

    cs.CV

    Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation

    Authors: Jie Liu, Wenzhe Yin, Haochen Wang, Yunlu CHen, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: Few-shot point cloud segmentation seeks to generate per-point masks for previously unseen categories, using only a minimal set of annotated point clouds as reference. Existing prototype-based methods rely on support prototypes to guide the segmentation of query point clouds, but they encounter challenges when significant object variations exist between the support prototypes and query features. In… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted in 3DV2024, code is available at https://github.com/jliu4ai/DPA

  12. arXiv:2312.10531  [pdf, other

    cs.CV

    How to Train Neural Field Representations: A Comprehensive Study and Benchmark

    Authors: Samuele Papa, Riccardo Valperga, David Knigge, Miltiadis Kofinas, Phillip Lippe, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: Neural fields (NeFs) have recently emerged as a versatile method for modeling signals of various modalities, including images, shapes, and scenes. Subsequently, a number of works have explored the use of NeFs as representations for downstream tasks, e.g. classifying an image based on the parameters of a NeF that has been fit to it. However, the impact of the NeF hyperparameters on their quality as… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

  13. arXiv:2312.08895  [pdf, other

    cs.CV

    Motion Flow Matching for Human Motion Synthesis and Editing

    Authors: Vincent Tao Hu, Wenzhe Yin, **chuan Ma, Yunlu Chen, Basura Fernando, Yuki M Asano, Efstratios Gavves, Pascal Mettes, Bjorn Ommer, Cees G. M. Snoek

    Abstract: Human motion synthesis is a fundamental task in computer animation. Recent methods based on diffusion models or GPT structure demonstrate commendable performance but exhibit drawbacks in terms of slow sampling speeds and error accumulation. In this paper, we propose \emph{Motion Flow Matching}, a novel generative model designed for human motion generation featuring efficient sampling and effective… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: WIP

  14. arXiv:2311.08851  [pdf, other

    cs.LG cs.CV

    Data Augmentations in Deep Weight Spaces

    Authors: Aviv Shamsian, David W. Zhang, Aviv Navon, Yan Zhang, Miltiadis Kofinas, Idan Achituve, Riccardo Valperga, Gertjan J. Burghouts, Efstratios Gavves, Cees G. M. Snoek, Ethan Fetaya, Gal Chechik, Haggai Maron

    Abstract: Learning in weight spaces, where neural networks process the weights of other deep neural networks, has emerged as a promising research direction with applications in various fields, from analyzing and editing neural fields and implicit neural representations, to network pruning and quantization. Recent works designed architectures for effective learning in that space, which takes into account its… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023 Workshop on Symmetry and Geometry in Neural Representations

  15. arXiv:2310.20679  [pdf, other

    cs.LG stat.ML

    Latent Field Discovery In Interacting Dynamical Systems With Neural Fields

    Authors: Miltiadis Kofinas, Erik J. Bekkers, Naveen Shankar Nagaraja, Efstratios Gavves

    Abstract: Systems of interacting objects often evolve under the influence of field effects that govern their dynamics, yet previous works have abstracted away from such effects, and assume that systems evolve in a vacuum. In this work, we focus on discovering these fields, and infer them from the observed dynamics alone, without directly observing them. We theorize the presence of latent force fields, and p… ▽ More

    Submitted 20 March, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: In NeurIPS 2023. Source code: https://github.com/mkofinas/aether

  16. arXiv:2308.11796  [pdf, other

    cs.CV

    Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations

    Authors: Mohammadreza Salehi, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano

    Abstract: Spatially dense self-supervised learning is a rapidly growing problem domain with promising applications for unsupervised segmentation and pretraining for dense downstream tasks. Despite the abundance of temporal data in the form of videos, this information-rich source has been largely overlooked. Our paper aims to address this gap by proposing a novel approach that incorporates temporal consisten… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  17. arXiv:2307.08351  [pdf, other

    eess.IV cs.CV

    Neural Modulation Fields for Conditional Cone Beam Neural Tomography

    Authors: Samuele Papa, David M. Knigge, Riccardo Valperga, Nikita Moriakov, Miltos Kofinas, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: Conventional Computed Tomography (CT) methods require large numbers of noise-free projections for accurate density reconstructions, limiting their applicability to the more complex class of Cone Beam Geometry CT (CBCT) reconstruction. Recently, deep learning methods have been proposed to overcome these limitations, with methods based on neural fields (NF) showing strong performance, by approximati… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  18. arXiv:2307.01583  [pdf, other

    cs.LG cs.CV

    Learning Lie Group Symmetry Transformations with Neural Networks

    Authors: Alex Gabel, Victoria Klein, Riccardo Valperga, Jeroen S. W. Lamb, Kevin Webster, Rick Quax, Efstratios Gavves

    Abstract: The problem of detecting and quantifying the presence of symmetries in datasets is useful for model selection, generative modeling, and data analysis, amongst others. While existing methods for hard-coding transformations in neural networks require prior knowledge of the symmetries of the task at hand, this work focuses on discovering and characterizing unknown symmetries present in the dataset, n… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 9 pages, 5 figures, Proceedings of the 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML) at the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. 2023

  19. arXiv:2306.09643  [pdf, other

    cs.LG cs.AI stat.ME

    BISCUIT: Causal Representation Learning from Binary Interactions

    Authors: Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

    Abstract: Identifying the causal variables of an environment and how to intervene on them is of core value in applications such as robotics and embodied AI. While an agent can commonly interact with the environment and may implicitly perturb the behavior of some of these causal variables, often the targets it affects remain unknown. In this paper, we show that causal variables can still be identified for ma… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Published in: Uncertainty in Artificial Intelligence (UAI 2023). Project page: https://phlippe.github.io/BISCUIT/

  20. arXiv:2306.00370  [pdf, other

    cs.CV cs.LG cs.MA math.DS

    Graph Switching Dynamical Systems

    Authors: Yongtuo Liu, Sara Magliacane, Miltiadis Kofinas, Efstratios Gavves

    Abstract: Dynamical systems with complex behaviours, e.g. immune system cells interacting with a pathogen, are commonly modelled by splitting the behaviour into different regimes, or modes, each with simpler dynamics, and then learning the switching behaviour from one mode to another. Switching Dynamical Systems (SDS) are a powerful tool that automatically discovers these modes and mode-switching behaviour… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  21. arXiv:2304.11609  [pdf

    cs.CV

    PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation

    Authors: Cilin Yan, Haochen Wang, Jie Liu, Xiaolong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves

    Abstract: Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing. In such a task, target ambiguity remains a problem hindering the accuracy and efficiency of segmentation. That is, in scenes with rich context, one click may correspond to multiple potential targets, while most previous interactive segmentors… ▽ More

    Submitted 17 June, 2024; v1 submitted 23 April, 2023; originally announced April 2023.

  22. arXiv:2304.01715  [pdf, other

    cs.CV cs.AI

    Towards Open-Vocabulary Video Instance Segmentation

    Authors: Haochen Wang, Cilin Yan, Shuai Wang, Xiaolong Jiang, XU Tang, Yao Hu, Weidi Xie, Efstratios Gavves

    Abstract: Video Instance Segmentation (VIS) aims at segmenting and categorizing objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categories in real-world videos. To address this limitation, we make the following three contributions. First, we introduce the novel task of Open-Vocabulary Video Instance Segmentation, which aims to simultaneously seg… ▽ More

    Submitted 6 August, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  23. arXiv:2302.13262  [pdf, other

    cs.LG stat.ML

    Modulated Neural ODEs

    Authors: Ilze Amanda Auzina, Çağatay Yıldız, Sara Magliacane, Matthias Bethge, Efstratios Gavves

    Abstract: Neural ordinary differential equations (NODEs) have been proven useful for learning non-linear dynamics of arbitrary trajectories. However, current NODE methods capture variations across trajectories only via the initial state value or by auto-regressive encoder updates. In this work, we introduce Modulated Neural ODEs (MoNODEs), a novel framework that sets apart dynamics states from underlying st… ▽ More

    Submitted 13 November, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

  24. arXiv:2301.10540  [pdf, other

    cs.CV

    Modelling Long Range Dependencies in $N$D: From Task-Specific to a General Purpose CNN

    Authors: David M. Knigge, David W. Romero, Albert Gu, Efstratios Gavves, Erik J. Bekkers, Jakub M. Tomczak, Mark Hoogendoorn, Jan-Jakob Sonke

    Abstract: Performant Convolutional Neural Network (CNN) architectures must be tailored to specific tasks in order to consider the length, resolution, and dimensionality of the input data. In this work, we tackle the need for problem-specific CNN architectures. We present the Continuous Convolutional Neural Network (CCNN): a single CNN able to process data of arbitrary resolution, dimensionality and length w… ▽ More

    Submitted 16 April, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  25. arXiv:2301.03194  [pdf, other

    cs.CV

    Few-shot Semantic Segmentation with Support-induced Graph Convolutional Network

    Authors: Jie Liu, Yanqi Bao, Wenzhe Yin, Haochen Wang, Yang Gao, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: Few-shot semantic segmentation (FSS) aims to achieve novel objects segmentation with only a few annotated samples and has made great progress recently. Most of the existing FSS models focus on the feature matching between support and query to tackle FSS. However, the appearance variations between objects from the same category could be extremely large, leading to unreliable feature matching and qu… ▽ More

    Submitted 15 March, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC2022 as oral presentation

  26. arXiv:2210.02159  [pdf, other

    cs.LG cs.CV

    Differentiable Mathematical Programming for Object-Centric Representation Learning

    Authors: Adeel Pervez, Phillip Lippe, Efstratios Gavves

    Abstract: We propose topology-aware feature partitioning into $k$ disjoint partitions for given scene features as a method for object-centric representation learning. To this end, we propose to use minimum $s$-$t$ graph cuts as a partitioning method which is represented as a linear program. The method is topologically aware since it explicitly encodes neighborhood relationships in the image graph. To solve… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  27. arXiv:2206.13336  [pdf, other

    cs.LG

    Continual Learning of Dynamical Systems with Competitive Federated Reservoir Computing

    Authors: Leonard Bereska, Efstratios Gavves

    Abstract: Machine learning recently proved efficient in learning differential equations and dynamical systems from data. However, the data is commonly assumed to originate from a single never-changing system. In contrast, when modeling real-world dynamical processes, the data distribution often shifts due to changes in the underlying system dynamics. Continual learning of these processes aims to rapidly ada… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: CoLLAs 2022

  28. arXiv:2206.06169  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Representation Learning for Instantaneous and Temporal Effects in Interactive Systems

    Authors: Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

    Abstract: Causal representation learning is the task of identifying the underlying causal variables and their relations from high-dimensional observations, such as images. Recent work has shown that one can reconstruct the causal variables from temporal sequences of observations under the assumption that there are no instantaneous causal relations between them. In practical applications, however, our measur… ▽ More

    Submitted 7 March, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: Published at International Conference on Learning Representations (ICLR), 2023

  29. arXiv:2206.03398  [pdf, other

    cs.LG cs.CV

    Towards a General Purpose CNN for Long Range Dependencies in $N$D

    Authors: David W. Romero, David M. Knigge, Albert Gu, Erik J. Bekkers, Efstratios Gavves, Jakub M. Tomczak, Mark Hoogendoorn

    Abstract: The use of Convolutional Neural Networks (CNNs) is widespread in Deep Learning due to a range of desirable model properties which result in an efficient and effective machine learning framework. However, performant CNN architectures must be tailored to specific tasks in order to incorporate considerations such as the input length, resolution, and dimentionality. In this work, we overcome the need… ▽ More

    Submitted 5 July, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: First two authors contributed equally to this work

  30. arXiv:2205.14297  [pdf, other

    cs.CV cs.LG

    Fake It Till You Make It: Towards Accurate Near-Distribution Novelty Detection

    Authors: Hossein Mirzaei, Mohammadreza Salehi, Sajjad Shahabi, Efstratios Gavves, Cees G. M. Snoek, Mohammad Sabokrou, Mohammad Hossein Rohban

    Abstract: We aim for image-based novelty detection. Despite considerable progress, existing models either fail or face a dramatic drop under the so-called "near-distribution" setting, where the differences between normal and anomalous samples are subtle. We first demonstrate existing methods experience up to 20% decrease in performance in the near-distribution setting. Next, we propose to exploit a score-ba… ▽ More

    Submitted 28 November, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

  31. arXiv:2204.10638  [pdf, other

    cs.CV

    Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation

    Authors: Jie Liu, Yanqi Bao, Guo-Sen Xie, Huan Xiong, Jan-Jakob Sonke, Efstratios Gavves

    Abstract: The key challenge for few-shot semantic segmentation (FSS) is how to tailor a desirable interaction among support and query features and/or their prototypes, under the episodic training scenario. Most existing FSS methods implement such support-query interactions by solely leveraging plain operations - e.g., cosine similarity and feature concatenation - for segmenting the query objects. However, t… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted in CVPR2022. Code will be available soon

  32. arXiv:2204.09331  [pdf, other

    cs.CV cs.AI

    NFormer: Robust Person Re-identification with Neighbor Transformer

    Authors: Haochen Wang, Jiayi Shen, Yongtuo Liu, Yan Gao, Efstratios Gavves

    Abstract: Person re-identification aims to retrieve persons in highly varying settings across different cameras and scenarios, in which robust and discriminative representation learning is crucial. Most research considers learning representations from single images, ignoring any potential interactions between them. However, due to the high intra-identity variations, ignoring such interactions typically lead… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 8 pages, 7 figures, CVPR2022 poster

  33. arXiv:2203.17178  [pdf, other

    cs.CV cs.AI

    3D Equivariant Graph Implicit Functions

    Authors: Yunlu Chen, Basura Fernando, Hakan Bilen, Matthias Nießner, Efstratios Gavves

    Abstract: In recent years, neural implicit representations have made remarkable progress in modeling of 3D shapes with arbitrary topology. In this work, we address two key limitations of such representations, in failing to capture local 3D geometric fine details, and to learn from and generalize to shapes with unseen 3D transformations. To this end, we introduce a novel family of graph implicit functions wi… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Video: https://youtu.be/W7goOzZP2Kc

  34. arXiv:2203.09594  [pdf, other

    cs.CV cs.LG

    Delta Distillation for Efficient Video Processing

    Authors: Amirhossein Habibian, Haitam Ben Yahia, Davide Abati, Efstratios Gavves, Fatih Porikli

    Abstract: This paper aims to accelerate video stream processing, such as object detection and semantic segmentation, by leveraging the temporal redundancies that exist between video frames. Instead of propagating and war** features using motion alignment, such as optical flow, we propose a novel knowledge distillation schema coined as Delta Distillation. In our proposal, the student learns the variations… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  35. arXiv:2203.03355  [pdf, other

    cs.AI cs.LG cs.MA

    Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment

    Authors: Tessa van der Heiden, Herke van Hoof, Efstratios Gavves, Christoph Salge

    Abstract: We consider multi-agent reinforcement learning (MARL) for cooperative communication and coordination tasks. MARL agents can be brittle because they can overfit their training partners' policies. This overfitting can produce agents that adopt policies that act under the expectation that other agents will act in a certain way rather than react to their actions. Our objective is to bias the learning… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2012.08255

  36. arXiv:2202.03169  [pdf, other

    cs.LG cs.AI stat.ME

    CITRIS: Causal Identifiability from Temporal Intervened Sequences

    Authors: Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

    Abstract: Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recen… ▽ More

    Submitted 15 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Accepted at the International Conference on Machine Learning (ICML), 2022

  37. arXiv:2110.14961  [pdf, other

    cs.LG stat.ML

    Roto-translated Local Coordinate Frames For Interacting Dynamical Systems

    Authors: Miltiadis Kofinas, Naveen Shankar Nagaraja, Efstratios Gavves

    Abstract: Modelling interactions is critical in learning complex dynamical systems, namely systems of interacting objects with highly non-linear and time-dependent behaviour. A large class of such systems can be formalized as $\textit{geometric graphs}$, $\textit{i.e.}$, graphs with nodes positioned in the Euclidean space given an $\textit{arbitrarily}$ chosen global coordinate system, for instance vehicles… ▽ More

    Submitted 20 March, 2024; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: In NeurIPS 2021. Source code: https://github.com/mkofinas/locs

  38. arXiv:2109.05892  [pdf, other

    eess.IV cs.CV

    WeakSTIL: Weak whole-slide image level stromal tumor infiltrating lymphocyte scores are all you need

    Authors: Yoni Schirris, Mendel Engelaer, Andreas Panteli, Hugo Mark Horlings, Efstratios Gavves, Jonas Teuwen

    Abstract: We present WeakSTIL, an interpretable two-stage weak label deep learning pipeline for scoring the percentage of stromal tumor infiltrating lymphocytes (sTIL%) in H&E-stained whole-slide images (WSIs) of breast cancer tissue. The sTIL% score is a prognostic and predictive biomarker for many solid tumor types. However, due to the high labeling efforts and high intra- and interobserver variability wi… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: 8 pages, 8 figures, 1 table, 4 pages supplementary

  39. arXiv:2107.10483  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Neural Causal Discovery without Acyclicity Constraints

    Authors: Phillip Lippe, Taco Cohen, Efstratios Gavves

    Abstract: Learning the structure of a causal graphical model using both observational and interventional data is a fundamental problem in many scientific fields. A promising direction is continuous optimization for score-based methods, which, however, require constrained optimization to enforce acyclicity or lack convergence guarantees. In this paper, we present ENCO, an efficient structure learning method… ▽ More

    Submitted 25 February, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Published as a conference paper at the International Conference on Learning Representations (ICLR), 2022

  40. DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD classification directly from H&E whole-slide images in colorectal and breast cancer

    Authors: Yoni Schirris, Efstratios Gavves, Iris Nederlof, Hugo Mark Horlings, Jonas Teuwen

    Abstract: We propose a Deep learning-based weak label learning method for analyzing whole slide images (WSIs) of Hematoxylin and Eosin (H&E) stained tumor tissue not requiring pixel-level or tile-level annotations using Self-supervised pre-training and heterogeneity-aware deep Multiple Instance LEarning (DeepSMILE). We apply DeepSMILE to the task of Homologous recombination deficiency (HRD) and microsatelli… ▽ More

    Submitted 28 June, 2023; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: Main paper: 14 pages, 2 tables, 1 algorithm, 3 figures. Supplementary material: 3 pages

    Journal ref: Medical Image Analysis Volume 79, July 2022, 102464

  41. arXiv:2107.06724  [pdf, other

    cs.LG cs.DC

    Federated Mixture of Experts

    Authors: Matthias Reisser, Christos Louizos, Efstratios Gavves, Max Welling

    Abstract: Federated learning (FL) has emerged as the predominant approach for collaborative training of neural network models across multiple users, without the need to gather the data at a central location. One of the important challenges in this setting is data heterogeneity, i.e. different users have different data characteristics. For this reason, training and using a single global model might be subopt… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  42. Unsharp Mask Guided Filtering

    Authors: Zenglin Shi, Yunlu Chen, Efstratios Gavves, Pascal Mettes, Cees G. M. Snoek

    Abstract: The goal of this paper is guided image filtering, which emphasizes the importance of structure transfer during filtering by means of an additional guidance image. Where classical guided filters transfer structures using hand-designed functions, recent guided filters have been considerably advanced through parametric learning of deep networks. The state-of-the-art leverages deep networks to estimat… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: IEEE Transactions on Image Processing, 2021

  43. arXiv:2104.10425  [pdf, other

    cs.LG cs.CV

    Sparse-shot Learning with Exclusive Cross-Entropy for Extremely Many Localisations

    Authors: Andreas Panteli, Jonas Teuwen, Hugo Horlings, Efstratios Gavves

    Abstract: Object localisation, in the context of regular images, often depicts objects like people or cars. In these images, there is typically a relatively small number of objects per class, which usually is manageable to annotate. However, outside the setting of regular images, we are often confronted with a different situation. In computational pathology, digitised tissue sections are extremely large ima… ▽ More

    Submitted 23 August, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: 11 pages, 5 figures, 3 tables

  44. arXiv:2102.13382  [pdf, other

    stat.ML cs.LG

    Batch Bayesian Optimization on Permutations using the Acquisition Weighted Kernel

    Authors: Changyong Oh, Roberto Bondesan, Efstratios Gavves, Max Welling

    Abstract: In this work we propose a batch Bayesian optimization method for combinatorial problems on permutations, which is well suited for expensive-to-evaluate objectives. We first introduce LAW, an efficient batch acquisition method based on determinantal point processes using the acquisition weighted kernel. Relying on multiple parallel evaluations, LAW enables accelerated search on combinatorial spaces… ▽ More

    Submitted 25 January, 2023; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2022

  45. arXiv:2102.12792  [pdf, other

    stat.ML cs.LG

    Mixed Variable Bayesian Optimization with Frequency Modulated Kernels

    Authors: Changyong Oh, Efstratios Gavves, Max Welling

    Abstract: The sample efficiency of Bayesian optimization(BO) is often boosted by Gaussian Process(GP) surrogate models. However, on mixed variable spaces, surrogate models other than GPs are prevalent, mainly due to the lack of kernels which can model complex dependencies across different types of variables. In this paper, we propose the frequency modulated (FM) kernel flexibly modeling dependencies among d… ▽ More

    Submitted 18 July, 2022; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, PMLR 161:950-960, 2021

  46. arXiv:2012.13078  [pdf, other

    cs.CV

    Rotation Equivariant Siamese Networks for Tracking

    Authors: Deepak K. Gupta, Devanshu Arya, Efstratios Gavves

    Abstract: Rotation is among the long prevailing, yet still unresolved, hard challenges encountered in visual object tracking. The existing deep learning-based tracking algorithms use regular CNNs that are inherently translation equivariant, but not designed to tackle rotations. In this paper, we first demonstrate that in the presence of rotation instances in videos, the performance of existing trackers is s… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

  47. arXiv:2012.08255  [pdf, other

    cs.MA

    Robust Multi-Agent Reinforcement Learning with Social Empowerment for Coordination and Communication

    Authors: T. van der Heiden, C. Salge, E. Gavves, H. van Hoof

    Abstract: We consider the problem of robust multi-agent reinforcement learning (MARL) for cooperative communication and coordination tasks. MARL agents, mainly those trained in a centralized way, can be brittle because they can adopt policies that act under the expectation that other agents will act a certain way rather than react to their actions. Our objective is to bias the learning process towards findi… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

  48. arXiv:2011.11721  [pdf, other

    cs.CV

    Siamese Tracking with Lingual Object Constraints

    Authors: Maximilian Filtenborg, Efstratios Gavves, Deepak Gupta

    Abstract: Classically, visual object tracking involves following a target object throughout a given video, and it provides us the motion trajectory of the object. However, for many practical applications, this output is often insufficient since additional semantic information is required to act on the video material. Example applications of this are surveillance and target-specific video summarization, wher… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  49. arXiv:2010.08750  [pdf, other

    cs.CV

    Self-Selective Context for Interaction Recognition

    Authors: Mert Kilickaya, Noureldien Hussein, Efstratios Gavves, Arnold Smeulders

    Abstract: Human-object interaction recognition aims for identifying the relationship between a human subject and an object. Researchers incorporate global scene context into the early layers of deep Convolutional Neural Networks as a solution. They report a significant increase in the performance since generally interactions are correlated with the scene (\ie riding bicycle on the city street). However, thi… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: Accepted at ICPR'20

  50. arXiv:2008.06374  [pdf, other

    cs.CV

    PointMixup: Augmentation for Point Clouds

    Authors: Yunlu Chen, Vincent Tao Hu, Efstratios Gavves, Thomas Mensink, Pascal Mettes, Pengwan Yang, Cees G. M. Snoek

    Abstract: This paper introduces data augmentation for point clouds by interpolation between examples. Data augmentation by interpolation has shown to be a simple and effective approach in the image domain. Such a mixup is however not directly transferable to point clouds, as we do not have a one-to-one correspondence between the points of two different objects. In this paper, we define data augmentation bet… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: Accepted as Spotlight presentation at European Conference on Computer Vision (ECCV), 2020