Skip to main content

Showing 1–50 of 55 results for author: Buhmann, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03932  [pdf, other

    cs.LG

    Breeding Programs Optimization with Reinforcement Learning

    Authors: Omar G. Younis, Luca Corinzia, Ioannis N. Athanasiadis, Andreas Krause, Joachim M. Buhmann, Matteo Turchetta

    Abstract: Crop breeding is crucial in improving agricultural productivity while potentially decreasing land usage, greenhouse gas emissions, and water consumption. However, breeding programs are challenging due to long turnover times, high-dimensional decision spaces, long-term objectives, and the need to adapt to rapid climate change. This paper introduces the use of Reinforcement Learning (RL) to optimize… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning

  2. arXiv:2406.03394  [pdf, other

    cs.CV

    Gaussian Representation for Deformable Image Registration

    Authors: Jihe Li, Fabian Zhang, Xia Li, Tianhao Zhang, Ye Zhang, Joachim Buhmann

    Abstract: Deformable image registration (DIR) is a fundamental task in radiotherapy, with existing methods often struggling to balance computational efficiency, registration accuracy, and speed effectively. We introduce a novel DIR approach employing parametric 3D Gaussian control points achieving a better tradeoff. It provides an explicit and flexible representation for spatial deformation fields between 3… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2405.19614  [pdf, other

    cs.RO

    TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM

    Authors: Peifeng Jiang, Hong Liu, Xia Li, Ti Wang, Fabian Zhang, Joachim M. Buhmann

    Abstract: The limited robustness of 3D Gaussian Splatting (3DGS) to motion blur and camera noise, along with its poor real-time performance, restricts its application in robotic SLAM tasks. Upon analysis, the primary causes of these issues are the density of views with motion blur and the cumulative errors in dense pose estimation from calculating losses based on noisy original images and rendering results,… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.15385  [pdf, other

    cs.CV physics.med-ph

    CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation

    Authors: Xia Li, Runzhao Yang, Xiangtai Li, Antony Lomax, Ye Zhang, Joachim Buhmann

    Abstract: Motion information from 4D medical imaging offers critical insights into dynamic changes in patient anatomy for clinical assessments and radiotherapy planning and, thereby, enhances the capabilities of 3D image analysis. However, inherent physical and technical constraints of imaging hardware often necessitate a compromise between temporal resolution and image quality. Frame interpolation emerges… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.14477  [pdf, other

    cs.LG cs.CV

    LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models

    Authors: Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber

    Abstract: Advances in latent diffusion models (LDMs) have revolutionized high-resolution image generation, but the design space of the autoencoder that is central to these systems remains underexplored. In this paper, we introduce LiteVAE, a family of autoencoders for LDMs that leverage the 2D discrete wavelet transform to enhance scalability and computational efficiency over standard variational autoencode… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2405.00430  [pdf

    physics.med-ph cs.CV

    Continuous sPatial-Temporal Deformable Image Registration (CPT-DIR) for motion modelling in radiotherapy: beyond classic voxel-based methods

    Authors: Xia Li, Muheng Li, Antony Lomax, Joachim Buhmann, Ye Zhang

    Abstract: Background and purpose: Deformable image registration (DIR) is a crucial tool in radiotherapy for extracting and modelling organ motion. However, when significant changes and sliding boundaries are present, it faces compromised accuracy and uncertainty, determining the subsequential contour propagation and dose accumulation procedures. Materials and methods: We propose an implicit neural represent… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  7. arXiv:2404.12352  [pdf, other

    cs.CV

    Point-In-Context: Understanding Point Cloud via In-Context Learning

    Authors: Mengyuan Liu, Zhongbin Fang, Xia Li, Joachim M. Buhmann, Xiangtai Li, Chen Change Loy

    Abstract: With the emergence of large-scale models trained on diverse datasets, in-context learning has emerged as a promising paradigm for multitasking, notably in natural language processing and image processing. However, its application in 3D point cloud tasks remains largely unexplored. In this work, we introduce Point-In-Context (PIC), a novel framework for 3D point cloud understanding via in-context l… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Project page: https://fanglaosi.github.io/Point-In-Context_Pages. arXiv admin note: text overlap with arXiv:2306.08659

  8. arXiv:2402.06974  [pdf, other

    cs.LG

    Hypernetwork-Driven Model Fusion for Federated Domain Generalization

    Authors: Marc Bartholet, Taehyeon Kim, Ami Beuret, Se-Young Yun, Joachim M. Buhmann

    Abstract: Federated Learning (FL) faces significant challenges with domain shifts in heterogeneous data, degrading performance. Traditional domain generalization aims to learn domain-invariant features, but the federated nature of model averaging often limits this due to its linear aggregation of local learning. To address this, we propose a robust framework, coined as hypernetwork-based Federated Fusion (h… ▽ More

    Submitted 28 May, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  9. arXiv:2402.05568  [pdf, other

    physics.med-ph cs.CV

    Neural Graphics Primitives-based Deformable Image Registration for On-the-fly Motion Extraction

    Authors: Xia Li, Fabian Zhang, Muheng Li, Damien Weber, Antony Lomax, Joachim Buhmann, Ye Zhang

    Abstract: Intra-fraction motion in radiotherapy is commonly modeled using deformable image registration (DIR). However, existing methods often struggle to balance speed and accuracy, limiting their applicability in clinical scenarios. This study introduces a novel approach that harnesses Neural Graphics Primitives (NGP) to optimize the displacement vector field (DVF). Our method leverages learned primitives… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  10. arXiv:2401.07363  [pdf, other

    cs.CL

    PersonalityChat: Conversation Distillation for Personalized Dialog Modeling with Facts and Traits

    Authors: Ehsan Lotfi, Maxime De Bruyn, Jeska Buhmann, Walter Daelemans

    Abstract: The new wave of Large Language Models (LLM) has offered an efficient tool to curate sizeable conversational datasets. So far studies have mainly focused on task-oriented or generic open-domain dialogs, and have not fully explored the ability of LLMs in following complicated prompts. In this work, we focus on personalization, and employ LLMs to curate a dataset which is difficult and costly to crow… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: GEM workshop @ EMNLP23

  11. arXiv:2312.14329  [pdf, other

    cs.LG

    Invariant Anomaly Detection under Distribution Shifts: A Causal Perspective

    Authors: João B. S. Carvalho, Mengtao Zhang, Robin Geyer, Carlos Cotrini, Joachim M. Buhmann

    Abstract: Anomaly detection (AD) is the machine learning task of identifying highly discrepant abnormal samples by solely relying on the consistency of the normal training samples. Under the constraints of a distribution shift, the assumption that training samples and test samples are drawn from the same distribution breaks down. In this work, by leveraging tools from causal inference we attempt to increase… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  12. arXiv:2310.17347  [pdf, other

    cs.CV

    CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling

    Authors: Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber

    Abstract: While conditional diffusion models are known to have good coverage of the data distribution, they still face limitations in output diversity, particularly when sampled with a high classifier-free guidance scale for optimal image quality or when trained on small datasets. We attribute this problem to the role of the conditioning signal in inference and offer an improved sampling strategy for diffus… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICLR 2024

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR 2024)

  13. arXiv:2308.09189  [pdf, ps, other

    cs.LG cs.AI

    Regularizing Adversarial Imitation Learning Using Causal Invariance

    Authors: Ivan Ovinnikov, Joachim M. Buhmann

    Abstract: Imitation learning methods are used to infer a policy in a Markov decision process from a dataset of expert demonstrations by minimizing a divergence measure between the empirical state occupancy measures of the expert and the policy. The guiding signal to the policy is provided by the discriminator used as part of an versarial optimization procedure. We observe that this model is prone to absorbi… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Published at the ICML 2023 Workshop on Spurious Correlations, Invariance, and Stability

  14. arXiv:2306.09035  [pdf, other

    cs.CV

    Improving Explainability of Disentangled Representations using Multipath-Attribution Map**s

    Authors: Lukas Klein, João B. S. Carvalho, Mennatallah El-Assady, Paolo Penna, Joachim M. Buhmann, Paul F. Jaeger

    Abstract: Explainable AI aims to render model behavior understandable by humans, which can be seen as an intermediate step in extracting causal relations from correlative patterns. Due to the high risk of possible fatal decisions in image-based clinical diagnostics, it is necessary to integrate explainable AI into these safety-critical systems. Current explanatory methods typically assign attribution scores… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of The 5th International Conference on Medical Imaging with Deep Learning, PMLR 172:689-712, 2022

  15. arXiv:2306.08659  [pdf, other

    cs.CV

    Explore In-Context Learning for 3D Point Cloud Understanding

    Authors: Zhongbin Fang, Xiangtai Li, Xia Li, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu

    Abstract: With the rise of large-scale models trained on broad data, in-context learning has become a new learning paradigm that has demonstrated significant potential in natural language processing and computer vision tasks. Meanwhile, in-context learning is still largely unexplored in the 3D point cloud domain. Although masked modeling has been successfully applied for in-context learning in 2D vision, di… ▽ More

    Submitted 27 December, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Project page: https://github.com/fanglaosi/Point-In-Context

  16. arXiv:2210.03649  [pdf, other

    cs.LG cs.AI cs.MA cs.RO

    How to Enable Uncertainty Estimation in Proximal Policy Optimization

    Authors: Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann

    Abstract: While deep reinforcement learning (RL) agents have showcased strong results across many domains, a major concern is their inherent opaqueness and the safety of such systems in real-world use cases. To overcome these issues, we need agents that can quantify their uncertainty and detect out-of-distribution (OOD) states. Existing uncertainty estimation techniques, like Monte-Carlo Dropout or Deep Ens… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    ACM Class: I.2; I.2.6; I.2.8; I.2.9; I.2.10

  17. arXiv:2209.12590  [pdf, other

    cs.LG

    Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs

    Authors: Đorđe Miladinović, Kumar Shridhar, Kushal Jain, Max B. Paulus, Joachim M. Buhmann, Mrinmaya Sachan, Carl Allen

    Abstract: In principle, applying variational autoencoders (VAEs) to sequential data offers a method for controlled sequence generation, manipulation, and structured representation learning. However, training sequence VAEs is challenging: autoregressive decoders can often explain the data without utilizing the latent space, known as posterior collapse. To mitigate this, state-of-the-art models weaken the pow… ▽ More

    Submitted 16 December, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022

  18. arXiv:2209.05185  [pdf, other

    cs.CL

    Open-Domain Dialog Evaluation using Follow-Ups Likelihood

    Authors: Maxime De Bruyn, Ehsan Lotfi, Jeska Buhmann, Walter Daelemans

    Abstract: Automatic evaluation of open-domain dialogs remains an unsolved problem. Moreover, existing methods do not correlate strongly with human annotations. This paper presents a new automated evaluation method using follow-ups: we measure the probability that a language model will continue the conversation with a fixed set of follow-ups (e.g., not really relevant here, what are you trying to say). When… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING 2022

  19. arXiv:2208.10481  [pdf, other

    cs.LG cs.AI cs.CR cs.CV cs.RO

    BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning

    Authors: Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann

    Abstract: Robustness to adversarial perturbations has been explored in many areas of computer vision. This robustness is particularly relevant in vision-based reinforcement learning, as the actions of autonomous agents might be safety-critic or impactful in the real world. We investigate the susceptibility of vision-based reinforcement learning agents to gradient-based adversarial attacks and evaluate a pot… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 5 pages, 2 figures, 3 tables

    ACM Class: I.2.6; I.2.8; I.2.9; I.2.10; I.5.4

  20. arXiv:2206.12444  [pdf, other

    cs.LG

    Gated Domain Units for Multi-source Domain Generalization

    Authors: Simon Föll, Alina Dubatovka, Eugen Ernst, Siu Lun Chau, Martin Maritsch, Patrik Okanovic, Gudrun Thäter, Joachim M. Buhmann, Felix Wortmann, Krikamol Muandet

    Abstract: The phenomenon of distribution shift (DS) occurs when a dataset at test time differs from the dataset at training time, which can significantly impair the performance of a machine learning model in practical settings due to a lack of knowledge about the data's distribution at test time. To address this problem, we postulate that real-world distributions are composed of latent Invariant Elementary… ▽ More

    Submitted 16 May, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  21. arXiv:2110.02067  [pdf, other

    cs.CL

    Teach Me What to Say and I Will Learn What to Pick: Unsupervised Knowledge Selection Through Response Generation with Pretrained Generative Models

    Authors: Ehsan Lotfi, Maxime De Bruyn, Jeska Buhmann, Walter Daelemans

    Abstract: Knowledge Grounded Conversation Models (KGCM) are usually based on a selection/retrieval module and a generation module, trained separately or simultaneously, with or without having access to a gold knowledge option. With the introduction of large pre-trained generative models, the selection and generation part have become more and more entangled, shifting the focus towards enhancing knowledge inc… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted at ConvAI workshop (EMNLP 2021)

  22. arXiv:2109.12870  [pdf, other

    cs.CL

    MFAQ: a Multilingual FAQ Dataset

    Authors: Maxime De Bruyn, Ehsan Lotfi, Jeska Buhmann, Walter Daelemans

    Abstract: In this paper, we present the first multilingual FAQ dataset publicly available. We collected around 6M FAQ pairs from the web, in 21 different languages. Although this is significantly larger than existing FAQ retrieval datasets, it comes with its own challenges: duplication of content and uneven distribution of topics. We adopt a similar setup as Dense Passage Retrieval (DPR) and test various bi… ▽ More

    Submitted 5 October, 2021; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: Accepted at MRQA workshop (EMNLP 2021)

  23. arXiv:2108.00719  [pdf, other

    cs.CL

    ConveRT for FAQ Answering

    Authors: Maxime De Bruyn, Ehsan Lotfi, Jeska Buhmann, Walter Daelemans

    Abstract: Knowledgeable FAQ chatbots are a valuable resource to any organization. While powerful and efficient retrieval-based models exist for English, it is rarely the case for other languages for which the same amount of training data is not available. In this paper, we propose a novel pre-training procedure to adapt ConveRT, an English conversational retriever model, to other languages with less trainin… ▽ More

    Submitted 14 October, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted at bnaicbenelearn2021

  24. arXiv:2103.11713  [pdf, other

    eess.IV cs.CV

    Spatially Dependent U-Nets: Highly Accurate Architectures for Medical Imaging Segmentation

    Authors: João B. S. Carvalho, João A. Santinha, Đorđe Miladinović, Joachim M. Buhmann

    Abstract: In clinical practice, regions of interest in medical imaging often need to be identified through a process of precise image segmentation. The quality of this image segmentation step critically affects the subsequent clinical assessment of the patient status. To enable high accuracy, automatic image segmentation, we introduce a novel deep neural network architecture that exploits the inherent spati… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  25. arXiv:2103.08877  [pdf, other

    cs.CV cs.AI cs.LG

    Spatial Dependency Networks: Neural Layers for Improved Generative Image Modeling

    Authors: Đorđe Miladinović, Aleksandar Stanić, Stefan Bauer, Jürgen Schmidhuber, Joachim M. Buhmann

    Abstract: How to improve generative modeling by better exploiting spatial regularities and coherence in images? We introduce a novel neural network for building image generators (decoders) and apply it to variational autoencoders (VAEs). In our spatial dependency networks (SDNs), feature maps at each level of a deep neural net are computed in a spatially coherent way, using a sequential gating-based mechani… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Journal ref: International Conference on Learning Representations (2021);

  26. arXiv:2101.09994  [pdf, ps, other

    cs.IT cs.AI cs.LG

    On maximum-likelihood estimation in the all-or-nothing regime

    Authors: Luca Corinzia, Paolo Penna, Wojciech Szpankowski, Joachim M. Buhmann

    Abstract: We study the problem of estimating a rank-1 additive deformation of a Gaussian tensor according to the \emph{maximum-likelihood estimator} (MLE). The analysis is carried out in the sparse setting, where the underlying signal has a support that scales sublinearly with the total number of dimensions. We show that for Bernoulli distributed signals, the MLE undergoes an \emph{all-or-nothing} (AoN) pha… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  27. arXiv:2011.11500  [pdf, other

    cs.LG cs.DS cs.IT

    Statistical and computational thresholds for the planted $k$-densest sub-hypergraph problem

    Authors: Luca Corinzia, Paolo Penna, Wojciech Szpankowski, Joachim M. Buhmann

    Abstract: In this work, we consider the problem of recovery a planted $k$-densest sub-hypergraph on $d$-uniform hypergraphs. This fundamental problem appears in different contexts, e.g., community detection, average-case complexity, and neuroscience applications as a structural variant of tensor-PCA problem. We provide tight \emph{information-theoretic} upper and lower bounds for the exact recovery threshol… ▽ More

    Submitted 28 January, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

  28. arXiv:2009.08371  [pdf, other

    cs.CV cs.LG eess.IV q-bio.QM

    Microtubule Tracking in Electron Microscopy Volumes

    Authors: Nils Eckstein, Julia Buhmann, Matthew Cook, Jan Funke

    Abstract: We present a method for microtubule tracking in electron microscopy volumes. Our method first identifies a sparse set of voxels that likely belong to microtubules. Similar to prior work, we then enumerate potential edges between these voxels, which we represent in a candidate graph. Tracks of microtubules are found by selecting nodes and edges in the candidate graph by solving a constrained optimi… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted at MICCAI 2020

  29. arXiv:2008.05867  [pdf, other

    eess.IV cs.LG

    Neural collaborative filtering for unsupervised mitral valve segmentation in echocardiography

    Authors: Luca Corinzia, Fabian Laumer, Alessandro Candreva, Maurizio Taramasso, Francesco Maisano, Joachim M. Buhmann

    Abstract: The segmentation of the mitral valve annulus and leaflets specifies a crucial first step to establish a machine learning pipeline that can support physicians in performing multiple tasks, e.g.\ diagnosis of mitral valve diseases, surgical planning, and intraoperative procedures. Current methods for mitral valve segmentation on 2D echocardiography videos require extensive interaction with annotator… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  30. arXiv:2006.13474  [pdf, other

    cs.LG stat.ML

    Continuous Submodular Function Maximization

    Authors: Yatao Bian, Joachim M. Buhmann, Andreas Krause

    Abstract: Continuous submodular functions are a category of generally non-convex/non-concave functions with a wide spectrum of applications. The celebrated property of this class of functions - continuous submodularity - enables both exact minimization and approximate maximization in poly. time. Continuous submodularity is obtained by generalizing the notion of submodularity from discrete domains to continu… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 64 pages

  31. arXiv:2006.01293  [pdf, other

    cs.LG stat.ML

    From Sets to Multisets: Provable Variational Inference for Probabilistic Integer Submodular Models

    Authors: Aytunc Sahin, Yatao Bian, Joachim M. Buhmann, Andreas Krause

    Abstract: Submodular functions have been studied extensively in machine learning and data mining. In particular, the optimization of submodular functions over the integer lattice (integer submodular functions) has recently attracted much interest, because this domain relates naturally to many practical problem settings, such as multilabel graph cut, budget allocation and revenue maximization with discrete a… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  32. arXiv:2003.09820  [pdf, other

    cs.GR

    Rig-space Neural Rendering

    Authors: Dominik Borer, Lu Yuhang, Laura Wuelfroth, Jakob Buhmann, Martin Guay

    Abstract: Movie productions use high resolution 3d characters with complex proprietary rigs to create the highest quality images possible for large displays. Unfortunately, these 3d assets are typically not compatible with real-time graphics engines used for games, mixed reality and real-time pre-visualization. Consequently, the 3d characters need to be re-modeled and re-rigged for these new applications, r… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

  33. arXiv:1906.06268  [pdf, other

    cs.LG stat.ML

    Variational Federated Multi-Task Learning

    Authors: Luca Corinzia, Ami Beuret, Joachim M. Buhmann

    Abstract: In federated learning, a central server coordinates the training of a single model on a massively distributed network of devices. This setting can be naturally extended to a multi-task learning framework, to handle real-world federated datasets that typically show strong statistical heterogeneity among devices. Despite federated multi-task learning being shown to be an effective paradigm for real-… ▽ More

    Submitted 4 February, 2021; v1 submitted 14 June, 2019; originally announced June 2019.

  34. arXiv:1906.03255  [pdf, other

    stat.ML cs.LG

    Disentangled State Space Representations

    Authors: Đorđe Miladinović, Muhammad Waleed Gondal, Bernhard Schölkopf, Joachim M. Buhmann, Stefan Bauer

    Abstract: Sequential data often originates from diverse domains across which statistical regularities and domain specifics exist. To specifically learn cross-domain sequence representations, we introduce disentangled state space models (DSSM) -- a class of SSM in which domain-invariant state dynamics is explicitly disentangled from domain-specific information governing that dynamics. We analyze how such sep… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  35. arXiv:1904.03266  [pdf, other

    cs.AI cs.CL cs.MA

    Domain Authoring Assistant for Intelligent Virtual Agents

    Authors: Sepehr Janghorbani, Ashutosh Modi, Jakob Buhmann, Mubbasir Kapadia

    Abstract: Develo** intelligent virtual characters has attracted a lot of attention in the recent years. The process of creating such characters often involves a team of creative authors who describe different aspects of the characters in natural language, and planning experts that translate this description into a planning domain. This can be quite challenging as the team of creative authors should dilige… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: 8+1 pages, Accepted at 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019)

  36. Learning Counterfactual Representations for Estimating Individual Dose-Response Curves

    Authors: Patrick Schwab, Lorenz Linhardt, Stefan Bauer, Joachim M. Buhmann, Walter Karlen

    Abstract: Estimating what would be an individual's potential response to varying levels of exposure to a treatment is of high practical relevance for several important fields, such as healthcare, economics and public policy. However, existing methods for learning to estimate counterfactual outcomes from observational data are either focused on estimating average dose-response curves, or limited to settings… ▽ More

    Submitted 10 December, 2020; v1 submitted 3 February, 2019; originally announced February 2019.

    Comments: published at AAAI 2020

  37. arXiv:1901.06799  [pdf, other

    cs.IT

    Exact Recovery for a Family of Community-Detection Generative Models

    Authors: Luca Corinzia, Paolo Penna, Luca Mondada, Joachim M. Buhmann

    Abstract: Generative models for networks with communities have been studied extensively for being a fertile ground to establish information-theoretic and computational thresholds. In this paper we propose a new toy model for planted generative models called planted Random Energy Model (REM), inspired by Derrida's REM. For this model we provide the asymptotic behaviour of the probability of error for the max… ▽ More

    Submitted 30 April, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

  38. arXiv:1806.08205  [pdf, other

    cs.CV

    Synaptic partner prediction from point annotations in insect brains

    Authors: Julia Buhmann, Renate Krause, Rodrigo Ceballos Lentini, Nils Eckstein, Matthew Cook, Srinivas Turaga, Jan Funke

    Abstract: High-throughput electron microscopy allows recording of lar- ge stacks of neural tissue with sufficient resolution to extract the wiring diagram of the underlying neural network. Current efforts to automate this process focus mainly on the segmentation of neurons. However, in order to recover a wiring diagram, synaptic partners need to be identi- fied as well. This is especially challenging in ins… ▽ More

    Submitted 16 July, 2018; v1 submitted 21 June, 2018; originally announced June 2018.

  39. arXiv:1805.07482  [pdf, other

    cs.LG stat.ML

    Optimal DR-Submodular Maximization and Applications to Provable Mean Field Inference

    Authors: An Bian, Joachim M. Buhmann, Andreas Krause

    Abstract: Mean field inference in probabilistic models is generally a highly nonconvex problem. Existing optimization methods, e.g., coordinate ascent algorithms, can only generate local optima. In this work we propose provable mean filed methods for probabilistic log-submodular models and its posterior agreement (PA) with strong approximation guarantees. The main algorithmic technique is a new Double Gre… ▽ More

    Submitted 29 November, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: 28 pages

  40. arXiv:1804.04378  [pdf, other

    stat.ML cs.LG

    Fast Gaussian Process Based Gradient Matching for Parameter Identification in Systems of Nonlinear ODEs

    Authors: Philippe Wenk, Alkis Gotovos, Stefan Bauer, Nico Gorbach, Andreas Krause, Joachim M. Buhmann

    Abstract: Parameter identification and comparison of dynamical systems is a challenging task in many fields. Bayesian approaches based on Gaussian process regression over time-series data have been successfully applied to infer the parameters of a dynamical system without explicitly solving it. While the benefits in computational cost are well established, a rigorous mathematical framework has been missing.… ▽ More

    Submitted 1 March, 2019; v1 submitted 12 April, 2018; originally announced April 2018.

    Comments: accepted at AISTATS 2019

  41. arXiv:1711.02515  [pdf, other

    cs.LG cs.AI stat.ML

    Continuous DR-submodular Maximization: Structure and Algorithms

    Authors: An Bian, Kfir Y. Levy, Andreas Krause, Joachim M. Buhmann

    Abstract: DR-submodular continuous functions are important objectives with wide real-world applications spanning MAP inference in determinantal point processes (DPPs), and mean-field inference for probabilistic submodular models, amongst others. DR-submodularity captures a subclass of non-convex functions that enables both exact minimization and approximate maximization in polynomial time. In this work we… ▽ More

    Submitted 24 May, 2019; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Published in NIPS 2017

  42. arXiv:1703.02100  [pdf, other

    cs.DM cs.AI cs.DS cs.LG math.OC

    Guarantees for Greedy Maximization of Non-submodular Functions with Applications

    Authors: Andrew An Bian, Joachim M. Buhmann, Andreas Krause, Sebastian Tschiatschek

    Abstract: We investigate the performance of the standard Greedy algorithm for cardinality constrained maximization of non-submodular nondecreasing set functions. While there are strong theoretical guarantees on the performance of Greedy for maximizing submodular functions, there are few guarantees for non-submodular ones. However, Greedy enjoys strong empirical performance for many important non-submodular… ▽ More

    Submitted 14 May, 2019; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: published at ICML 2017. First author is now known as Yatao Bian <[email protected]>. ORCID: https://orcid.org/0000-0002-2368-4084

  43. arXiv:1611.06652  [pdf, other

    stat.ML cs.LG

    Scalable Adaptive Stochastic Optimization Using Random Projections

    Authors: Gabriel Krummenacher, Brian McWilliams, Yannic Kilcher, Joachim M. Buhmann, Nicolai Meinshausen

    Abstract: Adaptive stochastic gradient methods such as AdaGrad have gained popularity in particular for training deep neural networks. The most commonly used and studied variant maintains a diagonal matrix approximation to second order information by accumulating past gradients which are used to tune the step size adaptively. In certain situations the full-matrix variant of AdaGrad is expected to attain bet… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: To appear in Advances in Neural Information Processing Systems 29 (NIPS 2016)

  44. arXiv:1609.00810  [pdf, other

    cs.DS cs.DM cs.IT

    Greedy MAXCUT Algorithms and their Information Content

    Authors: Yatao Bian, Alexey Gronskiy, Joachim M. Buhmann

    Abstract: MAXCUT defines a classical NP-hard problem for graph partitioning and it serves as a typical case of the symmetric non-monotone Unconstrained Submodular Maximization (USM) problem. Applications of MAXCUT are abundant in machine learning, computer vision and statistical physics. Greedy algorithms to approximately solve MAXCUT rely on greedy vertex labelling or on an edge contraction strategy. These… ▽ More

    Submitted 3 September, 2016; originally announced September 2016.

    Comments: This is a longer version of the paper published in 2015 IEEE Information Theory Workshop (ITW)

  45. arXiv:1606.05615  [pdf, other

    cs.LG cs.DS

    Guaranteed Non-convex Optimization: Submodular Maximization over Continuous Domains

    Authors: Andrew An Bian, Baharan Mirzasoleiman, Joachim M. Buhmann, Andreas Krause

    Abstract: Submodular continuous functions are a category of (generally) non-convex/non-concave functions with a wide spectrum of applications. We characterize these functions and demonstrate that they can be maximized efficiently with approximation guarantees. Specifically, i) We introduce the weak DR property that gives a unified characterization of submodularity for all set, integer-lattice and continuous… ▽ More

    Submitted 6 May, 2019; v1 submitted 17 June, 2016; originally announced June 2016.

    Comments: Appears in the 20th International Conference on Artificial Intelligence and Statistics (AISTATS) 2017

  46. arXiv:1606.00897  [pdf, other

    q-bio.QM cs.LG q-bio.TO stat.ML

    Multi-Organ Cancer Classification and Survival Analysis

    Authors: Stefan Bauer, Nicolas Carion, Peter Schüffler, Thomas Fuchs, Peter Wild, Joachim M. Buhmann

    Abstract: Accurate and robust cell nuclei classification is the cornerstone for a wider range of tasks in digital and Computational Pathology. However, most machine learning systems require extensive labeling from expert pathologists for each individual problem at hand, with no or limited abilities for knowledge transfer between datasets and organ sites. In this paper we implement and evaluate a variety of… ▽ More

    Submitted 2 December, 2016; v1 submitted 2 June, 2016; originally announced June 2016.

  47. arXiv:1604.06318  [pdf, other

    cs.CV

    TI-POOLING: transformation-invariant pooling for feature learning in Convolutional Neural Networks

    Authors: Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys

    Abstract: In this paper we present a deep neural network topology that incorporates a simple to implement transformation invariant pooling operator (TI-POOLING). This operator is able to efficiently handle prior knowledge on nuisance variations in the data, such as rotation or scale changes. Most current methods usually make use of dataset augmentation to address this issue, but this requires larger number… ▽ More

    Submitted 22 September, 2016; v1 submitted 21 April, 2016; originally announced April 2016.

    Comments: Accepted at CVPR 2016. The first two authors assert equal contribution and joint first authorship

  48. Computational Pathology: Challenges and Promises for Tissue Analysis

    Authors: Thomas J. Fuchs, Joachim M. Buhmann

    Abstract: The histological assessment of human tissue has emerged as the key challenge for detection and treatment of cancer. A plethora of different data sources ranging from tissue microarray data to gene expression, proteomics or metabolomics data provide a detailed overview of the health status of a patient. Medical doctors need to assess these information sources and they rely on data driven automatic… ▽ More

    Submitted 31 December, 2015; originally announced January 2016.

    Journal ref: Computerized Medical Imaging and Graphics, vol. 35, 7-8, p. 515-530, 2011

  49. arXiv:1411.6191  [pdf, other

    cs.LG cs.NE q-bio.NC

    Kickback cuts Backprop's red-tape: Biologically plausible credit assignment in neural networks

    Authors: David Balduzzi, Hastagiri Vanchinathan, Joachim Buhmann

    Abstract: Error backpropagation is an extremely effective algorithm for assigning credit in artificial neural networks. However, weight updates under Backprop depend on lengthy recursive computations and require separate output and error messages -- features not shared by biological neurons, that are perhaps unnecessary. In this paper, we revisit Backprop and the credit assignment problem. We first decompos… ▽ More

    Submitted 22 November, 2014; originally announced November 2014.

    Comments: 7 pages. To appear, AAAI-15

  50. arXiv:1306.5554  [pdf, ps, other

    stat.ML cs.LG

    Correlated random features for fast semi-supervised learning

    Authors: Brian McWilliams, David Balduzzi, Joachim M. Buhmann

    Abstract: This paper presents Correlated Nystrom Views (XNV), a fast semi-supervised algorithm for regression and classification. The algorithm draws on two main ideas. First, it generates two views consisting of computationally inexpensive random features. Second, XNV applies multiview regression using Canonical Correlation Analysis (CCA) on unlabeled data to bias the regression towards useful features. It… ▽ More

    Submitted 5 November, 2013; v1 submitted 24 June, 2013; originally announced June 2013.

    Comments: 15 pages, 3 figures, 6 tables