-
FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods
Authors:
Robin Hesse,
Simone Schaub-Meyer,
Stefan Roth
Abstract:
The field of explainable artificial intelligence (XAI) aims to uncover the inner workings of complex deep neural models. While being crucial for safety-critical domains, XAI inherently lacks ground-truth explanations, making its automatic evaluation an unsolved problem. We address this challenge by proposing a novel synthetic vision dataset, named FunnyBirds, and accompanying automatic evaluation…
▽ More
The field of explainable artificial intelligence (XAI) aims to uncover the inner workings of complex deep neural models. While being crucial for safety-critical domains, XAI inherently lacks ground-truth explanations, making its automatic evaluation an unsolved problem. We address this challenge by proposing a novel synthetic vision dataset, named FunnyBirds, and accompanying automatic evaluation protocols. Our dataset allows performing semantically meaningful image interventions, e.g., removing individual object parts, which has three important implications. First, it enables analyzing explanations on a part level, which is closer to human comprehension than existing methods that evaluate on a pixel level. Second, by comparing the model output for inputs with removed parts, we can estimate ground-truth part importances that should be reflected in the explanations. Third, by map** individual explanations into a common space of part importances, we can analyze a variety of different explanation types in a single common framework. Using our tools, we report results for 24 different combinations of neural models and XAI methods, demonstrating the strengths and weaknesses of the assessed methods in a fully automatic and systematic manner.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Almost Sure Averaging for Fast-slow Stochastic Differential Equations via Controlled Rough Path
Authors:
Bin Pei,
Robert Hesse,
Bjoern Schmalfuss,
Yong Xu
Abstract:
This paper establishes the averaging method to a coupled system consisting of two stochastic differential equations which has a slow component driven by fractional Brownian motion (FBM) with less regularity $1/3< H \leq 1/2$ and a fast dynamics under additive FBM with Hurst-index $1/3< \hat H \leq 1/2$. We prove that the solution of the slow component converges almost surely to the solution of the…
▽ More
This paper establishes the averaging method to a coupled system consisting of two stochastic differential equations which has a slow component driven by fractional Brownian motion (FBM) with less regularity $1/3< H \leq 1/2$ and a fast dynamics under additive FBM with Hurst-index $1/3< \hat H \leq 1/2$. We prove that the solution of the slow component converges almost surely to the solution of the corresponding averaged equation using the approach of time discretization and controlled rough path. To do this, we employ the random dynamical system (RDS) to obtain a stationary solution by an exponentially attracting random fixed point of the RDS generated by the non-Markovian fast component.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Content-Adaptive Downsampling in Convolutional Neural Networks
Authors:
Robin Hesse,
Simone Schaub-Meyer,
Stefan Roth
Abstract:
Many convolutional neural networks (CNNs) rely on progressive downsampling of their feature maps to increase the network's receptive field and decrease computational cost. However, this comes at the price of losing granularity in the feature maps, limiting the ability to correctly understand images or recover fine detail in dense prediction tasks. To address this, common practice is to replace the…
▽ More
Many convolutional neural networks (CNNs) rely on progressive downsampling of their feature maps to increase the network's receptive field and decrease computational cost. However, this comes at the price of losing granularity in the feature maps, limiting the ability to correctly understand images or recover fine detail in dense prediction tasks. To address this, common practice is to replace the last few downsampling operations in a CNN with dilated convolutions, allowing to retain the feature map resolution without reducing the receptive field, albeit increasing the computational cost. This allows to trade off predictive performance against cost, depending on the output feature resolution. By either regularly downsampling or not downsampling the entire feature map, existing work implicitly treats all regions of the input image and subsequent feature maps as equally important, which generally does not hold. We propose an adaptive downsampling scheme that generalizes the above idea by allowing to process informative regions at a higher resolution than less informative ones. In a variety of experiments, we demonstrate the versatility of our adaptive downsampling strategy and empirically show that it improves the cost-accuracy trade-off of various established CNNs.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Fast Axiomatic Attribution for Neural Networks
Authors:
Robin Hesse,
Simone Schaub-Meyer,
Stefan Roth
Abstract:
Mitigating the dependence on spurious correlations present in the training dataset is a quickly emerging and important topic of deep learning. Recent approaches include priors on the feature attribution of a deep neural network (DNN) into the training process to reduce the dependence on unwanted features. However, until now one needed to trade off high-quality attributions, satisfying desirable ax…
▽ More
Mitigating the dependence on spurious correlations present in the training dataset is a quickly emerging and important topic of deep learning. Recent approaches include priors on the feature attribution of a deep neural network (DNN) into the training process to reduce the dependence on unwanted features. However, until now one needed to trade off high-quality attributions, satisfying desirable axioms, against the time required to compute them. This in turn either led to long training times or ineffective attribution priors. In this work, we break this trade-off by considering a special class of efficiently axiomatically attributable DNNs for which an axiomatic feature attribution can be computed with only a single forward/backward pass. We formally prove that nonnegatively homogeneous DNNs, here termed $\mathcal{X}$-DNNs, are efficiently axiomatically attributable and show that they can be effortlessly constructed from a wide range of regular DNNs by simply removing the bias term of each layer. Various experiments demonstrate the advantages of $\mathcal{X}$-DNNs, beating state-of-the-art generic attribution methods on regular DNNs for training with attribution priors.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Global solutions for semilinear rough partial differential equations
Authors:
Robert Hesse,
Alexandra Neamtu
Abstract:
We construct global-in-time solutions for semilinear parabolic rough partial differential equations. We work on a scale of Banach spaces tailored to the controlled rough path approach and derive suitable a-priori estimates of the solution which do not contain quadratic terms.
We construct global-in-time solutions for semilinear parabolic rough partial differential equations. We work on a scale of Banach spaces tailored to the controlled rough path approach and derive suitable a-priori estimates of the solution which do not contain quadratic terms.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
Global solutions and random dynamical systems for rough evolution equations
Authors:
Robert Hesse,
Alexandra Neamtu
Abstract:
We consider infinite-dimensional parabolic rough evolution equations. Using regularizing properties of analytic semigroups we prove global-in-time existence of solutions and investigate random dynamical systems for such equations.
We consider infinite-dimensional parabolic rough evolution equations. Using regularizing properties of analytic semigroups we prove global-in-time existence of solutions and investigate random dynamical systems for such equations.
△ Less
Submitted 5 April, 2019; v1 submitted 23 November, 2018;
originally announced November 2018.
-
Local mild solutions for rough stochastic partial differential equations
Authors:
Robert Hesse,
Alexandra Neamtu
Abstract:
We investigate mild solutions for stochastic evolution equations driven by a fractional Brownian motion (fBm) with Hurst parameter H in (1/3, 1/2] in infinite-dimensional Banach spaces. Using elements from rough paths theory we introduce an appropriate integral with respect to the fBm. This allows us to solve pathwise our stochastic evolution equation in a suitable function space.
We investigate mild solutions for stochastic evolution equations driven by a fractional Brownian motion (fBm) with Hurst parameter H in (1/3, 1/2] in infinite-dimensional Banach spaces. Using elements from rough paths theory we introduce an appropriate integral with respect to the fBm. This allows us to solve pathwise our stochastic evolution equation in a suitable function space.
△ Less
Submitted 5 April, 2019; v1 submitted 23 September, 2018;
originally announced September 2018.
-
Proximal Heterogeneous Block Input-Output Method and application to Blind Ptychographic Diffraction Imaging
Authors:
Robert Hesse,
D. Russell Luke,
Shoham Sabach,
Matthew K. Tam
Abstract:
We propose a general alternating minimization algorithm for nonconvex optimization problems with separable structure and nonconvex coupling between blocks of variables. To fix our ideas, we apply the methodology to the problem of blind ptychographic imaging. Compared to other schemes in the literature, our approach differs in two ways: (i) it is posed within a clear mathematical framework with pra…
▽ More
We propose a general alternating minimization algorithm for nonconvex optimization problems with separable structure and nonconvex coupling between blocks of variables. To fix our ideas, we apply the methodology to the problem of blind ptychographic imaging. Compared to other schemes in the literature, our approach differs in two ways: (i) it is posed within a clear mathematical framework with practically verifiable assumptions, and (ii) under the given assumptions, it is provably convergent to critical points. A numerical comparison of our proposed algorithm with the current state-of-the-art on simulated and experimental data validates our approach and points toward directions for further improvement.
△ Less
Submitted 8 August, 2014;
originally announced August 2014.
-
Alternating Projections and Douglas-Rachford for Sparse Affine Feasibility
Authors:
Robert Hesse,
D. Russell Luke,
Patrick Neumann
Abstract:
The problem of finding a vector with the fewest nonzero elements that satisfies an underdetermined system of linear equations is an NP-complete problem that is typically solved numerically via convex heuristics or nicely-behaved nonconvex relaxations. In this work we consider elementary methods based on projections for solving a sparse feasibility problem without employing convex heuristics. In a…
▽ More
The problem of finding a vector with the fewest nonzero elements that satisfies an underdetermined system of linear equations is an NP-complete problem that is typically solved numerically via convex heuristics or nicely-behaved nonconvex relaxations. In this work we consider elementary methods based on projections for solving a sparse feasibility problem without employing convex heuristics. In a recent paper Bauschke, Luke, Phan and Wang (2014) showed that, locally, the fundamental method of alternating projections must converge linearly to a solution to the sparse feasibility problem with an affine constraint. In this paper we apply different analytical tools that allow us to show global linear convergence of alternating projections under familiar constraint qualifications. These analytical tools can also be applied to other algorithms. This is demonstrated with the prominent Douglas-Rachford algorithm where we establish local linear convergence of this method applied to the sparse affine feasibility problem.
△ Less
Submitted 14 March, 2014; v1 submitted 8 July, 2013;
originally announced July 2013.
-
Nonconvex notions of regularity and convergence of fundamental algorithms for feasibility problems
Authors:
Robert Hesse,
D. Russell Luke
Abstract:
We consider projection algorithms for solving (nonconvex) feasibility problems in Euclidean spaces. Of special interest are the Method of Alternating Projections (MAP) and the Douglas-Rachford or Averaged Alternating Reflection Algorithm (AAR). In the case of convex feasibility, firm nonexpansiveness of projection map**s is a global property that yields global convergence of MAP and for consiste…
▽ More
We consider projection algorithms for solving (nonconvex) feasibility problems in Euclidean spaces. Of special interest are the Method of Alternating Projections (MAP) and the Douglas-Rachford or Averaged Alternating Reflection Algorithm (AAR). In the case of convex feasibility, firm nonexpansiveness of projection map**s is a global property that yields global convergence of MAP and for consistent problems AAR. Based on (ε, δ)-regularity of sets developed by Bauschke, Luke, Phan and Wang in 2012, a relaxed local version of firm nonexpansiveness with respect to the intersection is introduced for consistent feasibility problems. Together with a coercivity condition that relates to the regularity of the intersection, this yields local linear convergence of MAP for a wide class of nonconvex problems,
△ Less
Submitted 24 June, 2013; v1 submitted 13 December, 2012;
originally announced December 2012.