Search | arXiv e-print repository

Instance-wise Linearization of Neural Network for Model Interpretation

Authors: Zhimin Li, Shusen Liu, Kailkhura Bhavya, Timo Bremer, Valerio Pascucci

Abstract: Neural network have achieved remarkable successes in many scientific fields. However, the interpretability of the neural network model is still a major bottlenecks to deploy such technique into our daily life. The challenge can dive into the non-linear behavior of the neural network, which rises a critical question that how a model use input feature to make a decision. The classical approach to ad… ▽ More Neural network have achieved remarkable successes in many scientific fields. However, the interpretability of the neural network model is still a major bottlenecks to deploy such technique into our daily life. The challenge can dive into the non-linear behavior of the neural network, which rises a critical question that how a model use input feature to make a decision. The classical approach to address this challenge is feature attribution, which assigns an important score to each input feature and reveal its importance of current prediction. However, current feature attribution approaches often indicate the importance of each input feature without detail of how they are actually processed by a model internally. These attribution approaches often raise a concern that whether they highlight correct features for a model prediction. For a neural network model, the non-linear behavior is often caused by non-linear activation units of a model. However, the computation behavior of a prediction from a neural network model is locally linear, because one prediction has only one activation pattern. Base on the observation, we propose an instance-wise linearization approach to reformulates the forward computation process of a neural network prediction. This approach reformulates different layers of convolution neural networks into linear matrix multiplication. Aggregating all layers' computation, a prediction complex convolution neural network operations can be described as a linear matrix multiplication $F(x) = W \cdot x + b$. This equation can not only provides a feature attribution map that highlights the important of the input features but also tells how each input feature contributes to a prediction exactly. Furthermore, we discuss the application of this technique in both supervise classification and unsupervised neural network learning parametric t-SNE dimension reduction. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2305.08614 [pdf, ps, other]

doi 10.1017/jfm.2023.766

The influence of spectral bandwidth and shape on deep-water wave breaking onset

Authors: M. L. McAllister, N. Pizzo, S. Draycott, T. S. van den Bremer

Abstract: Deep-water surface wave breaking affects the transfer of mass, momentum, energy and heat between the air and sea. Understanding when and how the onset of wave breaking will occur remains a challenge. The mechanisms that form steep waves, i.e. nonlinearity or dispersion, are thought to have a strong influence on the onset of wave breaking. In two-dimensions and on deep-water, spectral bandwidth is… ▽ More Deep-water surface wave breaking affects the transfer of mass, momentum, energy and heat between the air and sea. Understanding when and how the onset of wave breaking will occur remains a challenge. The mechanisms that form steep waves, i.e. nonlinearity or dispersion, are thought to have a strong influence on the onset of wave breaking. In two-dimensions and on deep-water, spectral bandwidth is the main factor that affects the roles these mechanism play. Existing studies, in which the relationship between spectral bandwidth and wave breaking onset is investigated, present varied and sometimes conflicting results. We perform numerical simulations of two-dimensional focused wave groups on deep-water to better understand this relationship, with the aim reconciling existing studies. We show that the way in which steepness is defined, may be the main source confusion in the literature. At breaking onset, locally defined steepness reduces as a function of bandwidth, and globally defined steepness increases. The relationship between global breaking onset steepness and spectral shape (using the parameters bandwidth and spectral skewness) is too complex to parameterise in a general sense. However, we find that the local surface slope of maximally steep non breaking waves, of all spectral bandwidths and shapes (constant-steepness, constant-amplitude, and JONSWAP), approaches a limit of $1/\tan(π/3)\approx0.5774$. This slope based threshold, is simple to measure and may be used as an alternative to existing kinematic breaking onset thresholds. There is a potential link between slope based and kinematic breaking onset thresholds which future work should seek to better understand. △ Less

Submitted 15 May, 2023; originally announced May 2023.

Journal ref: Journal of Fluid Mechanics. 2023;974:A14

arXiv:2301.10603 [pdf, other]

Stochastic particle transport by deep-water irregular breaking waves

Authors: D. Eeltink, R. Calvert, J. E. Swagemakers, Qian Xiao, T. S. van den Bremer

Abstract: Correct prediction of particle transport by surface waves is crucial in many practical applications such as search and rescue or salvage operations and pollution tracking and clean-up efforts. Recent results have indicated transport by deep-water breaking waves is enhanced compared to non-breaking waves. To model particle transport in irregular waves, some of which break, we develop a stochastic d… ▽ More Correct prediction of particle transport by surface waves is crucial in many practical applications such as search and rescue or salvage operations and pollution tracking and clean-up efforts. Recent results have indicated transport by deep-water breaking waves is enhanced compared to non-breaking waves. To model particle transport in irregular waves, some of which break, we develop a stochastic differential equation describing both mean particle transport and its uncertainty. The equation combines a Brownian motion, which captures non-breaking drift-diffusion effects, and a compound Poisson process, which captures jumps in particle positions due to breaking.We corroborate these predictions with new experiments, in which we track large numbers of particles in irregular breaking waves. For breaking and non-breaking wave fields, our experiments confirm that the variance of the particle position grows linearly with time, in accordance with Taylor's single-particle dispersion theory. For wave fields that include breaking, the compound Poisson process increases the linear growth rate of the mean and variance and introduces a finite skewness of the particle position distribution. △ Less

Submitted 25 January, 2023; originally announced January 2023.

arXiv:2109.10428 [pdf, ps, other]

doi 10.3847/1538-3881/ac235c

Variability in Proto-Planetary Nebulae: VIII. A New Sample of Southern Hemisphere Objects

Authors: Bruce J. Hrivnak, Gary Henson, Todd C. Hillwig, Wenxian Lu, Matthew T. Bremer, David M. Vogl, Peyton J. Grimm, Sean M. Egan

Abstract: As part of our continuing study of light variability in proto-planetary nebulae (PPNe), we present the results from a long-term study of nine southern hemisphere objects. We have monitored their light variations over a nine-year interval from 2010-2018. These were supplemented by data from the ASAS-SN and ASAS-3 surveys, leading to combined light curves from 2000 to 2020. Pulsation periods were fo… ▽ More As part of our continuing study of light variability in proto-planetary nebulae (PPNe), we present the results from a long-term study of nine southern hemisphere objects. We have monitored their light variations over a nine-year interval from 2010-2018. These were supplemented by data from the ASAS-SN and ASAS-3 surveys, leading to combined light curves from 2000 to 2020. Pulsation periods were found in seven of the objects, although the three shortest must be regarded as tentative. The periods range from 24 to 73 days. When compared with the results of previous studies of the light variations in PPNe, we find that they show the same trends of shorter period and smaller light variations with higher temperatures. Luminosities were calculated based on the spectral energy distributions, reddening, and Gaia distances, and these confirm the identification of all but one as post-AGB objects. Three of the stars possess long-period variations of 5 to 19 years. These are most likely due to the periodic obscuration of the star by a disk, suggesting the presence of a binary companion and a circumbinary disk. △ Less

Submitted 4 October, 2021; v1 submitted 21 September, 2021; originally announced September 2021.

Comments: 27 pages, 6 tables, 8 figures. Accepted Sep 1, 2021 for publication in the Astronomical Journal: revised in attempt to correct pdf file

arXiv:2102.09836 [pdf, ps, other]

The increased wave-induced drift of floating marine litter: A mechanism for the increased wave-induced drift of floating marine litter

Authors: R. Calvert, M. L. McAllister, C. Whittaker, A. Raby, A. G. L. Borthwick, T. S. van den Bremer

Abstract: Periodic water waves generate Stokes drift as manifest from the orbits of Lagrangian particles not fully closing. Stokes drift can contribute to the transport of floating marine litter, including plastic. Previously, marine litter objects have been considered to be perfect Lagrangian tracers, travelling with the Stokes drift of the waves. However, floating marine litter objects have large ranges o… ▽ More Periodic water waves generate Stokes drift as manifest from the orbits of Lagrangian particles not fully closing. Stokes drift can contribute to the transport of floating marine litter, including plastic. Previously, marine litter objects have been considered to be perfect Lagrangian tracers, travelling with the Stokes drift of the waves. However, floating marine litter objects have large ranges of sizes and densities, which potentially result in different rates of transport by waves due to the non-Lagrangian behaviour of the objects. Through a combination of theory and experiments for idealised spherical objects in deep-water waves, we show that different objects are transported at different rates depending on their size and density, and that larger buoyant objects can have increased drift compared with Lagrangian tracers. We show that the mechanism for the increased drift observed in our experiments comprises the variable submergence and the corresponding dynamic buoyancy force components in a direction perpendicular to the local water surface. This leads to an amplification of the drift of these objects compared to the Stokes drift when averaged over the wave cycle. Using an expansion in wave steepness, we derive a closed-form approximation for this increased drift, which can be included in ocean-scale models of marine litter transport. △ Less

Submitted 19 February, 2021; originally announced February 2021.

Comments: Accepted and In press, Journal of Fluid Mechanics

arXiv:1912.07748 [pdf, other]

MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking

Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

Abstract: In the past few years, Generative Adversarial Networks (GANs) have dramatically advanced our ability to represent and parameterize high-dimensional, non-linear image manifolds. As a result, they have been widely adopted across a variety of applications, ranging from challenging inverse problems like image completion, to problems such as anomaly detection and adversarial defense. A recurring theme… ▽ More In the past few years, Generative Adversarial Networks (GANs) have dramatically advanced our ability to represent and parameterize high-dimensional, non-linear image manifolds. As a result, they have been widely adopted across a variety of applications, ranging from challenging inverse problems like image completion, to problems such as anomaly detection and adversarial defense. A recurring theme in many of these applications is the notion of projecting an image observation onto the manifold that is inferred by the generator. In this context, Projected Gradient Descent (PGD) has been the most popular approach, which essentially optimizes for a latent vector that minimizes the discrepancy between a generated image and the given observation. However, PGD is a brittle optimization technique that fails to identify the right projection (or latent vector) when the observation is corrupted, or perturbed even by a small amount. Such corruptions are common in the real world, for example images in the wild come with unknown crops, rotations, missing pixels, or other kinds of non-linear distributional shifts which break current encoding methods, rendering downstream applications unusable. To address this, we propose corruption mimicking -- a new robust projection technique, that utilizes a surrogate network to approximate the unknown corruption directly at test time, without the need for additional supervision or data augmentation. The proposed method is significantly more robust than PGD and other competing methods under a wide variety of corruptions, thereby enabling a more effective use of GANs in real-world applications. More importantly, we show that our approach produces state-of-the-art performance in several GAN-based applications -- anomaly detection, domain adaptation, and adversarial defense, that benefit from an accurate projection. △ Less

Submitted 30 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

Comments: International Journal on Computer Vision's (IJCV) Special Issue on GANs

arXiv:1812.07423 [pdf, other]

doi 10.1073/pnas.1821970116

Directional Soliton and Breather Beams

Authors: Amin Chabchoub, Kento Mozumi, Norbert Hoffmann, Alexander V. Babanin, Alessandro Toffoli, James N. Steer, Ton S. van den Bremer, Nail Akhmediev, Miguel Onorato, Takuji Waseda

Abstract: Solitons and breathers are nonlinear modes that exist in a wide range of physical systems. They are fundamental solutions of a number of nonlinear wave evolution equations, including the uni-directional nonlinear Schrödinger equation (NLSE). We report the observation of slanted solitons and breathers propagating at an angle with respect to the direction of propagation of the wave field. As the coh… ▽ More Solitons and breathers are nonlinear modes that exist in a wide range of physical systems. They are fundamental solutions of a number of nonlinear wave evolution equations, including the uni-directional nonlinear Schrödinger equation (NLSE). We report the observation of slanted solitons and breathers propagating at an angle with respect to the direction of propagation of the wave field. As the coherence is diagonal, the scale in the crest direction becomes finite, consequently, a beam dynamics forms. Spatio-temporal measurements of the water surface elevation are obtained by stereo-reconstructing the positions of the floating markers placed on a regular lattice and recorded with two synchronized high-speed cameras. Experimental results, based on the predictions obtained from the (2D+1) hyperbolic NLSE equation, are in excellent agreement with the theory. Our study proves the existence of such unique and coherent wave packets and has serious implications for practical applications in optical sciences and physical oceanography. Moreover, unstable wave fields in this geometry may explain the formation of directional large amplitude rogue waves with a finite crest length within a wide range of nonlinear dispersive media, such as Bose-Einstein condensates, plasma, hydrodynamics and optics. △ Less

Submitted 13 December, 2018; originally announced December 2018.

arXiv:1811.08484 [pdf, other]

MimicGAN: Corruption-Mimicking for Blind Image Recovery & Adversarial Defense

Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

Abstract: Solving inverse problems continues to be a central challenge in computer vision. Existing techniques either explicitly construct an inverse map** using prior knowledge about the corruption, or learn the inverse directly using a large collection of examples. However, in practice, the nature of corruption may be unknown, and thus it is challenging to regularize the problem of inferring a plausible… ▽ More Solving inverse problems continues to be a central challenge in computer vision. Existing techniques either explicitly construct an inverse map** using prior knowledge about the corruption, or learn the inverse directly using a large collection of examples. However, in practice, the nature of corruption may be unknown, and thus it is challenging to regularize the problem of inferring a plausible solution. On the other hand, collecting task-specific training data is tedious for known corruptions and impossible for unknown ones. We present MimicGAN, an unsupervised technique to solve general inverse problems based on image priors in the form of generative adversarial networks (GANs). Using a GAN prior, we show that one can reliably recover solutions to underdetermined inverse problems through a surrogate network that learns to mimic the corruption at test time. Our system successively estimates the corruption and the clean image without the need for supervisory training, while outperforming existing baselines in blind image recovery. We also demonstrate that MimicGAN improves upon recent GAN-based defenses against adversarial attacks and represents one of the strongest test-time defenses available today. △ Less

Submitted 20 November, 2018; originally announced November 2018.

arXiv:1805.07281 [pdf, other]

An Unsupervised Approach to Solving Inverse Problems using Generative Adversarial Networks

Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

Abstract: Solving inverse problems continues to be a challenge in a wide array of applications ranging from deblurring, image inpainting, source separation etc. Most existing techniques solve such inverse problems by either explicitly or implicitly finding the inverse of the model. The former class of techniques require explicit knowledge of the measurement process which can be unrealistic, and rely on stro… ▽ More Solving inverse problems continues to be a challenge in a wide array of applications ranging from deblurring, image inpainting, source separation etc. Most existing techniques solve such inverse problems by either explicitly or implicitly finding the inverse of the model. The former class of techniques require explicit knowledge of the measurement process which can be unrealistic, and rely on strong analytical regularizers to constrain the solution space, which often do not generalize well. The latter approaches have had remarkable success in part due to deep learning, but require a large collection of source-observation pairs, which can be prohibitively expensive. In this paper, we propose an unsupervised technique to solve inverse problems with generative adversarial networks (GANs). Using a pre-trained GAN in the space of source signals, we show that one can reliably recover solutions to under determined problems in a `blind' fashion, i.e., without knowledge of the measurement process. We solve this by making successive estimates on the model and the solution in an iterative fashion. We show promising results in three challenging applications -- blind source separation, image deblurring, and recovering an image from its edge map, and perform better than several baselines. △ Less

Submitted 4 June, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

arXiv:1711.10388 [pdf, other]

Lose The Views: Limited Angle CT Reconstruction via Implicit Sinogram Completion

Authors: Rushil Anirudh, Hyo** Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle Champley, Timo Bremer

Abstract: Computed Tomography (CT) reconstruction is a fundamental component to a wide variety of applications ranging from security, to healthcare. The classical techniques require measuring projections, called sinograms, from a full 180$^\circ$ view of the object. This is impractical in a limited angle scenario, when the viewing angle is less than 180$^\circ$, which can occur due to different factors incl… ▽ More Computed Tomography (CT) reconstruction is a fundamental component to a wide variety of applications ranging from security, to healthcare. The classical techniques require measuring projections, called sinograms, from a full 180$^\circ$ view of the object. This is impractical in a limited angle scenario, when the viewing angle is less than 180$^\circ$, which can occur due to different factors including restrictions on scanning time, limited flexibility of scanner rotation, etc. The sinograms obtained as a result, cause existing techniques to produce highly artifact-laden reconstructions. In this paper, we propose to address this problem through implicit sinogram completion, on a challenging real world dataset containing scans of common checked-in luggage. We propose a system, consisting of 1D and 2D convolutional neural networks, that operates on a limited angle sinogram to directly produce the best estimate of a reconstruction. Next, we use the x-ray transform on this reconstruction to obtain a "completed" sinogram, as if it came from a full 180$^\circ$ measurement. We feed this to standard analytical and iterative reconstruction techniques to obtain the final reconstruction. We show with extensive experimentation that this combined strategy outperforms many competitive baselines. We also propose a measure of confidence for the reconstruction that enables a practitioner to gauge the reliability of a prediction made by our network. We show that this measure is a strong indicator of quality as measured by the PSNR, while not requiring ground truth at test time. Finally, using a segmentation experiment, we show that our reconstruction preserves the 3D structure of objects effectively. △ Less

Submitted 11 July, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

Comments: Spotlight presentation at CVPR 2018

Showing 1–10 of 10 results for author: Bremer, T