-
Moving Frame Net: SE(3)-Equivariant Network for Volumes
Authors:
Mateus Sangalli,
Samy Blusseau,
Santiago Velasco-Forero,
Jesus Angulo
Abstract:
Equivariance of neural networks to transformations helps to improve their performance and reduce generalization error in computer vision tasks, as they apply to datasets presenting symmetries (e.g. scalings, rotations, translations). The method of moving frames is classical for deriving operators invariant to the action of a Lie group in a manifold.Recently, a rotation and translation equivariant…
▽ More
Equivariance of neural networks to transformations helps to improve their performance and reduce generalization error in computer vision tasks, as they apply to datasets presenting symmetries (e.g. scalings, rotations, translations). The method of moving frames is classical for deriving operators invariant to the action of a Lie group in a manifold.Recently, a rotation and translation equivariant neural network for image data was proposed based on the moving frames approach. In this paper we significantly improve that approach by reducing the computation of moving frames to only one, at the input stage, instead of repeated computations at each layer. The equivariance of the resulting architecture is proved theoretically and we build a rotation and translation equivariant neural network to process volumes, i.e. signals on the 3D space. Our trained model overperforms the benchmarks in the medical volume classification of most of the tested datasets from MedMNIST3D.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
The optically elusive, changing-look active nucleus in NGC 4156
Authors:
Giulia Tozzi,
Elisabeta Lusso,
Lapo Casetti,
Marco Romoli,
Gloria Andreuzzi,
Isabel Montoya A.,
Emanuele Nardini,
Giovanni Cresci,
Riccardo Middei,
Silvia Bertolini,
Paolo Calabretto,
Vieri Cammelli,
Francisco Cuadra,
Marco Dalla Ragione,
Cosimo Marconcini,
Adriano Miceli,
Irene Mini,
Martina Palazzini,
Giorgio Rotellini,
Andrea Saccardi,
Lavinia Samà,
Mattia Sangalli,
Lorenzo Serafini,
Fabio Spaccino
Abstract:
We report on the changing-look nature of the active galactic nucleus (AGN) in the galaxy NGC 4156, as serendipitously discovered thanks to data acquired in 2019 at the Telescopio Nazionale Galileo (TNG) during a students' observing programme. Previous optical spectra had never shown any signatures of broad-line emission, and evidence of the AGN had come only from X-ray observations, being the opti…
▽ More
We report on the changing-look nature of the active galactic nucleus (AGN) in the galaxy NGC 4156, as serendipitously discovered thanks to data acquired in 2019 at the Telescopio Nazionale Galileo (TNG) during a students' observing programme. Previous optical spectra had never shown any signatures of broad-line emission, and evidence of the AGN had come only from X-ray observations, being the optical narrow-line flux ratios unable to unambiguously denote this galaxy as a Seyfert. Our 2019 TNG data unexpectedly revealed the appearance of broad-line components in both the H$α$ and H$β$ profiles, along with a rise of the continuum, thus implying a changing-look AGN transitioning from a type 2 (no broad-line emission) towards a (nearly) type 1. The broad-line emission has then been confirmed by our 2022 follow-up observations, whereas the rising continuum has no longer been detected, which hints at a further evolution backwards to a (nearly) type 2. The presence of broad-line components also allowed us to obtain the first single-epoch estimate of the black hole mass (log(MBH/Msun) $\sim$ 8.1) in this source. The observed spectral variability might be the result of a change in the accretion activity of NGC 4156, although variable absorption cannot be completely excluded.
△ Less
Submitted 17 October, 2022;
originally announced October 2022.
-
Scale Equivariant U-Net
Authors:
Mateus Sangalli,
Samy Blusseau,
Santiago Velasco-Forero,
Jesus Angulo
Abstract:
In neural networks, the property of being equivariant to transformations improves generalization when the corresponding symmetry is present in the data. In particular, scale-equivariant networks are suited to computer vision tasks where the same classes of objects appear at different scales, like in most semantic segmentation tasks. Recently, convolutional layers equivariant to a semigroup of scal…
▽ More
In neural networks, the property of being equivariant to transformations improves generalization when the corresponding symmetry is present in the data. In particular, scale-equivariant networks are suited to computer vision tasks where the same classes of objects appear at different scales, like in most semantic segmentation tasks. Recently, convolutional layers equivariant to a semigroup of scalings and translations have been proposed. However, the equivariance of subsampling and upsampling has never been explicitly studied even though they are necessary building blocks in some segmentation architectures. The U-Net is a representative example of such architectures, which includes the basic elements used for state-of-the-art semantic segmentation. Therefore, this paper introduces the Scale Equivariant U-Net (SEU-Net), a U-Net that is made approximately equivariant to a semigroup of scales and translations through careful application of subsampling and upsampling layers and the use of aforementioned scale-equivariant layers. Moreover, a scale-dropout is proposed in order to improve generalization to different scales in approximately scale-equivariant architectures. The proposed SEU-Net is trained for semantic segmentation of the Oxford Pet IIIT and the DIC-C2DH-HeLa dataset for cell segmentation. The generalization metric to unseen scales is dramatically improved in comparison to the U-Net, even when the U-Net is trained with scale jittering, and to a scale-equivariant architecture that does not perform upsampling operators inside the equivariant pipeline. The scale-dropout induces better generalization on the scale-equivariant models in the Pet experiment, but not on the cell segmentation experiment.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Differential invariants for SE(2)-equivariant networks
Authors:
Mateus Sangalli,
Samy Blusseau,
Santiago Velasco-Forero,
Jesús Angulo
Abstract:
Symmetry is present in many tasks in computer vision, where the same class of objects can appear transformed, e.g. rotated due to different camera orientations, or scaled due to perspective. The knowledge of such symmetries in data coupled with equivariance of neural networks can improve their generalization to new samples. Differential invariants are equivariant operators computed from the partia…
▽ More
Symmetry is present in many tasks in computer vision, where the same class of objects can appear transformed, e.g. rotated due to different camera orientations, or scaled due to perspective. The knowledge of such symmetries in data coupled with equivariance of neural networks can improve their generalization to new samples. Differential invariants are equivariant operators computed from the partial derivatives of a function. In this paper we use differential invariants to define equivariant operators that form the layers of an equivariant neural network. Specifically, we derive invariants of the Special Euclidean Group SE(2), composed of rotations and translations, and apply them to construct a SE(2)-equivariant network, called SE(2) Differential Invariants Network (SE2DINNet). The network is subsequently tested in classification tasks which require a degree of equivariance or invariance to rotations. The results compare positively with the state-of-the-art, even though the proposed SE2DINNet has far less parameters than the compared models.
△ Less
Submitted 22 July, 2022; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Scale Equivariant Neural Networks with Morphological Scale-Spaces
Authors:
Mateus Sangalli,
Samy Blusseau,
Santiago Velasco-Forero,
Jesus Angulo
Abstract:
The translation equivariance of convolutions can make convolutional neural networks translation equivariant or invariant. Equivariance to other transformations (e.g. rotations, affine transformations, scalings) may also be desirable as soon as we know a priori that transformed versions of the same objects appear in the data. The semigroup cross-correlation, which is a linear operator equivariant…
▽ More
The translation equivariance of convolutions can make convolutional neural networks translation equivariant or invariant. Equivariance to other transformations (e.g. rotations, affine transformations, scalings) may also be desirable as soon as we know a priori that transformed versions of the same objects appear in the data. The semigroup cross-correlation, which is a linear operator equivariant to semigroup actions, was recently proposed and applied in conjunction with the Gaussian scale-space to create architectures which are equivariant to discrete scalings. In this paper, a generalization using a broad class of liftings, including morphological scale-spaces, is proposed. The architectures obtained from different scale-spaces are tested and compared in supervised classification and semantic segmentation tasks where objects in test images appear at different scales compared to training images. In both classification and segmentation tasks, the scale-equivariant architectures improve dramatically the generalization to unseen scales compared to a convolutional baseline. Besides, in our experiments morphological scale-spaces outperformed the Gaussian scale-space in geometrical tasks.
△ Less
Submitted 4 May, 2021;
originally announced May 2021.
-
Smooth Principal Component Analysis over two-dimensional manifolds with an application to Neuroimaging
Authors:
Eardi Lila,
John A. D. Aston,
Laura M. Sangalli
Abstract:
Motivated by the analysis of high-dimensional neuroimaging signals located over the cortical surface, we introduce a novel Principal Component Analysis technique that can handle functional data located over a two-dimensional manifold. For this purpose a regularization approach is adopted, introducing a smoothing penalty coherent with the geodesic distance over the manifold. The model introduced ca…
▽ More
Motivated by the analysis of high-dimensional neuroimaging signals located over the cortical surface, we introduce a novel Principal Component Analysis technique that can handle functional data located over a two-dimensional manifold. For this purpose a regularization approach is adopted, introducing a smoothing penalty coherent with the geodesic distance over the manifold. The model introduced can be applied to any manifold topology, can naturally handle missing data and functional samples evaluated in different grids of points. We approach the discretization task by means of finite element analysis and propose an efficient iterative algorithm for its resolution. We compare the performances of the proposed algorithm with other approaches classically adopted in literature. We finally apply the proposed method to resting state functional magnetic resonance imaging data from the Human Connectome Project, where the method shows substantial differential variations between brain regions that were not apparent with other approaches.
△ Less
Submitted 12 September, 2016; v1 submitted 14 January, 2016;
originally announced January 2016.
-
Functional Data Analysis of Amplitude and Phase Variation
Authors:
J. S. Marron,
James O. Ramsay,
Laura M. Sangalli,
Anuj Srivastava
Abstract:
The abundance of functional observations in scientific endeavors has led to a significant development in tools for functional data analysis (FDA). This kind of data comes with several challenges: infinite-dimensionality of function spaces, observation noise, and so on. However, there is another interesting phenomena that creates problems in FDA. The functional data often comes with lateral displac…
▽ More
The abundance of functional observations in scientific endeavors has led to a significant development in tools for functional data analysis (FDA). This kind of data comes with several challenges: infinite-dimensionality of function spaces, observation noise, and so on. However, there is another interesting phenomena that creates problems in FDA. The functional data often comes with lateral displacements/deformations in curves, a phenomenon which is different from the height or amplitude variability and is termed phase variation. The presence of phase variability artificially often inflates data variance, blurs underlying data structures, and distorts principal components. While the separation and/or removal of phase from amplitude data is desirable, this is a difficult problem. In particular, a commonly used alignment procedure, based on minimizing the $\mathbb{L}^2$ norm between functions, does not provide satisfactory results. In this paper we motivate the importance of dealing with the phase variability and summarize several current ideas for separating phase and amplitude components. These approaches differ in the following: (1) the definition and mathematical representation of phase variability, (2) the objective functions that are used in functional data alignment, and (3) the algorithmic tools for solving estimation/optimization problems. We use simple examples to illustrate various approaches and to provide useful contrast between them.
△ Less
Submitted 10 December, 2015;
originally announced December 2015.
-
Generalized Spatial Regression with Differential Regularization
Authors:
Matthieu Wilhelm,
Laura M. Sangalli
Abstract:
We aim at analyzing geostatistical and areal data observed over irregularly shaped spatial domains and having a distribution within the exponential family. We propose a generalized additive model that allows to account for spatially-varying covariate information. The model is fitted by maximizing a penalized log-likelihood function, with a roughness penalty term that involves a differential quanti…
▽ More
We aim at analyzing geostatistical and areal data observed over irregularly shaped spatial domains and having a distribution within the exponential family. We propose a generalized additive model that allows to account for spatially-varying covariate information. The model is fitted by maximizing a penalized log-likelihood function, with a roughness penalty term that involves a differential quantity of the spatial field, computed over the domain of interest. Efficient estimation of the spatial field is achieved resorting to the finite element method, which provides a basis for piecewise polynomial surfaces. The proposed model is illustrated by an application to the study of criminality in the city of Portland, Oregon, USA.
△ Less
Submitted 21 April, 2016; v1 submitted 9 November, 2015;
originally announced November 2015.
-
IGS: an IsoGeometric approach for Smoothing on surfaces
Authors:
Matthieu Wilhelm,
Luca Dedè,
Laura M. Sangalli,
Pierre Wilhelm
Abstract:
We propose an Isogeometric approach for smoothing on surfaces, namely estimating a function starting from noisy and discrete measurements. More precisely, we aim at estimating functions lying on a surface represented by NURBS, which are geometrical representations commonly used in industrial applications. The estimation is based on the minimization of a penalized least-square functional. The latte…
▽ More
We propose an Isogeometric approach for smoothing on surfaces, namely estimating a function starting from noisy and discrete measurements. More precisely, we aim at estimating functions lying on a surface represented by NURBS, which are geometrical representations commonly used in industrial applications. The estimation is based on the minimization of a penalized least-square functional. The latter is equivalent to solve a 4th-order Partial Differential Equation (PDE). In this context, we use Isogeometric Analysis (IGA) for the numerical approximation of such surface PDE, leading to an IsoGeometric Smoothing (IGS) method for fitting data spatially distributed on a surface. Indeed, IGA facilitates encapsulating the exact geometrical representation of the surface in the analysis and also allows the use of at least globally $C^1-$continuous NURBS basis functions for which the 4th-order PDE can be solved using the standard Galerkin method. We show the performance of the proposed IGS method by means of numerical simulations and we apply it to the estimation of the pressure coefficient, and associated aerodynamic force on a winglet of the SOAR space shuttle.
△ Less
Submitted 1 January, 2016; v1 submitted 21 August, 2015;
originally announced August 2015.
-
Latent diffusion models for survival analysis
Authors:
Gareth O. Roberts,
Laura M. Sangalli
Abstract:
We consider Bayesian hierarchical models for survival analysis, where the survival times are modeled through an underlying diffusion process which determines the hazard rate. We show how these models can be efficiently treated by means of Markov chain Monte Carlo techniques.
We consider Bayesian hierarchical models for survival analysis, where the survival times are modeled through an underlying diffusion process which determines the hazard rate. We show how these models can be efficiently treated by means of Markov chain Monte Carlo techniques.
△ Less
Submitted 8 October, 2010;
originally announced October 2010.