Search | arXiv e-print repository

Moving Frame Net: SE(3)-Equivariant Network for Volumes

Authors: Mateus Sangalli, Samy Blusseau, Santiago Velasco-Forero, Jesus Angulo

Abstract: Equivariance of neural networks to transformations helps to improve their performance and reduce generalization error in computer vision tasks, as they apply to datasets presenting symmetries (e.g. scalings, rotations, translations). The method of moving frames is classical for deriving operators invariant to the action of a Lie group in a manifold.Recently, a rotation and translation equivariant… ▽ More Equivariance of neural networks to transformations helps to improve their performance and reduce generalization error in computer vision tasks, as they apply to datasets presenting symmetries (e.g. scalings, rotations, translations). The method of moving frames is classical for deriving operators invariant to the action of a Lie group in a manifold.Recently, a rotation and translation equivariant neural network for image data was proposed based on the moving frames approach. In this paper we significantly improve that approach by reducing the computation of moving frames to only one, at the input stage, instead of repeated computations at each layer. The equivariance of the resulting architecture is proved theoretically and we build a rotation and translation equivariant neural network to process volumes, i.e. signals on the 3D space. Our trained model overperforms the benchmarks in the medical volume classification of most of the tested datasets from MedMNIST3D. △ Less

Submitted 7 November, 2022; originally announced November 2022.

Journal ref: Symmetry and Geometry in Neural Representations (NeurReps), Dec 2022, New Orleans, United States

arXiv:2210.09341 [pdf, other]

doi 10.1051/0004-6361/202244987

The optically elusive, changing-look active nucleus in NGC 4156

Authors: Giulia Tozzi, Elisabeta Lusso, Lapo Casetti, Marco Romoli, Gloria Andreuzzi, Isabel Montoya A., Emanuele Nardini, Giovanni Cresci, Riccardo Middei, Silvia Bertolini, Paolo Calabretto, Vieri Cammelli, Francisco Cuadra, Marco Dalla Ragione, Cosimo Marconcini, Adriano Miceli, Irene Mini, Martina Palazzini, Giorgio Rotellini, Andrea Saccardi, Lavinia Samà, Mattia Sangalli, Lorenzo Serafini, Fabio Spaccino

Abstract: We report on the changing-look nature of the active galactic nucleus (AGN) in the galaxy NGC 4156, as serendipitously discovered thanks to data acquired in 2019 at the Telescopio Nazionale Galileo (TNG) during a students' observing programme. Previous optical spectra had never shown any signatures of broad-line emission, and evidence of the AGN had come only from X-ray observations, being the opti… ▽ More We report on the changing-look nature of the active galactic nucleus (AGN) in the galaxy NGC 4156, as serendipitously discovered thanks to data acquired in 2019 at the Telescopio Nazionale Galileo (TNG) during a students' observing programme. Previous optical spectra had never shown any signatures of broad-line emission, and evidence of the AGN had come only from X-ray observations, being the optical narrow-line flux ratios unable to unambiguously denote this galaxy as a Seyfert. Our 2019 TNG data unexpectedly revealed the appearance of broad-line components in both the H$α$ and H$β$ profiles, along with a rise of the continuum, thus implying a changing-look AGN transitioning from a type 2 (no broad-line emission) towards a (nearly) type 1. The broad-line emission has then been confirmed by our 2022 follow-up observations, whereas the rising continuum has no longer been detected, which hints at a further evolution backwards to a (nearly) type 2. The presence of broad-line components also allowed us to obtain the first single-epoch estimate of the black hole mass (log(MBH/Msun) $\sim$ 8.1) in this source. The observed spectral variability might be the result of a change in the accretion activity of NGC 4156, although variable absorption cannot be completely excluded. △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: 9 pages, 6 figures. Accepted for publication in A&A Letters

Journal ref: A&A 667, L12 (2022)

arXiv:2210.04508 [pdf, other]

Scale Equivariant U-Net

Authors: Mateus Sangalli, Samy Blusseau, Santiago Velasco-Forero, Jesus Angulo

Abstract: In neural networks, the property of being equivariant to transformations improves generalization when the corresponding symmetry is present in the data. In particular, scale-equivariant networks are suited to computer vision tasks where the same classes of objects appear at different scales, like in most semantic segmentation tasks. Recently, convolutional layers equivariant to a semigroup of scal… ▽ More In neural networks, the property of being equivariant to transformations improves generalization when the corresponding symmetry is present in the data. In particular, scale-equivariant networks are suited to computer vision tasks where the same classes of objects appear at different scales, like in most semantic segmentation tasks. Recently, convolutional layers equivariant to a semigroup of scalings and translations have been proposed. However, the equivariance of subsampling and upsampling has never been explicitly studied even though they are necessary building blocks in some segmentation architectures. The U-Net is a representative example of such architectures, which includes the basic elements used for state-of-the-art semantic segmentation. Therefore, this paper introduces the Scale Equivariant U-Net (SEU-Net), a U-Net that is made approximately equivariant to a semigroup of scales and translations through careful application of subsampling and upsampling layers and the use of aforementioned scale-equivariant layers. Moreover, a scale-dropout is proposed in order to improve generalization to different scales in approximately scale-equivariant architectures. The proposed SEU-Net is trained for semantic segmentation of the Oxford Pet IIIT and the DIC-C2DH-HeLa dataset for cell segmentation. The generalization metric to unseen scales is dramatically improved in comparison to the U-Net, even when the U-Net is trained with scale jittering, and to a scale-equivariant architecture that does not perform upsampling operators inside the equivariant pipeline. The scale-dropout induces better generalization on the scale-equivariant models in the Pet experiment, but not on the cell segmentation experiment. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Journal ref: 33rd British Machine Vision Conference, Nov 2022, Londres, United Kingdom

arXiv:2206.13279 [pdf, other]

Differential invariants for SE(2)-equivariant networks

Authors: Mateus Sangalli, Samy Blusseau, Santiago Velasco-Forero, Jesús Angulo

Abstract: Symmetry is present in many tasks in computer vision, where the same class of objects can appear transformed, e.g. rotated due to different camera orientations, or scaled due to perspective. The knowledge of such symmetries in data coupled with equivariance of neural networks can improve their generalization to new samples. Differential invariants are equivariant operators computed from the partia… ▽ More Symmetry is present in many tasks in computer vision, where the same class of objects can appear transformed, e.g. rotated due to different camera orientations, or scaled due to perspective. The knowledge of such symmetries in data coupled with equivariance of neural networks can improve their generalization to new samples. Differential invariants are equivariant operators computed from the partial derivatives of a function. In this paper we use differential invariants to define equivariant operators that form the layers of an equivariant neural network. Specifically, we derive invariants of the Special Euclidean Group SE(2), composed of rotations and translations, and apply them to construct a SE(2)-equivariant network, called SE(2) Differential Invariants Network (SE2DINNet). The network is subsequently tested in classification tasks which require a degree of equivariance or invariance to rotations. The results compare positively with the state-of-the-art, even though the proposed SE2DINNet has far less parameters than the compared models. △ Less

Submitted 22 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

Journal ref: 29th IEEE International Conference on Image Processing (IEEE ICIP), Oct 2022, Bordeaux, France

arXiv:2105.01335 [pdf, other]

Scale Equivariant Neural Networks with Morphological Scale-Spaces

Authors: Mateus Sangalli, Samy Blusseau, Santiago Velasco-Forero, Jesus Angulo

Abstract: The translation equivariance of convolutions can make convolutional neural networks translation equivariant or invariant. Equivariance to other transformations (e.g. rotations, affine transformations, scalings) may also be desirable as soon as we know a priori that transformed versions of the same objects appear in the data. The semigroup cross-correlation, which is a linear operator equivariant… ▽ More The translation equivariance of convolutions can make convolutional neural networks translation equivariant or invariant. Equivariance to other transformations (e.g. rotations, affine transformations, scalings) may also be desirable as soon as we know a priori that transformed versions of the same objects appear in the data. The semigroup cross-correlation, which is a linear operator equivariant to semigroup actions, was recently proposed and applied in conjunction with the Gaussian scale-space to create architectures which are equivariant to discrete scalings. In this paper, a generalization using a broad class of liftings, including morphological scale-spaces, is proposed. The architectures obtained from different scale-spaces are tested and compared in supervised classification and semantic segmentation tasks where objects in test images appear at different scales compared to training images. In both classification and segmentation tasks, the scale-equivariant architectures improve dramatically the generalization to unseen scales compared to a convolutional baseline. Besides, in our experiments morphological scale-spaces outperformed the Gaussian scale-space in geometrical tasks. △ Less

Submitted 4 May, 2021; originally announced May 2021.

Journal ref: IAPR International Conference on Discrete Geometry and Mathematical Morphology (DGMM), 2021, May 2021, Uppsala, Sweden

arXiv:1601.03670 [pdf, other]

doi 10.1214/16-AOAS975

Smooth Principal Component Analysis over two-dimensional manifolds with an application to Neuroimaging

Authors: Eardi Lila, John A. D. Aston, Laura M. Sangalli

Abstract: Motivated by the analysis of high-dimensional neuroimaging signals located over the cortical surface, we introduce a novel Principal Component Analysis technique that can handle functional data located over a two-dimensional manifold. For this purpose a regularization approach is adopted, introducing a smoothing penalty coherent with the geodesic distance over the manifold. The model introduced ca… ▽ More Motivated by the analysis of high-dimensional neuroimaging signals located over the cortical surface, we introduce a novel Principal Component Analysis technique that can handle functional data located over a two-dimensional manifold. For this purpose a regularization approach is adopted, introducing a smoothing penalty coherent with the geodesic distance over the manifold. The model introduced can be applied to any manifold topology, can naturally handle missing data and functional samples evaluated in different grids of points. We approach the discretization task by means of finite element analysis and propose an efficient iterative algorithm for its resolution. We compare the performances of the proposed algorithm with other approaches classically adopted in literature. We finally apply the proposed method to resting state functional magnetic resonance imaging data from the Human Connectome Project, where the method shows substantial differential variations between brain regions that were not apparent with other approaches. △ Less

Submitted 12 September, 2016; v1 submitted 14 January, 2016; originally announced January 2016.

Comments: 33 pages

arXiv:1512.03216 [pdf, ps, other]

doi 10.1214/15-STS524

Functional Data Analysis of Amplitude and Phase Variation

Authors: J. S. Marron, James O. Ramsay, Laura M. Sangalli, Anuj Srivastava

Abstract: The abundance of functional observations in scientific endeavors has led to a significant development in tools for functional data analysis (FDA). This kind of data comes with several challenges: infinite-dimensionality of function spaces, observation noise, and so on. However, there is another interesting phenomena that creates problems in FDA. The functional data often comes with lateral displac… ▽ More The abundance of functional observations in scientific endeavors has led to a significant development in tools for functional data analysis (FDA). This kind of data comes with several challenges: infinite-dimensionality of function spaces, observation noise, and so on. However, there is another interesting phenomena that creates problems in FDA. The functional data often comes with lateral displacements/deformations in curves, a phenomenon which is different from the height or amplitude variability and is termed phase variation. The presence of phase variability artificially often inflates data variance, blurs underlying data structures, and distorts principal components. While the separation and/or removal of phase from amplitude data is desirable, this is a difficult problem. In particular, a commonly used alignment procedure, based on minimizing the $\mathbb{L}^2$ norm between functions, does not provide satisfactory results. In this paper we motivate the importance of dealing with the phase variability and summarize several current ideas for separating phase and amplitude components. These approaches differ in the following: (1) the definition and mathematical representation of phase variability, (2) the objective functions that are used in functional data alignment, and (3) the algorithmic tools for solving estimation/optimization problems. We use simple examples to illustrate various approaches and to provide useful contrast between them. △ Less

Submitted 10 December, 2015; originally announced December 2015.

Comments: Published at http://dx.doi.org/10.1214/15-STS524 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-STS-STS524

Journal ref: Statistical Science 2015, Vol. 30, No. 4, 468-484

arXiv:1511.02688 [pdf, other]

Generalized Spatial Regression with Differential Regularization

Authors: Matthieu Wilhelm, Laura M. Sangalli

Abstract: We aim at analyzing geostatistical and areal data observed over irregularly shaped spatial domains and having a distribution within the exponential family. We propose a generalized additive model that allows to account for spatially-varying covariate information. The model is fitted by maximizing a penalized log-likelihood function, with a roughness penalty term that involves a differential quanti… ▽ More We aim at analyzing geostatistical and areal data observed over irregularly shaped spatial domains and having a distribution within the exponential family. We propose a generalized additive model that allows to account for spatially-varying covariate information. The model is fitted by maximizing a penalized log-likelihood function, with a roughness penalty term that involves a differential quantity of the spatial field, computed over the domain of interest. Efficient estimation of the spatial field is achieved resorting to the finite element method, which provides a basis for piecewise polynomial surfaces. The proposed model is illustrated by an application to the study of criminality in the city of Portland, Oregon, USA. △ Less

Submitted 21 April, 2016; v1 submitted 9 November, 2015; originally announced November 2015.

arXiv:1508.05214 [pdf, other]

doi 10.1016/j.cma.2015.12.028

IGS: an IsoGeometric approach for Smoothing on surfaces

Authors: Matthieu Wilhelm, Luca Dedè, Laura M. Sangalli, Pierre Wilhelm

Abstract: We propose an Isogeometric approach for smoothing on surfaces, namely estimating a function starting from noisy and discrete measurements. More precisely, we aim at estimating functions lying on a surface represented by NURBS, which are geometrical representations commonly used in industrial applications. The estimation is based on the minimization of a penalized least-square functional. The latte… ▽ More We propose an Isogeometric approach for smoothing on surfaces, namely estimating a function starting from noisy and discrete measurements. More precisely, we aim at estimating functions lying on a surface represented by NURBS, which are geometrical representations commonly used in industrial applications. The estimation is based on the minimization of a penalized least-square functional. The latter is equivalent to solve a 4th-order Partial Differential Equation (PDE). In this context, we use Isogeometric Analysis (IGA) for the numerical approximation of such surface PDE, leading to an IsoGeometric Smoothing (IGS) method for fitting data spatially distributed on a surface. Indeed, IGA facilitates encapsulating the exact geometrical representation of the surface in the analysis and also allows the use of at least globally $C^1-$continuous NURBS basis functions for which the 4th-order PDE can be solved using the standard Galerkin method. We show the performance of the proposed IGS method by means of numerical simulations and we apply it to the estimation of the pressure coefficient, and associated aerodynamic force on a winglet of the SOAR space shuttle. △ Less

Submitted 1 January, 2016; v1 submitted 21 August, 2015; originally announced August 2015.

arXiv:1010.1688 [pdf, ps, other]

doi 10.3150/09-BEJ217

Latent diffusion models for survival analysis

Authors: Gareth O. Roberts, Laura M. Sangalli

Abstract: We consider Bayesian hierarchical models for survival analysis, where the survival times are modeled through an underlying diffusion process which determines the hazard rate. We show how these models can be efficiently treated by means of Markov chain Monte Carlo techniques. We consider Bayesian hierarchical models for survival analysis, where the survival times are modeled through an underlying diffusion process which determines the hazard rate. We show how these models can be efficiently treated by means of Markov chain Monte Carlo techniques. △ Less

Submitted 8 October, 2010; originally announced October 2010.

Comments: Published in at http://dx.doi.org/10.3150/09-BEJ217 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ217

Journal ref: Bernoulli 2010, Vol. 16, No. 2, 435-458

Showing 1–10 of 10 results for author: Sangalli, M