Search | arXiv e-print repository

Q-Learning to navigate turbulence without a map

Authors: Marco Rando, Martin James, Alessandro Verri, Lorenzo Rosasco, Agnese Seminara

Abstract: We consider the problem of olfactory searches in a turbulent environment. We focus on agents that respond solely to odor stimuli, with no access to spatial perception nor prior information about the odor location. We ask whether navigation strategies to a target can be learned robustly within a sequential decision making framework. We develop a reinforcement learning algorithm using a small set of… ▽ More We consider the problem of olfactory searches in a turbulent environment. We focus on agents that respond solely to odor stimuli, with no access to spatial perception nor prior information about the odor location. We ask whether navigation strategies to a target can be learned robustly within a sequential decision making framework. We develop a reinforcement learning algorithm using a small set of interpretable olfactory states and train it with realistic turbulent odor cues. By introducing a temporal memory, we demonstrate that two salient features of odor traces, discretized in few olfactory states, are sufficient to learn navigation in a realistic odor plume. Performance is dictated by the sparse nature of turbulent plumes. An optimal memory exists which ignores blanks within the plume and activates a recovery strategy outside the plume. We obtain the best performance by letting agents learn their recovery strategy and show that it is mostly casting cross wind, similar to behavior observed in flying insects. The optimal strategy is robust to substantial changes in the odor plumes, suggesting minor parameter tuning may be sufficient to adapt to different environments. △ Less

Submitted 26 April, 2024; originally announced April 2024.

Comments: 18 pages, 8 figures

arXiv:2309.07192 [pdf, other]

The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease detection

Authors: Rosanna Turrisi, Alessandro Verri, Annalisa Barla

Abstract: Machine Learning (ML) has emerged as a promising approach in healthcare, outperforming traditional statistical techniques. However, to establish ML as a reliable tool in clinical practice, adherence to best practices regarding data handling, experimental design, and model evaluation is crucial. This work summarizes and strictly observes such practices to ensure reproducible and reliable ML. Specif… ▽ More Machine Learning (ML) has emerged as a promising approach in healthcare, outperforming traditional statistical techniques. However, to establish ML as a reliable tool in clinical practice, adherence to best practices regarding data handling, experimental design, and model evaluation is crucial. This work summarizes and strictly observes such practices to ensure reproducible and reliable ML. Specifically, we focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare. We investigate the impact of different data augmentation techniques and model complexity on the overall performance. We consider MRI data from ADNI dataset to address a classification problem employing 3D Convolutional Neural Network (CNN). The experiments are designed to compensate for data scarcity and initial random parameters by utilizing cross-validation and multiple training trials. Within this framework, we train 15 predictive models, considering three different data augmentation strategies and five distinct 3D CNN architectures, each varying in the number of convolutional layers. Specifically, the augmentation strategies are based on affine transformations, such as zoom, shift, and rotation, applied concurrently or separately. The combined effect of data augmentation and model complexity leads to a variation in prediction performance up to 10% of accuracy. When affine transformation are applied separately, the model is more accurate, independently from the adopted architecture. For all strategies, the model accuracy followed a concave behavior at increasing number of convolutional layers, peaking at an intermediate value of layers. The best model (8 CL, (B)) is the most stable across cross-validation folds and training trials, reaching excellent performance both on the testing set and on an external test set. △ Less

Submitted 13 September, 2023; originally announced September 2023.

arXiv:2204.00495 [pdf, other]

Physics Informed Shallow Machine Learning for Wind Speed Prediction

Authors: Daniele Lagomarsino-Oneto, Giacomo Meanti, Nicolò Pagliana, Alessandro Verri, Andrea Mazzino, Lorenzo Rosasco, Agnese Seminara

Abstract: The ability to predict wind is crucial for both energy production and weather forecasting. Mechanistic models that form the basis of traditional forecasting perform poorly near the ground. In this paper, we take an alternative data-driven approach based on supervised learning. We analyze a massive dataset of wind measured from anemometers located at 10 m height in 32 locations in two central and n… ▽ More The ability to predict wind is crucial for both energy production and weather forecasting. Mechanistic models that form the basis of traditional forecasting perform poorly near the ground. In this paper, we take an alternative data-driven approach based on supervised learning. We analyze a massive dataset of wind measured from anemometers located at 10 m height in 32 locations in two central and north west regions of Italy (Abruzzo and Liguria). We train supervised learning algorithms using the past history of wind to predict its value at a future time (horizon). Using data from a single location and time horizon we compare systematically several algorithms where we vary the input/output variables, the memory of the input and the linear vs non-linear learning model. We then compare performance of the best algorithms across all locations and forecasting horizons. We find that the optimal design as well as its performance vary with the location. We demonstrate that the presence of a reproducible diurnal cycle provides a rationale to understand this variation. We conclude with a systematic comparison with state of the art algorithms and show that, when the model is accurately designed, shallow algorithms are competitive with more complex deep architectures. △ Less

Submitted 1 April, 2022; originally announced April 2022.

Comments: 26 pages, 11 figures

arXiv:2203.16591 [pdf, other]

Spectral analysis in broken sheared waveguides

Authors: Diana C. S. Bello, Alessandra A. Verri

Abstract: Let $Ω\subset \mathbb R^3$ be a broken sheared waveguide, i.e., it is built by translating a cross-section in a constant direction along a broken line in $\mathbb R^3$. We prove that the discrete spectrum of the Dirichlet Laplacian operator in $Ω$ is non-empty and finite. Furthermore, we show a particular geometry for $Ω$ which implies that the total multiplicity of the discrete spectrum is equals… ▽ More Let $Ω\subset \mathbb R^3$ be a broken sheared waveguide, i.e., it is built by translating a cross-section in a constant direction along a broken line in $\mathbb R^3$. We prove that the discrete spectrum of the Dirichlet Laplacian operator in $Ω$ is non-empty and finite. Furthermore, we show a particular geometry for $Ω$ which implies that the total multiplicity of the discrete spectrum is equals 1. △ Less

Submitted 17 July, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

Comments: In this version, we add a result which shows a particular geometry for $Ω$ which implies that the total multiplicity of the discrete spectrum of the operator is equals 1

arXiv:2111.13471 [pdf, other]

Spectral analysis on ruled surfaces with combined Dirichlet and Neumann boundary conditions

Authors: Rafael T. Amorim, Alessandra A. Verri

Abstract: Let $Ω$ be an unbounded two dimensional strip on a ruled surface in $\mathbb{R}^d$, $d\geq2$. Consider the Laplacian operator in $Ω$ with Dirichlet and Neumann boundary conditions on opposite sides of $Ω$. We prove some results on the existence and absence of the discrete spectrum of the operator; which are influenced by the twisted and bent effects of $Ω$. Provided that $Ω$ is thin enough, we sho… ▽ More Let $Ω$ be an unbounded two dimensional strip on a ruled surface in $\mathbb{R}^d$, $d\geq2$. Consider the Laplacian operator in $Ω$ with Dirichlet and Neumann boundary conditions on opposite sides of $Ω$. We prove some results on the existence and absence of the discrete spectrum of the operator; which are influenced by the twisted and bent effects of $Ω$. Provided that $Ω$ is thin enough, we show an asymptotic behavior of the eigenvalues. The interest in those considerations lies on the difference from the purely Dirichlet case. Finally, we perform an appropriate dilatation in $Ω$ and we compare the results. △ Less

Submitted 26 November, 2021; originally announced November 2021.

arXiv:2110.13655 [pdf, other]

Bridging the gap to real-world for network intrusion detection systems with data-centric approach

Authors: Gustavo de Carvalho Bertoli, Lourenço Alves Pereira Junior, Filipe Alves Neto Verri, Aldri Luiz dos Santos, Osamu Saotome

Abstract: Most research using machine learning (ML) for network intrusion detection systems (NIDS) uses well-established datasets such as KDD-CUP99, NSL-KDD, UNSW-NB15, and CICIDS-2017. In this context, the possibilities of machine learning techniques are explored, aiming for metrics improvements compared to the published baselines (model-centric approach). However, those datasets present some limitations a… ▽ More Most research using machine learning (ML) for network intrusion detection systems (NIDS) uses well-established datasets such as KDD-CUP99, NSL-KDD, UNSW-NB15, and CICIDS-2017. In this context, the possibilities of machine learning techniques are explored, aiming for metrics improvements compared to the published baselines (model-centric approach). However, those datasets present some limitations as aging that make it unfeasible to transpose those ML-based solutions to real-world applications. This paper presents a systematic data-centric approach to address the current limitations of NIDS research, specifically the datasets. This approach generates NIDS datasets composed of the most recent network traffic and attacks, with the labeling process integrated by design. △ Less

Submitted 8 January, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

Comments: Camera-ready version from Data-centric AI workshop at NeurIPS 2021, see https://datacentricai.org/papers/104_CameraReady_dcaicamera-ready.pdf

arXiv:2010.00034 [pdf, other]

Existence of discrete eigenvalues for the Dirichlet Laplacian in a two-dimensional twisted strip

Authors: Rafael T. Amorim, Alessandra A. Verri

Abstract: We study the spectrum of the Dirichlet Laplacian operator in a two-dimensional twisted strip embedded in $\mathbb R^d$ with $d \geq 2$. It is shown that a local twisting perturbation can create discrete eigenvalues for the operator. In particular, we also study the case where the twisted effect "grows" at infinity while the width of the strip goes to zero. In this situation, we find an asymptotic… ▽ More We study the spectrum of the Dirichlet Laplacian operator in a two-dimensional twisted strip embedded in $\mathbb R^d$ with $d \geq 2$. It is shown that a local twisting perturbation can create discrete eigenvalues for the operator. In particular, we also study the case where the twisted effect "grows" at infinity while the width of the strip goes to zero. In this situation, we find an asymptotic behavior for the eigenvalues. △ Less

Submitted 30 August, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

Comments: We corrected some imprecision in the proof of Proposition 4 and added some references

arXiv:2005.04772 [pdf, other]

Spectrum of the Dirichlet Laplacian in waveguides with parallel cross-sections

Authors: Alessandra A. Verri

Abstract: Let $Ω\subset \mathbb R^3$ be a waveguide which is obtained by translating a cross-section in a constant direction along an unbounded spatial curve. Consider $-Δ_Ω^D$ the Dirichlet Laplacian operator in $Ω$. Under the condition that the tangent vector of the reference curve admits a finite limit at infinity, we find the essential spectrum of $-Δ_Ω^D$. Then, we state sufficient conditions that give… ▽ More Let $Ω\subset \mathbb R^3$ be a waveguide which is obtained by translating a cross-section in a constant direction along an unbounded spatial curve. Consider $-Δ_Ω^D$ the Dirichlet Laplacian operator in $Ω$. Under the condition that the tangent vector of the reference curve admits a finite limit at infinity, we find the essential spectrum of $-Δ_Ω^D$. Then, we state sufficient conditions that give rise to a non-empty discrete spectrum for $-Δ_Ω^D$; in particular, we show that the number of discrete eigenvalues can be arbitrarily large since the waveguide is thin enough. △ Less

Submitted 10 May, 2020; originally announced May 2020.

arXiv:1802.04186 [pdf, other]

doi 10.1140/epjs/s11734-021-00154-5

Network community detection via iterative edge removal in a flocking-like system

Authors: Filipe Alves Neto Verri, Roberto Alves Gueleri, Qiusheng Zheng, Junbao Zhang, Liang Zhao

Abstract: We present a network community-detection technique based on properties that emerge from a nature-inspired system of aligning particles. Initially, each vertex is assigned a random-direction unit vector. A nonlinear dynamic law is established so that neighboring vertices try to become aligned with each other. After some time, the system stops and edges that connect the least-aligned pairs of vertic… ▽ More We present a network community-detection technique based on properties that emerge from a nature-inspired system of aligning particles. Initially, each vertex is assigned a random-direction unit vector. A nonlinear dynamic law is established so that neighboring vertices try to become aligned with each other. After some time, the system stops and edges that connect the least-aligned pairs of vertices are removed. Then the evolution starts over without the removed edges, and after enough number of removal rounds, each community becomes a connected component. The proposed approach is evaluated using widely-accepted benchmarks and real-world networks. Experimental results reveal that the method is robust and excels on a wide variety of networks. Moreover, for large sparse networks, the edge-removal process runs in quasilinear time, which enables application in large-scale networks. △ Less

Submitted 12 February, 2018; originally announced February 2018.

arXiv:1802.03987 [pdf, other]

doi 10.1145/3219819.3220121

Latent Variable Time-varying Network Inference

Authors: Federico Tomasi, Veronica Tozzo, Saverio Salzo, Alessandro Verri

Abstract: In many applications of finance, biology and sociology, complex systems involve entities interacting with each other. These processes have the peculiarity of evolving over time and of comprising latent factors, which influence the system without being explicitly measured. In this work we present latent variable time-varying graphical lasso (LTGL), a method for multivariate time-series graphical mo… ▽ More In many applications of finance, biology and sociology, complex systems involve entities interacting with each other. These processes have the peculiarity of evolving over time and of comprising latent factors, which influence the system without being explicitly measured. In this work we present latent variable time-varying graphical lasso (LTGL), a method for multivariate time-series graphical modelling that considers the influence of hidden or unmeasurable factors. The estimation of the contribution of the latent factors is embedded in the model which produces both sparse and low-rank components for each time point. In particular, the first component represents the connectivity structure of observable variables of the system, while the second represents the influence of hidden factors, assumed to be few with respect to the observed variables. Our model includes temporal consistency on both components, providing an accurate evolutionary pattern of the system. We derive a tractable optimisation algorithm based on alternating direction method of multipliers, and develop a scalable and efficient implementation which exploits proximity operators in closed form. LTGL is extensively validated on synthetic data, achieving optimal performance in terms of accuracy, structure learning and scalability with respect to ground truth and state-of-the-art methods for graphical inference. We conclude with the application of LTGL to real case studies, from biology and finance, to illustrate how our method can be successfully employed to gain insights on multivariate time-series data. △ Less

Submitted 2 August, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

Comments: 9 pages, 5 figures, 1 table

Journal ref: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2018). ACM, New York, NY, USA, 2338-2346

arXiv:1710.09300 [pdf, other]

doi 10.1109/CEC.2018.8477891

Feature learning in feature-sample networks using multi-objective optimization

Authors: Filipe Alves Neto Verri, Renato Tinós, Liang Zhao

Abstract: Data and knowledge representation are fundamental concepts in machine learning. The quality of the representation impacts the performance of the learning model directly. Feature learning transforms or enhances raw data to structures that are effectively exploited by those models. In recent years, several works have been using complex networks for data representation and analysis. However, no featu… ▽ More Data and knowledge representation are fundamental concepts in machine learning. The quality of the representation impacts the performance of the learning model directly. Feature learning transforms or enhances raw data to structures that are effectively exploited by those models. In recent years, several works have been using complex networks for data representation and analysis. However, no feature learning method has been proposed for such category of techniques. Here, we present an unsupervised feature learning mechanism that works on datasets with binary features. First, the dataset is mapped into a feature--sample network. Then, a multi-objective optimization process selects a set of new vertices to produce an enhanced version of the network. The new features depend on a nonlinear function of a combination of preexisting features. Effectively, the process projects the input data into a higher-dimensional space. To solve the optimization problem, we design two metaheuristics based on the lexicographic genetic algorithm and the improved strength Pareto evolutionary algorithm (SPEA2). We show that the enhanced network contains more information and can be exploited to improve the performance of machine learning methods. The advantages and disadvantages of each optimization strategy are discussed. △ Less

Submitted 25 October, 2017; originally announced October 2017.

Comments: 7 pages, 4 figures

arXiv:1704.07844 [pdf, ps, other]

Influence of the bound states in the Neumann Laplacian in a thin waveguide

Authors: Carlos R. Mamani, Alessandra A. Verri

Abstract: We study the Neumann Laplacian operator $-Δ_Ω^N$ restricted to a twisted waveguide $Ω$. The goal is to find the effective operator when the diameter of $Ω$ tends to zero. However, when $Ω$ is "squeezed" there are divergent eigenvalues due to the transverse oscillations. We show that each one of these eigenvalues influences the action of the effective operator in a different way. In the case where… ▽ More We study the Neumann Laplacian operator $-Δ_Ω^N$ restricted to a twisted waveguide $Ω$. The goal is to find the effective operator when the diameter of $Ω$ tends to zero. However, when $Ω$ is "squeezed" there are divergent eigenvalues due to the transverse oscillations. We show that each one of these eigenvalues influences the action of the effective operator in a different way. In the case where $Ω$ is periodic and sufficiently thin, we find information about the absolutely continuous spectrum of $-Δ_Ω^N$ and the existence and location of band gaps in its structure. △ Less

Submitted 25 April, 2017; originally announced April 2017.

arXiv:1612.00615 [pdf, other]

A temporal model for multiple sclerosis course evolution

Authors: Samuele Fiorini, Andrea Tacchino, Giampaolo Brichetto, Alessandro Verri, Annalisa Barla

Abstract: Multiple Sclerosis is a degenerative condition of the central nervous system that affects nearly 2.5 million of individuals in terms of their physical, cognitive, psychological and social capabilities. Researchers are currently investigating on the use of patient reported outcome measures for the assessment of impact and evolution of the disease on the life of the patients. To date, a clear unders… ▽ More Multiple Sclerosis is a degenerative condition of the central nervous system that affects nearly 2.5 million of individuals in terms of their physical, cognitive, psychological and social capabilities. Researchers are currently investigating on the use of patient reported outcome measures for the assessment of impact and evolution of the disease on the life of the patients. To date, a clear understanding on the use of such measures to predict the evolution of the disease is still lacking. In this work we resort to regularized machine learning methods for binary classification and multiple output regression. We propose a pipeline that can be used to predict the disease progression from patient reported measures. The obtained model is tested on a data set collected from an ongoing clinical research project. △ Less

Submitted 2 December, 2016; originally announced December 2016.

Comments: NIPS Machine Learning for health Workshop 2016

arXiv:1606.04513 [pdf, ps, other]

A note on the spectrum of the Neumann Laplacian in periodic waveguides

Authors: Alessandra A. Verri, Carlos R. Mamani

Abstract: We study the Neumann Laplacian $-Δ^N$ restricted to a periodic waveguide. In this situation its spectrum $σ(-Δ^N)$ presents a band structure. Our goal and strategy is to get spectral information from an analysis of the asymptotic behavior of these bands provided that the waveguide is sufficiently thin. We study the Neumann Laplacian $-Δ^N$ restricted to a periodic waveguide. In this situation its spectrum $σ(-Δ^N)$ presents a band structure. Our goal and strategy is to get spectral information from an analysis of the asymptotic behavior of these bands provided that the waveguide is sufficiently thin. △ Less

Submitted 27 August, 2017; v1 submitted 14 June, 2016; originally announced June 2016.

Comments: We corrected some details about the analyticity of the eigenvalues. arXiv admin note: text overlap with arXiv:1508.02574

arXiv:1603.01182 [pdf, other]

doi 10.1109/TNNLS.2016.2626341

Network Unfolding Map by Edge Dynamics Modeling

Authors: Filipe Alves Neto Verri, Paulo Roberto Urio, Liang Zhao

Abstract: The emergence of collective dynamics in neural networks is a mechanism of the animal and human brain for information processing. In this paper, we develop a computational technique using distributed processing elements in a complex network, which are called particles, to solve semi-supervised learning problems. Three actions govern the particles' dynamics: generation, walking, and absorption. Labe… ▽ More The emergence of collective dynamics in neural networks is a mechanism of the animal and human brain for information processing. In this paper, we develop a computational technique using distributed processing elements in a complex network, which are called particles, to solve semi-supervised learning problems. Three actions govern the particles' dynamics: generation, walking, and absorption. Labeled vertices generate new particles that compete against rival particles for edge domination. Active particles randomly walk in the network until they are absorbed by either a rival vertex or an edge currently dominated by rival particles. The result from the model evolution consists of sets of edges arranged by the label dominance. Each set tends to form a connected subnetwork to represent a data class. Although the intrinsic dynamics of the model is a stochastic one, we prove there exists a deterministic version with largely reduced computational complexity; specifically, with linear growth. Furthermore, the edge domination process corresponds to an unfolding map in such way that edges "stretch" and "shrink" according to the vertex-edge dynamics. Consequently, the unfolding effect summarizes the relevant relationships between vertices and the uncovered data classes. The proposed model captures important details of connectivity patterns over the vertex-edge dynamics evolution, in contrast to previous approaches which focused on only vertex or only edge dynamics. Computer simulations reveal that the new model can identify nonlinear features in both real and artificial data, including boundaries between distinct classes and overlap** structures of data. △ Less

Submitted 19 February, 2018; v1 submitted 3 March, 2016; originally announced March 2016.

Comments: Published version in http://ieeexplore.ieee.org/document/7762202/

Journal ref: IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 2, pp. 405-418, Feb. 2018. doi: 10.1109/TNNLS.2016.2626341

arXiv:1508.02574 [pdf, other]

Absolute continuity and band gaps of the spectrum of the Dirichlet Laplacian in periodic waveguides

Authors: Carlos R. Mamani, Alessandra A. Verri

Abstract: Consider the Dirichlet Laplacian operator $-Δ^D$ in a periodic waveguide $Ω$. On the condition that $Ω$ is sufficiently thin, we show that its spectrum $σ(-Δ^D)$ is absolutely continuous (in each finite region). In addition, we ensure the existence of at least one gap in $σ(-Δ^D)$ and locate it. Consider the Dirichlet Laplacian operator $-Δ^D$ in a periodic waveguide $Ω$. On the condition that $Ω$ is sufficiently thin, we show that its spectrum $σ(-Δ^D)$ is absolutely continuous (in each finite region). In addition, we ensure the existence of at least one gap in $σ(-Δ^D)$ and locate it. △ Less

Submitted 7 July, 2017; v1 submitted 11 August, 2015; originally announced August 2015.

Comments: We corrected some details about the analyticity of the eigenvalues

arXiv:1402.5047 [pdf, other]

Real-time Automatic Emotion Recognition from Body Gestures

Authors: Stefano Piana, Alessandra Staglianò, Francesca Odone, Alessandro Verri, Antonio Camurri

Abstract: Although psychological research indicates that bodily expressions convey important affective information, to date research in emotion recognition focused mainly on facial expression or voice analysis. In this paper we propose an approach to realtime automatic emotion recognition from body movements. A set of postural, kinematic, and geometrical features are extracted from sequences 3D skeletons an… ▽ More Although psychological research indicates that bodily expressions convey important affective information, to date research in emotion recognition focused mainly on facial expression or voice analysis. In this paper we propose an approach to realtime automatic emotion recognition from body movements. A set of postural, kinematic, and geometrical features are extracted from sequences 3D skeletons and fed to a multi-class SVM classifier. The proposed method has been assessed on data acquired through two different systems: a professionalgrade optical motion capture system, and Microsoft Kinect. The system has been assessed on a "six emotions" recognition problem, and using a leave-one-subject-out cross validation strategy, reached an overall recognition rate of 61.3% which is very close to the recognition rate of 61.9% obtained by human observers. To provide further testing of the system, two games were developed, where one or two users have to interact to understand and express emotions with their body. △ Less

Submitted 20 February, 2014; originally announced February 2014.

Report number: IDGEI/2014/02

arXiv:1305.4902 [pdf, ps, other]

Complex $Γ$-convergence and magnetic Dirichlet Laplacian in bounded thin tubes

Authors: R. Bedoya, C. R. de Oliveira, A. A. Verri

Abstract: The resolvent convergence of self-adjoint operators via the technique of $Γ$-convergence of quadratic forms is adapted to incorporate complex Hilbert spaces. As an application, we find effective operators to the Dirichlet Laplacian with magnetic potentials in very thin bounded tubular regions in space built along smooth closed curves; relatively weak regularity is asked for the potentials, and the… ▽ More The resolvent convergence of self-adjoint operators via the technique of $Γ$-convergence of quadratic forms is adapted to incorporate complex Hilbert spaces. As an application, we find effective operators to the Dirichlet Laplacian with magnetic potentials in very thin bounded tubular regions in space built along smooth closed curves; relatively weak regularity is asked for the potentials, and the convergence is in the norm resolvent sense as the cross sections of the tubes go uniformly to zero. △ Less

Submitted 17 November, 2013; v1 submitted 21 May, 2013; originally announced May 2013.

Comments: 22 pages; to appear in Journal of Spectral Theory

MSC Class: 81Q15; 49R50; 35P20; 47B99

arXiv:1209.0368 [pdf, other]

Proximal methods for the latent group lasso penalty

Authors: Silvia Villa, Lorenzo Rosasco, Sofia Mosci, Alessandro Verri

Abstract: We consider a regularized least squares problem, with regularization by structured sparsity-inducing norms, which extend the usual $\ell_1$ and the group lasso penalty, by allowing the subsets to overlap. Such regularizations lead to nonsmooth problems that are difficult to optimize, and we propose in this paper a suitable version of an accelerated proximal method to solve them. We prove convergen… ▽ More We consider a regularized least squares problem, with regularization by structured sparsity-inducing norms, which extend the usual $\ell_1$ and the group lasso penalty, by allowing the subsets to overlap. Such regularizations lead to nonsmooth problems that are difficult to optimize, and we propose in this paper a suitable version of an accelerated proximal method to solve them. We prove convergence of a nested procedure, obtained composing an accelerated proximal method with an inner algorithm for computing the proximity operator. By exploiting the geometrical properties of the penalty, we devise a new active set strategy, thanks to which the inner iteration is relatively fast, thus guaranteeing good computational performances of the overall algorithm. Our approach allows to deal with high dimensional problems without pre-processing for dimensionality reduction, leading to better computational and prediction performances with respect to the state-of-the art methods, as shown empirically both on toy and real data. △ Less

Submitted 3 September, 2012; originally announced September 2012.

Comments: 4 figures

MSC Class: 65K10; 90C25

arXiv:1208.2572 [pdf, other]

Nonparametric sparsity and regularization

Authors: Lorenzo Rosasco, Silvia Villa, Sofia Mosci, Matteo Santoro, Alessandro verri

Abstract: In this work we are interested in the problems of supervised learning and variable selection when the input-output dependence is described by a nonlinear function depending on a few variables. Our goal is to consider a sparse nonparametric model, hence avoiding linear or additive models. The key idea is to measure the importance of each variable in the model by making use of partial derivatives. B… ▽ More In this work we are interested in the problems of supervised learning and variable selection when the input-output dependence is described by a nonlinear function depending on a few variables. Our goal is to consider a sparse nonparametric model, hence avoiding linear or additive models. The key idea is to measure the importance of each variable in the model by making use of partial derivatives. Based on this intuition we propose a new notion of nonparametric sparsity and a corresponding least squares regularization scheme. Using concepts and results from the theory of reproducing kernel Hilbert spaces and proximal methods, we show that the proposed learning algorithm corresponds to a minimization problem which can be provably solved by an iterative procedure. The consistency properties of the obtained estimator are studied both in terms of prediction and selection performance. An extensive empirical analysis shows that the proposed method performs favorably with respect to the state-of-the-art methods. △ Less

Submitted 13 August, 2012; originally announced August 2012.

Comments: 45 pages, 11 figures

arXiv:1205.6437 [pdf, ps, other]

doi 10.1063/1.4719976

Mathematical predominance of Dirichlet condition for the one-dimensional Coulomb potential

Authors: Cesar R. de Oliveira, Alessandra A. Verri

Abstract: We restrict a quantum particle under a coulombian potential (i.e., the Schrödinger operator with inverse of the distance potential) to three dimensional tubes along the x-axis and diameter $\varepsilon$, and study the confining limit $\varepsilon\to0$. In the repulsive case we prove a strong resolvent convergence to a one-dimensional limit operator, which presents Dirichlet boundary condition at t… ▽ More We restrict a quantum particle under a coulombian potential (i.e., the Schrödinger operator with inverse of the distance potential) to three dimensional tubes along the x-axis and diameter $\varepsilon$, and study the confining limit $\varepsilon\to0$. In the repulsive case we prove a strong resolvent convergence to a one-dimensional limit operator, which presents Dirichlet boundary condition at the origin. Due to the possibility of the falling of the particle in the center of force, in the attractive case we need to regularize the potential and also prove a norm resolvent convergence to the Dirichlet operator at the origin. Thus, it is argued that, among the infinitely many self-adjoint realizations of the corresponding problem in one dimension, the Dirichlet boundary condition at the origin is the reasonable one-dimensional limit. △ Less

Submitted 29 May, 2012; originally announced May 2012.

Comments: 30 pages; no figures

Journal ref: Journal of Mathematical Physics 53, 052104 (2012)

arXiv:1103.2934 [pdf, ps, other]

doi 10.1016/j.jmaa.2011.03.022

On the spectrum and weakly effective operator for Dirichlet Laplacian in thin deformed tubes

Authors: Cesar R. de Oliveira, Alessandra A. Verri

Abstract: We study the Laplacian in deformed thin (bounded or unbounded) tubes in ?$\R^3$, i.e., tubular regions along a curve $r(s)$ whose cross sections are multiplied by an appropriate deformation function $h(s)> 0$. One the main requirements on $h(s)$ is that it has a single point of global maximum. We find the asymptotic behaviors of the eigenvalues and weakly effective operators as the diameters of th… ▽ More We study the Laplacian in deformed thin (bounded or unbounded) tubes in ?$\R^3$, i.e., tubular regions along a curve $r(s)$ whose cross sections are multiplied by an appropriate deformation function $h(s)> 0$. One the main requirements on $h(s)$ is that it has a single point of global maximum. We find the asymptotic behaviors of the eigenvalues and weakly effective operators as the diameters of the tubes tend to zero. It is shown that such behaviors are not influenced by some geometric features of the tube, such as curvature, torsion and twisting, and so a huge amount of different deformed tubes are asymptotically described by the same weakly effective operator. △ Less

Submitted 15 March, 2011; originally announced March 2011.

arXiv:1011.3728 [pdf, other]

PADDLE: Proximal Algorithm for Dual Dictionaries LEarning

Authors: Curzio Basso, Matteo Santoro, Alessandro Verri, Silvia Villa

Abstract: Recently, considerable research efforts have been devoted to the design of methods to learn from data overcomplete dictionaries for sparse coding. However, learned dictionaries require the solution of an optimization problem for coding new data. In order to overcome this drawback, we propose an algorithm aimed at learning both a dictionary and its dual: a linear map** directly performing the cod… ▽ More Recently, considerable research efforts have been devoted to the design of methods to learn from data overcomplete dictionaries for sparse coding. However, learned dictionaries require the solution of an optimization problem for coding new data. In order to overcome this drawback, we propose an algorithm aimed at learning both a dictionary and its dual: a linear map** directly performing the coding. By leveraging on proximal methods, our algorithm jointly minimizes the reconstruction error of the dictionary and the coding error of its dual; the sparsity of the representation is induced by an $\ell_1$-based penalty on its coefficients. The results obtained on synthetic data and real images show that the algorithm is capable of recovering the expected dictionaries. Furthermore, on a benchmark dataset, we show that the image features obtained from the dual matrix yield state-of-the-art classification performance while being much less computational intensive. △ Less

Submitted 16 November, 2010; originally announced November 2010.

Report number: DISI-TR-2010-06

arXiv:0809.1777 [pdf, ps, other]

A Regularized Method for Selecting Nested Groups of Relevant Genes from Microarray Data

Authors: Christine De Mol, Sofia Mosci, Magali Traskine, Alessandro Verri

Abstract: Gene expression analysis aims at identifying the genes able to accurately predict biological parameters like, for example, disease subty** or progression. While accurate prediction can be achieved by means of many different techniques, gene identification, due to gene correlation and the limited number of available samples, is a much more elusive problem. Small changes in the expression values… ▽ More Gene expression analysis aims at identifying the genes able to accurately predict biological parameters like, for example, disease subty** or progression. While accurate prediction can be achieved by means of many different techniques, gene identification, due to gene correlation and the limited number of available samples, is a much more elusive problem. Small changes in the expression values often produce different gene lists, and solutions which are both sparse and stable are difficult to obtain. We propose a two-stage regularization method able to learn linear models characterized by a high prediction performance. By varying a suitable parameter these linear models allow to trade sparsity for the inclusion of correlated genes and to produce gene lists which are almost perfectly nested. Experimental results on synthetic and microarray data confirm the interesting properties of the proposed method and its potential as a starting point for further biological investigations △ Less

Submitted 10 September, 2008; originally announced September 2008.

Comments: 17 pages, 8 Post-script figures

Report number: DISI-TR-07-04B

arXiv:0806.2764 [pdf, ps, other]

doi 10.1016/j.aop.2008.06.001

Self-adjoint extensions of Coulomb systems in 1,2 and 3 dimensions

Authors: Cesar R. de Oliveira, Alessandra A. Verri

Abstract: We study the nonrelativistic quantum Coulomb hamiltonian (i.e., inverse of distance potential) in $R^n$, n = 1, 2, 3. We characterize their self-adjoint extensions and, in the unidimensional case, present a discussion of controversies in the literature, particularly the question of the permeability of the origin. Potentials given by fundamental solutions of Laplace equation are also briefly cons… ▽ More We study the nonrelativistic quantum Coulomb hamiltonian (i.e., inverse of distance potential) in $R^n$, n = 1, 2, 3. We characterize their self-adjoint extensions and, in the unidimensional case, present a discussion of controversies in the literature, particularly the question of the permeability of the origin. Potentials given by fundamental solutions of Laplace equation are also briefly considered. △ Less

Submitted 17 June, 2008; originally announced June 2008.

Comments: 23 pages; Annals of Physics (NY)

Showing 1–25 of 25 results for author: verri, A