-
Symmetries in Overparametrized Neural Networks: A Mean-Field View
Authors:
Javier Maass Martínez,
Joaquin Fontbona
Abstract:
We develop a Mean-Field (MF) view of the learning dynamics of overparametrized Artificial Neural Networks (NN) under data symmetric in law wrt the action of a general compact group $G$. We consider for this a class of generalized shallow NNs given by an ensemble of $N$ multi-layer units, jointly trained using stochastic gradient descent (SGD) and possibly symmetry-leveraging (SL) techniques, such…
▽ More
We develop a Mean-Field (MF) view of the learning dynamics of overparametrized Artificial Neural Networks (NN) under data symmetric in law wrt the action of a general compact group $G$. We consider for this a class of generalized shallow NNs given by an ensemble of $N$ multi-layer units, jointly trained using stochastic gradient descent (SGD) and possibly symmetry-leveraging (SL) techniques, such as Data Augmentation (DA), Feature Averaging (FA) or Equivariant Architectures (EA). We introduce the notions of weakly and strongly invariant laws (WI and SI) on the parameter space of each single unit, corresponding, respectively, to $G$-invariant distributions, and to distributions supported on parameters fixed by the group action (which encode EA). This allows us to define symmetric models compatible with taking $N\to\infty$ and give an interpretation of the asymptotic dynamics of DA, FA and EA in terms of Wasserstein Gradient Flows describing their MF limits. When activations respect the group action, we show that, for symmetric data, DA, FA and freely-trained models obey the exact same MF dynamic, which stays in the space of WI laws and minimizes therein the population risk. We also give a counterexample to the general attainability of an optimum over SI laws. Despite this, quite remarkably, we show that the set of SI laws is also preserved by the MF dynamics even when freely trained. This sharply contrasts the finite-$N$ setting, in which EAs are generally not preserved by unconstrained SGD. We illustrate the validity of our findings as $N$ gets larger in a teacher-student experimental setting, training a student NN to learn from a WI, SI or arbitrary teacher model through various SL schemes. We last deduce a data-driven heuristic to discover the largest subspace of parameters supporting SI distributions for a problem, that could be used for designing EA with minimal generalization error.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Authors:
Pablo Marcos-Manchón,
Roberto Alcover-Couso,
Juan C. SanMiguel,
Jose M. Martínez
Abstract:
Diffusion models represent a new paradigm in text-to-image generation. Beyond generating high-quality images from text prompts, models such as Stable Diffusion have been successfully extended to the joint generation of semantic segmentation pseudo-masks. However, current extensions primarily rely on extracting attentions linked to prompt words used for image synthesis. This approach limits the gen…
▽ More
Diffusion models represent a new paradigm in text-to-image generation. Beyond generating high-quality images from text prompts, models such as Stable Diffusion have been successfully extended to the joint generation of semantic segmentation pseudo-masks. However, current extensions primarily rely on extracting attentions linked to prompt words used for image synthesis. This approach limits the generation of segmentation masks derived from word tokens not contained in the text prompt. In this work, we introduce Open-Vocabulary Attention Maps (OVAM)-a training-free method for text-to-image diffusion models that enables the generation of attention maps for any word. In addition, we propose a lightweight optimization process based on OVAM for finding tokens that generate accurate attention maps for an object class with a single annotation. We evaluate these tokens within existing state-of-the-art Stable Diffusion extensions. The best-performing model improves its mIoU from 52.1 to 86.6 for the synthetic images' pseudo-masks, demonstrating that our optimized tokens are an efficient way to improve the performance of existing methods without architectural changes or retraining.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
On almost $p$-rational characters in principal blocks
Authors:
Attila Maróti,
J. Miquel Martínez,
A. A. Schaeffer Fry,
Carolina Vallejo
Abstract:
Let p be a prime. In this paper we provide a lower bound for the number of almost p-rational characters of degree coprime to p in the principal p-block of a finite group of order divisible by p. We further describe the p-local structure of the groups for which the above-mentioned bound is sharp.
Let p be a prime. In this paper we provide a lower bound for the number of almost p-rational characters of degree coprime to p in the principal p-block of a finite group of order divisible by p. We further describe the p-local structure of the groups for which the above-mentioned bound is sharp.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
The Alperin Weight Conjecture and the Glauberman correspondence via character triples
Authors:
J. Miquel Martínez,
N. Rizo,
D. Rossi
Abstract:
Recently, G. Navarro introduced a new conjecture that unifies the Alperin Weight Conjecture and the Glauberman correspondence into a single statement. In this paper, we reduce this problem to simple groups and prove it for several classes of groups and blocks. Our reduction can be divided into two steps. First, we show that assuming the so-called Inductive (Blockwise) Alperin Weight Condition for…
▽ More
Recently, G. Navarro introduced a new conjecture that unifies the Alperin Weight Conjecture and the Glauberman correspondence into a single statement. In this paper, we reduce this problem to simple groups and prove it for several classes of groups and blocks. Our reduction can be divided into two steps. First, we show that assuming the so-called Inductive (Blockwise) Alperin Weight Condition for finite simple groups, we obtain an analogous statement for arbitrary finite groups, that is, an automorphism-equivariant version of the Alperin Weight Conjecture inducing isomorphisms of modular character triples. Then, we show that the latter implies Navarro's conjecture for each finite group.
△ Less
Submitted 30 May, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
The blocks with five irreducible characters
Authors:
J. Miquel Martínez,
Noelia Rizo,
Lucia Sanus
Abstract:
Let $G$ be a finite group, $p$ a prime and $B$ a Brauer $p$-block of $G$ with defect group $D$. We prove that if the number of irreducible ordinary characters in $B$ is $5$ then $D\cong C_5, C_7, D_8$ or $Q_8$, assuming that the Alperin--McKay conjecture holds for $B$.
Let $G$ be a finite group, $p$ a prime and $B$ a Brauer $p$-block of $G$ with defect group $D$. We prove that if the number of irreducible ordinary characters in $B$ is $5$ then $D\cong C_5, C_7, D_8$ or $Q_8$, assuming that the Alperin--McKay conjecture holds for $B$.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Characters of prime power degree in principal blocks
Authors:
J. Miquel Martínez
Abstract:
We describe finite groups whose principal block contains only characters of prime power degree.
We describe finite groups whose principal block contains only characters of prime power degree.
△ Less
Submitted 3 April, 2023; v1 submitted 24 March, 2023;
originally announced March 2023.
-
Soft labelling for semantic segmentation: Bringing coherence to label down-sampling
Authors:
Roberto Alcover-Couso,
Marcos Escudero-Vinolo,
Juan C. SanMiguel,
Jose M. Martinez
Abstract:
In semantic segmentation, training data down-sampling is commonly performed due to limited resources, the need to adapt image size to the model input, or improve data augmentation. This down-sampling typically employs different strategies for the image data and the annotated labels. Such discrepancy leads to mismatches between the down-sampled color and label images. Hence, the training performanc…
▽ More
In semantic segmentation, training data down-sampling is commonly performed due to limited resources, the need to adapt image size to the model input, or improve data augmentation. This down-sampling typically employs different strategies for the image data and the annotated labels. Such discrepancy leads to mismatches between the down-sampled color and label images. Hence, the training performance significantly decreases as the down-sampling factor increases. In this paper, we bring together the down-sampling strategies for the image data and the training labels. To that aim, we propose a novel framework for label down-sampling via soft-labeling that better conserves label information after down-sampling. Therefore, fully aligning soft-labels with image data to keep the distribution of the sampled pixels. This proposal also produces reliable annotations for under-represented semantic classes. Altogether, it allows training competitive models at lower resolutions. Experiments show that the proposal outperforms other down-sampling strategies. Moreover, state-of-the-art performance is achieved for reference benchmarks, but employing significantly less computational resources than foremost approaches. This proposal enables competitive research for semantic segmentation under resource constraints.
△ Less
Submitted 19 February, 2024; v1 submitted 27 February, 2023;
originally announced February 2023.
-
The Analytical Method algorithm for trigger primitives generation at the LHC Drift Tubes detector
Authors:
G. Abbiendi,
J. Alcaraz Maestre,
A. Álvarez Fernández,
B. Álvarez González,
N. Amapane,
I. Bachiller,
L. Barcellan,
C. Baldanza,
C. Battilana,
M. Bellato,
G. Bencze,
M. Benettoni,
N. Beni,
A. Benvenuti,
A. Bergnoli,
L. C. Blanco Ramos,
L. Borgonovi,
A. Bragagnolo,
V. Cafaro,
A. Calderon,
E. Calvo,
R. Carlin,
C. A. Carrillo Montoya,
F. R. Cavallo,
J. M. Cela Ruiz
, et al. (121 additional authors not shown)
Abstract:
The Compact Muon Solenoid (CMS) experiment prepares its Phase-2 upgrade for the high-luminosity era of the LHC operation (HL-LHC). Due to the increase of occupancy, trigger latency and rates, the full electronics of the CMS Drift Tube (DT) chambers will need to be replaced. In the new design, the time bin for the digitisation of the chamber signals will be of around 1~ns, and the totality of the s…
▽ More
The Compact Muon Solenoid (CMS) experiment prepares its Phase-2 upgrade for the high-luminosity era of the LHC operation (HL-LHC). Due to the increase of occupancy, trigger latency and rates, the full electronics of the CMS Drift Tube (DT) chambers will need to be replaced. In the new design, the time bin for the digitisation of the chamber signals will be of around 1~ns, and the totality of the signals will be forwarded asynchronously to the service cavern at full resolution. The new backend system will be in charge of building the trigger primitives of each chamber. These trigger primitives contain the information at chamber level about the muon candidates position, direction, and collision time, and are used as input in the L1 CMS trigger. The added functionalities will improve the robustness of the system against ageing. An algorithm based on analytical solutions for reconstructing the DT trigger primitives, called Analytical Method, has been implemented both as a software C++ emulator and in firmware. Its performance has been estimated using the software emulator with simulated and real data samples, and through hardware implementation tests. Measured efficiencies are 96 to 98\% for all qualities and time and spatial resolutions are close to the ultimate performance of the DT chambers. A prototype chain of the HL-LHC electronics using the Analytical Method for trigger primitive generation has been installed during Long Shutdown 2 of the LHC and operated in CMS cosmic data taking campaigns in 2020 and 2021. Results from this validation step, the so-called Slice Test, are presented.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
Graph Convolutional Network for Multi-Target Multi-Camera Vehicle Tracking
Authors:
Elena Luna,
Juan Carlos San Miguel,
José María Martínez,
Marcos Escudero-Viñolo
Abstract:
This letter focuses on the task of Multi-Target Multi-Camera vehicle tracking. We propose to associate single-camera trajectories into multi-camera global trajectories by training a Graph Convolutional Network. Our approach simultaneously processes all cameras providing a global solution, and it is also robust to large cameras unsynchronizations. Furthermore, we design a new loss function to deal…
▽ More
This letter focuses on the task of Multi-Target Multi-Camera vehicle tracking. We propose to associate single-camera trajectories into multi-camera global trajectories by training a Graph Convolutional Network. Our approach simultaneously processes all cameras providing a global solution, and it is also robust to large cameras unsynchronizations. Furthermore, we design a new loss function to deal with class imbalance. Our proposal outperforms the related work showing better generalization and without requiring ad-hoc manual annotations or thresholds, unlike compared approaches.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Degree divisibility in Alperin-McKay correspondences
Authors:
J. Miquel Martìnez,
Damiano Rossi
Abstract:
Let p be a prime, B a p-block of a finite group G and b its Brauer correspondent. According to the Alperin-McKay Conjecture, there exists a bijection between the set of irreducible ordinary characters of height zero of B and those of b. In this paper, we show that whenever G is p-solvable such a bijection can be found, both for ordinary and Brauer characters, with the additional property of being…
▽ More
Let p be a prime, B a p-block of a finite group G and b its Brauer correspondent. According to the Alperin-McKay Conjecture, there exists a bijection between the set of irreducible ordinary characters of height zero of B and those of b. In this paper, we show that whenever G is p-solvable such a bijection can be found, both for ordinary and Brauer characters, with the additional property of being compatible with divisibility of character degrees. In this case, we also show that the dimension of b divides the dimension of B.
△ Less
Submitted 15 December, 2022; v1 submitted 21 April, 2022;
originally announced April 2022.
-
Graph Neural Networks for Cross-Camera Data Association
Authors:
Elena Luna,
Juan C. SanMiguel,
José M. Martínez,
Pablo Carballeira
Abstract:
Cross-camera image data association is essential for many multi-camera computer vision tasks, such as multi-camera pedestrian detection, multi-camera multi-target tracking, 3D pose estimation, etc. This association task is typically stated as a bipartite graph matching problem and often solved by applying minimum-cost flow techniques, which may be computationally inefficient with large data. Furth…
▽ More
Cross-camera image data association is essential for many multi-camera computer vision tasks, such as multi-camera pedestrian detection, multi-camera multi-target tracking, 3D pose estimation, etc. This association task is typically stated as a bipartite graph matching problem and often solved by applying minimum-cost flow techniques, which may be computationally inefficient with large data. Furthermore, cameras are usually treated by pairs, obtaining local solutions, rather than finding a global solution at once. Other key issue is that of the affinity measurement: the widespread usage of non-learnable pre-defined distances, such as the Euclidean and Cosine ones. This paper proposes an efficient approach for cross-cameras data-association focused on a global solution, instead of processing cameras by pairs. To avoid the usage of fixed distances, we leverage the connectivity of Graph Neural Networks, previously unused in this scope, using a Message Passing Network to jointly learn features and similarity. We validate the proposal for pedestrian multi-view association, showing results over the EPFL multi-camera pedestrian dataset. Our approach considerably outperforms the literature data association techniques, without requiring to be trained in the same scenario in which it is tested. Our code is available at \url{http://www-vpu.eps.uam.es/publications/gnn_cca}.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Inexact Restoration for Minimization with Inexact Evaluation both of the Objective Function and the Constraints
Authors:
L. F. Bueno,
F. Larreal,
J. M. Martínez
Abstract:
In a recent paper an Inexact Restoration method for solving continuous constrained optimization problems was analyzed from the point of view of worst-case functional complexity and convergence. On the other hand, the Inexact Restoration methodology was employed, in a different research,to handle minimization problems with inexact evaluation and simple constraints. These two methodologies are combi…
▽ More
In a recent paper an Inexact Restoration method for solving continuous constrained optimization problems was analyzed from the point of view of worst-case functional complexity and convergence. On the other hand, the Inexact Restoration methodology was employed, in a different research,to handle minimization problems with inexact evaluation and simple constraints. These two methodologies are combined in the present report, for constrained minimization problems in which both the objective function and the constraints, as well as their derivatives, are subject to evaluation errors. Together with a complete description of the method, complexity and convergence results will be proved.
△ Less
Submitted 19 September, 2023; v1 submitted 4 January, 2022;
originally announced January 2022.
-
Block Coordinate Descent for smooth nonconvex constrained minimization
Authors:
E. G. Birgin,
J. M. Martínez
Abstract:
At each iteration of a Block Coordinate Descent method one minimizes an approximation of the objective function with respect to a generally small set of variables subject to constraints in which these variables are involved. The unconstrained case and the case in which the constraints are simple were analyzed in the recent literature. In this paper we address the problem in which block constraints…
▽ More
At each iteration of a Block Coordinate Descent method one minimizes an approximation of the objective function with respect to a generally small set of variables subject to constraints in which these variables are involved. The unconstrained case and the case in which the constraints are simple were analyzed in the recent literature. In this paper we address the problem in which block constraints are not simple and, moreover, the case in which they are not defined by global sets of equations and inequations. A general algorithm that minimizes quadratic models with quadratric regularization over blocks of variables is defined and convergence and complexity are proved. In particular, given tolerances $δ>0$ and $\varepsilon>0$ for feasibility/complementarity and optimality, respectively, it is shown that a measure of $(δ,0)$-criticality tends to zero; and the the number of iterations and functional evaluations required to achieve $(δ,\varepsilon)$-criticality is $O(\varepsilon^2)$. Numerical experiments in which the proposed method is used to solve a continuous version of the traveling salesman problem are presented.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
Character degrees in blocks and defect groups
Authors:
Eugenio Giannelli,
J. Miquel Martínez,
A. A. Schaeffer Fry
Abstract:
A recent question of Gabriel Navarro asks whether it is true that the derived length of a defect group is less than or equal to the number of degrees of irreducible characters in a block. In this article, we bring new evidence towards the validity of this statement.
A recent question of Gabriel Navarro asks whether it is true that the derived length of a defect group is less than or equal to the number of degrees of irreducible characters in a block. In this article, we bring new evidence towards the validity of this statement.
△ Less
Submitted 9 November, 2021; v1 submitted 25 October, 2021;
originally announced October 2021.
-
The blocks with four irreducible characters
Authors:
J. Miquel Martínez,
Noelia Rizo,
Lucía Sanus
Abstract:
Suppose that $B$ is a Brauer $p$-block with defect group $D$. If $B$ exactly contains 4 irreducible characters, then we show that $D$ has order 4 or 5, assuming the Alperin--McKay conjecture.
Suppose that $B$ is a Brauer $p$-block with defect group $D$. If $B$ exactly contains 4 irreducible characters, then we show that $D$ has order 4 or 5, assuming the Alperin--McKay conjecture.
△ Less
Submitted 27 January, 2022; v1 submitted 22 September, 2021;
originally announced September 2021.
-
Seeing poverty from space, how much can it be tuned?
Authors:
Tomas Sako,
Arturo Jr M. Martinez
Abstract:
Since the United Nations launched the Sustainable Development Goals (SDG) in 2015, numerous universities, NGOs and other organizations have attempted to develop tools for monitoring worldwide progress in achieving them. Led by advancements in the fields of earth observation techniques, data sciences and the emergence of artificial intelligence, a number of research teams have developed innovative…
▽ More
Since the United Nations launched the Sustainable Development Goals (SDG) in 2015, numerous universities, NGOs and other organizations have attempted to develop tools for monitoring worldwide progress in achieving them. Led by advancements in the fields of earth observation techniques, data sciences and the emergence of artificial intelligence, a number of research teams have developed innovative tools for highlighting areas of vulnerability and tracking the implementation of SDG targets. In this paper we demonstrate that individuals with no organizational affiliation and equipped only with common hardware, publicly available datasets and cloud-based computing services can participate in the improvement of predicting machine-learning-based approaches to predicting local poverty levels in a given agro-ecological environment. The approach builds upon several pioneering efforts over the last five years related to map** poverty by deep learning to process satellite imagery and "ground-truth" data from the field to link features with incidence of poverty in a particular context. The approach employs new methods for object identification in order to optimize the modeled results and achieve significantly high accuracy. A key goal of the project was to intentionally keep costs as low as possible - by using freely available resources - so that citizen scientists, students and organizations could replicate the method in other areas of interest. Moreover, for simplicity, the input data used were derived from just a handful of sources (involving only earth observation and population headcounts). The results of the project could therefore certainly be strengthened further through the integration of proprietary data from social networks, mobile phone providers, and other sources.
△ Less
Submitted 30 July, 2021;
originally announced July 2021.
-
Accelerated derivative-free spectral residual method for nonlinear systems of equations
Authors:
Ernesto G. Birgin,
John L. Gardenghi,
Diaulas S. Marcondes,
José M. Martínez
Abstract:
Spectral residual methods are powerful tools for solving nonlinear systems of equations without derivatives. In a recent paper, it was shown that an acceleration technique based on the Sequential Secant Method can greatly improve its efficiency and robustness. In the present work, an R implementation of the method is presented. Numerical experiments with a widely used test bed compares the present…
▽ More
Spectral residual methods are powerful tools for solving nonlinear systems of equations without derivatives. In a recent paper, it was shown that an acceleration technique based on the Sequential Secant Method can greatly improve its efficiency and robustness. In the present work, an R implementation of the method is presented. Numerical experiments with a widely used test bed compares the presented approach with its plain (i.e. non-accelerated) version that makes part of the R package BB. Additional numerical experiments compare the proposed method with NITSOL, a state-of-the-art solver for nonlinear systems. The comparison shows that the acceleration process greatly improves the robustness of its counterpart included in the existent R package. As a by-product, an interface is provided between R and the consolidated CUTEst collection, which contains over a thousand nonlinear programming problems of all types and represents a standard for evaluating the performance of optimization methods.
△ Less
Submitted 27 April, 2021;
originally announced April 2021.
-
Accelerated derivative-free nonlinear least-squares applied to the estimation of Manning coefficients
Authors:
E. G. Birgin,
J. M. Martínez
Abstract:
A general framework for solving nonlinear least squares problems without the employment of derivatives is proposed in the present paper together with a new general global convergence theory. With the aim to cope with the case in which the number of variables is big (for the standards of derivative-free optimization), two dimension-reduction procedures are introduced. One of them is based on iterat…
▽ More
A general framework for solving nonlinear least squares problems without the employment of derivatives is proposed in the present paper together with a new general global convergence theory. With the aim to cope with the case in which the number of variables is big (for the standards of derivative-free optimization), two dimension-reduction procedures are introduced. One of them is based on iterative subspace minimization and the other one is based on spline interpolation with variable nodes. Each iteration based on those procedures is followed by an acceleration step inspired in the Sequential Secant Method. The practical motivation for this work is the estimation of parameters in Hydraulic models applied to dam breaking problems. Numerical examples of the application of the new method to those problems are given.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
Online Clustering-based Multi-Camera Vehicle Tracking in Scenarios with overlap** FOVs
Authors:
Elena Luna,
Juan C. SanMiguel,
Jose M. Martínez,
Marcos Escudero-Viñolo
Abstract:
Multi-Target Multi-Camera (MTMC) vehicle tracking is an essential task of visual traffic monitoring, one of the main research fields of Intelligent Transportation Systems. Several offline approaches have been proposed to address this task; however, they are not compatible with real-world applications due to their high latency and post-processing requirements. In this paper, we present a new low-la…
▽ More
Multi-Target Multi-Camera (MTMC) vehicle tracking is an essential task of visual traffic monitoring, one of the main research fields of Intelligent Transportation Systems. Several offline approaches have been proposed to address this task; however, they are not compatible with real-world applications due to their high latency and post-processing requirements. In this paper, we present a new low-latency online approach for MTMC tracking in scenarios with partially overlap** fields of view (FOVs), such as road intersections. Firstly, the proposed approach detects vehicles at each camera. Then, the detections are merged between cameras by applying cross-camera clustering based on appearance and location. Lastly, the clusters containing different detections of the same vehicle are temporally associated to compute the tracks on a frame-by-frame basis. The experiments show promising low-latency results while addressing real-world challenges such as the a priori unknown and time-varying number of targets and the continuous state estimation of them without performing any post-processing of the trajectories.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Secant acceleration of sequential residual methods for solving large-scale nonlinear systems of equations
Authors:
Ernesto G. Birgin,
J. M. Martínez
Abstract:
Sequential Residual Methods try to solve nonlinear systems of equations $F(x)=0$ by iteratively updating the current approximate solution along a residual-related direction. Therefore, memory requirements are minimal and, consequently, these methods are attractive for solving large-scale nonlinear systems. However, the convergence of these algorithms may be slow in critical cases; therefore, accel…
▽ More
Sequential Residual Methods try to solve nonlinear systems of equations $F(x)=0$ by iteratively updating the current approximate solution along a residual-related direction. Therefore, memory requirements are minimal and, consequently, these methods are attractive for solving large-scale nonlinear systems. However, the convergence of these algorithms may be slow in critical cases; therefore, acceleration procedures are welcome. In this paper, we suggest to employ a variation of the Sequential Secant Method in order to accelerate Sequential Residual Methods. The performance of the resulting algorithm is illustrated by applying it to the solution of very large problems coming from the discretization of partial differential equations.
△ Less
Submitted 29 July, 2021; v1 submitted 24 December, 2020;
originally announced December 2020.
-
Economic inexact restoration for derivative-free expensive function minimization and applications
Authors:
Ernesto G. Birgin,
Natasa Krejić,
José Mario Martínez
Abstract:
The Inexact Restoration approach has proved to be an adequate tool for handling the problem of minimizing an expensive function within an arbitrary feasible set by using different degrees of precision in the objective function. The Inexact Restoration framework allows one to obtain suitable convergence and complexity results for an approach that rationally combines low- and high-precision evaluati…
▽ More
The Inexact Restoration approach has proved to be an adequate tool for handling the problem of minimizing an expensive function within an arbitrary feasible set by using different degrees of precision in the objective function. The Inexact Restoration framework allows one to obtain suitable convergence and complexity results for an approach that rationally combines low- and high-precision evaluations. In the present research, it is recognized that many problems with expensive objective functions are nonsmooth and, sometimes, even discontinuous. Having this in mind, the Inexact Restoration approach is extended to the nonsmooth or discontinuous case. Although optimization phases that rely on smoothness cannot be used in this case, basic convergence and complexity results are recovered. A derivative-free optimization phase is defined and the subproblems that arise at this phase are solved using a regularization approach that take advantage of different notions of stationarity. The new methodology is applied to the problem of reproducing a controlled experiment that mimics the failure of a dam.
△ Less
Submitted 3 June, 2021; v1 submitted 18 September, 2020;
originally announced September 2020.
-
On complexity and convergence of high-order coordinate descent algorithms for smooth nonconvex box-constrained minimization
Authors:
V. S. Amaral,
R. Andreani,
E. G. Birgin,
D. S. Marcondes,
J. M. Martínez
Abstract:
Coordinate descent methods have considerable impact in global optimization because global (or, at least, almost global) minimization is affordable for low-dimensional problems. Coordinate descent methods with high-order regularized models for smooth nonconvex box-constrained minimization are introduced in this work. High-order stationarity asymptotic convergence and first-order stationarity worst-…
▽ More
Coordinate descent methods have considerable impact in global optimization because global (or, at least, almost global) minimization is affordable for low-dimensional problems. Coordinate descent methods with high-order regularized models for smooth nonconvex box-constrained minimization are introduced in this work. High-order stationarity asymptotic convergence and first-order stationarity worst-case evaluation complexity bounds are established. The computer work that is necessary for obtaining first-order $\varepsilon$-stationarity with respect to the variables of each coordinate-descent block is $O(\varepsilon^{-(p+1)/p})$ whereas the computer work for getting first-order $\varepsilon$-stationarity with respect to all the variables simultaneously is $O(\varepsilon^{-(p+1)})$. Numerical examples involving multidimensional scaling problems are presented. The numerical performance of the methods is enhanced by means of coordinate-descent strategies for choosing initial points.
△ Less
Submitted 2 February, 2022; v1 submitted 3 September, 2020;
originally announced September 2020.
-
Group testing with nested pools
Authors:
Inés Armendáriz,
Pablo A. Ferrari,
Daniel Fraiman,
José M. Martínez,
Silvina Ponce Dawson
Abstract:
In order to identify the infected individuals of a population, their samples are divided in equally sized groups called pools and a single laboratory test is applied to each pool. Individuals whose samples belong to pools that test negative are declared healthy, while each pool that tests positive is divided into smaller, equally sized pools which are tested in the next stage. In the $(k+1)$-th st…
▽ More
In order to identify the infected individuals of a population, their samples are divided in equally sized groups called pools and a single laboratory test is applied to each pool. Individuals whose samples belong to pools that test negative are declared healthy, while each pool that tests positive is divided into smaller, equally sized pools which are tested in the next stage. In the $(k+1)$-th stage all remaining samples are tested. If $p<1-3^{-1/3}$, we minimize the expected number of tests per individual as a function of the number $k+1$ of stages, and of the pool sizes in the first $k$ stages. We show that for each $p\in (0, 1-3^{-1/3})$ the optimal choice is one of four possible schemes, which are explicitly described. We conjecture that for each $p$, the optimal choice is one of the two sequences of pool sizes $(3^k\text{ or }3^{k-1}4,3^{k-1},\dots,3^2,3 )$, with a precise description of the range of $p$'s where each is optimal. The conjecture is supported by overwhelming numerical evidence for $p>2^{-51}$. We also show that the cost of the best among the schemes $(3^k,\dots,3)$ is of order $O\big(p\log(1/p)\big)$, comparable to the information theoretical lower bound $p\log_2(1/p)+(1-p)\log_2(1/(1-p))$, the entropy of a Bernoulli$(p)$ random variable.
△ Less
Submitted 4 October, 2021; v1 submitted 27 May, 2020;
originally announced May 2020.
-
An infinite family of counterexamples to a conjecture on positivity
Authors:
J. Miquel Martínez
Abstract:
Recently, G. Mason has produced a counterexample of order 128 to a conjecture in conformal field theory and tensor category theory in [Ma]. Here we easily produce an infinite family of counterexamples, the smallest of which has order 72.
Recently, G. Mason has produced a counterexample of order 128 to a conjecture in conformal field theory and tensor category theory in [Ma]. Here we easily produce an infinite family of counterexamples, the smallest of which has order 72.
△ Less
Submitted 22 October, 2019;
originally announced October 2019.
-
GNU-Octave Como Alternativa de Simulación de Sistemas Dinámicos No Lineales en la Enseñanza de la Ingeniería
Authors:
Felipe de Jesús Torres,
Monserrat Sugey Arredondo,
José Manuel Martinez,
Víctor Manuel Ocampo
Abstract:
This paper presents a proposed alternative to simulate non-linear dynamical systems. This has an application in bachelor programs like: Electrical and mechanical engineering, Networks and Telecommunications engineering, Mechanical engineering and more, than they are taught in several public universities in Guerrero state. Commonly, the computer devices used for simulations require of high hardware…
▽ More
This paper presents a proposed alternative to simulate non-linear dynamical systems. This has an application in bachelor programs like: Electrical and mechanical engineering, Networks and Telecommunications engineering, Mechanical engineering and more, than they are taught in several public universities in Guerrero state. Commonly, the computer devices used for simulations require of high hardware capacity to support the simulation software. Moreover, the simulation software in the majority of the cases is under license permission. For these reasons, implementing a simulation lab in a public university is very high cost. Thus, we show an alternative by using a commercial development board Raspberry Pi supporting the GNU-Octave software, which is a free software, to simulate non-linear dynamical systems like a 4 grades of freedom SCARA robot and a rotational inverted pendulum. The comparision of the simulated dynamical models in both the specialized software and the proposed free software, exhibit the viability of the proposed alternative.
△ Less
Submitted 29 August, 2019;
originally announced September 2019.
-
Complexity and performance of an Augmented Lagrangian algorithm
Authors:
E. G. Birgin,
J. M. Martínez
Abstract:
Algencan is a well established safeguarded Augmented Lagrangian algorithm introduced in [R. Andreani, E. G. Birgin, J. M. Martínez and M. L. Schuverdt, On Augmented Lagrangian methods with general lower-level constraints, SIAM Journal on Optimization 18, pp. 1286-1309, 2008]. Complexity results that report its worst-case behavior in terms of iterations and evaluations of functions and derivatives…
▽ More
Algencan is a well established safeguarded Augmented Lagrangian algorithm introduced in [R. Andreani, E. G. Birgin, J. M. Martínez and M. L. Schuverdt, On Augmented Lagrangian methods with general lower-level constraints, SIAM Journal on Optimization 18, pp. 1286-1309, 2008]. Complexity results that report its worst-case behavior in terms of iterations and evaluations of functions and derivatives that are necessary to obtain suitable stop** criteria are presented in this work. In addition, the computational performance of a new version of the method is presented, which shows that the updated software is a useful tool for solving large-scale constrained optimization problems.
△ Less
Submitted 4 July, 2019;
originally announced July 2019.
-
On guiding video object segmentation
Authors:
Diego Ortego,
Kevin McGuinness,
Juan C. SanMiguel,
Eric Arazo,
José M. Martínez,
Noel E. O'Connor
Abstract:
This paper presents a novel approach for segmenting moving objects in unconstrained environments using guided convolutional neural networks. This guiding process relies on foreground masks from independent algorithms (i.e. state-of-the-art algorithms) to implement an attention mechanism that incorporates the spatial location of foreground and background to compute their separated representations.…
▽ More
This paper presents a novel approach for segmenting moving objects in unconstrained environments using guided convolutional neural networks. This guiding process relies on foreground masks from independent algorithms (i.e. state-of-the-art algorithms) to implement an attention mechanism that incorporates the spatial location of foreground and background to compute their separated representations. Our approach initially extracts two kinds of features for each frame using colour and optical flow information. Such features are combined following a multiplicative scheme to benefit from their complementarity. These unified colour and motion features are later processed to obtain the separated foreground and background representations. Then, both independent representations are concatenated and decoded to perform foreground segmentation. Experiments conducted on the challenging DAVIS 2016 dataset demonstrate that our guided representations not only outperform non-guided, but also recent and top-performing video object segmentation algorithms.
△ Less
Submitted 25 April, 2019;
originally announced April 2019.
-
Gaia16apd -- a link between fast-and slowly-declining type I superluminous supernovae
Authors:
T. Kangas,
N. Blagorodnova,
S. Mattila,
P. Lundqvist,
M. Fraser,
U. Burgaz,
E. Cappellaro,
J. M. Carrasco Martínez,
N. Elias-Rosa,
L. K. Hardy,
J. Harmanen,
E. Y. Hsiao,
J. Isern,
E. Kankare,
Z. Kołaczkowski,
M. B. Nielsen,
T. M. Reynolds,
L. Rhodes,
A. Somero,
M. D. Stritzinger,
Ł. Wyrzykowski
Abstract:
We present ultraviolet, optical and infrared photometry and optical spectroscopy of the type Ic superluminous supernova (SLSN) Gaia16apd (= SN 2016eay), covering its evolution from 26 d before the $g$-band peak to 234.1 d after the peak. Gaia16apd was followed as a part of the NOT Unbiased Transient Survey (NUTS). It is one of the closest SLSNe known ($z = 0.102\pm0.001$), with detailed optical an…
▽ More
We present ultraviolet, optical and infrared photometry and optical spectroscopy of the type Ic superluminous supernova (SLSN) Gaia16apd (= SN 2016eay), covering its evolution from 26 d before the $g$-band peak to 234.1 d after the peak. Gaia16apd was followed as a part of the NOT Unbiased Transient Survey (NUTS). It is one of the closest SLSNe known ($z = 0.102\pm0.001$), with detailed optical and ultraviolet (UV) observations covering the peak. Gaia16apd is a spectroscopically typical type Ic SLSN, exhibiting the characteristic blue early spectra with O II absorption, and reaches a peak $M_{g} = -21.8 \pm 0.1$ mag. However, photometrically it exhibits an evolution intermediate between the fast- and slowly-declining type Ic SLSNe, with an early evolution closer to the fast-declining events. Together with LSQ12dlf, another SLSN with similar properties, it demonstrates a possible continuum between fast- and slowly-declining events. It is unusually UV-bright even for a SLSN, reaching a non-$K$-corrected $M_{uvm2} \simeq -23.3$ mag, the only other type Ic SLSN with similar UV brightness being SN 2010gx. Assuming that Gaia16apd was powered by magnetar spin-down, we derive a period of $P = 1.9\pm0.2$ ms and a magnetic field of $B = 1.9\pm0.2 \times 10^{14}$ G for the magnetar. The estimated ejecta mass is between 8 and 16 $\mathrm{M}_{\odot}$ and the kinetic energy between 1.3 and $2.5 \times 10^{52}$ erg, depending on opacity and assuming that the entire ejecta is swept up into a thin shell. Despite the early photometric differences, the spectra at late times are similar to slowly-declining type Ic SLSNe, implying that the two subclasses originate from similar progenitors.
△ Less
Submitted 5 June, 2017; v1 submitted 30 November, 2016;
originally announced November 2016.
-
Noise-assisted quantum electron transfer in photosynthetic complexes
Authors:
Alexander I. Nesterov,
Gennady P. Berman,
José Manuel Sánchez Martínez,
Richard T. Sayre
Abstract:
Electron transfer (ET) between primary electron donors and acceptors is modeled in the photosystem II reaction center (RC). Our model includes (i) two discrete energy levels associated with donor and acceptor, interacting through a dipole-type matrix element and (ii) two continuum manifolds of electron energy levels ("sinks"), which interact directly with the donor and acceptor. Namely, two discre…
▽ More
Electron transfer (ET) between primary electron donors and acceptors is modeled in the photosystem II reaction center (RC). Our model includes (i) two discrete energy levels associated with donor and acceptor, interacting through a dipole-type matrix element and (ii) two continuum manifolds of electron energy levels ("sinks"), which interact directly with the donor and acceptor. Namely, two discrete energy levels of the donor and acceptor are embedded in their independent sinks through the corresponding interaction matrix elements. We also introduce classical (external) noise which acts simultaneously on the donor and acceptor (collective interaction). We derive a closed system of integro-differential equations which describes the non-Markovian quantum dynamics of the ET. A region of parameters is found in which the ET dynamics can be simplified, and described by coupled ordinary differential equations. Using these simplified equations, both sharp and flat redox potentials are analyzed. We analytically and numerically obtain the characteristic parameters that optimize the ET rates and efficiency in this system.
△ Less
Submitted 29 April, 2013;
originally announced April 2013.
-
Dynamics of a magnetic dimer with exchange, dipolar and Dzyalozhinski-Moriya interaction
Authors:
A. F. Franco,
J. M. Martinez,
J. L. Déjardin,
H. Kachkachi
Abstract:
We investigate the dynamics of a magnetic system consisting of two magnetic moments coupled by either exchange, dipole-dipole, or Dzyalozhinski-Moriya interaction. We compare the switching mechanisms and switching rates as induced by the three couplings. For each coupling and each configuration of the two anisotropy axes, we describe the switching modes and, using the kinetic theory of Langer, we…
▽ More
We investigate the dynamics of a magnetic system consisting of two magnetic moments coupled by either exchange, dipole-dipole, or Dzyalozhinski-Moriya interaction. We compare the switching mechanisms and switching rates as induced by the three couplings. For each coupling and each configuration of the two anisotropy axes, we describe the switching modes and, using the kinetic theory of Langer, we provide (semi-)analytical expressions for the switching rate. We then compare the three interactions with regard to their efficiency in the reversal of the net magnetic moment of the dimer. We also investigate how the energy barriers vary with the coupling. For the dipole-dipole interaction we find that the energy barrier may either increase or decrease with the coupling depending on whether the latter is weak or strong. Finally, upon comparing the various switching rates, we find that the dipole-dipole coupling leads to the slowest magnetic dimer, as far as the switching of its net magnetic moment is concerned.
△ Less
Submitted 24 October, 2011; v1 submitted 23 June, 2011;
originally announced June 2011.
-
Continuing dynamic assimilation of the inner region data in hydrodynamics modelling: Optimization approach
Authors:
F. I. Pisnichenko,
I. A. Pisnichenko,
J. M. Martinez,
S. A. Santos
Abstract:
In meteorological and oceanological studies the classical approach for finding the numerical solution of the regional model consists in formulating and solving the Cauchy-Dirichlet problem. The related boundary conditions are obtained by linear interpolation of data available on a coarse grid (global data), to the boundary of regional model. Errors, in boundary conditions, appearing owing to lin…
▽ More
In meteorological and oceanological studies the classical approach for finding the numerical solution of the regional model consists in formulating and solving the Cauchy-Dirichlet problem. The related boundary conditions are obtained by linear interpolation of data available on a coarse grid (global data), to the boundary of regional model. Errors, in boundary conditions, appearing owing to linear interpolation may lead to increasing errors in numerical solution during integration. The methods developed to reduce these errors deal with continuous dynamic assimilation of known global data available inside the regional domain. Essentially, this assimilation procedure performs a nudging of large-scale component of regional model solution to large-scale global data component by introducing the relaxation forcing terms into the regional model equations. As a result, the obtained solution is not a valid numerical solution of the original regional model. In this work we propose the optimization approach which is free from the above-mentioned shortcoming. The formulation of the joint problem of finding the regional model solution and data assimilation, as a PDE-constrained optimization problem, gives the possibility to obtain the exact numerical solution of the regional model. Three simple model examples (ODE Burgers equation, Rossby-Oboukhov equation, Korteweg-de Vries equation) were considered in this paper. The result of performed numerical experiments indicates that the optimization approach can significantly improve the precision of the sought numerical solution, even in the cases in which the solution of Cauchy-Dirichlet problem is very sensitive to the errors in the boundary condition.
△ Less
Submitted 10 January, 2008;
originally announced January 2008.