Search | arXiv e-print repository

MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection

Authors: Heitor R. Medeiros, David Latortue, Fidel Guerrero Pena, Eric Granger, Marco Pedersoli

Abstract: In this paper, we present a different way to use two modalities, in which either one modality or the other is seen by a single model. This can be useful when adapting an unimodal model to leverage more information while respecting a limited computational budget. This would mean having a single model that is able to deal with any modalities. To describe this, we coined the term anymodal learning. A… ▽ More In this paper, we present a different way to use two modalities, in which either one modality or the other is seen by a single model. This can be useful when adapting an unimodal model to leverage more information while respecting a limited computational budget. This would mean having a single model that is able to deal with any modalities. To describe this, we coined the term anymodal learning. An example of this, is a use case where, surveillance in a room when the lights are off would be much more valuable using an infrared modality while a visible one would provide more discriminative information when lights are on. This work investigates how to efficiently leverage visible and infrared/thermal modalities for transformer-based object detection backbone to create an anymodal architecture. Our work does not create any inference overhead during the testing while exploring an effective way to exploit the two modalities during the training. To accomplish such a task, we introduce the novel anymodal training technique: Mixed Patches (MiPa), in conjunction with a patch-wise domain agnostic module, which is responsible of learning the best way to find a common representation of both modalities. This approach proves to be able to balance modalities by reaching competitive results on individual modality benchmarks with the alternative of using an unimodal architecture on three different visible-infrared object detection datasets. Finally, our proposed method, when used as a regularization for the strongest modality, can beat the performance of multimodal fusion methods while only requiring a single modality during inference. Notably, MiPa became the state-of-the-art on the LLVIP visible/infrared benchmark. Code: https://github.com/heitorrapela/MiPa △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.01492 [pdf, other]

Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge

Authors: Heitor Rapela Medeiros, Masih Aminbeidokhti, Fidel Guerrero Pena, David Latortue, Eric Granger, Marco Pedersoli

Abstract: A common practice in deep learning consists of training large neural networks on massive datasets to perform accurately for different domains and tasks. While this methodology may work well in numerous application areas, it only applies across modalities due to a larger distribution shift in data captured using different sensors. This paper focuses on the problem of adapting a large object detecti… ▽ More A common practice in deep learning consists of training large neural networks on massive datasets to perform accurately for different domains and tasks. While this methodology may work well in numerous application areas, it only applies across modalities due to a larger distribution shift in data captured using different sensors. This paper focuses on the problem of adapting a large object detection model to one or multiple modalities while being efficient. To do so, we propose ModTr as an alternative to the common approach of fine-tuning large models. ModTr consists of adapting the input with a small transformation network trained to minimize the detection loss directly. The original model can therefore work on the translated inputs without any further change or fine-tuning to its parameters. Experimental results on translating from IR to RGB images on two well-known datasets show that this simple ModTr approach provides detectors that can perform comparably or better than the standard fine-tuning without forgetting the original knowledge. This opens the doors to a more flexible and efficient service-based detection pipeline in which, instead of using a different detector for each modality, a unique and unaltered server is constantly running, where multiple modalities with the corresponding translations can query it. Code: https://github.com/heitorrapela/ModTr. △ Less

Submitted 11 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2310.06670 [pdf, other]

Domain Generalization by Rejecting Extreme Augmentations

Authors: Masih Aminbeidokhti, Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Eric Granger, Marco Pedersoli

Abstract: Data augmentation is one of the most effective techniques for regularizing deep learning models and improving their recognition performance in a variety of tasks and domains. However, this holds for standard in-domain settings, in which the training and test data follow the same distribution. For the out-of-domain case, where the test data follow a different and unknown distribution, the best reci… ▽ More Data augmentation is one of the most effective techniques for regularizing deep learning models and improving their recognition performance in a variety of tasks and domains. However, this holds for standard in-domain settings, in which the training and test data follow the same distribution. For the out-of-domain case, where the test data follow a different and unknown distribution, the best recipe for data augmentation is unclear. In this paper, we show that for out-of-domain and domain generalization settings, data augmentation can provide a conspicuous and robust improvement in performance. To do that, we propose a simple training procedure: (i) use uniform sampling on standard data augmentation transformations; (ii) increase the strength transformations to account for the higher data variance expected when working out-of-domain, and (iii) devise a new reward function to reject extreme transformations that can harm the training. With this procedure, our data augmentation scheme achieves a level of accuracy that is comparable to or better than state-of-the-art methods on benchmark domain generalization datasets. Code: \url{https://github.com/Masseeh/DCAug} △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.04662 [pdf, other]

HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information

Authors: Heitor Rapela Medeiros, Fidel A. Guerrero Pena, Masih Aminbeidokhti, Thomas Dubail, Eric Granger, Marco Pedersoli

Abstract: A powerful way to adapt a visual recognition model to a new domain is through image translation. However, common image translation approaches only focus on generating data from the same distribution as the target domain. Given a cross-modal application, such as pedestrian detection from aerial images, with a considerable shift in data distribution between infrared (IR) to visible (RGB) images, a t… ▽ More A powerful way to adapt a visual recognition model to a new domain is through image translation. However, common image translation approaches only focus on generating data from the same distribution as the target domain. Given a cross-modal application, such as pedestrian detection from aerial images, with a considerable shift in data distribution between infrared (IR) to visible (RGB) images, a translation focused on generation might lead to poor performance as the loss focuses on irrelevant details for the task. In this paper, we propose HalluciDet, an IR-RGB image translation model for object detection. Instead of focusing on reconstructing the original image on the IR modality, it seeks to reduce the detection loss of an RGB detector, and therefore avoids the need to access RGB data. This model produces a new image representation that enhances objects of interest in the scene and greatly improves detection performance. We empirically compare our approach against state-of-the-art methods for image translation and for fine-tuning on IR, and show that our HalluciDet improves detection accuracy in most cases by exploiting the privileged information encoded in a pre-trained RGB detector. Code: https://github.com/heitorrapela/HalluciDet △ Less

Submitted 22 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2024

arXiv:2306.13603 [pdf, ps, other]

Eigenvalue for a problem involving the fractional (p,q)-Laplacian operator and nonlinearity with a singular and a supercritical Sobolev growth

Authors: A. L. A. de Araujo, Aldo H. S. Medeiros

Abstract: In this paper, we are interested in studying the multiplicity, uniqueness, and nonexistence of solutions for a class of singular elliptic eigenvalue problem for the Dirichlet fractional $(p,q)$-Laplacian. The nonlinearity considered involves supercritical Sobolev growth. Our approach is variational togheter with the sub- and supesolution methods, and in this way we can address a wide range of prob… ▽ More In this paper, we are interested in studying the multiplicity, uniqueness, and nonexistence of solutions for a class of singular elliptic eigenvalue problem for the Dirichlet fractional $(p,q)$-Laplacian. The nonlinearity considered involves supercritical Sobolev growth. Our approach is variational togheter with the sub- and supesolution methods, and in this way we can address a wide range of problems not yet contained in the literature. Even when $W^{s_1,p}_0(Ω) \hookrightarrow L^{\infty}\left(Ω\right)$ failing, we establish $\|u\|_{L^{\infty}\left(Ω\right)} \leq C[u]_{s_1,p}$ (for some $C>0$ ), when $u$ is a solution. △ Less

Submitted 23 June, 2023; originally announced June 2023.

MSC Class: 35J75 35R11 35J67 35A15

arXiv:2306.10639 [pdf, ps, other]

A problem involving competing and intrinsic operators

Authors: Aldo H. S. Medeiros, Dumitru Motreanu

Abstract: The main result establishes the existence of a solution in a generalized sense for a nonlinear Dirichlet problem driven by a competing operator and exhibiting a convection term composed with an intrinsic operator. A finite dimensional approximation is constructed. Applications regarding nonhomogeneous Dirichlet problems and equations with convolution are given by choosing an adequate intrinsic ope… ▽ More The main result establishes the existence of a solution in a generalized sense for a nonlinear Dirichlet problem driven by a competing operator and exhibiting a convection term composed with an intrinsic operator. A finite dimensional approximation is constructed. Applications regarding nonhomogeneous Dirichlet problems and equations with convolution are given by choosing an adequate intrinsic operator. △ Less

Submitted 18 June, 2023; originally announced June 2023.

MSC Class: 35J92 47F10 35B45

arXiv:2304.00315 [pdf, ps, other]

On the limiting problems for two eigenvalue systems and variations

Authors: Hamilton P Bueno, Aldo H S Medeiros

Abstract: Let $Ω$ be a bounded, smooth domain. Supposing that $α(p) + β(p) = p$, $\forall\, p \in \left(\frac{N}{s},\infty\right)$ and $\displaystyle\lim_{p \to \infty} α(p)/{p} = θ\in (0,1)$, we consider two systems for the fractional $p$-Laplacian and a variation on the first system. The first system is the following.… ▽ More Let $Ω$ be a bounded, smooth domain. Supposing that $α(p) + β(p) = p$, $\forall\, p \in \left(\frac{N}{s},\infty\right)$ and $\displaystyle\lim_{p \to \infty} α(p)/{p} = θ\in (0,1)$, we consider two systems for the fractional $p$-Laplacian and a variation on the first system. The first system is the following. $$\left\{\begin{array}{ll} (-Δ_p)^{s}u(x) = λα(p) \vert u \vert^{α(p)-2} u \vert v(x_0)\vert^{β(p)} & {\rm in} \ \ Ω,\\ (-Δ_p)^{t}v(x) = λβ(p) \left(\displaystyle\int_Ω\vert u \vert^{α(p)} d x\right) \vert v(x_0) \vert^{β(p)-2} v(x_0) δ_{x_0} & {\rm in} \ \ Ω,\\ u= v=0 & {\rm in} \ \mathbb{R}^N\setminusΩ, \end{array}\right. $$ where $x_0$ is a point in $\overlineΩ$, $λ$ is a parameter, $0<s\leq t<1$, $δ_x$ denotes the Dirac delta distribution centered at $x$ and $p>N/s$. A variation on this system is obtained by considering $x_0$ to be a point where the function $v$ attains its maximum. The second one is the system $$\left\{\begin{array}{ll} (-Δ_p)^{s}u(x) = λα(p) \vert u(x_1) \vert^{α(p)-2} u(x_1) \vert v(x_2) \vert^{β(p)} δ_{x_1} & {\rm in} \ \ Ω,\\ (-Δ_p)^{t}v(x) = λβ(p) \vert u(x_1) \vert^{α(p)} \vert v(x_2) \vert^{β(p)-2} v(x_2) δ_{x_2} & {\rm in} \ \ Ω,\\ u= v=0 & {\rm in} \ \mathbb{R}^N\setminusΩ, \end{array}\right. $$ where $x_1,x_2\in Ω$ are arbitrary, $x_1\neq x_2$. Although we not consider here, a variation similar to that on the first system can be solved by practically the same method we apply. We obtain solutions for the systems (including the variation on the first system) and consider the asymptotic behavior of these solutions as $p\to\infty$. We prove that they converge, in the viscosity sense, to solutions of problems on $u$ and $v$. △ Less

Submitted 1 April, 2023; originally announced April 2023.

Comments: 18 pages

MSC Class: 35R11; 35A15; 35D40

arXiv:2303.05099 [pdf, other]

doi 10.1093/mnras/stad708

Observations of two super fast rotator NEAs: 2021 NY$_1$ and 2022 AB

Authors: J. Licandro, M. Popescu, E. Tatsumi, M. R. Alarcon, M. Serra-Ricart, H. Medeiros, D. Morate, J. de Leon

Abstract: In the framework of the Visible NEAs Observations Survey (ViNOS) that uses several telescopes at the Canary Islands observatories since 2018, we observed two super fast rotator NEAs, 2021 NY$_1$ and 2022 AB. We obtained photometry and spectrophotometry of both targets and visible spectroscopy of 2022 AB. Light curves of 2021 NY$_1$ obtained in 4 different nights between Sept. 30 and Oct. 16, 2021… ▽ More In the framework of the Visible NEAs Observations Survey (ViNOS) that uses several telescopes at the Canary Islands observatories since 2018, we observed two super fast rotator NEAs, 2021 NY$_1$ and 2022 AB. We obtained photometry and spectrophotometry of both targets and visible spectroscopy of 2022 AB. Light curves of 2021 NY$_1$ obtained in 4 different nights between Sept. 30 and Oct. 16, 2021 return a rotation period $P=13.3449\pm0.0013$ minutes and a light curve amplitude $A = 1.00$ mag. We found that 2021 NY$_1$ is a very elongated super fast rotator with an axis ratio $a/b \ge 3.6$. We also report colours $(g-r) = 0.664 \pm 0.013$, $(r-i) = 0.186 \pm 0.013$, and $(i-z_s) = -0.117 \pm 0.012$ mag. These are compatible with an S-type asteroid. The light curves of 2022 AB obtained on Jan. 5 and Jan. 8, 2021 show a rotation period $P=3.0304\pm0.0008$ minutes, with amplitudes $A = 0.52$ and $A =0.54$ mag. 2022 AB is also an elongated object with axis ratio $a/b \ge 1.6$. The obtained colours are $(g-r) = 0.400 \pm 0.017$, $(r-i) = 0.133 \pm 0.017$, and $(i-z_s) = 0.093 \pm 0.016$. These colours are similar to those of the X-types, but with an unusually high $(g-r)$ value. Spectra obtained on Jan. 12 and Jan. 14, 2022, are consistent with the reported colours. The spectral upturn over the 0.4 - 0.6 $μm $ region of 2022 AB does not fit with any known asteroid taxonomical class or meteorite spectrum, confirming its unusual surface properties. △ Less

Submitted 9 March, 2023; originally announced March 2023.

Comments: 9 pages, 7 figures

arXiv:2301.00190

doi 10.1109/TSMC.2022.3225252

Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security Checkpoints

Authors: Abubakar Siddique, Henry Medeiros

Abstract: We introduce a novel framework to track multiple objects in overhead camera videos for airport checkpoint security scenarios where targets correspond to passengers and their baggage items. We propose a Self-Supervised Learning (SSL) technique to provide the model information about instance segmentation uncertainty from overhead images. Our SSL approach improves object detection by employing a test… ▽ More We introduce a novel framework to track multiple objects in overhead camera videos for airport checkpoint security scenarios where targets correspond to passengers and their baggage items. We propose a Self-Supervised Learning (SSL) technique to provide the model information about instance segmentation uncertainty from overhead images. Our SSL approach improves object detection by employing a test-time data augmentation and a regression-based, rotation-invariant pseudo-label refinement technique. Our pseudo-label generation method provides multiple geometrically-transformed images as inputs to a Convolutional Neural Network (CNN), regresses the augmented detections generated by the network to reduce localization errors, and then clusters them using the mean-shift algorithm. The self-supervised detector model is used in a single-camera tracking algorithm to generate temporal identifiers for the targets. Our method also incorporates a multi-view trajectory association mechanism to maintain consistent temporal identifiers as passengers travel across camera views. An evaluation of detection, tracking, and association performances on videos obtained from multiple overhead cameras in a realistic airport checkpoint environment demonstrates the effectiveness of the proposed approach. Our results show that self-supervision improves object detection accuracy by up to $42\%$ without increasing the inference time of the model. Our multi-camera association method achieves up to $89\%$ multi-object tracking accuracy with an average computation time of less than $15$ ms. △ Less

Submitted 30 September, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

Comments: Mistaken upload. See arXiv:2007.07924 for the latest version

Journal ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems, Early Access, 14 December 2022

arXiv:2212.12042 [pdf, other]

Re-basin via implicit Sinkhorn differentiation

Authors: Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli

Abstract: The recent emergence of new algorithms for permuting models into functionally equivalent regions of the solution space has shed some light on the complexity of error surfaces, and some promising properties like mode connectivity. However, finding the right permutation is challenging, and current optimization techniques are not differentiable, which makes it difficult to integrate into a gradient-b… ▽ More The recent emergence of new algorithms for permuting models into functionally equivalent regions of the solution space has shed some light on the complexity of error surfaces, and some promising properties like mode connectivity. However, finding the right permutation is challenging, and current optimization techniques are not differentiable, which makes it difficult to integrate into a gradient-based optimization, and often leads to sub-optimal solutions. In this paper, we propose a Sinkhorn re-basin network with the ability to obtain the transportation plan that better suits a given objective. Unlike the current state-of-art, our method is differentiable and, therefore, easy to adapt to any task within the deep learning domain. Furthermore, we show the advantage of our re-basin method by proposing a new cost function that allows performing incremental learning by exploiting the linear mode connectivity property. The benefit of our method is compared against similar approaches from the literature, under several conditions for both optimal transport finding and linear mode connectivity. The effectiveness of our continual learning method based on re-basin is also shown for several common benchmark datasets, providing experimental results that are competitive with state-of-art results from the literature. △ Less

Submitted 22 December, 2022; originally announced December 2022.

arXiv:2211.16946 [pdf, ps, other]

Existence of nonnegative solutions for fractional Schrödinger equations with Neumann condition

Authors: Hamilton Bueno, Aldo H. S. Medeiros

Abstract: In this paper we study a Neumann problem for the fractional Laplacian, namely \begin{equation}\left\{ \begin{array}{rcll} \varepsilon^{2s}(- Δ)^{s}u + u &=& f(u) \ \ &\mbox{in} \ \ Ω\\ \mathcal{N}_{s}u &=& 0 , \,\, &\text{in} \,\, \mathbb{R}^{N}\backslash Ω\end{array}\right. \end{equation} where $Ω\subset \mathbb{R}^{N}$ is a smooth bounded domain, $N>2s$, $s \in (0,1)$, $\varepsilon > 0$ is a par… ▽ More In this paper we study a Neumann problem for the fractional Laplacian, namely \begin{equation}\left\{ \begin{array}{rcll} \varepsilon^{2s}(- Δ)^{s}u + u &=& f(u) \ \ &\mbox{in} \ \ Ω\\ \mathcal{N}_{s}u &=& 0 , \,\, &\text{in} \,\, \mathbb{R}^{N}\backslash Ω\end{array}\right. \end{equation} where $Ω\subset \mathbb{R}^{N}$ is a smooth bounded domain, $N>2s$, $s \in (0,1)$, $\varepsilon > 0$ is a parameter and $\mathcal{N}_{s}$ is the nonlocal normal derivative introduced by Dipierro, Ros-Oton, and Valdinoci. We establish the existence of a nonnegative, non-constant small energy solution $u_{\varepsilon}$, and we use the Moser-Nash iteration procedure to show that $u_{\varepsilon} \in L^{\infty}(Ω)$. △ Less

Submitted 30 November, 2022; originally announced November 2022.

Comments: 15 pages

MSC Class: 35R11; 35A01; 35B45

arXiv:2211.14817 [pdf, other]

Principal curves to fractional $m$-Laplacian systems and related maximum and comparison principles

Authors: Anderson Luis Albuquerque de Araujo, Edir Junior Ferreira Leite, Aldo Henrique de Souza Medeiros

Abstract: In this paper we develop a comprehensive study on principal eigenvalues and both the (weak and strong) maximum and comparison principles related to an important class of nonlinear systems involving fractional $m$-Laplacian operators. Explicit lower bounds for principal eigenvalues of this system in terms of the diameter of $Ω$ are also proved. As application, given $λ,μ\geq 0$ we measure explicitl… ▽ More In this paper we develop a comprehensive study on principal eigenvalues and both the (weak and strong) maximum and comparison principles related to an important class of nonlinear systems involving fractional $m$-Laplacian operators. Explicit lower bounds for principal eigenvalues of this system in terms of the diameter of $Ω$ are also proved. As application, given $λ,μ\geq 0$ we measure explicitly how small has to be $\text{diam}(Ω)$ so that weak and strong maximum principles associated to this problem hold in $Ω$. △ Less

Submitted 19 April, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

Comments: 21 pages

MSC Class: 35B50; 35B51; 93B60; 35J70; 35J92; 35P15

arXiv:2211.01435 [pdf, other]

doi 10.1093/mnras/stac3199

A Targeted Search for Main Belt Comets

Authors: Léa Ferellec, Colin Snodgrass, Alan Fitzsimmons, Agata Rożek, Daniel Gardener, Richard Smith, Hissa Medeiros, Cyrielle Opitom, Henry H. Hsieh

Abstract: Main Belt Comets (MBCs) exhibit sublimation-driven activity while occupying asteroid-like orbits in the Main Asteroid Belt. MBCs and candidates show stronger clustering of their longitudes of perihelion around 15° than other objects from the Outer Main Belt (OMB). This potential property of MBCs could facilitate the discovery of new candidates by observing objects in similar orbits. We acquired de… ▽ More Main Belt Comets (MBCs) exhibit sublimation-driven activity while occupying asteroid-like orbits in the Main Asteroid Belt. MBCs and candidates show stronger clustering of their longitudes of perihelion around 15° than other objects from the Outer Main Belt (OMB). This potential property of MBCs could facilitate the discovery of new candidates by observing objects in similar orbits. We acquired deep r-band images of 534 targeted asteroids using the INT/WFC between 2018 and 2020. Our sample is comprised of OMB objects observed near perihelion, with longitudes of perihelion between 0° and 30° and orbital parameters similar to knowns MBCs. Our pipeline applied activity detection methods to 319 of these objects to look for tails or comae, and we visually inspected the remaining asteroids. Our activity detection pipeline highlighted a faint anti-solar tail-like feature around 2001 NL19 (279870) observed on 2018 November 07, six months after perihelion. This is consistent with cometary activity but additional observations of this object will be needed during its next perihelion to investigate its potential MBC status. If it is active our survey yields a detection rate of $\sim$1:300, which is higher than previous similar surveys, supporting the idea of dynamical clustering of MBCs. If not, it is consistent with previously estimated abundance rates of MBCs in the OMB (<1:500). △ Less

Submitted 2 November, 2022; originally announced November 2022.

Comments: 13 pages, 9 figures, accepted for publication in MNRAS

arXiv:2209.11335 [pdf, other]

Privacy-Preserving Person Detection Using Low-Resolution Infrared Cameras

Authors: Thomas Dubail, Fidel Alejandro Guerrero Peña, Heitor Rapela Medeiros, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli

Abstract: In intelligent building management, knowing the number of people and their location in a room are important for better control of its illumination, ventilation, and heating with reduced costs and improved comfort. This is typically achieved by detecting people using compact embedded devices that are installed on the room's ceiling, and that integrate low-resolution infrared camera, which conceals… ▽ More In intelligent building management, knowing the number of people and their location in a room are important for better control of its illumination, ventilation, and heating with reduced costs and improved comfort. This is typically achieved by detecting people using compact embedded devices that are installed on the room's ceiling, and that integrate low-resolution infrared camera, which conceals each person's identity. However, for accurate detection, state-of-the-art deep learning models still require supervised training using a large annotated dataset of images. In this paper, we investigate cost-effective methods that are suitable for person detection based on low-resolution infrared images. Results indicate that for such images, we can reduce the amount of supervision and computation, while still achieving a high level of detection accuracy. Going from single-shot detectors that require bounding box annotations of each person in an image, to auto-encoders that only rely on unlabelled images that do not contain people, allows for considerable savings in terms of annotation costs, and for models with lower computational costs. We validate these experimental findings on two challenging top-view datasets with low-resolution infrared images. △ Less

Submitted 22 September, 2022; originally announced September 2022.

arXiv:2209.04618 [pdf, other]

doi 10.1109/LRA.2022.3217000

Self-supervised Learning for Panoptic Segmentation of Multiple Fruit Flower Species

Authors: Abubakar Siddique, Amy Tabb, Henry Medeiros

Abstract: Convolutional neural networks trained using manually generated labels are commonly used for semantic or instance segmentation. In precision agriculture, automated flower detection methods use supervised models and post-processing techniques that may not perform consistently as the appearance of the flowers and the data acquisition conditions vary. We propose a self-supervised learning strategy to… ▽ More Convolutional neural networks trained using manually generated labels are commonly used for semantic or instance segmentation. In precision agriculture, automated flower detection methods use supervised models and post-processing techniques that may not perform consistently as the appearance of the flowers and the data acquisition conditions vary. We propose a self-supervised learning strategy to enhance the sensitivity of segmentation models to different flower species using automatically generated pseudo-labels. We employ a data augmentation and refinement approach to improve the accuracy of the model predictions. The augmented semantic predictions are then converted to panoptic pseudo-labels to iteratively train a multi-task model. The self-supervised model predictions can be refined with existing post-processing approaches to further improve their accuracy. An evaluation on a multi-species fruit tree flower dataset demonstrates that our method outperforms state-of-the-art models without computationally expensive post-processing steps, providing a new baseline for flower detection applications. △ Less

Submitted 10 September, 2022; originally announced September 2022.

Comments: 8 pages, 7 figures

Journal ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 4, October 2022)

arXiv:2109.00463 [pdf, ps, other]

doi 10.1051/0004-6361/202140991

Properties of slowly rotating asteroids from the Convex Inversion Thermophysical Model

Authors: A. Marciniak, J. Ďurech, V. Alí-Lagoa, W. Ogłoza, R. Szakáts, T. G. Müller, L. Molnár, A. Pál, F. Monteiro, P. Arcoverde, R. Behrend, Z. Benkhaldoun, L. Bernasconi, J. Bosch, S. Brincat, L. Brunetto, M. Butkiewicz - Bąk, F. Del Freo, R. Duffard, M. Evangelista-Santana, G. Farroni, S. Fauvaud, M. Fauvaud, M. Ferrais, S. Geier , et al. (51 additional authors not shown)

Abstract: Results from the TESS mission showed that previous studies strngly underestimated the number of slow rotators, revealing the importance of studying those asteroids. For most slowly rotating asteroids (P > 12), no spin and shape model is available because of observation selection effects. This hampers determination of their thermal parameters and accurate sizes. We continue our campaign in minimi… ▽ More Results from the TESS mission showed that previous studies strngly underestimated the number of slow rotators, revealing the importance of studying those asteroids. For most slowly rotating asteroids (P > 12), no spin and shape model is available because of observation selection effects. This hampers determination of their thermal parameters and accurate sizes. We continue our campaign in minimising selection effects among main belt asteroids. Our targets are slow rotators with low light-curve amplitudes. The goal is to provide their scaled spin and shape models together with thermal inertia, albedo, and surface roughness to complete the statistics. Rich multi-apparition datasets of dense light curves are supplemented with data from Kepler and TESS. In addition to data in the visible range, we also use thermal data from infrared space observatories (IRAS, Akari and WISE) in a combined optimisation process using the Convex Inversion Thermophysical Model (CITPM). This novel method has so far been applied to only a few targets, and in this work we further validate the method. We present the models of 16 slow rotators. All provide good fits to both thermal and visible data. The obtained sizes are on average accurate at the 5% precision, with diameters in the range from 25 to 145 km. The rotation periods of our targets range from 11 to 59 hours, and the thermal inertia covers a wide range of values, from 2 to <400 SI units, not showing any correlation with the period. With this work we increase the sample of slow rotators with reliable spin and shape models and known thermal inertia by 40%. The thermal inertia values of our sample do not display a previously suggested increasing trend with rotation period, which might be due to their small skin depth. △ Less

Submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted to Astronomy & Astrophysics. 10 pages + appendices

Journal ref: A&A 654, A87 (2021)

arXiv:2108.01968 [pdf, other]

doi 10.1103/PhysRevB.104.195307

Electric field-induced edge state oscillations in GaSb/InAs quantum wells

Authors: Marcos H. L. de Medeiros, Raphael L. R. C. Teixeira, Guilherme M. Sipahi, Luis G. G. V. Dias da Silva

Abstract: Inverted-gap GaSb/InAs quantum wells have long been predicted to show quantum spin Hall insulator (QSHI) behavior. The experimental characterization of the QSHI phase in these systems has relied on the presence of quantized edge transport near charge neutrality. However, experimental data showing the presence of edge conductance in the \emph{trivial} regime suggest that additional experimental sig… ▽ More Inverted-gap GaSb/InAs quantum wells have long been predicted to show quantum spin Hall insulator (QSHI) behavior. The experimental characterization of the QSHI phase in these systems has relied on the presence of quantized edge transport near charge neutrality. However, experimental data showing the presence of edge conductance in the \emph{trivial} regime suggest that additional experimental signatures are needed to characterize the QSHI phase. Here we show that electric field- induced gap oscillations can be used as an indicator of the presence of helical edge states in system. By studying a realistic low-energy model GaSb/InAs quantum wells derived from $k \cdot p$ band theory, we show that such oscillations are bound to appear in narrow samples as the system is driven to the the the topological phase by the electric field. Our results can serve as a guide for the search of additional experimental signatures of the presence of topologically-protected helical edge states in GaSb/InAs systems. △ Less

Submitted 4 August, 2021; originally announced August 2021.

Comments: 9 pages, 9 figures

Journal ref: Phys. Rev. B 104 195307 (2021)

arXiv:2107.02984 [pdf, ps, other]

doi 10.1016/j.cviu.2022.103479

Deep Convolutional Correlation Iterative Particle Filter for Visual Tracking

Authors: Reza Jalil Mozhdehi, Henry Medeiros

Abstract: This work proposes a novel framework for visual tracking based on the integration of an iterative particle filter, a deep convolutional neural network, and a correlation filter. The iterative particle filter enables the particles to correct themselves and converge to the correct target position. We employ a novel strategy to assess the likelihood of the particles after the iterations by applying K… ▽ More This work proposes a novel framework for visual tracking based on the integration of an iterative particle filter, a deep convolutional neural network, and a correlation filter. The iterative particle filter enables the particles to correct themselves and converge to the correct target position. We employ a novel strategy to assess the likelihood of the particles after the iterations by applying K-means clustering. Our approach ensures a consistent support for the posterior distribution. Thus, we do not need to perform resampling at every video frame, improving the utilization of prior distribution information. Experimental results on two different benchmark datasets show that our tracker performs favorably against state-of-the-art methods. △ Less

Submitted 3 January, 2023; v1 submitted 6 July, 2021; originally announced July 2021.

Comments: 25 pages, 10 figures, 1 table

Journal ref: Computer Vision and Image Understanding (ELSEVIER), Volume 222, 103479, 2022

arXiv:2010.13855 [pdf, other]

doi 10.1051/0004-6361/202039263

Spectral characterisation of 14 V-type candidate asteroids from the MOVIS catalogue

Authors: Pavol Matlovič, Julia de Leon, Hissa Medeiros, Marcel Popescu, Juan Luis Rizos, Jad-Alexandru Mansour

Abstract: Most of the currently known basaltic (V-type) asteroids are believed to be past or present members of the Vesta dynamical family. The rising discoveries of V-type asteroids that are not dynamically linked to the Vesta family suggest that a number of major basaltic bodies may have been present during the early stages of the solar system. In this work, we aim to provide a spectral analysis of 14 V-t… ▽ More Most of the currently known basaltic (V-type) asteroids are believed to be past or present members of the Vesta dynamical family. The rising discoveries of V-type asteroids that are not dynamically linked to the Vesta family suggest that a number of major basaltic bodies may have been present during the early stages of the solar system. In this work, we aim to provide a spectral analysis of 14 V-type candidates of various dynamical types, selected from the Moving Objects from VISTA Survey (MOVIS) catalogue. The computed visible and near-infrared (NIR) spectral parameters are used to investigate evidence of space-weathering or mineralogical differences from the expected basaltic composition. Based on the analysis of their visible spectra, we confirm 11 new V-type asteroids: six low-i asteroids - (3188) Jekabsons, (3331) Kvistaberg, (4693) Drummond, (7223) Dolgorukij, (9007) James Bond, and (29733) 1999 BA4; along with four inner-other asteroids - (5524) Lecacheux, (19983) 1990 DW, (51742) 2001 KE$_{55}$, and (90023) 2003 BD$_{13}$; as well as one fugitive - (2275) Cuitlahuac. Additionally, we analysed three peculiar outer main belt candidates based on their visible + NIR spectra. We confirm the diogenite-like composition of (2452) Lyot. The spectrum of asteroid (7302) is inconsistent with a basaltic composition and likely reflects an S-type body. The spectrum of (14390) 1990 QP$_{10}$ shows unique features that suggest a peculiar, unclassified composition. Overall, our results demonstrate the efficiency of the MOVIS catalogue in identifying V-type objects, with a success rate of over 85\%. The identification of V-types in the inner main-belt is more likely due to the presence of the Vesta family and other nearby asteroids that had escaped from the family. In the middle and outer main belt, where the amount of data is more limited, the proportion of false positives increases. △ Less

Submitted 26 October, 2020; originally announced October 2020.

arXiv:2008.12624 [pdf, other]

A Framework for Studying Reinforcement Learning and Sim-to-Real in Robot Soccer

Authors: Hansenclever F. Bassani, Renie A. Delgado, José Nilton de O. Lima Junior, Heitor R. Medeiros, Pedro H. M. Braga, Mateus G. Machado, Lucas H. C. Santos, Alain Tapp

Abstract: This article introduces an open framework, called VSSS-RL, for studying Reinforcement Learning (RL) and sim-to-real in robot soccer, focusing on the IEEE Very Small Size Soccer (VSSS) league. We propose a simulated environment in which continuous or discrete control policies can be trained to control the complete behavior of soccer agents and a sim-to-real method based on domain adaptation to adap… ▽ More This article introduces an open framework, called VSSS-RL, for studying Reinforcement Learning (RL) and sim-to-real in robot soccer, focusing on the IEEE Very Small Size Soccer (VSSS) league. We propose a simulated environment in which continuous or discrete control policies can be trained to control the complete behavior of soccer agents and a sim-to-real method based on domain adaptation to adapt the obtained policies to real robots. Our results show that the trained policies learned a broad repertoire of behaviors that are difficult to implement with handcrafted control policies. With VSSS-RL, we were able to beat human-designed policies in the 2019 Latin American Robotics Competition (LARC), achieving 4th place out of 21 teams, being the first to apply Reinforcement Learning (RL) successfully in this competition. Both environment and hardware specifications are available open-source to allow reproducibility of our results and further studies. △ Less

Submitted 18 August, 2020; originally announced August 2020.

arXiv:2007.07924 [pdf, other]

doi 10.1109/TSMC.2022.3225252

Tracking Passengers and Baggage Items Using Multiple Overhead Cameras at Security Checkpoints

Authors: Abubakar Siddique, Henry Medeiros

Abstract: We introduce a novel framework to track multiple objects in overhead camera videos for airport checkpoint security scenarios where targets correspond to passengers and their baggage items. We propose a self-supervised learning (SSL) technique to provide the model information about instance segmentation uncertainty from overhead images. Our SSL approach improves object detection by employing a test… ▽ More We introduce a novel framework to track multiple objects in overhead camera videos for airport checkpoint security scenarios where targets correspond to passengers and their baggage items. We propose a self-supervised learning (SSL) technique to provide the model information about instance segmentation uncertainty from overhead images. Our SSL approach improves object detection by employing a test-time data augmentation and a regression-based, rotation-invariant pseudo-label refinement technique. Our pseudo-label generation method provides multiple geometrically transformed images as inputs to a convolutional neural network (CNN), regresses the augmented detections generated by the network to reduce localization errors, and then clusters them using the mean-shift algorithm. The self-supervised detector model is used in a single-camera tracking algorithm to generate temporal identifiers for the targets. Our method also incorporates a multiview trajectory association mechanism to maintain consistent temporal identifiers as passengers travel across camera views. An evaluation of detection, tracking, and association performances on videos obtained from multiple overhead cameras in a realistic airport checkpoint environment demonstrates the effectiveness of the proposed approach. Our results show that self-supervision improves object detection accuracy by up to 42% without increasing the inference time of the model. Our multicamera association method achieves up to 89% multiobject tracking accuracy with an average computation time of less than 15 ms. △ Less

Submitted 27 February, 2024; v1 submitted 15 July, 2020; originally announced July 2020.

Comments: 16 pages, 16 figures

Journal ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems ( Volume: 53, Issue: 6, June 2023)

arXiv:2007.07175 [pdf, ps, other]

Unsupervised Spatio-temporal Latent Feature Clustering for Multiple-object Tracking and Segmentation

Authors: Abubakar Siddique, Reza Jalil Mozhdehi, Henry Medeiros

Abstract: Assigning consistent temporal identifiers to multiple moving objects in a video sequence is a challenging problem. A solution to that problem would have immediate ramifications in multiple object tracking and segmentation problems. We propose a strategy that treats the temporal identification task as a spatio-temporal clustering problem. We propose an unsupervised learning approach using a convolu… ▽ More Assigning consistent temporal identifiers to multiple moving objects in a video sequence is a challenging problem. A solution to that problem would have immediate ramifications in multiple object tracking and segmentation problems. We propose a strategy that treats the temporal identification task as a spatio-temporal clustering problem. We propose an unsupervised learning approach using a convolutional and fully connected autoencoder, which we call deep heterogeneous autoencoder, to learn discriminative features from segmentation masks and detection bounding boxes. We extract masks and their corresponding bounding boxes from a pretrained instance segmentation network and train the autoencoders jointly using task-dependent uncertainty weights to generate common latent features. We then construct constraints graphs that encourage associations among objects that satisfy a set of known temporal conditions. The feature vectors and the constraints graphs are then provided to the kmeans clustering algorithm to separate the corresponding data points in the latent space. We evaluate the performance of our method using challenging synthetic and real-world multiple-object video datasets. Our results show that our technique outperforms several state-of-the-art methods. △ Less

Submitted 4 November, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

Comments: 10 pages, 5 figures, accepted by BMVC 2021

arXiv:2006.13682 [pdf, other]

Deep Categorization with Semi-Supervised Self-Organizing Maps

Authors: Pedro H. M. Braga, Heitor R. Medeiros, Hansenclever F. Bassani

Abstract: Nowadays, with the advance of technology, there is an increasing amount of unstructured data being generated every day. However, it is a painful job to label and organize it. Labeling is an expensive, time-consuming, and difficult task. It is usually done manually, which collaborates with the incorporation of noise and errors to the data. Hence, it is of great importance to develo** intelligent… ▽ More Nowadays, with the advance of technology, there is an increasing amount of unstructured data being generated every day. However, it is a painful job to label and organize it. Labeling is an expensive, time-consuming, and difficult task. It is usually done manually, which collaborates with the incorporation of noise and errors to the data. Hence, it is of great importance to develo** intelligent models that can benefit from both labeled and unlabeled data. Currently, works on unsupervised and semi-supervised learning are still being overshadowed by the successes of purely supervised learning. However, it is expected that they become far more important in the longer term. This article presents a semi-supervised model, called Batch Semi-Supervised Self-Organizing Map (Batch SS-SOM), which is an extension of a SOM incorporating some advances that came with the rise of Deep Learning, such as batch training. The results show that Batch SS-SOM is a good option for semi-supervised classification and clustering. It performs well in terms of accuracy and clustering error, even with a small number of labeled samples, as well as when presented to unsupervised data, and shows competitive results in transfer learning scenarios in traditional image classification benchmark datasets. △ Less

Submitted 17 June, 2020; originally announced June 2020.

Comments: Accepted for publication at the 2020 International Joint Conference on Neural Networks (IJCNN)

arXiv:2006.06746 [pdf, ps, other]

Deep Convolutional Likelihood Particle Filter for Visual Tracking

Authors: Reza Jalil Mozhdehi, Henry Medeiros

Abstract: We propose a novel particle filter for convolutional-correlation visual trackers. Our method uses correlation response maps to estimate likelihood distributions and employs these likelihoods as proposal densities to sample particles. Likelihood distributions are more reliable than proposal densities based on target transition distributions because correlation response maps provide additional infor… ▽ More We propose a novel particle filter for convolutional-correlation visual trackers. Our method uses correlation response maps to estimate likelihood distributions and employs these likelihoods as proposal densities to sample particles. Likelihood distributions are more reliable than proposal densities based on target transition distributions because correlation response maps provide additional information regarding the target's location. Additionally, our particle filter searches for multiple modes in the likelihood distribution, which improves performance in target occlusion scenarios while decreasing computational costs by more efficiently sampling particles. In other challenging scenarios such as those involving motion blur, where only one mode is present but a larger search area may be necessary, our particle filter allows for the variance of the likelihood distribution to increase. We tested our algorithm on the Visual Tracker Benchmark v1.1 (OTB100) and our experimental results demonstrate that our framework outperforms state-of-the-art methods. △ Less

Submitted 11 June, 2020; originally announced June 2020.

Comments: Accepted in Transactions on Computational Science & Computational Intelligence, 11 pages, 7 figures

arXiv:2005.05856 [pdf, other]

Probabilistic Semantic Segmentation Refinement by Monte Carlo Region Growing

Authors: Philipe A. Dias, Henry Medeiros

Abstract: Semantic segmentation with fine-grained pixel-level accuracy is a fundamental component of a variety of computer vision applications. However, despite the large improvements provided by recent advances in the architectures of convolutional neural networks, segmentations provided by modern state-of-the-art methods still show limited boundary adherence. We introduce a fully unsupervised post-process… ▽ More Semantic segmentation with fine-grained pixel-level accuracy is a fundamental component of a variety of computer vision applications. However, despite the large improvements provided by recent advances in the architectures of convolutional neural networks, segmentations provided by modern state-of-the-art methods still show limited boundary adherence. We introduce a fully unsupervised post-processing algorithm that exploits Monte Carlo sampling and pixel similarities to propagate high-confidence pixel labels into regions of low-confidence classification. Our algorithm, which we call probabilistic Region Growing Refinement (pRGR), is based on a rigorous mathematical foundation in which clusters are modelled as multivariate normally distributed sets of pixels. Exploiting concepts of Bayesian estimation and variance reduction techniques, pRGR performs multiple refinement iterations at varied receptive fields sizes, while updating cluster statistics to adapt to local image features. Experiments using multiple modern semantic segmentation networks and benchmark datasets demonstrate the effectiveness of our approach for the refinement of segmentation predictions at different levels of coarseness, as well as the suitability of the variance estimates obtained in the Monte Carlo iterations as uncertainty measures that are highly correlated with segmentation accuracy. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: Submitted to IEEE Transactions on Image Processing (April 2020)

arXiv:2003.11102 [pdf, other]

Learning to Play Soccer by Reinforcement and Applying Sim-to-Real to Compete in the Real World

Authors: Hansenclever F. Bassani, Renie A. Delgado, Jose Nilton de O. Lima Junior, Heitor R. Medeiros, Pedro H. M. Braga, Alain Tapp

Abstract: This work presents an application of Reinforcement Learning (RL) for the complete control of real soccer robots of the IEEE Very Small Size Soccer (VSSS), a traditional league in the Latin American Robotics Competition (LARC). In the VSSS league, two teams of three small robots play against each other. We propose a simulated environment in which continuous or discrete control policies can be train… ▽ More This work presents an application of Reinforcement Learning (RL) for the complete control of real soccer robots of the IEEE Very Small Size Soccer (VSSS), a traditional league in the Latin American Robotics Competition (LARC). In the VSSS league, two teams of three small robots play against each other. We propose a simulated environment in which continuous or discrete control policies can be trained, and a Sim-to-Real method to allow using the obtained policies to control a robot in the real world. The results show that the learned policies display a broad repertoire of behaviors that are difficult to specify by hand. This approach, called VSSS-RL, was able to beat the human-designed policy for the striker of the team ranked 3rd place in the 2018 LARC, in 1-vs-1 matches. △ Less

Submitted 24 March, 2020; originally announced March 2020.

Journal ref: LatinX in AI Research Workshop at NeurIPS 2019

arXiv:1909.09225 [pdf, other]

Gaze Estimation for Assisted Living Environments

Authors: Philipe A. Dias, Damiano Malafronte, Henry Medeiros, Francesca Odone

Abstract: Effective assisted living environments must be able to perform inferences on how their occupants interact with one another as well as with surrounding objects. To accomplish this goal using a vision-based automated approach, multiple tasks such as pose estimation, object segmentation and gaze estimation must be addressed. Gaze direction in particular provides some of the strongest indications of h… ▽ More Effective assisted living environments must be able to perform inferences on how their occupants interact with one another as well as with surrounding objects. To accomplish this goal using a vision-based automated approach, multiple tasks such as pose estimation, object segmentation and gaze estimation must be addressed. Gaze direction in particular provides some of the strongest indications of how a person interacts with the environment. In this paper, we propose a simple neural network regressor that estimates the gaze direction of individuals in a multi-camera assisted living scenario, relying only on the relative positions of facial keypoints collected from a single pose estimation model. To handle cases of keypoint occlusion, our model exploits a novel confidence gated unit in its input layer. In addition to the gaze direction, our model also outputs an estimation of its own prediction uncertainty. Experimental results on a public benchmark demonstrate that our approach performs on pair with a complex, dataset-specific baseline, while its uncertainty predictions are highly correlated to the actual angular error of corresponding estimations. Finally, experiments on images from a real assisted living environment demonstrate the higher suitability of our model for its final application. △ Less

Submitted 19 September, 2019; originally announced September 2019.

Comments: Work to be published in its final version at WACV '20

arXiv:1906.09147 [pdf, ps, other]

Existence, regularity, asymptotic decay and radiality of solutions to some extension problems

Authors: Hamilton Bueno, Aldo H. S. Medeiros, G. A. Pereira

Abstract: Supposing only that $\displaystyle\lim_{t \to 0} \frac{f(t)}{t} = 0$ and $\displaystyle\lim_{t \to \infty} \frac{f(t)}{t^{p}} = 0$, for some $p \in \left(1,\frac{N+1}{N-1}\right)$, we prove that solutions to the extension problem \begin{equation*}\left\{ \begin{array}{rcll} -Δu+ m^2u &=& 0, &\mbox{in} \ \ \mathbb{R}^{N+1}_{+} \\ -\frac{\partial u}{\partial{x}} (0,y)& =& f(u(0,y)), & y \in \mathbb{… ▽ More Supposing only that $\displaystyle\lim_{t \to 0} \frac{f(t)}{t} = 0$ and $\displaystyle\lim_{t \to \infty} \frac{f(t)}{t^{p}} = 0$, for some $p \in \left(1,\frac{N+1}{N-1}\right)$, we prove that solutions to the extension problem \begin{equation*}\left\{ \begin{array}{rcll} -Δu+ m^2u &=& 0, &\mbox{in} \ \ \mathbb{R}^{N+1}_{+} \\ -\frac{\partial u}{\partial{x}} (0,y)& =& f(u(0,y)), & y \in \mathbb{R}^{N}, \end{array}\right. \end{equation*} and also to the extension Hartree problem \begin{equation*} \left\{\begin{aligned} -Δu +m^2u&=0, &&\mbox{in} \ \mathbb{R}^{N+1}_+,\\ -\displaystyle\frac{\partial u}{\partial x}(0,y)&=-V_\infty u(0,y)+\left(\frac{1}{|y|^{N-α}}*F(u(0,y))\right)f(u(0,y)) &&\mbox{in} \ \mathbb{R}^{N}\end{aligned}\right. \end{equation*} are radially symmetric in $\mathbb{R}^N$. In the last problem, $V_\infty>0$ is a constant and $F$ the primitive of $f$. Under the same hypotheses, regularity and exponential decay of solutions to the first problem is also proved and, supposing the traditional Ambrosetti-Rabinowitz condition, also existence of a ground state solution. △ Less

Submitted 20 June, 2019; originally announced June 2019.

Comments: 23 pages. arXiv admin note: text overlap with arXiv:1802.03963

MSC Class: 35J20; 35Q55; 35B65; 35R11

arXiv:1906.07785 [pdf, ps, other]

doi 10.3233/ASY-201632

On the behavior of least energy solutions of a fractional $(p,q(p))$-Laplacian problem as p goes to infinity

Authors: Grey Ercole, Aldo H. S. Medeiros, Gilberto A. Pereira

Abstract: We study the behavior as $p\rightarrow\infty$ of $u_{p},$ a positive least energy solution of the problem \[ \left\{\begin{array} [c]{lll} \left[ \left( -Δ_{p}\right) ^α+\left( -Δ_{q(p)}\right) ^β\right] u=μ_{p}\left\Vert u\right\Vert _{\infty}^{p-2} u(x_{u})δ_{x_{u}} & \mathrm{in} & Ω\\ u=0 & \mathrm{in} & \mathbb{R}^{N}\setminusΩ\\ \left\vert u(x_{u})\right\vert =\left\Vert u\right\Vert _{\infty… ▽ More We study the behavior as $p\rightarrow\infty$ of $u_{p},$ a positive least energy solution of the problem \[ \left\{\begin{array} [c]{lll} \left[ \left( -Δ_{p}\right) ^α+\left( -Δ_{q(p)}\right) ^β\right] u=μ_{p}\left\Vert u\right\Vert _{\infty}^{p-2} u(x_{u})δ_{x_{u}} & \mathrm{in} & Ω\\ u=0 & \mathrm{in} & \mathbb{R}^{N}\setminusΩ\\ \left\vert u(x_{u})\right\vert =\left\Vert u\right\Vert _{\infty}, & & \end{array} \right. \] where $Ω\subset\mathbb{R}^{N}$ is a bounded, smooth domain, $δ_{x_{u}}$ is the Dirac delta distribution supported at $x_{u},$ \[ \lim_{p\rightarrow\infty}\frac{q(p)}{p}=Q\in\left\{ \begin{array} [c]{lll} (0,1) & \mathrm{if} & 0<β<α<1\\ (1,\infty) & \mathrm{if} & 0<α<β<1 \end{array} \right. \] and \[ \lim_{p\rightarrow\infty}\sqrt[p]{μ_{p}}>R^{-α}, \] with $R$ denoting the inradius of $Ω.$ △ Less

Submitted 18 June, 2019; originally announced June 2019.

Comments: 24 pages

MSC Class: 35D40; 35J60; 35R11

arXiv:1903.06811 [pdf, other]

Multi-camera calibration with pattern rigs, including for non-overlap** cameras: CALICO

Authors: Amy Tabb, Henry Medeiros, Mitchell J. Feldmann, Thiago T. Santos

Abstract: This paper describes CALICO, a method for multi-camera calibration suitable for challenging contexts: stationary and mobile multi-camera systems, cameras without overlap** fields of view, and non-synchronized cameras. Recent approaches are roughly divided into infrastructure- and pattern-based. Infrastructure-based approaches use the scene's features to calibrate, while pattern-based approaches… ▽ More This paper describes CALICO, a method for multi-camera calibration suitable for challenging contexts: stationary and mobile multi-camera systems, cameras without overlap** fields of view, and non-synchronized cameras. Recent approaches are roughly divided into infrastructure- and pattern-based. Infrastructure-based approaches use the scene's features to calibrate, while pattern-based approaches use calibration patterns. Infrastructure-based approaches are not suitable for stationary camera systems, and pattern-based approaches may constrain camera placement because shared fields of view or extremely large patterns are required. CALICO is a pattern-based approach, where the multi-calibration problem is formulated using rigidity constraints between patterns and cameras. We use a {\it pattern rig}: several patterns rigidly attached to each other or some structure. We express the calibration problem as that of algebraic and reprojection error minimization problems. Simulated and real experiments demonstrate the method in a variety of settings. CALICO compared favorably to Kalibr. Mean reconstruction accuracy error was $\le 0.71$ mm for real camera rigs, and $\le 1.11$ for simulated camera rigs. Code and data releases are available at \cite{tabb_amy_2019_3520866} and \url{https://github.com/amy-tabb/calico}. △ Less

Submitted 27 March, 2024; v1 submitted 15 March, 2019; originally announced March 2019.

Comments: 11 pages

arXiv:1903.00815 [pdf, other]

doi 10.1109/ICRA.2019.8794116

Detecting Invasive Insects with Unmanned Aerial Vehicles

Authors: Brian Stumph, Miguel Hernandez Virto, Henry Medeiros, Amy Tabb, Scott Wolford, Kevin Rice, Tracy Leskey

Abstract: A key aspect to controlling and reducing the effects invasive insect species have on agriculture is to obtain knowledge about the migration patterns of these species. Current state-of-the-art methods of studying these migration patterns involve a mark-release-recapture technique, in which insects are released after being marked and researchers attempt to recapture them later. However, this approac… ▽ More A key aspect to controlling and reducing the effects invasive insect species have on agriculture is to obtain knowledge about the migration patterns of these species. Current state-of-the-art methods of studying these migration patterns involve a mark-release-recapture technique, in which insects are released after being marked and researchers attempt to recapture them later. However, this approach involves a human researcher manually searching for these insects in large fields and results in very low recapture rates. In this paper, we propose an automated system for detecting released insects using an unmanned aerial vehicle. This system utilizes ultraviolet lighting technology, digital cameras, and lightweight computer vision algorithms to more quickly and accurately detect insects compared to the current state of the art. The efficiency and accuracy that this system provides will allow for a more comprehensive understanding of invasive insect species migration patterns. Our experimental results demonstrate that our system can detect real target insects in field conditions with high precision and recall rates. △ Less

Submitted 15 August, 2019; v1 submitted 2 March, 2019; originally announced March 2019.

Comments: IEEE ICRA 2019. 7 pages

arXiv:1902.06806 [pdf, other]

doi 10.1109/WACV.2019.00010

FreeLabel: A Publicly Available Annotation Tool based on Freehand Traces

Authors: Philipe A. Dias, Zhou Shen, Amy Tabb, Henry Medeiros

Abstract: Large-scale annotation of image segmentation datasets is often prohibitively expensive, as it usually requires a huge number of worker hours to obtain high-quality results. Abundant and reliable data has been, however, crucial for the advances on image understanding tasks achieved by deep learning models. In this paper, we introduce FreeLabel, an intuitive open-source web interface that allows use… ▽ More Large-scale annotation of image segmentation datasets is often prohibitively expensive, as it usually requires a huge number of worker hours to obtain high-quality results. Abundant and reliable data has been, however, crucial for the advances on image understanding tasks achieved by deep learning models. In this paper, we introduce FreeLabel, an intuitive open-source web interface that allows users to obtain high-quality segmentation masks with just a few freehand scribbles, in a matter of seconds. The efficacy of FreeLabel is quantitatively demonstrated by experimental results on the PASCAL dataset as well as on a dataset from the agricultural domain. Designed to benefit the computer vision community, FreeLabel can be used for both crowdsourced or private annotation and has a modular structure that can be easily adapted for any image dataset. △ Less

Submitted 11 March, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

Comments: Accepted and presented at 2019 IEEE Winter Conference on Applications of Computer Vision (WACV). 10 pages

Journal ref: 2019 IEEE Winter Conference on Applications of Computer Vision (WACV)

arXiv:1812.01466 [pdf, ps, other]

Local behaviour and existence of solutions of the fractional (p,q)-Laplacian

Authors: Emerson Abreu, A. H. Souza Medeiros

Abstract: In this paper, we consider the regularity of weak solutions (in an appropriate space) to the elliptic partial differential equation \begin{equation*} (-Δ_{p})^{s} u + (-Δ_{q})^{s} u = f(x) \quad \text{in} \quad \mathbb{R}^{N}, \end{equation*} where $0<s<1$ and $ 2 \leq q \leq p < N/s$. We prove that these solutions are locally in $C^{0,α}(\mathbb{R}^N)$, which seems to be optimal. Furthermore, we… ▽ More In this paper, we consider the regularity of weak solutions (in an appropriate space) to the elliptic partial differential equation \begin{equation*} (-Δ_{p})^{s} u + (-Δ_{q})^{s} u = f(x) \quad \text{in} \quad \mathbb{R}^{N}, \end{equation*} where $0<s<1$ and $ 2 \leq q \leq p < N/s$. We prove that these solutions are locally in $C^{0,α}(\mathbb{R}^N)$, which seems to be optimal. Furthermore, we prove the existence of solutions to the problem \begin{equation*} (-Δ_{p})^{s} u + (-Δ_{q})^{s} u = \vert u \vert^{p^{*}_{s}-2}u + λg(x) \vert u \vert^{r-2}u \,\,\, \text{in} \,\,\,\, \mathbb{R}^{N}, \end{equation*} where $1 < q\leq p < N/s$, $λ$ is a parameter and $g$ satisfies some conditions of integrability. We also show that, if $g$ is bounded, then the solutions are continuous and bounded. △ Less

Submitted 3 December, 2018; originally announced December 2018.

Comments: arXiv admin note: text overlap with arXiv:1411.2956 by other authors

MSC Class: 35D10; 35R11; 35J20

arXiv:1810.07597 [pdf, ps, other]

Pohozaev identities for a pseudo-relativistic Schrödinger operator and applications

Authors: H. Bueno, G. A Pereira, A. H. Souza Medeiros

Abstract: In this paper we prove a Pohozaev-type identity for both the problem $(-Δ+m^2)^su=f(u)$ in $\mathbb{R}^N$ and its harmonic extension to $\mathbb{R}^{N+1}_+$ when $0<s<1$. So, our setting includes the pseudo-relativistic operator $\sqrt{-Δ+m^2}$ and the results showed here are original, to the best of our knowledge. The identity is first obtained in the extension setting and then "translated" into… ▽ More In this paper we prove a Pohozaev-type identity for both the problem $(-Δ+m^2)^su=f(u)$ in $\mathbb{R}^N$ and its harmonic extension to $\mathbb{R}^{N+1}_+$ when $0<s<1$. So, our setting includes the pseudo-relativistic operator $\sqrt{-Δ+m^2}$ and the results showed here are original, to the best of our knowledge. The identity is first obtained in the extension setting and then "translated" into the original problem. In order to do that, we develop a specific Fourier transform theory for the fractionary operator $(-Δ+m^2)^s$, which lead us to define a weak solution $u$ of the original problem if the identity \begin{equation}\label{defsola}\int_{\mathbb{R}^N}(-Δ+m^2)^{s/2}u(-Δ+m^2)^{s/2}v\dd x=\int_{ \mathbb{R}^N}f(u)v\dd x\tag{S}\end{equation} is satisfied by all $v\in H^{s}(\mathbb{R}^N)$. The obtained Pohozaev-type identity is then applied to prove both a result of nonexistence of solution to the case $f(u)=|u|^{p-2}u$ if $p\geq 2^{*}_s$ and a result of existence of a ground state, if $f$ is modeled by $κu^3/(1+u^2)$, for a constant $κ$. In this last case, we apply the Nehari-Pohozaev manifold introduced by D. Ruiz. Finally, we prove that positive solutions of $(-Δ+m^2)^su=f(u)$ are radially symmetric and decreasing with respect to the origin, if $f$ is modeled by functions like $t^α$, $α\in(1,2^{*}_s-1)$ or $t\ln t$. △ Less

Submitted 5 April, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

Comments: 35 pages

MSC Class: 35J20; 35Q55; 35R11; 35B38

arXiv:1809.10080 [pdf, other]

doi 10.1109/LRA.2018.2849498

Multispecies fruit flower detection using a refined semantic segmentation network

Authors: Philipe A. Dias, Amy Tabb, Henry Medeiros

Abstract: In fruit production, critical crop management decisions are guided by bloom intensity, i.e., the number of flowers present in an orchard. Despite its importance, bloom intensity is still typically estimated by means of human visual inspection. Existing automated computer vision systems for flower identification are based on hand-engineered techniques that work only under specific conditions and wi… ▽ More In fruit production, critical crop management decisions are guided by bloom intensity, i.e., the number of flowers present in an orchard. Despite its importance, bloom intensity is still typically estimated by means of human visual inspection. Existing automated computer vision systems for flower identification are based on hand-engineered techniques that work only under specific conditions and with limited performance. This work proposes an automated technique for flower identification that is robust to uncontrolled environments and applicable to different flower species. Our method relies on an end-to-end residual convolutional neural network (CNN) that represents the state-of-the-art in semantic segmentation. To enhance its sensitivity to flowers, we fine-tune this network using a single dataset of apple flower images. Since CNNs tend to produce coarse segmentations, we employ a refinement method to better distinguish between individual flower instances. Without any pre-processing or dataset-specific training, experimental results on images of apple, peach and pear flowers, acquired under different conditions demonstrate the robustness and broad applicability of our method. △ Less

Submitted 19 September, 2018; originally announced September 2018.

Comments: 8 pages

Journal ref: IEEE Robotics and Automation Letters, vol. 3, no. 4, pp. 3003-3010, Oct. 2018

arXiv:1809.06357 [pdf, other]

doi 10.1016/j.compind.2018.03.010

Apple Flower Detection using Deep Convolutional Networks

Authors: Philipe A. Dias, Amy Tabb, Henry Medeiros

Abstract: To optimize fruit production, a portion of the flowers and fruitlets of apple trees must be removed early in the growing season. The proportion to be removed is determined by the bloom intensity, i.e., the number of flowers present in the orchard. Several automated computer vision systems have been proposed to estimate bloom intensity, but their overall performance is still far from satisfactory e… ▽ More To optimize fruit production, a portion of the flowers and fruitlets of apple trees must be removed early in the growing season. The proportion to be removed is determined by the bloom intensity, i.e., the number of flowers present in the orchard. Several automated computer vision systems have been proposed to estimate bloom intensity, but their overall performance is still far from satisfactory even in relatively controlled environments. With the goal of devising a technique for flower identification which is robust to clutter and to changes in illumination, this paper presents a method in which a pre-trained convolutional neural network is fine-tuned to become specially sensitive to flowers. Experimental results on a challenging dataset demonstrate that our method significantly outperforms three approaches that represent the state of the art in flower detection, with recall and precision rates higher than $90\%$. Moreover, a performance assessment on three additional datasets previously unseen by the network, which consist of different flower species and were acquired under different conditions, reveals that the proposed method highly surpasses baseline approaches in terms of generalization capability. △ Less

Submitted 17 September, 2018; originally announced September 2018.

Comments: 14 pages

Journal ref: Computers in Industry, vol. 99, pp. 17-28, Aug. 2018

arXiv:1806.04065 [pdf, other]

doi 10.1093/mnras/sty1565

Basaltic material in the main belt: a tale of two (or more) parent bodies?

Authors: S. Ieva, E. Dotto, D. Lazzaro, D. Fulvio, D. Perna, E. Mazzotta Epifani, H. Medeiros, M. Fulchignoni

Abstract: The majority of basaltic objects in the main belt are dynamically connected to Vesta, the largest differentiated asteroid known. Others, due to their current orbital parameters, cannot be easily dynamically linked to Vesta. This is particularly true for all the basaltic asteroids located beyond 2.5 au, where lies the 3:1 mean motion resonance with Jupiter. In order to investigate the presence of o… ▽ More The majority of basaltic objects in the main belt are dynamically connected to Vesta, the largest differentiated asteroid known. Others, due to their current orbital parameters, cannot be easily dynamically linked to Vesta. This is particularly true for all the basaltic asteroids located beyond 2.5 au, where lies the 3:1 mean motion resonance with Jupiter. In order to investigate the presence of other V-type asteroids in the middle and outer main belt (MOVs) we started an observational campaign to spectroscopically characterize in the visible range MOV candidates. We observed 18 basaltic candidates from TNG and ESO - NTT between 2015 and 2016. We derived spectral parameters using the same approach adopted in our recent statistical analysis and we compared our data with orbital parameters to look for possible clusters of MOVs in the main belt, symptomatic for a new basaltic family. Our analysis seemed to point out that MOVs show different spectral parameters respect to other basaltic bodies in the main belt, which could account for a diverse mineralogy than Vesta; moreover, some of them belong to the Eos family, suggesting the possibility of another basaltic progenitor. This could have strong repercussions on the temperature gradient present in the early Solar System, and on our current understanding of differentiation processes. △ Less

Submitted 11 June, 2018; originally announced June 2018.

Comments: 15 pages, 4 figures, accepted for pubblication to MNRAS

arXiv:1802.07789 [pdf, other]

Semantic Segmentation Refinement by Monte Carlo Region Growing of High Confidence Detections

Authors: Philipe A. Dias, Henry Medeiros

Abstract: Despite recent improvements using fully convolutional networks, in general, the segmentation produced by most state-of-the-art semantic segmentation methods does not show satisfactory adherence to the object boundaries. We propose a method to refine the segmentation results generated by such deep learning models. Our method takes as input the confidence scores generated by a pixel-dense segmentati… ▽ More Despite recent improvements using fully convolutional networks, in general, the segmentation produced by most state-of-the-art semantic segmentation methods does not show satisfactory adherence to the object boundaries. We propose a method to refine the segmentation results generated by such deep learning models. Our method takes as input the confidence scores generated by a pixel-dense segmentation network and re-labels pixels with low confidence levels. The re-labeling approach employs a region growing mechanism that aggregates these pixels to neighboring areas with high confidence scores and similar appearance. In order to correct the labels of pixels that were incorrectly classified with high confidence level by the semantic segmentation algorithm, we generate multiple region growing steps through a Monte Carlo sampling of the seeds of the regions. Our method improves the accuracy of a state-of-the-art fully convolutional semantic segmentation approach on the publicly available COCO and PASCAL datasets, and it shows significantly better results on selected sequences of the finely-annotated DAVIS dataset. △ Less

Submitted 21 February, 2018; originally announced February 2018.

arXiv:1707.05368 [pdf, other]

doi 10.1109/IROS.2017.8206497

A robotic vision system to measure tree traits

Authors: Amy Tabb, Henry Medeiros

Abstract: The autonomous measurement of tree traits, such as branching structure, branch diameters, branch lengths, and branch angles, is required for tasks such as robotic pruning of trees as well as structural phenoty**. We propose a robotic vision system called the Robotic System for Tree Shape Estimation (RoTSE) to determine tree traits in field settings. The process is composed of the following stage… ▽ More The autonomous measurement of tree traits, such as branching structure, branch diameters, branch lengths, and branch angles, is required for tasks such as robotic pruning of trees as well as structural phenoty**. We propose a robotic vision system called the Robotic System for Tree Shape Estimation (RoTSE) to determine tree traits in field settings. The process is composed of the following stages: image acquisition with a mobile robot unit, segmentation, reconstruction, curve skeletonization, conversion to a graph representation, and then computation of traits. Quantitative and qualitative results on apple trees are shown in terms of accuracy, computation time, and robustness. Compared to ground truth measurements, the RoTSE produced the following estimates: branch diameter (root mean-squared error $2.97$ mm), branch length (root mean-squared error $136.92$ mm), and branch angle (mean-squared error $31.07$ degrees). The average run time was $8.47$ minutes when the voxel resolution was $3$ mm$^3$. △ Less

Submitted 1 November, 2021; v1 submitted 17 July, 2017; originally announced July 2017.

Comments: 9 pages, IEEE/RSJ IROS 2017 conference paper, added Erratum 11/1/2021

Journal ref: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

arXiv:1704.06718 [pdf, other]

Hierarchical Bayesian Data Fusion for Robotic Platform Navigation

Authors: Andres F. Echeverri, Henry Medeiros, Ryan Walsh, Yevgeniy Reznichenko, Richard Povinelli

Abstract: Data fusion has become an active research topic in recent years. Growing computational performance has allowed the use of redundant sensors to measure a single phenomenon. While Bayesian fusion approaches are common in general applications, the computer vision field has largely relegated this approach. Most object following algorithms have gone towards pure machine learning fusion techniques that… ▽ More Data fusion has become an active research topic in recent years. Growing computational performance has allowed the use of redundant sensors to measure a single phenomenon. While Bayesian fusion approaches are common in general applications, the computer vision field has largely relegated this approach. Most object following algorithms have gone towards pure machine learning fusion techniques that tend to lack flexibility. Consequently, a more general data fusion scheme is needed. Within this work, a hierarchical Bayesian fusion approach is proposed, which outperforms individual trackers by using redundant measurements. The adaptive framework is achieved by relying on each measurement's local statistics and a global softened majority voting. The proposed approach was validated in a simulated application and two robotic platforms. △ Less

Submitted 21 April, 2017; originally announced April 2017.

Comments: 8 pages, 9 figures

arXiv:1702.07619 [pdf, other]

doi 10.1109/WACV.2018.00214

Fast and robust curve skeletonization for real-world elongated objects

Authors: Amy Tabb, Henry Medeiros

Abstract: We consider the problem of extracting curve skeletons of three-dimensional, elongated objects given a noisy surface, which has applications in agricultural contexts such as extracting the branching structure of plants. We describe an efficient and robust method based on breadth-first search that can determine curve skeletons in these contexts. Our approach is capable of automatically detecting jun… ▽ More We consider the problem of extracting curve skeletons of three-dimensional, elongated objects given a noisy surface, which has applications in agricultural contexts such as extracting the branching structure of plants. We describe an efficient and robust method based on breadth-first search that can determine curve skeletons in these contexts. Our approach is capable of automatically detecting junction points as well as spurious segments and loops. All of that is accomplished with only one user-adjustable parameter. The run time of our method ranges from hundreds of milliseconds to less than four seconds on large, challenging datasets, which makes it appropriate for situations where real-time decision making is needed. Experiments on synthetic models as well as on data from real world objects, some of which were collected in challenging field conditions, show that our approach compares favorably to classical thinning algorithms as well as to recent contributions to the field. △ Less

Submitted 19 March, 2018; v1 submitted 24 February, 2017; originally announced February 2017.

Comments: 47 pages; IEEE WACV 2018, main paper and supplementary material

arXiv:1702.07611 [pdf, other]

doi 10.1016/j.compind.2018.03.002

Automatic segmentation of trees in dynamic outdoor environments

Authors: Amy Tabb, Henry Medeiros

Abstract: Segmentation in dynamic outdoor environments can be difficult when the illumination levels and other aspects of the scene cannot be controlled. Specifically in orchard and vineyard automation contexts, a background material is often used to shield a camera's field of view from other rows of crops. In this paper, we describe a method that uses superpixels to determine low texture regions of the ima… ▽ More Segmentation in dynamic outdoor environments can be difficult when the illumination levels and other aspects of the scene cannot be controlled. Specifically in orchard and vineyard automation contexts, a background material is often used to shield a camera's field of view from other rows of crops. In this paper, we describe a method that uses superpixels to determine low texture regions of the image that correspond to the background material, and then show how this information can be integrated with the color distribution of the image to compute optimal segmentation parameters to segment objects of interest. Quantitative and qualitative experiments demonstrate the suitability of this approach for dynamic outdoor environments, specifically for tree reconstruction and apple flower detection applications. △ Less

Submitted 3 April, 2018; v1 submitted 24 February, 2017; originally announced February 2017.

Comments: 14 pages

Journal ref: Computers in Industry 98, 90-99. 2018

Showing 1–42 of 42 results for author: Medeiros, H