-
Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation
Authors:
Tao Lian,
Jose L. Gómez,
Antonio M. López
Abstract:
The last mile of unsupervised domain adaptation (UDA) for semantic segmentation is the challenge of solving the syn-to-real domain gap. Recent UDA methods have progressed significantly, yet they often rely on strategies customized for synthetic single-source datasets (e.g., GTA5), which limits their generalisation to multi-source datasets. Conversely, synthetic multi-source datasets hold promise f…
▽ More
The last mile of unsupervised domain adaptation (UDA) for semantic segmentation is the challenge of solving the syn-to-real domain gap. Recent UDA methods have progressed significantly, yet they often rely on strategies customized for synthetic single-source datasets (e.g., GTA5), which limits their generalisation to multi-source datasets. Conversely, synthetic multi-source datasets hold promise for advancing the last mile of UDA but remain underutilized in current research. Thus, we propose DEC, a flexible UDA framework for multi-source datasets. Following a divide-and-conquer strategy, DEC simplifies the task by categorizing semantic classes, training models for each category, and fusing their outputs by an ensemble model trained exclusively on synthetic datasets to obtain the final segmentation mask. DEC can integrate with existing UDA methods, achieving state-of-the-art performance on Cityscapes, BDD100K, and Mapillary Vistas, significantly narrowing the syn-to-real domain gap.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations
Authors:
Daniel Coelho,
Miguel Oliveira,
Vitor Santos,
Antonio M. Lopez
Abstract:
The development of Autonomous Driving (AD) systems in simulated environments like CARLA is crucial for advancing real-world automotive technologies. To drive innovation, CARLA introduced Leaderboard 2.0, significantly more challenging than its predecessor. However, current AD methods have struggled to achieve satisfactory outcomes due to a lack of sufficient ground truth data. Human driving logs p…
▽ More
The development of Autonomous Driving (AD) systems in simulated environments like CARLA is crucial for advancing real-world automotive technologies. To drive innovation, CARLA introduced Leaderboard 2.0, significantly more challenging than its predecessor. However, current AD methods have struggled to achieve satisfactory outcomes due to a lack of sufficient ground truth data. Human driving logs provided by CARLA are insufficient, and previously successful expert agents like Autopilot and Roach, used for collecting datasets, have seen reduced effectiveness under these more demanding conditions. To overcome these data limitations, we introduce PRIBOOT, an expert agent that leverages limited human logs with privileged information. We have developed a novel BEV representation specifically tailored to meet the demands of this new benchmark and processed it as an RGB image to facilitate the application of transfer learning techniques, instead of using a set of masks. Additionally, we propose the Infraction Rate Score (IRS), a new evaluation metric designed to provide a more balanced assessment of driving performance over extended routes. PRIBOOT is the first model to achieve a Route Completion (RC) of 75% in Leaderboard 2.0, along with a Driving Score (DS) and IRS of 20% and 45%, respectively. With PRIBOOT, researchers can now generate extensive datasets, potentially solving the data availability issues that have hindered progress in this benchmark.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation
Authors:
Yufan Zhu,
Chongzhi Ran,
Mingtao Feng,
Fangfang Wu,
Le Dong,
Weisheng Dong,
Antonio M. López,
Guangming Shi
Abstract:
Virtual engines can generate dense depth maps for various synthetic scenes, making them invaluable for training depth estimation models. However, discrepancies between synthetic and real-world colors pose significant challenges for depth estimation in real-world scenes, especially in complex and uncertain environments encountered in unsupervised monocular depth estimation tasks. To address this is…
▽ More
Virtual engines can generate dense depth maps for various synthetic scenes, making them invaluable for training depth estimation models. However, discrepancies between synthetic and real-world colors pose significant challenges for depth estimation in real-world scenes, especially in complex and uncertain environments encountered in unsupervised monocular depth estimation tasks. To address this issue, we propose Back2Color, a framework that predicts realistic colors from depth using a model trained on real-world data, thus transforming synthetic colors into their real-world counterparts. Additionally, we introduce the Syn-Real CutMix method for joint training with both real-world unsupervised and synthetic supervised depth samples, enhancing monocular depth estimation performance in real-world scenes. Furthermore, to mitigate the impact of non-rigid motions on depth estimation, we present an auto-learning uncertainty temporal-spatial fusion method (Auto-UTSF), which leverages the strengths of unsupervised learning in both temporal and spatial dimensions. We also designed VADepth, based on the Vision Attention Network, which offers lower computational complexity and higher accuracy than transformers. Our Back2Color framework achieves state-of-the-art performance on the Kitti dataset, as evidenced by improvements in performance metrics and the production of fine-grained details. This is particularly evident on more challenging datasets such as Cityscapes for unsupervised depth estimation.
△ Less
Submitted 3 July, 2024; v1 submitted 11 June, 2024;
originally announced June 2024.
-
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Authors:
Yachan Guo,
Yi Xiao,
Danna Xue,
Jose Luis Gomez Zurita,
Antonio M. López
Abstract:
Unsupervised Domain Adaptation (UDA) aims to transfer knowledge learned from a labeled source domain to an unlabeled target domain. While UDA methods for synthetic to real-world domains (synth-to-real) show remarkable performance in tasks such as semantic segmentation and object detection, very few were proposed for instance segmentation in the field of vision-based autonomous driving, and the exi…
▽ More
Unsupervised Domain Adaptation (UDA) aims to transfer knowledge learned from a labeled source domain to an unlabeled target domain. While UDA methods for synthetic to real-world domains (synth-to-real) show remarkable performance in tasks such as semantic segmentation and object detection, very few were proposed for instance segmentation in the field of vision-based autonomous driving, and the existing ones are based on a suboptimal baseline, which severely limits the performance. In this paper, we introduce UDA4Inst, a strong baseline of synth-to-real UDA for instance segmentation. UDA4Inst adopts cross-domain bidirectional data mixing at the instance level to effectively utilize data from both source and target domains. Rare-class balancing and category module training are also employed to further improve the performance. It is worth noting that we are the first to demonstrate results on two new synth-to-real instance segmentation benchmarks, with 39.0 mAP on UrbanSyn->Cityscapes and 35.7 mAP on Synscapes->Cityscapes. Our method outperforms the source-only Mask2Former model by +7 mAP and +7.6 mAP, respectively. On SYNTHIA->Cityscapes, our method improves the source-only Mask2Former by +6.7 mAP, achieving state-of-the-art results.Our code will be released soon.
△ Less
Submitted 5 July, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Exact solution of long-range stabilizer Rényi entropy in the dual-unitary XXZ model
Authors:
Jordi Arnau Montañà López,
Pavel Kos
Abstract:
Quantum systems can not be efficiently simulated classically due to the presence of entanglement and nonstabilizerness, also known as quantum magic. Here we study the generation of magic under evolution by a quantum circuit. To be able to provide exact solutions, we focus on the dual-unitary XXZ model and a measure of magic called stabilizer Rényi entropy (SRE). Moreover, we focus also on long-ran…
▽ More
Quantum systems can not be efficiently simulated classically due to the presence of entanglement and nonstabilizerness, also known as quantum magic. Here we study the generation of magic under evolution by a quantum circuit. To be able to provide exact solutions, we focus on the dual-unitary XXZ model and a measure of magic called stabilizer Rényi entropy (SRE). Moreover, we focus also on long-range SRE, which cannot be removed by short-depth quantum circuits. To obtain exact solutions we use a ZX-calculus representation and graphical rules for the evaluation of the required expressions. We obtain exact results for SRE after short-time evolution in the thermodynamic limit and for long-range SRE for all times and all Rényi parameters for a particular partition of the state. Since the numerical evaluation of these quantities is exponentially costly in the Rényi parameter, we verify this numerically for low Rényi parameters and accessible system sizes and provide numerical results for the long-range SRE in other bipartitions.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Guiding Attention in End-to-End Driving Models
Authors:
Diego Porres,
Yi Xiao,
Gabriel Villalonga,
Alexandre Levy,
Antonio M. López
Abstract:
Vision-based end-to-end driving models trained by imitation learning can lead to affordable solutions for autonomous driving. However, training these well-performing models usually requires a huge amount of data, while still lacking explicit and intuitive activation maps to reveal the inner workings of these models while driving. In this paper, we study how to guide the attention of these models t…
▽ More
Vision-based end-to-end driving models trained by imitation learning can lead to affordable solutions for autonomous driving. However, training these well-performing models usually requires a huge amount of data, while still lacking explicit and intuitive activation maps to reveal the inner workings of these models while driving. In this paper, we study how to guide the attention of these models to improve their driving quality and obtain more intuitive activation maps by adding a loss term during training using salient semantic maps. In contrast to previous work, our method does not require these salient semantic maps to be available during testing time, as well as removing the need to modify the model's architecture to which it is applied. We perform tests using perfect and noisy salient semantic maps with encouraging results in both, the latter of which is inspired by possible errors encountered with real data. Using CIL++ as a representative state-of-the-art model and the CARLA simulator with its standard benchmarks, we conduct experiments that show the effectiveness of our method in training better autonomous driving models, especially when data and computational resources are scarce.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
A Big Ring on the Sky
Authors:
Alexia M. Lopez,
Roger G. Clowes,
Gerard M. Williger
Abstract:
We present the discovery of `A Big Ring on the Sky' (BR), the second ultra-large-scale structure (uLSS) found in MgII-absorber catalogues, following the previously reported Giant Arc (GA). In cosmological terms the BR is close to the GA - at the same redshift $z \sim 0.8$ and with a separation on the sky of only $\sim 12^\circ$. Two extraordinary uLSSs in such close configuration raises the possib…
▽ More
We present the discovery of `A Big Ring on the Sky' (BR), the second ultra-large-scale structure (uLSS) found in MgII-absorber catalogues, following the previously reported Giant Arc (GA). In cosmological terms the BR is close to the GA - at the same redshift $z \sim 0.8$ and with a separation on the sky of only $\sim 12^\circ$. Two extraordinary uLSSs in such close configuration raises the possibility that together they form an even more extraordinary cosmological system. The BR is a striking circular, annulus-like, structure of diameter $\sim 400$ Mpc (proper size, present epoch). The method of discovery is as described in the GA paper, but here using the new MgII-absorber catalogues restricted to DR16Q quasars. Using the Convex Hull of Member Spheres (CHMS) algorithm, we estimate that the annulus and inner absorbers of the BR have departures from random expectations, at the density of the control field, of up to $5.2σ$. We present the discovery of the BR, assess its significance using the CHMS, Minimal Spanning Tree (MST), FilFinder and Cuzick & Edwards (CE) methods, show it in the context of the GA+BR system, and suggest some implications for the origins of uLSS and for our understanding of cosmology. For example, it may be that unusual geometric patterns, such as these uLSSs, have an origin in cosmic strings.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Critical mobility in policy making for epidemic containment
Authors:
Jesús A. Moreno López,
Sandro Meloni,
Jose J. Ramasco
Abstract:
When considering airborne epidemic spreading in social systems, a natural connection arises between mobility and epidemic contacts. As individuals travel, possibilities to encounter new people either at the final destination or during the transportation process appear. Such contacts can lead to new contagion events. In fact, mobility has been a crucial target for early non-pharmaceutical containme…
▽ More
When considering airborne epidemic spreading in social systems, a natural connection arises between mobility and epidemic contacts. As individuals travel, possibilities to encounter new people either at the final destination or during the transportation process appear. Such contacts can lead to new contagion events. In fact, mobility has been a crucial target for early non-pharmaceutical containment measures against the recent COVID-19 pandemic, with a degree of intensity ranging from public transportation line closures to regional, city or even home confinements. Nonetheless, quantitative knowledge on the relationship between mobility-contagions and, consequently, on the efficiency of containment measures remains elusive. Here we introduce an agent-based model with a simple interaction between mobility and contacts. Despite its simplicity our model shows the emergence of a critical mobility level, inducing major outbreaks when surpassed. We explore the interplay between mobility restrictions and the infection in recent intervention policies seen across many countries, and how interventions in the form of closures triggered by incidence rates can guide the epidemic into an oscillatory regime with recurrent waves. We consider how the different interventions impact societal well-being, the economy and the population. Finally, we propose a mitigation framework based on the critical nature of mobility in an epidemic, able to suppress incidence and oscillations at will, preventing extreme incidence peaks with potential to saturate health care resources.
△ Less
Submitted 9 May, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction
Authors:
Muhammad Naveed Riaz,
Maciej Wielgosz,
Abel Garcia Romera,
Antonio M. Lopez
Abstract:
Pedestrian intention prediction is crucial for autonomous driving. In particular, knowing if pedestrians are going to cross in front of the ego-vehicle is core to performing safe and comfortable maneuvers. Creating accurate and fast models that predict such intentions from sequential images is challenging. A factor contributing to this is the lack of datasets with diverse crossing and non-crossing…
▽ More
Pedestrian intention prediction is crucial for autonomous driving. In particular, knowing if pedestrians are going to cross in front of the ego-vehicle is core to performing safe and comfortable maneuvers. Creating accurate and fast models that predict such intentions from sequential images is challenging. A factor contributing to this is the lack of datasets with diverse crossing and non-crossing (C/NC) scenarios. We address this scarceness by introducing a framework, named ARCANE, which allows programmatically generating synthetic datasets consisting of C/NC video clip samples. As an example, we use ARCANE to generate a large and diverse dataset named PedSynth. We will show how PedSynth complements widely used real-world datasets such as JAAD and PIE, so enabling more accurate models for C/NC prediction. Considering the onboard deployment of C/NC prediction models, we also propose a deep model named PedGNN, which is fast and has a very low memory footprint. PedGNN is based on a GNN-GRU architecture that takes a sequence of pedestrian skeletons as input to predict crossing intentions.
△ Less
Submitted 15 June, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes
Authors:
Jose L. Gómez,
Manuel Silva,
Antonio Seoane,
Agnès Borrás,
Mario Noriega,
Germán Ros,
Jose A. Iglesias-Guitian,
Antonio M. López
Abstract:
We introduce UrbanSyn, a photorealistic dataset acquired through semi-procedurally generated synthetic urban driving scenarios. Developed using high-quality geometry and materials, UrbanSyn provides pixel-level ground truth, including depth, semantic segmentation, and instance segmentation with object bounding boxes and occlusion degree. It complements GTAV and Synscapes datasets to form what we c…
▽ More
We introduce UrbanSyn, a photorealistic dataset acquired through semi-procedurally generated synthetic urban driving scenarios. Developed using high-quality geometry and materials, UrbanSyn provides pixel-level ground truth, including depth, semantic segmentation, and instance segmentation with object bounding boxes and occlusion degree. It complements GTAV and Synscapes datasets to form what we coin as the 'Three Musketeers'. We demonstrate the value of the Three Musketeers in unsupervised domain adaptation for image semantic segmentation. Results on real-world datasets, Cityscapes, Mapillary Vistas, and BDD100K, establish new benchmarks, largely attributed to UrbanSyn. We make UrbanSyn openly and freely accessible (www.urbansyn.org).
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Discriminatory or Samaritan -- which AI is needed for humanity? An Evolutionary Game Theory Analysis of Hybrid Human-AI populations
Authors:
Tim Booker,
Manuel Miranda,
Jesús A. Moreno López,
José María Ramos Fernández,
Max Reddel,
Valeria Widler,
Filippo Zimmaro,
Alberto Antonioni,
The Anh Han
Abstract:
As artificial intelligence (AI) systems are increasingly embedded in our lives, their presence leads to interactions that shape our behaviour, decision-making, and social interactions. Existing theoretical research has primarily focused on human-to-human interactions, overlooking the unique dynamics triggered by the presence of AI. In this paper, resorting to methods from evolutionary game theory,…
▽ More
As artificial intelligence (AI) systems are increasingly embedded in our lives, their presence leads to interactions that shape our behaviour, decision-making, and social interactions. Existing theoretical research has primarily focused on human-to-human interactions, overlooking the unique dynamics triggered by the presence of AI. In this paper, resorting to methods from evolutionary game theory, we study how different forms of AI influence the evolution of cooperation in a human population playing the one-shot Prisoner's Dilemma game in both well-mixed and structured populations. We found that Samaritan AI agents that help everyone unconditionally, including defectors, can promote higher levels of cooperation in humans than Discriminatory AI that only help those considered worthy/cooperative, especially in slow-moving societies where change is viewed with caution or resistance (small intensities of selection). Intuitively, in fast-moving societies (high intensities of selection), Discriminatory AIs promote higher levels of cooperation than Samaritan AIs.
△ Less
Submitted 3 July, 2023; v1 submitted 30 June, 2023;
originally announced June 2023.
-
CARLA-BSP: a simulated dataset with pedestrians
Authors:
Maciej Wielgosz,
Antonio M. López,
Muhammad Naveed Riaz
Abstract:
We present a sample dataset featuring pedestrians generated using the ARCANE framework, a new framework for generating datasets in CARLA (0.9.13). We provide use cases for pedestrian detection, autoencoding, pose estimation, and pose lifting. We also showcase baseline results. For more information, visit https://project-arcane.eu/.
We present a sample dataset featuring pedestrians generated using the ARCANE framework, a new framework for generating datasets in CARLA (0.9.13). We provide use cases for pedestrian detection, autoencoding, pose estimation, and pose lifting. We also showcase baseline results. For more information, visit https://project-arcane.eu/.
△ Less
Submitted 29 April, 2023;
originally announced May 2023.
-
On the Shadowableness of Flows With Hyperbolic Singularities
Authors:
Alexander Arbieto,
Andrés M. López,
Elias Rego,
Yeison Sánchez
Abstract:
In this work we study the existence of singular flows satisfying shadowing-like properties. More precisely, we prove that if C1 -vector field on a closed manifold induces a chain-recurrent flow containing an attached hyperbolic singularity of stable or unstable index-one, then this flow cannot satisfy the shadowing property. If the manifold is non-compact, the vector field is complete and non-wand…
▽ More
In this work we study the existence of singular flows satisfying shadowing-like properties. More precisely, we prove that if C1 -vector field on a closed manifold induces a chain-recurrent flow containing an attached hyperbolic singularity of stable or unstable index-one, then this flow cannot satisfy the shadowing property. If the manifold is non-compact, the vector field is complete and non-wandering, we prove that we prove that the existence of index-one hyperbolic singularities prevents the induced flow to satisfy the rescaled-shadowing property introduced in [6].
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
On the Metrics for Evaluating Monocular Depth Estimation
Authors:
Akhil Gurram,
Antonio M. Lopez
Abstract:
Monocular Depth Estimation (MDE) is performed to produce 3D information that can be used in downstream tasks such as those related to on-board perception for Autonomous Vehicles (AVs) or driver assistance. Therefore, a relevant arising question is whether the standard metrics for MDE assessment are a good indicator of the accuracy of future MDE-based driving-related perception tasks. We address th…
▽ More
Monocular Depth Estimation (MDE) is performed to produce 3D information that can be used in downstream tasks such as those related to on-board perception for Autonomous Vehicles (AVs) or driver assistance. Therefore, a relevant arising question is whether the standard metrics for MDE assessment are a good indicator of the accuracy of future MDE-based driving-related perception tasks. We address this question in this paper. In particular, we take the task of 3D object detection on point clouds as a proxy of on-board perception. We train and test state-of-the-art 3D object detectors using 3D point clouds coming from MDE models. We confront the ranking of object detection results with the ranking given by the depth estimation metrics of the MDE models. We conclude that, indeed, MDE evaluation metrics give rise to a ranking of methods that reflects relatively well the 3D object detection results we may expect. Among the different metrics, the absolute relative (abs-rel) error seems to be the best for that purpose.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Scaling Vision-based End-to-End Driving with Multi-View Attention Learning
Authors:
Yi Xiao,
Felipe Codevilla,
Diego Porres,
Antonio M. Lopez
Abstract:
On end-to-end driving, human driving demonstrations are used to train perception-based driving models by imitation learning. This process is supervised on vehicle signals (e.g., steering angle, acceleration) but does not require extra costly supervision (human labeling of sensor data). As a representative of such vision-based end-to-end driving models, CILRS is commonly used as a baseline to compa…
▽ More
On end-to-end driving, human driving demonstrations are used to train perception-based driving models by imitation learning. This process is supervised on vehicle signals (e.g., steering angle, acceleration) but does not require extra costly supervision (human labeling of sensor data). As a representative of such vision-based end-to-end driving models, CILRS is commonly used as a baseline to compare with new driving models. So far, some latest models achieve better performance than CILRS by using expensive sensor suites and/or by using large amounts of human-labeled data for training. Given the difference in performance, one may think that it is not worth pursuing vision-based pure end-to-end driving. However, we argue that this approach still has great value and potential considering cost and maintenance. In this paper, we present CIL++, which improves on CILRS by both processing higher-resolution images using a human-inspired HFOV as an inductive bias and incorporating a proper attention mechanism. CIL++ achieves competitive performance compared to models which are more costly to develop. We propose to replace CILRS with CIL++ as a strong vision-based pure end-to-end driving baseline supervised by only vehicle signals and trained by conditional imitation learning.
△ Less
Submitted 22 July, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Modeling Backgrounds for the MAJORANA DEMONSTRATOR
Authors:
C. R. Haufe,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
K. H. Bhimani,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe
, et al. (33 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR is a neutrinoless double-beta decay ($0νββ$) experiment containing $\sim$30 kg of p-type point contact germanium detectors enriched to 88% in 76Ge and $\sim$14 kg of natural germanium detectors. The detectors are housed in two electroformed copper cryostats and surrounded by a graded passive shield with active muon veto. An extensive radioassay campaign was performed prio…
▽ More
The MAJORANA DEMONSTRATOR is a neutrinoless double-beta decay ($0νββ$) experiment containing $\sim$30 kg of p-type point contact germanium detectors enriched to 88% in 76Ge and $\sim$14 kg of natural germanium detectors. The detectors are housed in two electroformed copper cryostats and surrounded by a graded passive shield with active muon veto. An extensive radioassay campaign was performed prior to installation to insure the use of ultra-clean materials. The DEMONSTRATOR achieved one of the lowest background rates in the region of the $0νββ$ Q-value, 15.7 $\pm$ 1.4 cts/(FWHM t y) from the low-background configuration spanning most of the 64.5 kg-yr active exposure. Nevertheless this background rate is a factor of five higher than the projected background rate. This discrepancy arises from an excess of events from the 232Th decay chain. Background model fits aim to understand this deviation from assay-based projections, potentially determine the source(s) of observed backgrounds, and allow a precision measurement of the two-neutrino double-beta decay half-life. The fits agree with earlier simulation studies, which indicate the origin of the 232Th excess is not from a near-detector component and have informed design decisions for the next-generation LEGEND experiment. Recent findings have narrowed the suspected locations for the excess activity, motivating a final simulation and assay campaign to complete the background model.
△ Less
Submitted 11 January, 2023; v1 submitted 21 September, 2022;
originally announced September 2022.
-
Unstructured Road Segmentation using Hypercolumn based Random Forests of Local experts
Authors:
Prassanna Ganesh Ravishankar,
Antonio M. Lopez,
Gemma M. Sanchez
Abstract:
Monocular vision based road detection methods are mostly based on machine learning methods, relying on classification and feature extraction accuracy, and suffer from appearance, illumination and weather changes. Traditional methods introduce the predictions into conditional random fields or markov random fields models to improve the intermediate predictions based on structure. These methods are o…
▽ More
Monocular vision based road detection methods are mostly based on machine learning methods, relying on classification and feature extraction accuracy, and suffer from appearance, illumination and weather changes. Traditional methods introduce the predictions into conditional random fields or markov random fields models to improve the intermediate predictions based on structure. These methods are optimization based and therefore resource heavy and slow, making it unsuitable for real time applications. We propose a method to detect and segment roads with a random forest classifier of local experts with superpixel based machine-learned features. The random forest takes in machine learnt descriptors from a pre-trained convolutional neural network - VGG-16. The features are also pooled into their respective superpixels, allowing for local structure to be continuous. We compare our algorithm against Nueral Network based methods and Traditional approaches (based on Hand-crafted features), on both Structured Road (CamVid and Kitti) and Unstructured Road Datasets. Finally, we introduce a Road Scene Dataset with 1000 annotated images, and verify that our algorithm works well in non-urban and rural road scenarios.
△ Less
Submitted 23 July, 2022;
originally announced July 2022.
-
Final Result of the MAJORANA DEMONSTRATOR's Search for Neutrinoless Double-$β$ Decay in $^{76}$Ge
Authors:
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
P. J. Barton,
K. H. Bhimani,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe
, et al. (35 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR searched for neutrinoless double-$β$ decay ($0νββ$) of $^{76}$Ge using modular arrays of high-purity Ge detectors operated in vacuum cryostats in a low-background shield. The arrays operated with up to 40.4 kg of detectors (27.2 kg enriched to $\sim$88\% in $^{76}$Ge). From these measurements, the DEMONSTRATOR has accumulated 64.5 kg yr of enriched active exposure. With a…
▽ More
The MAJORANA DEMONSTRATOR searched for neutrinoless double-$β$ decay ($0νββ$) of $^{76}$Ge using modular arrays of high-purity Ge detectors operated in vacuum cryostats in a low-background shield. The arrays operated with up to 40.4 kg of detectors (27.2 kg enriched to $\sim$88\% in $^{76}$Ge). From these measurements, the DEMONSTRATOR has accumulated 64.5 kg yr of enriched active exposure. With a world-leading energy resolution of 2.52 keV FWHM at the 2039 keV $Q_{ββ}$ (0.12\%), we set a half-life limit of $0νββ$ in $^{76}$Ge at $T_{1/2}>8.3\times10^{25}$ yr (90\% C.L.). This provides a range of upper limits on $m_{ββ}$ of $(113-269)$ meV (90\% C.L.), depending on the choice of nuclear matrix elements.
△ Less
Submitted 10 February, 2023; v1 submitted 15 July, 2022;
originally announced July 2022.
-
Is the Observable Universe Consistent with the Cosmological Principle?
Authors:
Pavan Kumar Aluri,
Paolo Cea,
Pravabati Chingangbam,
Ming-Chung Chu,
Roger G. Clowes,
Damien Hutsemékers,
Joby P. Kochappan,
Alexia M. Lopez,
Lang Liu,
Niels C. M. Martens,
C. J. A. P. Martins,
Konstantinos Migkas,
Eoin Ó Colgáin,
Pratyush Pranav,
Lior Shamir,
Ashok K. Singal,
M. M. Sheikh-Jabbari,
Jenny Wagner,
Shao-Jiang Wang,
David L. Wiltshire,
Shek Yeung,
Lu Yin,
Wen Zhao
Abstract:
The Cosmological Principle (CP) -- the notion that the Universe is spatially isotropic and homogeneous on large scales -- underlies a century of progress in cosmology. It is conventionally formulated through the Friedmann-Lemaître-Robertson-Walker (FLRW) cosmologies as the spacetime metric, and culminates in the successful and highly predictive $Λ$-Cold-Dark-Matter ($Λ$CDM) model. Yet, tensions ha…
▽ More
The Cosmological Principle (CP) -- the notion that the Universe is spatially isotropic and homogeneous on large scales -- underlies a century of progress in cosmology. It is conventionally formulated through the Friedmann-Lemaître-Robertson-Walker (FLRW) cosmologies as the spacetime metric, and culminates in the successful and highly predictive $Λ$-Cold-Dark-Matter ($Λ$CDM) model. Yet, tensions have emerged within the $Λ$CDM model, most notably a statistically significant discrepancy in the value of the Hubble constant, $H_0$. Since the notion of cosmic expansion determined by a single parameter is intimately tied to the CP, implications of the $H_0$ tension may extend beyond $Λ$CDM to the CP itself. This review surveys current observational hints for deviations from the expectations of the CP, highlighting synergies and disagreements that warrant further study. Setting aside the debate about individual large structures, potential deviations from the CP include variations of cosmological parameters on the sky, discrepancies in the cosmic dipoles, and mysterious alignments in quasar polarizations and galaxy spins. While it is possible that a host of observational systematics are impacting results, it is equally plausible that precision cosmology may have outgrown the FLRW paradigm, an extremely pragmatic but non-fundamental symmetry assumption.
△ Less
Submitted 27 February, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Exotic dark matter search with the Majorana Demonstrator
Authors:
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
K. H. Bhimani,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe,
C. R. Haufe
, et al. (34 additional authors not shown)
Abstract:
With excellent energy resolution and ultra-low level radiogenic backgrounds, the high-purity germanium detectors in the Majorana Demonstrator enable searches for several classes of exotic dark matter (DM) models. In this work we report new experimental limits on keV-scale sterile neutrino DM via the transition magnetic moment from conversion to active neutrinos, $ν_s \rightarrow ν_a$. We report ne…
▽ More
With excellent energy resolution and ultra-low level radiogenic backgrounds, the high-purity germanium detectors in the Majorana Demonstrator enable searches for several classes of exotic dark matter (DM) models. In this work we report new experimental limits on keV-scale sterile neutrino DM via the transition magnetic moment from conversion to active neutrinos, $ν_s \rightarrow ν_a$. We report new limits on fermionic dark matter absorption ($χ+ A \rightarrow ν+ A$) and sub-GeV DM-nucleus 3$\rightarrow$2 scattering ($χ+ χ+ A \rightarrow φ+ A$), and new exclusion limits for bosonic dark matter (axionlike particles and dark photons). These searches utilize the 1--100 keV low energy region of a 37.5 kg-y exposure collected by the Demonstrator between May 2016 and Nov. 2019, using a set of $^{76}$Ge-enriched detectors whose surface exposure time was carefully controlled, resulting in extremely low levels of cosmogenic activation.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Search for solar axions via axion-photon coupling with the Majorana Demonstrator
Authors:
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
K. H. Bhimani,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe,
C. R. Haufe
, et al. (33 additional authors not shown)
Abstract:
Axions were originally proposed to explain the strong-CP problem in QCD. Through the axion-photon coupling, the Sun could be a major source of axions, which could be measured in solid state detection experiments with enhancements due to coherent Primakoff-Bragg scattering. The Majorana Demonstrator experiment has searched for solar axions with a set of $^{76}$Ge-enriched high purity germanium dete…
▽ More
Axions were originally proposed to explain the strong-CP problem in QCD. Through the axion-photon coupling, the Sun could be a major source of axions, which could be measured in solid state detection experiments with enhancements due to coherent Primakoff-Bragg scattering. The Majorana Demonstrator experiment has searched for solar axions with a set of $^{76}$Ge-enriched high purity germanium detectors using a 33 kg-yr exposure collected between Jan. 2017 and Nov. 2019. A temporal-energy analysis gives a new limit on the axion-photon coupling as $g_{aγ}<1.45\times 10^{-9}$ GeV$^{-1}$ (95% C.I.) for axions with mass up to 100 eV/$c^2$. This improves laboratory-based limits between about 1 eV/$c^2$ and 100 eV/$c^2$.
△ Less
Submitted 22 August, 2022; v1 submitted 12 June, 2022;
originally announced June 2022.
-
Co-Training for Unsupervised Domain Adaptation of Semantic Segmentation Models
Authors:
Jose L. Gómez,
Gabriel Villalonga,
Antonio M. López
Abstract:
Semantic image segmentation is a central and challenging task in autonomous driving, addressed by training deep models. Since this training draws to a curse of human-based image labeling, using synthetic images with automatically generated labels together with unlabeled real-world images is a promising alternative. This implies to address an unsupervised domain adaptation (UDA) problem. In this pa…
▽ More
Semantic image segmentation is a central and challenging task in autonomous driving, addressed by training deep models. Since this training draws to a curse of human-based image labeling, using synthetic images with automatically generated labels together with unlabeled real-world images is a promising alternative. This implies to address an unsupervised domain adaptation (UDA) problem. In this paper, we propose a new co-training procedure for synth-to-real UDA of semantic segmentation models. It consists of a self-training stage, which provides two domain-adapted models, and a model collaboration loop for the mutual improvement of these two models. These models are then used to provide the final semantic segmentation labels (pseudo-labels) for the real-world images. The overall procedure treats the deep models as black boxes and drives their collaboration at the level of pseudo-labeled target images, i.e., neither modifying loss functions is required, nor explicit feature alignment. We test our proposal on standard synthetic and real-world datasets for on-board semantic segmentation. Our procedure shows improvements ranging from ~13 to ~26 mIoU points over baselines, so establishing new state-of-the-art results.
△ Less
Submitted 30 January, 2023; v1 submitted 31 May, 2022;
originally announced May 2022.
-
Experimental study of 13C(α,n)16O reactions in the Majorana Demonstrator calibration data
Authors:
MAJORANA Collaboration,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
K. H. Bhimani,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe
, et al. (33 additional authors not shown)
Abstract:
Neutron captures and delayed decays of reaction products are common sources of backgrounds in ultra-rare event searches. In this work, we studied $^{13}$C($α,n)^{16}$O reactions induced by $α$-particles emitted within the calibration sources of the \textsc{Majorana Demonstrator}. These sources are thorium-based calibration standards enclosed in carbon-rich materials. The reaction rate was estimate…
▽ More
Neutron captures and delayed decays of reaction products are common sources of backgrounds in ultra-rare event searches. In this work, we studied $^{13}$C($α,n)^{16}$O reactions induced by $α$-particles emitted within the calibration sources of the \textsc{Majorana Demonstrator}. These sources are thorium-based calibration standards enclosed in carbon-rich materials. The reaction rate was estimated by using the 6129-keV $γ$-rays emitted from the excited $^{16}$O states that are populated when the incoming $α$-particles exceed the reaction Q-value. Thanks to the excellent energy performance of the \textsc{Demonstrator}'s germanium detectors, these characteristic photons can be clearly observed in the calibration data. Facilitated by \textsc{Geant4} simulations, a comparison between the observed 6129-keV photon rates and predictions by a TALYS-based software was performed. The measurements and predictions were found to be consistent, albeit with large statistical uncertainties. This agreement provides support for background projections from ($α,n$)-reactions in future double-beta decay search efforts.
△ Less
Submitted 11 July, 2022; v1 submitted 27 March, 2022;
originally announced March 2022.
-
Search for charge nonconservation and Pauli exclusion principle violation with the Majorana Demonstrator
Authors:
MAJORANA Collaboration,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
K. H. Bhimani,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe
, et al. (33 additional authors not shown)
Abstract:
Charge conservation and the Pauli exclusion principle (PEP) result from fundamental symmetries in the Standard Model, and are typically taken as axiomatic. High-precision tests for small violations of these symmetries could point to new physics. In this work we consider three models for violation of these processes which would produce detectable ionization in the high-purity germanium detectors of…
▽ More
Charge conservation and the Pauli exclusion principle (PEP) result from fundamental symmetries in the Standard Model, and are typically taken as axiomatic. High-precision tests for small violations of these symmetries could point to new physics. In this work we consider three models for violation of these processes which would produce detectable ionization in the high-purity germanium detectors of the Majorana Demonstrator. Using a 37.5 kg-yr exposure, we report a new lower limit on the electron mean lifetime of $τ_e > 3.2 \times 10^{25}$ yr (90\% CL), the best result for this decay channel ($e \rightarrow ν_e \overline{ν_e} ν_e$ or more generally $e \rightarrow \mathrm{invisibles}$) in more than two decades. We also present searches for two types of violation of the PEP, setting new limits on the probability of two electrons forming a symmetric quantum state. Using our $^{228}$Th calibration data set, which introduces electrons new to the system through electron-positron pair production, we obtain a world-leading model-independent limit for a terrestrial experiment of $β^2/2 < 1.0 \times 10^{-3}$ (99.7\% CL). Our 37.5 kg-yr exposure is also used to search for a process where an electron in an atomic system spontaneously violates the PEP, resulting in a model-dependent upper limit of $β^2/2 < 1.0 \times 10^{-48}$ (90\% CL).
△ Less
Submitted 11 January, 2023; v1 submitted 3 March, 2022;
originally announced March 2022.
-
Search for Spontaneous Radiation from Wavefunction Collapse in the Majorana Demonstrator
Authors:
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe,
C. R. Haufe,
R. Henning
, et al. (29 additional authors not shown)
Abstract:
The Majorana Demonstrator neutrinoless double-beta decay experiment comprises a 44 kg (30 kg enriched in $^{76}\mathrm{Ge}$) array of $p$-type, point-contact germanium detectors. With its unprecedented energy resolution and ultralow backgrounds, Majorana also searches for rare event signatures from beyond standard model physics in the low energy region below 100 keV. In this Letter, we test the co…
▽ More
The Majorana Demonstrator neutrinoless double-beta decay experiment comprises a 44 kg (30 kg enriched in $^{76}\mathrm{Ge}$) array of $p$-type, point-contact germanium detectors. With its unprecedented energy resolution and ultralow backgrounds, Majorana also searches for rare event signatures from beyond standard model physics in the low energy region below 100 keV. In this Letter, we test the continuous spontaneous localization (CSL) model, one of the mathematically well-motivated wave function collapse models aimed at solving the long-standing unresolved quantum mechanical measurement problem. While the CSL predicts the existence of a detectable radiation signature in the x-ray domain, we find no evidence of such radiation in the 19--100 keV range in a 37.5 kg-y enriched germanium exposure collected between December 31, 2015, and November 27, 2019, with the Demonstrator. We explored both the non-mass-proportional (n-m-p) and the mass-proportional (m-p) versions of the CSL with two different assumptions: that only the quasifree electrons can emit the x-ray radiation and that the nucleus can coherently emit an amplified radiation. In all cases, we set the most stringent upper limit to date for the white CSL model on the collapse rate, $λ$, providing a factor of 40--100 improvement in sensitivity over comparable searches. Our limit is the most stringent for large parts of the allowed parameter space. If the result is interpreted in terms of the Diòsi-Penrose gravitational wave function collapse model, the lower bound with a 95% confidence level is almost an order of magnitude improvement over the previous best limit.
△ Less
Submitted 12 June, 2023; v1 submitted 2 February, 2022;
originally announced February 2022.
-
A Giant Arc on the Sky
Authors:
Alexia M. Lopez,
Roger G. Clowes,
Gerard M. Williger
Abstract:
We present the serendipitous discovery of a `Giant Arc on the Sky' at $z \sim 0.8$. The Giant Arc (GA) spans $\sim 1$ Gpc (proper size, present epoch), and appears to be almost symmetrical on the sky. It was discovered via intervening MgII absorbers in the spectra of background quasars, using the catalogues of Zhu \& Ménard. The use of MgII absorbers represents a new approach to the investigation…
▽ More
We present the serendipitous discovery of a `Giant Arc on the Sky' at $z \sim 0.8$. The Giant Arc (GA) spans $\sim 1$ Gpc (proper size, present epoch), and appears to be almost symmetrical on the sky. It was discovered via intervening MgII absorbers in the spectra of background quasars, using the catalogues of Zhu \& Ménard. The use of MgII absorbers represents a new approach to the investigation of large-scale structures (LSSs) at redshifts $0.45 \lesssim z \lesssim 2.25$. We present the observational properties of the GA, and we assess it statistically using methods based on: (i) single-linkage hierarchical clustering ($\sim 4.5σ$); (ii) the Cuzick-Edwards test ($\sim 3.0σ$); and (iii) power spectrum analysis ($\sim 4.8σ$). Each of these methods has distinctive attributes and powers, and we advise considering the evidence from the ensemble. We discuss our approaches to mitigating any {\it post-hoc} aspects of analysing significance after discovery. The overdensity of the GA is $δρ/ ρ\sim 1.3 \pm 0.3$. The GA is the newest and one of the largest of a steadily accumulating set of very large LSSs that may (cautiously) challenge the Cosmological Principle, upon which the `standard model' of cosmology is founded. Conceivably, the GA is the precursor of a structure like the Sloan Great Wall (but the GA is about twice the size), seen when the Universe was about half its present age.
△ Less
Submitted 2 August, 2022; v1 submitted 18 January, 2022;
originally announced January 2022.
-
The MAJORANA DEMONSTRATOR Readout Electronics System
Authors:
N. Abgrall,
M. Amman,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
P. J. Barton,
F. E. Bertrand,
K. H. Bhimani,
B. Bos,
A. W. Bradley,
T. H. Burritt,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
R. J. Cooper,
C. Cuesta,
J. A. Detwiler,
A. Drobizhev,
D. W. Edwins,
Yu. Efremenko
, et al. (54 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR comprises two arrays of high-purity germanium detectors constructed to search for neutrinoless double-beta decay in 76-Ge and other physics beyond the Standard Model. Its readout electronics were designed to have low electronic noise, and radioactive backgrounds were minimized by using low-mass components and low-radioactivity materials near the detectors. This paper prov…
▽ More
The MAJORANA DEMONSTRATOR comprises two arrays of high-purity germanium detectors constructed to search for neutrinoless double-beta decay in 76-Ge and other physics beyond the Standard Model. Its readout electronics were designed to have low electronic noise, and radioactive backgrounds were minimized by using low-mass components and low-radioactivity materials near the detectors. This paper provides a description of all components of the MAJORANA DEMONSTRATOR readout electronics, spanning the front-end electronics and internal cabling, back-end electronics, digitizer, and power supplies, along with the grounding scheme. The spectroscopic performance achieved with these readout electronics is also demonstrated.
△ Less
Submitted 23 February, 2022; v1 submitted 17 November, 2021;
originally announced November 2021.
-
Signatures of muonic activation in the Majorana Demonstrator
Authors:
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
T. R. Edwards,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe
, et al. (33 additional authors not shown)
Abstract:
Experiments searching for very rare processes such as neutrinoless double-beta decay require a detailed understanding of all sources of background. Signals from radioactive impurities present in construction and detector materials can be suppressed using a number of well-understood techniques. Background from in-situ cosmogenic interactions can be reduced by siting an experiment deep underground.…
▽ More
Experiments searching for very rare processes such as neutrinoless double-beta decay require a detailed understanding of all sources of background. Signals from radioactive impurities present in construction and detector materials can be suppressed using a number of well-understood techniques. Background from in-situ cosmogenic interactions can be reduced by siting an experiment deep underground. However, the next generation of such experiments have unprecedented sensitivity goals of 10$^{28}$ years half-life with background rates of 10$^{-5}$cts/(keV kg yr) in the region of interest. To achieve these goals, the remaining cosmogenic background must be well understood. In the work presented here, Majorana Demonstrator data is used to search for decay signatures of meta-stable germanium isotopes. Contributions to the region of interest in energy and time are estimated using simulations, and compared to Demonstrator data. Correlated time-delayed signals are used to identify decay signatures of isotopes produced in the germanium detectors. A good agreement between expected and measured rate is found and different simulation frameworks are used to estimate the uncertainties of the predictions. The simulation campaign is then extended to characterize the background for the LEGEND experiment, a proposed tonne-scale effort searching for neutrinoless double-beta decay in $^{76}$Ge.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
Co-training for Deep Object Detection: Comparing Single-modal and Multi-modal Approaches
Authors:
Jose L. Gómez,
Gabriel Villalonga,
Antonio M. López
Abstract:
Top-performing computer vision models are powered by convolutional neural networks (CNNs). Training an accurate CNN highly depends on both the raw sensor data and their associated ground truth (GT). Collecting such GT is usually done through human labeling, which is time-consuming and does not scale as we wish. This data labeling bottleneck may be intensified due to domain shifts among image senso…
▽ More
Top-performing computer vision models are powered by convolutional neural networks (CNNs). Training an accurate CNN highly depends on both the raw sensor data and their associated ground truth (GT). Collecting such GT is usually done through human labeling, which is time-consuming and does not scale as we wish. This data labeling bottleneck may be intensified due to domain shifts among image sensors, which could force per-sensor data labeling. In this paper, we focus on the use of co-training, a semi-supervised learning (SSL) method, for obtaining self-labeled object bounding boxes (BBs), i.e., the GT to train deep object detectors. In particular, we assess the goodness of multi-modal co-training by relying on two different views of an image, namely, appearance (RGB) and estimated depth (D). Moreover, we compare appearance-based single-modal co-training with multi-modal. Our results suggest that in a standard SSL setting (no domain shift, a few human-labeled data) and under virtual-to-real domain shift (many virtual-world labeled data, no human-labeled data) multi-modal co-training outperforms single-modal. In the latter case, by performing GAN-based domain translation both co-training modalities are on pair; at least, when using an off-the-shelf depth estimation model not specifically trained on the translated images.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Monocular Depth Estimation through Virtual-world Supervision and Real-world SfM Self-Supervision
Authors:
Akhil Gurram,
Ahmet Faruk Tuna,
Fengyi Shen,
Onay Urfalioglu,
Antonio M. López
Abstract:
Depth information is essential for on-board perception in autonomous driving and driver assistance. Monocular depth estimation (MDE) is very appealing since it allows for appearance and depth being on direct pixelwise correspondence without further calibration. Best MDE models are based on Convolutional Neural Networks (CNNs) trained in a supervised manner, i.e., assuming pixelwise ground truth (G…
▽ More
Depth information is essential for on-board perception in autonomous driving and driver assistance. Monocular depth estimation (MDE) is very appealing since it allows for appearance and depth being on direct pixelwise correspondence without further calibration. Best MDE models are based on Convolutional Neural Networks (CNNs) trained in a supervised manner, i.e., assuming pixelwise ground truth (GT). Usually, this GT is acquired at training time through a calibrated multi-modal suite of sensors. However, also using only a monocular system at training time is cheaper and more scalable. This is possible by relying on structure-from-motion (SfM) principles to generate self-supervision. Nevertheless, problems of camouflaged objects, visibility changes, static-camera intervals, textureless areas, and scale ambiguity, diminish the usefulness of such self-supervision. In this paper, we perform monocular depth estimation by virtual-world supervision (MonoDEVS) and real-world SfM self-supervision. We compensate the SfM self-supervision limitations by leveraging virtual-world images with accurate semantic and depth supervision and addressing the virtual-to-real domain gap. Our MonoDEVSNet outperforms previous MDE CNNs trained on monocular and even stereo sequences.
△ Less
Submitted 3 June, 2022; v1 submitted 22 March, 2021;
originally announced March 2021.
-
Action-Based Representation Learning for Autonomous Driving
Authors:
Yi Xiao,
Felipe Codevilla,
Christopher Pal,
Antonio M. Lopez
Abstract:
Human drivers produce a vast amount of data which could, in principle, be used to improve autonomous driving systems. Unfortunately, seemingly straightforward approaches for creating end-to-end driving models that map sensor data directly into driving actions are problematic in terms of interpretability, and typically have significant difficulty dealing with spurious correlations. Alternatively, w…
▽ More
Human drivers produce a vast amount of data which could, in principle, be used to improve autonomous driving systems. Unfortunately, seemingly straightforward approaches for creating end-to-end driving models that map sensor data directly into driving actions are problematic in terms of interpretability, and typically have significant difficulty dealing with spurious correlations. Alternatively, we propose to use this kind of action-based driving data for learning representations. Our experiments show that an affordance-based driving model pre-trained with this approach can leverage a relatively small amount of weakly annotated imagery and outperform pure end-to-end driving models, while being more interpretable. Further, we demonstrate how this strategy outperforms previous methods based on learning inverse dynamics models as well as other methods based on heavy human supervision (ImageNet).
△ Less
Submitted 9 November, 2020; v1 submitted 21 August, 2020;
originally announced August 2020.
-
The Majorana Demonstrator's Search for Double-Beta Decay of $^{76}$Ge to Excited States of $^{76}$Se
Authors:
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
A. Drobizhev,
T. R. Edwards,
D. W. Edwins,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green
, et al. (38 additional authors not shown)
Abstract:
The Majorana Demonstrator is a neutrinoless double-beta decay search consisting of a low-background modular array of high-purity germanium detectors, $\sim2/3$ of which are enriched to 88\% in $^{76}$Ge. The experiment is also searching for double-beta decay of $^{76}$Ge to excited states (e.s.) in $^{76}$Se. $^{76}$Ge can decay into three daughter states of $^{76}$Se, with clear event signatures…
▽ More
The Majorana Demonstrator is a neutrinoless double-beta decay search consisting of a low-background modular array of high-purity germanium detectors, $\sim2/3$ of which are enriched to 88\% in $^{76}$Ge. The experiment is also searching for double-beta decay of $^{76}$Ge to excited states (e.s.) in $^{76}$Se. $^{76}$Ge can decay into three daughter states of $^{76}$Se, with clear event signatures consisting of a $ββ$-decay followed by the prompt emission of one or two $γ$-rays. This results with high probability in multi-detector coincidences. The granularity of the Demonstrator detector array enables powerful discrimination of this event signature from backgrounds. Using 41.9~kg-y of isotopic exposure, the Demonstrator has set world leading limits for each e.s.\ decay of $^{76}$Ge, with 90\% CL lower half-life limits in the range of $(0.75-4.0)\times10^{24}$~y. In particular, for the $2ν$ transition to the first $0^+$ e.s.\ of $^{76}$Se, a lower half-life limit of $7.5\times10^{23}$~y at 90\% CL was achieved.
△ Less
Submitted 24 February, 2021; v1 submitted 13 August, 2020;
originally announced August 2020.
-
Co-training for On-board Deep Object Detection
Authors:
Gabriel Villalonga,
Antonio M. Lopez
Abstract:
Providing ground truth supervision to train visual models has been a bottleneck over the years, exacerbated by domain shifts which degenerate the performance of such models. This was the case when visual tasks relied on handcrafted features and shallow machine learning and, despite its unprecedented performance gains, the problem remains open within the deep learning paradigm due to its data-hungr…
▽ More
Providing ground truth supervision to train visual models has been a bottleneck over the years, exacerbated by domain shifts which degenerate the performance of such models. This was the case when visual tasks relied on handcrafted features and shallow machine learning and, despite its unprecedented performance gains, the problem remains open within the deep learning paradigm due to its data-hungry nature. Best performing deep vision-based object detectors are trained in a supervised manner by relying on human-labeled bounding boxes which localize class instances (i.e.objects) within the training images.Thus, object detection is one of such tasks for which human labeling is a major bottleneck. In this paper, we assess co-training as a semi-supervised learning method for self-labeling objects in unlabeled images, so reducing the human-labeling effort for develo** deep object detectors. Our study pays special attention to a scenario involving domain shift; in particular, when we have automatically generated virtual-world images with object bounding boxes and we have real-world images which are unlabeled. Moreover, we are particularly interested in using co-training for deep object detection in the context of driver assistance systems and/or self-driving vehicles. Thus, using well-established datasets and protocols for object detection in these application contexts, we will show how co-training is a paradigm worth to pursue for alleviating object labeling, working both alone and together with task-agnostic domain adaptation.
△ Less
Submitted 12 August, 2020;
originally announced August 2020.
-
Shadowing for codimension one sectional-Anosov flows
Authors:
A. Arbieto,
A. M. López,
Y. Sánchez
Abstract:
In hyperbolic dynamics, a well-known result is that every hyperbolic attracting set, have a finite pseudo-orbit tracing property (FPOTP). It's natural to wonder if this result is maintained in the sectional-hyperbolic dynamics; Komuro in [Lorenz attractors do not have the pseudo-orbit tracing property], provides a negative answer for this question, by proving that the geometric Lorenz Attractor do…
▽ More
In hyperbolic dynamics, a well-known result is that every hyperbolic attracting set, have a finite pseudo-orbit tracing property (FPOTP). It's natural to wonder if this result is maintained in the sectional-hyperbolic dynamics; Komuro in [Lorenz attractors do not have the pseudo-orbit tracing property], provides a negative answer for this question, by proving that the geometric Lorenz Attractor doesn't have a FPOTP. In this paper, we generalized the result of Komuro, we prove that every codimension one sectional-hyperbolic attractor set with a unique singularity Lorenz-like, which is of boundary-type, does not have FPOTP.
△ Less
Submitted 23 June, 2020;
originally announced June 2020.
-
$α$-event Characterization and Rejection in Point-Contact HPGe Detectors
Authors:
The MAJORANA Collaboration,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
E. Blalock,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
A. Drobizhev,
T. R. Edwards,
D. W. Edwins,
Yu. Efremenko,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green
, et al. (40 additional authors not shown)
Abstract:
P-type point contact (PPC) HPGe detectors are a leading technology for rare event searches due to their excellent energy resolution, low thresholds, and multi-site event rejection capabilities. We have characterized a PPC detector's response to $α$ particles incident on the sensitive passivated and p+ surfaces, a previously poorly-understood source of background. The detector studied is identical…
▽ More
P-type point contact (PPC) HPGe detectors are a leading technology for rare event searches due to their excellent energy resolution, low thresholds, and multi-site event rejection capabilities. We have characterized a PPC detector's response to $α$ particles incident on the sensitive passivated and p+ surfaces, a previously poorly-understood source of background. The detector studied is identical to those in the MAJORANA DEMONSTRATOR experiment, a search for neutrinoless double-beta decay ($0νββ$) in $^{76}$Ge. $α$ decays on most of the passivated surface exhibit significant energy loss due to charge trap**, with waveforms exhibiting a delayed charge recovery (DCR) signature caused by the slow collection of a fraction of the trapped charge. The DCR is found to be complementary to existing methods of $α$ identification, reliably identifying $α$ background events on the passivated surface of the detector. We demonstrate effective rejection of all surface $α$ events (to within statistical uncertainty) with a loss of only 0.2% of bulk events by combining the DCR discriminator with previously-used methods. The DCR discriminator has been used to reduce the background rate in the $0νββ$ region of interest window by an order of magnitude in the MAJORANA DEMONSTRATOR, and will be used in the upcoming LEGEND-200 experiment.
△ Less
Submitted 14 March, 2022; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Distributed Learning and Inference with Compressed Images
Authors:
Sudeep Katakol,
Basem Elbarashy,
Luis Herranz,
Joost van de Weijer,
Antonio M. Lopez
Abstract:
Modern computer vision requires processing large amounts of data, both while training the model and/or during inference, once the model is deployed. Scenarios where images are captured and processed in physically separated locations are increasingly common (e.g. autonomous vehicles, cloud computing). In addition, many devices suffer from limited resources to store or transmit data (e.g. storage sp…
▽ More
Modern computer vision requires processing large amounts of data, both while training the model and/or during inference, once the model is deployed. Scenarios where images are captured and processed in physically separated locations are increasingly common (e.g. autonomous vehicles, cloud computing). In addition, many devices suffer from limited resources to store or transmit data (e.g. storage space, channel capacity). In these scenarios, lossy image compression plays a crucial role to effectively increase the number of images collected under such constraints. However, lossy compression entails some undesired degradation of the data that may harm the performance of the downstream analysis task at hand, since important semantic information may be lost in the process. Moreover, we may only have compressed images at training time but are able to use original images at inference time, or vice versa, and in such a case, the downstream model suffers from covariate shift. In this paper, we analyze this phenomenon, with a special focus on vision-based perception for autonomous driving as a paradigmatic scenario. We see that loss of semantic information and covariate shift do indeed exist, resulting in a drop in performance that depends on the compression rate. In order to address the problem, we propose dataset restoration, based on image restoration with generative adversarial networks (GANs). Our method is agnostic to both the particular image compression method and the downstream task; and has the advantage of not adding additional cost to the deployed models, which is particularly important in resource-limited devices. The presented experiments focus on semantic segmentation as a challenging use case, cover a broad range of compression rates and diverse datasets, and show how our method is able to significantly alleviate the negative effects of compression on the downstream visual task.
△ Less
Submitted 5 February, 2021; v1 submitted 22 April, 2020;
originally announced April 2020.
-
ADC Nonlinearity Correction for the MAJORANA DEMONSTRATOR
Authors:
N. Abgrall,
J. M. Allmond,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
C. M. Campbell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
H. L. Crawford,
C. Cuesta,
J. A. Detwiler,
A. Drobizhev,
D. W. Edwins,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
T. Gilliss
, et al. (42 additional authors not shown)
Abstract:
Imperfections in analog-to-digital conversion (ADC) cannot be ignored when signal digitization requirements demand both wide dynamic range and high resolution, as is the case for the Majorana Demonstrator 76Ge neutrinoless double beta decay search. Enabling the experiment's high-resolution spectral analysis and efficient pulse shape discrimination required careful measurement and correction of ADC…
▽ More
Imperfections in analog-to-digital conversion (ADC) cannot be ignored when signal digitization requirements demand both wide dynamic range and high resolution, as is the case for the Majorana Demonstrator 76Ge neutrinoless double beta decay search. Enabling the experiment's high-resolution spectral analysis and efficient pulse shape discrimination required careful measurement and correction of ADC nonlinearites. A simple measurement protocol was developed that did not require sophisticated equipment or lengthy data taking campaigns. A slope-dependent hysteresis was observed and characterized. A correction applied to digitized waveforms prior to signal processing reduced the differential and integral nonlinearites by an order of magnitude, eliminating these as dominant contributions to the systematic energy uncertainty at the double-beta decay Q value.
△ Less
Submitted 24 March, 2021; v1 submitted 4 March, 2020;
originally announced March 2020.
-
A Low Energy Rare Event Search with the Majorana Demonstrator
Authors:
MAJORANA Collaboration,
C. Wiseman,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
A. Drobizhev,
D. W. Edwins,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green
, et al. (37 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR is sensitive to rare events near its energy threshold, including bosonic dark matter, solar axions, and lightly ionizing particles. In this analysis, a novel training set of low energy small-angle Compton scatter events is used to determine the efficiency of pulse shape analysis cuts, and we present updated bosonic dark matter and solar axion results from an 11.17 kg-y da…
▽ More
The MAJORANA DEMONSTRATOR is sensitive to rare events near its energy threshold, including bosonic dark matter, solar axions, and lightly ionizing particles. In this analysis, a novel training set of low energy small-angle Compton scatter events is used to determine the efficiency of pulse shape analysis cuts, and we present updated bosonic dark matter and solar axion results from an 11.17 kg-y dataset using a 5 keV analysis threshold.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Results of the MAJORANA DEMONSTRATOR's Search for Double-Beta Decay of $^{76}$Ge to Excited States of $^{76}$Se
Authors:
I. S. Guinn,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P-H. Chu,
M. L. Clark,
C. Cuesta,
J. A. Detwiler,
A. Drobizhev,
D. W. Edwins,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green,
J. Gruszko
, et al. (35 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR is searching for double-beta decay of $^{76}$Ge to excited states (E.S.) in $^{76}$Se using a modular array of high purity Germanium detectors. $^{76}$Ge can decay into three E.S.s of $^{76}$Se. The E.S. decays have a clear event signature consisting of a $ββ$-decay with the prompt emission of one or two $γ$-rays, resulting in with high probability in a multi-site event.…
▽ More
The MAJORANA DEMONSTRATOR is searching for double-beta decay of $^{76}$Ge to excited states (E.S.) in $^{76}$Se using a modular array of high purity Germanium detectors. $^{76}$Ge can decay into three E.S.s of $^{76}$Se. The E.S. decays have a clear event signature consisting of a $ββ$-decay with the prompt emission of one or two $γ$-rays, resulting in with high probability in a multi-site event. The granularity of the DEMONSTRATOR detector array enables powerful discrimination of this event signature from backgrounds. Using 21.3 kg-y of isotopic exposure, the DEMONSTRATOR has set world leading limits for each E.S. decay, with 90% CL lower half-life limits in the range of $(0.56-2.1)\cdot10^{24}$ y. In particular, for the $2ν$ transition to the first $0^+$ E.S. of $^{76}$Se, a lower half-life limit of $0.68\cdot10^{24}$ at 90% CL was achieved.
△ Less
Submitted 11 December, 2019;
originally announced December 2019.
-
Active Learning for Deep Detection Neural Networks
Authors:
Hamed H. Aghdam,
Abel Gonzalez-Garcia,
Joost van de Weijer,
Antonio M. López
Abstract:
The cost of drawing object bounding boxes (i.e. labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active le…
▽ More
The cost of drawing object bounding boxes (i.e. labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active learning of object detectors based on convolutional neural networks. We propose a new image-level scoring process to rank unlabeled images for their automatic selection, which clearly outperforms classical scores. The proposed method can be applied to videos and sets of still images. In the former case, temporal selection rules can complement our scoring process. As a relevant use case, we extensively study the performance of our method on the task of pedestrian detection. Overall, the experiments show that the proposed method performs better than random selection. Our codes are publicly available at www.gitlab.com/haghdam/deep_active_learning.
△ Less
Submitted 20 November, 2019;
originally announced November 2019.
-
Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models
Authors:
César Roberto de Souza,
Adrien Gaidon,
Yohann Cabon,
Naila Murray,
Antonio Manuel López
Abstract:
Deep video action recognition models have been highly successful in recent years but require large quantities of manually annotated data, which are expensive and laborious to obtain. In this work, we investigate the generation of synthetic training data for video action recognition, as synthetic data have been successfully used to supervise models for a variety of other computer vision tasks. We p…
▽ More
Deep video action recognition models have been highly successful in recent years but require large quantities of manually annotated data, which are expensive and laborious to obtain. In this work, we investigate the generation of synthetic training data for video action recognition, as synthetic data have been successfully used to supervise models for a variety of other computer vision tasks. We propose an interpretable parametric generative model of human action videos that relies on procedural generation, physics models and other components of modern game engines. With this model we generate a diverse, realistic, and physically plausible dataset of human action videos, called PHAV for "Procedural Human Action Videos". PHAV contains a total of 39,982 videos, with more than 1,000 examples for each of 35 action categories. Our video generation approach is not limited to existing motion capture sequences: 14 of these 35 categories are procedurally defined synthetic actions. In addition, each video is represented with 6 different data modalities, including RGB, optical flow and pixel-level semantic labels. These modalities are generated almost simultaneously using the Multiple Render Targets feature of modern GPUs. In order to leverage PHAV, we introduce a deep multi-task (i.e. that considers action classes from multiple datasets) representation learning architecture that is able to simultaneously learn from synthetic and real video datasets, even when their action categories differ. Our experiments on the UCF-101 and HMDB-51 benchmarks suggest that combining our large set of synthetic videos with small real-world datasets can boost recognition performance. Our approach also significantly outperforms video representations produced by fine-tuning state-of-the-art unsupervised generative models of videos.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
Intention Recognition of Pedestrians and Cyclists by 2D Pose Estimation
Authors:
Zhijie Fang,
Antonio M. López
Abstract:
Anticipating the intentions of vulnerable road users (VRUs) such as pedestrians and cyclists is critical for performing safe and comfortable driving maneuvers. This is the case for human driving and, thus, should be taken into account by systems providing any level of driving assistance, from advanced driver assistant systems (ADAS) to fully autonomous vehicles (AVs). In this paper, we show how th…
▽ More
Anticipating the intentions of vulnerable road users (VRUs) such as pedestrians and cyclists is critical for performing safe and comfortable driving maneuvers. This is the case for human driving and, thus, should be taken into account by systems providing any level of driving assistance, from advanced driver assistant systems (ADAS) to fully autonomous vehicles (AVs). In this paper, we show how the latest advances on monocular vision-based human pose estimation, i.e. those relying on deep Convolutional Neural Networks (CNNs), enable to recognize the intentions of such VRUs. In the case of cyclists, we assume that they follow traffic rules to indicate future maneuvers with arm signals. In the case of pedestrians, no indications can be assumed. Instead, we hypothesize that the walking pattern of a pedestrian allows to determine if he/she has the intention of crossing the road in the path of the ego-vehicle, so that the ego-vehicle must maneuver accordingly (e.g. slowing down or stop**). In this paper, we show how the same methodology can be used for recognizing pedestrians and cyclists' intentions. For pedestrians, we perform experiments on the JAAD dataset. For cyclists, we did not found an analogous dataset, thus, we created our own one by acquiring and annotating videos which we share with the research community. Overall, the proposed pipeline provides new state-of-the-art results on the intention recognition of VRUs.
△ Less
Submitted 9 October, 2019;
originally announced October 2019.
-
Slanted Stixels: A way to represent steep streets
Authors:
Daniel Hernandez-Juarez,
Lukas Schneider,
Pau Cebrian,
Antonio Espinosa,
David Vazquez,
Antonio M. Lopez,
Uwe Franke,
Marc Pollefeys,
Juan C. Moure
Abstract:
This work presents and evaluates a novel compact scene representation based on Stixels that infers geometric and semantic information. Our approach overcomes the previous rather restrictive geometric assumptions for Stixels by introducing a novel depth model to account for non-flat roads and slanted objects. Both semantic and depth cues are used jointly to infer the scene representation in a sound…
▽ More
This work presents and evaluates a novel compact scene representation based on Stixels that infers geometric and semantic information. Our approach overcomes the previous rather restrictive geometric assumptions for Stixels by introducing a novel depth model to account for non-flat roads and slanted objects. Both semantic and depth cues are used jointly to infer the scene representation in a sound global energy minimization formulation.
Furthermore, a novel approximation scheme is introduced in order to significantly reduce the computational complexity of the Stixel algorithm, and then achieve real-time computation capabilities. The idea is to first perform an over-segmentation of the image, discarding the unlikely Stixel cuts, and apply the algorithm only on the remaining Stixel cuts. This work presents a novel over-segmentation strategy based on a Fully Convolutional Network (FCN), which outperforms an approach based on using local extrema of the disparity map.
We evaluate the proposed methods in terms of semantic and geometric accuracy as well as run-time on four publicly available benchmark datasets. Our approach maintains accuracy on flat road scene datasets while improving substantially on a novel non-flat road dataset.
△ Less
Submitted 2 October, 2019;
originally announced October 2019.
-
Temporal Coherence for Active Learning in Videos
Authors:
Javad Zolfaghari Bengar,
Abel Gonzalez-Garcia,
Gabriel Villalonga,
Bogdan Raducanu,
Hamed H. Aghdam,
Mikhail Mozerov,
Antonio M. Lopez,
Joost van de Weijer
Abstract:
Autonomous driving systems require huge amounts of data to train. Manual annotation of this data is time-consuming and prohibitively expensive since it involves human resources. Therefore, active learning emerged as an alternative to ease this effort and to make data annotation more manageable. In this paper, we introduce a novel active learning approach for object detection in videos by exploitin…
▽ More
Autonomous driving systems require huge amounts of data to train. Manual annotation of this data is time-consuming and prohibitively expensive since it involves human resources. Therefore, active learning emerged as an alternative to ease this effort and to make data annotation more manageable. In this paper, we introduce a novel active learning approach for object detection in videos by exploiting temporal coherence. Our active learning criterion is based on the estimated number of errors in terms of false positives and false negatives. The detections obtained by the object detector are used to define the nodes of a graph and tracked forward and backward to temporally link the nodes. Minimizing an energy function defined on this graphical model provides estimates of both false positives and false negatives. Additionally, we introduce a synthetic video dataset, called SYNTHIA-AL, specially designed to evaluate active learning for video object detection in road scenes. Finally, we show that our approach outperforms active learning baselines tested on two datasets.
△ Less
Submitted 30 August, 2019;
originally announced August 2019.
-
Self-supervised Domain Adaptation for Computer Vision Tasks
Authors:
Jiaolong Xu,
Liang Xiao,
Antonio M. Lopez
Abstract:
Recent progress of self-supervised visual representation learning has achieved remarkable success on many challenging computer vision benchmarks. However, whether these techniques can be used for domain adaptation has not been explored. In this work, we propose a generic method for self-supervised domain adaptation, using object recognition and semantic segmentation of urban scenes as use cases. F…
▽ More
Recent progress of self-supervised visual representation learning has achieved remarkable success on many challenging computer vision benchmarks. However, whether these techniques can be used for domain adaptation has not been explored. In this work, we propose a generic method for self-supervised domain adaptation, using object recognition and semantic segmentation of urban scenes as use cases. Focusing on simple pretext/auxiliary tasks (e.g. image rotation prediction), we assess different learning strategies to improve domain adaptation effectiveness by self-supervision. Additionally, we propose two complementary strategies to further boost the domain adaptation accuracy on semantic segmentation within our method, consisting of prediction layer alignment and batch normalization calibration. The experimental results show adaptation levels comparable to most studied domain adaptation methods, thus, bringing self-supervision as a new alternative for reaching domain adaptation. The code is available at https://github.com/Jiaolong/self-supervised-da.
△ Less
Submitted 10 December, 2019; v1 submitted 25 July, 2019;
originally announced July 2019.
-
Multimodal End-to-End Autonomous Driving
Authors:
Yi Xiao,
Felipe Codevilla,
Akhil Gurram,
Onay Urfalioglu,
Antonio M. López
Abstract:
A crucial component of an autonomous vehicle (AV) is the artificial intelligence (AI) is able to drive towards a desired destination. Today, there are different paradigms addressing the development of AI drivers. On the one hand, we find modular pipelines, which divide the driving task into sub-tasks such as perception and maneuver planning and control. On the other hand, we find end-to-end drivin…
▽ More
A crucial component of an autonomous vehicle (AV) is the artificial intelligence (AI) is able to drive towards a desired destination. Today, there are different paradigms addressing the development of AI drivers. On the one hand, we find modular pipelines, which divide the driving task into sub-tasks such as perception and maneuver planning and control. On the other hand, we find end-to-end driving approaches that try to learn a direct map** from input raw sensor data to vehicle control signals. The later are relatively less studied, but are gaining popularity since they are less demanding in terms of sensor data annotation. This paper focuses on end-to-end autonomous driving. So far, most proposals relying on this paradigm assume RGB images as input sensor data. However, AVs will not be equipped only with cameras, but also with active sensors providing accurate depth information (e.g., LiDARs). Accordingly, this paper analyses whether combining RGB and depth modalities, i.e. using RGBD data, produces better end-to-end AI drivers than relying on a single modality. We consider multimodality based on early, mid and late fusion schemes, both in multisensory and single-sensor (monocular depth estimation) settings. Using the CARLA simulator and conditional imitation learning (CIL), we show how, indeed, early fusion multimodality outperforms single-modality.
△ Less
Submitted 25 October, 2020; v1 submitted 7 June, 2019;
originally announced June 2019.
-
Exploring the Limitations of Behavior Cloning for Autonomous Driving
Authors:
Felipe Codevilla,
Eder Santana,
Antonio M. López,
Adrien Gaidon
Abstract:
Driving requires reacting to a wide variety of complex environment conditions and agent behaviors. Explicitly modeling each possible scenario is unrealistic. In contrast, imitation learning can, in theory, leverage data from large fleets of human-driven cars. Behavior cloning in particular has been successfully used to learn simple visuomotor policies end-to-end, but scaling to the full spectrum o…
▽ More
Driving requires reacting to a wide variety of complex environment conditions and agent behaviors. Explicitly modeling each possible scenario is unrealistic. In contrast, imitation learning can, in theory, leverage data from large fleets of human-driven cars. Behavior cloning in particular has been successfully used to learn simple visuomotor policies end-to-end, but scaling to the full spectrum of driving behaviors remains an unsolved problem. In this paper, we propose a new benchmark to experimentally investigate the scalability and limitations of behavior cloning. We show that behavior cloning leads to state-of-the-art results, including in unseen environments, executing complex lateral and longitudinal maneuvers without these reactions being explicitly programmed. However, we confirm well-known limitations (due to dataset bias and overfitting), new generalization issues (due to dynamic objects and the lack of a causal model), and training instability requiring further research before behavior cloning can graduate to real-world driving. The code of the studied behavior cloning approaches can be found at https://github.com/felipecode/coiltraine .
△ Less
Submitted 18 April, 2019;
originally announced April 2019.
-
A Search for Neutrinoless Double-Beta Decay in $^{76}$Ge with 26 kg-yr of Exposure from the MAJORANA DEMONSTRATOR
Authors:
S. I. Alvis,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
V. Basu,
F. E. Bertrand,
B. Bos,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
C. Cuesta,
J. A. Detwiler,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe
, et al. (39 additional authors not shown)
Abstract:
The MAJORANA Collaboration is operating an array of high purity Ge detectors to search for the neutrinoless double-beta decay of $^{76}$Ge. The MAJORANA DEMONSTRATOR consists of 44.1 kg of Ge detectors (29.7 kg enriched to 88% in $^{76}$Ge) split between two modules constructed from ultra-clean materials. Both modules are contained in a low-background shield at the Sanford Underground Research Fac…
▽ More
The MAJORANA Collaboration is operating an array of high purity Ge detectors to search for the neutrinoless double-beta decay of $^{76}$Ge. The MAJORANA DEMONSTRATOR consists of 44.1 kg of Ge detectors (29.7 kg enriched to 88% in $^{76}$Ge) split between two modules constructed from ultra-clean materials. Both modules are contained in a low-background shield at the Sanford Underground Research Facility in Lead, South Dakota. We present updated results on the search for neutrinoless double-beta decay in $^{76}$Ge with $26.0\pm0.5$ kg-yr of enriched exposure. With the DEMONSTRATOR's unprecedented energy resolution of 2.53 keV FWHM at $Q_{ββ}$, we observe one event in the region of interest with 0.65 events expected from the estimated background, resulting in a lower limit on the $^{76}$Ge neutrinoless double-beta decay half-life of $2.7\times10^{25}$ yr (90% CL) with a median sensitivity of $4.8\times10^{25}$ yr (90% CL). Depending on the matrix elements used, a 90% CL upper limit on the effective Majorana neutrino mass in the range of 200-433 meV is obtained. The measured background in the low-background configurations is $11.9\pm2.0$ counts/(FWHM t yr).
△ Less
Submitted 6 February, 2019;
originally announced February 2019.
-
Multi-site event discrimination for the MAJORANA DEMONSTRATOR
Authors:
S. I. Alvis,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
B. Bos,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
C. Cuesta,
J. A. Detwiler,
H. Ejiri,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green,
J. Gruszko,
I. S. Guinn,
V. E. Guiseppe,
C. R. Haufe,
R. J. Hegedus,
L. Hehn
, et al. (38 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR is searching for neutrinoless double-beta decay in 76Ge using arrays of point-contact germanium detectors operating at the Sanford Underground Research Facility. Background results in the neutrinoless double-beta decay region of interest from data taken during construction, commissioning, and the start of full operations have been recently published. A pulse shape analysi…
▽ More
The MAJORANA DEMONSTRATOR is searching for neutrinoless double-beta decay in 76Ge using arrays of point-contact germanium detectors operating at the Sanford Underground Research Facility. Background results in the neutrinoless double-beta decay region of interest from data taken during construction, commissioning, and the start of full operations have been recently published. A pulse shape analysis cut applied to achieve this result, named AvsE, is described in this paper. This cut is developed to remove events whose waveforms are typical of multi-site energy deposits while retaining (90 +/- 3.5)% of single-site events. This pulse shape discrimination is based on the relationship between the maximum current and energy, and tuned using 228Th calibration source data. The efficiency uncertainty accounts for variation across detectors, energy, and time, as well as for the position distribution difference between calibration and $0νββ$ events, established using simulations.
△ Less
Submitted 16 January, 2019;
originally announced January 2019.
-
Recent results from the MAJORANA DEMONSTRATOR
Authors:
J. Myslik,
S. I. Alvis,
I. J. Arnquist,
F. T. Avignone III,
A. S. Barabash,
C. J. Barton,
F. E. Bertrand,
T. Bode,
B. Bos,
V. Brudanin,
M. Busch,
M. Buuck,
T. S. Caldwell,
Y-D. Chan,
C. D. Christofferson,
P. -H. Chu,
C. Cuesta,
J. A. Detwiler,
C. Dunagan,
Yu. Efremenko,
H. Ejiri,
S. R. Elliott,
T. Gilliss,
G. K. Giovanetti,
M. P. Green
, et al. (43 additional authors not shown)
Abstract:
The MAJORANA DEMONSTRATOR is an experiment constructed to search for neutrinoless double-beta decay in $^{76}$Ge and to demonstrate the feasibility to deploy a large-scale experiment in a phased and modular fashion. It consists of two modules of natural and $^{76}$Ge-enriched germanium detectors totalling 44.1 kg, operating at the 4850' level of the Sanford Underground Research Facility in Lead, S…
▽ More
The MAJORANA DEMONSTRATOR is an experiment constructed to search for neutrinoless double-beta decay in $^{76}$Ge and to demonstrate the feasibility to deploy a large-scale experiment in a phased and modular fashion. It consists of two modules of natural and $^{76}$Ge-enriched germanium detectors totalling 44.1 kg, operating at the 4850' level of the Sanford Underground Research Facility in Lead, South Dakota, USA. Commissioning of the experiment began in June 2015, followed by data production with the full detector array in August 2016. The ultra-low background and record energy resolution achieved by the MAJORANA DEMONSTRATOR enable a sensitive neutrinoless double-beta decay search, as well as additional searches for physics beyond the Standard Model. I will discuss the design elements that enable these searches, along with the latest results, focusing on the neutrinoless double-beta decay search. I will also discuss the current status and the future plans of the MAJORANA DEMONSTRATOR, as well as the plans for a future tonne-scale $^{76}$Ge experiment.
△ Less
Submitted 19 December, 2018;
originally announced December 2018.