-
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
Authors:
Julian Straub,
Daniel DeTone,
Tianwei Shen,
Nan Yang,
Chris Sweeney,
Richard Newcombe
Abstract:
The advent of wearable computers enables a new source of context for AI that is embedded in egocentric sensor data. This new egocentric data comes equipped with fine-grained 3D location information and thus presents the opportunity for a novel class of spatial foundation models that are rooted in 3D space. To measure progress on what we term Egocentric Foundation Models (EFMs) we establish EFM3D,…
▽ More
The advent of wearable computers enables a new source of context for AI that is embedded in egocentric sensor data. This new egocentric data comes equipped with fine-grained 3D location information and thus presents the opportunity for a novel class of spatial foundation models that are rooted in 3D space. To measure progress on what we term Egocentric Foundation Models (EFMs) we establish EFM3D, a benchmark with two core 3D egocentric perception tasks. EFM3D is the first benchmark for 3D object detection and surface regression on high quality annotated egocentric data of Project Aria. We propose Egocentric Voxel Lifting (EVL), a baseline for 3D EFMs. EVL leverages all available egocentric modalities and inherits foundational capabilities from 2D foundation models. This model, trained on a large simulated dataset, outperforms existing methods on the EFM3D benchmark.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
STT: Stateful Tracking with Transformers for Autonomous Driving
Authors:
Longlong **g,
Ruichi Yu,
Xu Chen,
Zhengli Zhao,
Shiwei Sheng,
Colin Graber,
Qi Chen,
Qinru Li,
Shangxuan Wu,
Han Deng,
Sang** Lee,
Chris Sweeney,
Qiurui He,
Wei-Chih Hung,
Tong He,
Xingyi Zhou,
Farshid Moussavi,
Zijian Guo,
Yin Zhou,
Mingxing Tan,
Weilong Yang,
Congcong Li
Abstract:
Tracking objects in three-dimensional space is critical for autonomous driving. To ensure safety while driving, the tracker must be able to reliably track objects across frames and accurately estimate their states such as velocity and acceleration in the present. Existing works frequently focus on the association task while either neglecting the model performance on state estimation or deploying c…
▽ More
Tracking objects in three-dimensional space is critical for autonomous driving. To ensure safety while driving, the tracker must be able to reliably track objects across frames and accurately estimate their states such as velocity and acceleration in the present. Existing works frequently focus on the association task while either neglecting the model performance on state estimation or deploying complex heuristics to predict the states. In this paper, we propose STT, a Stateful Tracking model built with Transformers, that can consistently track objects in the scenes while also predicting their states accurately. STT consumes rich appearance, geometry, and motion signals through long term history of detections and is jointly optimized for both data association and state estimation tasks. Since the standard tracking metrics like MOTA and MOTP do not capture the combined performance of the two tasks in the wider spectrum of object states, we extend them with new metrics called S-MOTA and MOTPS that address this limitation. STT achieves competitive real-time performance on the Waymo Open Dataset.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
Authors:
Qiao Gu,
Zhaoyang Lv,
Duncan Frost,
Simon Green,
Julian Straub,
Chris Sweeney
Abstract:
In this paper we present EgoLifter, a novel system that can automatically segment scenes captured from egocentric sensors into a complete decomposition of individual 3D objects. The system is specifically designed for egocentric data where scenes contain hundreds of objects captured from natural (non-scanning) motion. EgoLifter adopts 3D Gaussians as the underlying representation of 3D scenes and…
▽ More
In this paper we present EgoLifter, a novel system that can automatically segment scenes captured from egocentric sensors into a complete decomposition of individual 3D objects. The system is specifically designed for egocentric data where scenes contain hundreds of objects captured from natural (non-scanning) motion. EgoLifter adopts 3D Gaussians as the underlying representation of 3D scenes and objects and uses segmentation masks from the Segment Anything Model (SAM) as weak supervision to learn flexible and promptable definitions of object instances free of any specific object taxonomy. To handle the challenge of dynamic objects in ego-centric videos, we design a transient prediction module that learns to filter out dynamic objects in the 3D reconstruction. The result is a fully automatic pipeline that is able to reconstruct 3D object instances as collections of 3D Gaussians that collectively compose the entire scene. We created a new benchmark on the Aria Digital Twin dataset that quantitatively demonstrates its state-of-the-art performance in open-world 3D segmentation from natural egocentric input. We run EgoLifter on various egocentric activity datasets which shows the promise of the method for 3D egocentric perception at scale.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Search for CP-violating Neutrino Non-Standard Interactions with the NOvA Experiment
Authors:
NOvA Collaboration,
M. A. Acero,
B. Acharya,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
N. Balashov,
P. Baldi,
B. A. Bambah,
A. Bat,
K. Bays,
R. Bernstein,
T. J. C. Bezerra,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles,
B. Brahma
, et al. (182 additional authors not shown)
Abstract:
This Letter reports a search for charge-parity (CP) symmetry violating non-standard interactions (NSI) of neutrinos with matter using the NOvA Experiment, and examines their effects on the determination of the standard oscillation parameters. Data from $ν_μ(\barν_μ)\rightarrowν_μ(\barν_μ)$ and $ν_μ(\barν_μ)\rightarrowν_{e}(\barν_{e})$ oscillation channels are used to measure the effect of the NSI…
▽ More
This Letter reports a search for charge-parity (CP) symmetry violating non-standard interactions (NSI) of neutrinos with matter using the NOvA Experiment, and examines their effects on the determination of the standard oscillation parameters. Data from $ν_μ(\barν_μ)\rightarrowν_μ(\barν_μ)$ and $ν_μ(\barν_μ)\rightarrowν_{e}(\barν_{e})$ oscillation channels are used to measure the effect of the NSI parameters $\varepsilon_{eμ}$ and $\varepsilon_{eτ}$. With 90% C.L. the magnitudes of the NSI couplings are constrained to be $|\varepsilon_{eμ}| \, \lesssim 0.3$ and $|\varepsilon_{eτ}| \, \lesssim 0.4$. A degeneracy at $|\varepsilon_{eτ}| \, \approx 1.8$ is reported, and we observe that the presence of NSI limits sensitivity to the standard CP phase $δ_{\tiny\text{CP}}$.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Aria Everyday Activities Dataset
Authors:
Zhaoyang Lv,
Nicholas Charron,
Pierre Moulon,
Alexander Gamino,
Cheng Peng,
Chris Sweeney,
Edward Miller,
Huixuan Tang,
Jeff Meissner,
**g Dong,
Kiran Somasundaram,
Luis Pesqueira,
Mark Schwesinger,
Omkar Parkhi,
Qiao Gu,
Renzo De Nardi,
Shangyi Cheng,
Steve Saarinen,
Vijay Baiyya,
Yuyang Zou,
Richard Newcombe,
Jakob Julian Engel,
Xiaqing Pan,
Carl Ren
Abstract:
We present Aria Everyday Activities (AEA) Dataset, an egocentric multimodal open dataset recorded using Project Aria glasses. AEA contains 143 daily activity sequences recorded by multiple wearers in five geographically diverse indoor locations. Each of the recording contains multimodal sensor data recorded through the Project Aria glasses. In addition, AEA provides machine perception data includi…
▽ More
We present Aria Everyday Activities (AEA) Dataset, an egocentric multimodal open dataset recorded using Project Aria glasses. AEA contains 143 daily activity sequences recorded by multiple wearers in five geographically diverse indoor locations. Each of the recording contains multimodal sensor data recorded through the Project Aria glasses. In addition, AEA provides machine perception data including high frequency globally aligned 3D trajectories, scene point cloud, per-frame 3D eye gaze vector and time aligned speech transcription. In this paper, we demonstrate a few exemplar research applications enabled by this dataset, including neural scene reconstruction and prompted segmentation. AEA is an open source dataset that can be downloaded from https://www.projectaria.com/datasets/aea/. We are also providing open-source implementations and examples of how to use the dataset in Project Aria Tools https://github.com/facebookresearch/projectaria_tools.
△ Less
Submitted 21 February, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Expanding neutrino oscillation parameter measurements in NOvA using a Bayesian approach
Authors:
NOvA Collaboration,
M. A. Acero,
B. Acharya,
P. Adamson,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
N. Balashov,
P. Baldi,
B. A. Bambah,
A. Bat,
K. Bays,
R. Bernstein,
T. J. C. Bezerra,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles,
B. Brahma,
C. Bromberg
, et al. (174 additional authors not shown)
Abstract:
NOvA is a long-baseline neutrino oscillation experiment that measures oscillations in charged-current $ν_μ \rightarrow ν_μ$ (disappearance) and $ν_μ \rightarrow ν_{e}$ (appearance) channels, and their antineutrino counterparts, using neutrinos of energies around 2 GeV over a distance of 810 km. In this work we reanalyze the dataset first examined in our previous paper [Phys. Rev. D 106, 032004 (20…
▽ More
NOvA is a long-baseline neutrino oscillation experiment that measures oscillations in charged-current $ν_μ \rightarrow ν_μ$ (disappearance) and $ν_μ \rightarrow ν_{e}$ (appearance) channels, and their antineutrino counterparts, using neutrinos of energies around 2 GeV over a distance of 810 km. In this work we reanalyze the dataset first examined in our previous paper [Phys. Rev. D 106, 032004 (2022)] using an alternative statistical approach based on Bayesian Markov Chain Monte Carlo. We measure oscillation parameters consistent with the previous results. We also extend our inferences to include the first NOvA measurements of the reactor mixing angle $θ_{13}$ and the Jarlskog invariant. We use these results to quantify the strength of our inferences about CP violation, as well as to examine the effects of constraints from short-baseline measurements of $θ_{13}$ using antineutrinos from nuclear reactors when making NOvA measurements of $θ_{23}$. Our long-baseline measurement of $θ_{13}$ is also shown to be consistent with the reactor measurements, supporting the general applicability and robustness of the PMNS framework for neutrino oscillations.
△ Less
Submitted 27 May, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Project Aria: A New Tool for Egocentric Multi-Modal AI Research
Authors:
Jakob Engel,
Kiran Somasundaram,
Michael Goesele,
Albert Sun,
Alexander Gamino,
Andrew Turner,
Arjang Talattof,
Arnie Yuan,
Bilal Souti,
Brighid Meredith,
Cheng Peng,
Chris Sweeney,
Cole Wilson,
Dan Barnes,
Daniel DeTone,
David Caruso,
Derek Valleroy,
Dinesh Ginjupalli,
Duncan Frost,
Edward Miller,
Elias Mueggler,
Evgeniy Oleinik,
Fan Zhang,
Guruprasad Somasundaram,
Gustavo Solaira
, et al. (49 additional authors not shown)
Abstract:
Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul…
▽ More
Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, multi-modal data recording and streaming device with the goal to foster and accelerate research in this area. In this paper, we describe the Aria device hardware including its sensor configuration and the corresponding software tools that enable recording and processing of such data.
△ Less
Submitted 1 October, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
The Profiled Feldman-Cousins technique for confidence interval construction in the presence of nuisance parameters
Authors:
M. A. Acero,
B. Acharya,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
A. Bat,
K. Bays,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles
, et al. (196 additional authors not shown)
Abstract:
Measuring observables to constrain models using maximum-likelihood estimation is fundamental to many physics experiments. The Profiled Feldman-Cousins method described here is a potential solution to common challenges faced in constructing accurate confidence intervals: small datasets, bounded parameters, and the need to properly handle nuisance parameters. This method achieves more accurate frequ…
▽ More
Measuring observables to constrain models using maximum-likelihood estimation is fundamental to many physics experiments. The Profiled Feldman-Cousins method described here is a potential solution to common challenges faced in constructing accurate confidence intervals: small datasets, bounded parameters, and the need to properly handle nuisance parameters. This method achieves more accurate frequentist coverage than other methods in use, and is generally applicable to the problem of parameter estimation in neutrino oscillations and similar measurements. We describe an implementation of this method in the context of the NOvA experiment.
△ Less
Submitted 1 August, 2022; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Measurement of the $ν_e-$Nucleus Charged-Current Double-Differential Cross Section at $\left< E_ν \right> = $ 2.4 GeV using NOvA
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
A. C. Booth,
R. Bowles,
B. Brahma,
C. Bromberg
, et al. (190 additional authors not shown)
Abstract:
The inclusive electron neutrino charged-current cross section is measured in the NOvA near detector using $8.02\times10^{20}$ protons-on-target (POT) in the NuMI beam. The sample of GeV electron neutrino interactions is the largest analyzed to date and is limited by $\simeq$ 17\% systematic rather than the $\simeq$ 7.4\% statistical uncertainties. The double-differential cross section in final-sta…
▽ More
The inclusive electron neutrino charged-current cross section is measured in the NOvA near detector using $8.02\times10^{20}$ protons-on-target (POT) in the NuMI beam. The sample of GeV electron neutrino interactions is the largest analyzed to date and is limited by $\simeq$ 17\% systematic rather than the $\simeq$ 7.4\% statistical uncertainties. The double-differential cross section in final-state electron energy and angle is presented for the first time, together with the single-differential dependence on $Q^{2}$ (squared four-momentum transfer) and energy, in the range 1 GeV $ \leq E_ν < $6 GeV. Detailed comparisons are made to the predictions of the GENIE, GiBUU, NEUT, and NuWro neutrino event generators. The data do not strongly favor a model over the others consistently across all three cross sections measured, though some models have especially good or poor agreement in the single differential cross section vs. $Q^{2}$.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation
Authors:
Gil Avraham,
Julian Straub,
Tianwei Shen,
Tsun-Yi Yang,
Hugo Germain,
Chris Sweeney,
Vasileios Balntas,
David Novotny,
Daniel DeTone,
Richard Newcombe
Abstract:
This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism. Our proposed 3D scene representation, Nerfels, is locally dense yet globally sparse. As opposed to existing invertible neural rendering systems which overfit a model to the entire scene, we adopt a feature-driven approach for representing scene-agnostic,…
▽ More
This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism. Our proposed 3D scene representation, Nerfels, is locally dense yet globally sparse. As opposed to existing invertible neural rendering systems which overfit a model to the entire scene, we adopt a feature-driven approach for representing scene-agnostic, local 3D patches with renderable codes. By modelling a scene only where local features are detected, our framework effectively generalizes to unseen local regions in the scene via an optimizable code conditioning mechanism in the neural renderer, all while maintaining the low memory footprint of a sparse 3D map representation. Our model can be incorporated to existing state-of-the-art hand-crafted and learned local feature pose estimators, yielding improved performance when evaluating on ScanNet for wide camera baseline scenarios.
△ Less
Submitted 4 June, 2022;
originally announced June 2022.
-
Self-supervised Neural Articulated Shape and Appearance Models
Authors:
Fangyin Wei,
Rohan Chabra,
Lingni Ma,
Christoph Lassner,
Michael Zollhöfer,
Szymon Rusinkiewicz,
Chris Sweeney,
Richard Newcombe,
Mira Slavcheva
Abstract:
Learning geometry, motion, and appearance priors of object classes is important for the solution of a large variety of computer vision problems. While the majority of approaches has focused on static objects, dynamic objects, especially with controllable articulation, are less explored. We propose a novel approach for learning a representation of the geometry, appearance, and motion of a class of…
▽ More
Learning geometry, motion, and appearance priors of object classes is important for the solution of a large variety of computer vision problems. While the majority of approaches has focused on static objects, dynamic objects, especially with controllable articulation, are less explored. We propose a novel approach for learning a representation of the geometry, appearance, and motion of a class of articulated objects given only a set of color images as input. In a self-supervised manner, our novel representation learns shape, appearance, and articulation codes that enable independent control of these semantic dimensions. Our model is trained end-to-end without requiring any articulation annotations. Experiments show that our approach performs well for different joint types, such as revolute and prismatic joints, as well as different combinations of these joints. Compared to state of the art that uses direct 3D supervision and does not output appearance, we recover more faithful geometry and appearance from 2D observations only. In addition, our representation enables a large variety of applications, such as few-shot reconstruction, the generation of novel articulations, and novel view-synthesis.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
LISA: Learning Implicit Shape and Appearance of Hands
Authors:
Enric Corona,
Tomas Hodan,
Minh Vo,
Francesc Moreno-Noguer,
Chris Sweeney,
Richard Newcombe,
Lingni Ma
Abstract:
This paper proposes a do-it-all neural model of human hands, named LISA. The model can capture accurate hand shape and appearance, generalize to arbitrary hand subjects, provide dense surface correspondences, be reconstructed from images in the wild and easily animated. We train LISA by minimizing the shape and appearance losses on a large set of multi-view RGB image sequences annotated with coars…
▽ More
This paper proposes a do-it-all neural model of human hands, named LISA. The model can capture accurate hand shape and appearance, generalize to arbitrary hand subjects, provide dense surface correspondences, be reconstructed from images in the wild and easily animated. We train LISA by minimizing the shape and appearance losses on a large set of multi-view RGB image sequences annotated with coarse 3D poses of the hand skeleton. For a 3D point in the hand local coordinate, our model predicts the color and the signed distance with respect to each hand bone independently, and then combines the per-bone predictions using predicted skinning weights. The shape, color and pose representations are disentangled by design, allowing to estimate or animate only selected parameters. We experimentally demonstrate that LISA can accurately reconstruct a dynamic hand from monocular or multi-view sequences, achieving a noticeably higher quality of reconstructed hand shapes compared to baseline approaches. Project page: https://www.iri.upc.edu/people/ecorona/lisa/.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
Repairing Group-Level Errors for DNNs Using Weighted Regularization
Authors:
Ziyuan Zhong,
Yuchi Tian,
Conor J. Sweeney,
Vicente Ordonez,
Baishakhi Ray
Abstract:
Deep Neural Networks (DNNs) have been widely used in software making decisions impacting people's lives. However, they have been found to exhibit severe erroneous behaviors that may lead to unfortunate outcomes. Previous work shows that such misbehaviors often occur due to class property violations rather than errors on a single image. Although methods for detecting such errors have been proposed,…
▽ More
Deep Neural Networks (DNNs) have been widely used in software making decisions impacting people's lives. However, they have been found to exhibit severe erroneous behaviors that may lead to unfortunate outcomes. Previous work shows that such misbehaviors often occur due to class property violations rather than errors on a single image. Although methods for detecting such errors have been proposed, fixing them has not been studied so far. Here, we propose a generic method called Weighted Regularization (WR) consisting of five concrete methods targeting the error-producing classes to fix the DNNs. In particular, it can repair confusion error and bias error of DNN models for both single-label and multi-label image classifications. A confusion error happens when a given DNN model tends to confuse between two classes. Each method in WR assigns more weights at a stage of DNN retraining or inference to mitigate the confusion between target pair. A bias error can be fixed similarly. We evaluate and compare the proposed methods along with baselines on six widely-used datasets and architecture combinations. The results suggest that WR methods have different trade-offs but under each setting at least one WR method can greatly reduce confusion/bias errors at a very limited cost of the overall performance.
△ Less
Submitted 4 April, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning
Authors:
Tony Ng,
Hyo ** Kim,
Vincent Lee,
Daniel DeTone,
Tsun-Yi Yang,
Tianwei Shen,
Eddy Ilg,
Vassileios Balntas,
Krystian Mikolajczyk,
Chris Sweeney
Abstract:
In the light of recent analyses on privacy-concerning scene revelation from visual descriptors, we develop descriptors that conceal the input image content. In particular, we propose an adversarial learning framework for training visual descriptors that prevent image reconstruction, while maintaining the matching accuracy. We let a feature encoding network and image reconstruction network compete…
▽ More
In the light of recent analyses on privacy-concerning scene revelation from visual descriptors, we develop descriptors that conceal the input image content. In particular, we propose an adversarial learning framework for training visual descriptors that prevent image reconstruction, while maintaining the matching accuracy. We let a feature encoding network and image reconstruction network compete with each other, such that the feature encoder tries to impede the image reconstruction with its generated descriptors, while the reconstructor tries to recover the input image from the descriptors. The experimental results demonstrate that the visual descriptors obtained with our method significantly deteriorate the image reconstruction quality with minimal impact on correspondence matching and camera localization performance.
△ Less
Submitted 29 March, 2022; v1 submitted 23 December, 2021;
originally announced December 2021.
-
Measurement of the Double-Differential Muon-neutrino Charged-Current Inclusive Cross Section in the NOvA Near Detector
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
B. Behera,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles
, et al. (181 additional authors not shown)
Abstract:
We report cross-section measurements of the final-state muon kinematics for \numu charged-current interactions in the NOvA near detector using an accumulated 8.09$\times10^{20}$ protons-on-target (POT) in the NuMI beam. We present the results as a double-differential cross section in the observed outgoing muon energy and angle, as well as single-differential cross sections in the derived neutrino…
▽ More
We report cross-section measurements of the final-state muon kinematics for \numu charged-current interactions in the NOvA near detector using an accumulated 8.09$\times10^{20}$ protons-on-target (POT) in the NuMI beam. We present the results as a double-differential cross section in the observed outgoing muon energy and angle, as well as single-differential cross sections in the derived neutrino energy, $E_ν$, and square of the four-momentum transfer, $Q^2$. We compare the results to inclusive cross-section predictions from various neutrino event generators via $χ^2$ calculations using a covariance matrix that accounts for bin-to-bin correlations of systematic uncertainties. These comparisons show a clear discrepancy between the data and each of the tested predictions at forward muon angle and low $Q^2$, indicating a missing suppression of the cross section in current neutrino-nucleus scattering models.
△ Less
Submitted 18 July, 2023; v1 submitted 24 September, 2021;
originally announced September 2021.
-
ODAM: Object Detection, Association, and Map** using Posed RGB Video
Authors:
Kejie Li,
Daniel DeTone,
Steven Chen,
Minh Vo,
Ian Reid,
Hamid Rezatofighi,
Chris Sweeney,
Julian Straub,
Richard Newcombe
Abstract:
Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics. We present ODAM, a system for 3D Object Detection, Association, and Map** using posed RGB videos. The proposed system relies on a deep learning front-end to detect 3D objects from a given RGB frame and associate them t…
▽ More
Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics. We present ODAM, a system for 3D Object Detection, Association, and Map** using posed RGB videos. The proposed system relies on a deep learning front-end to detect 3D objects from a given RGB frame and associate them to a global object-based map using a graph neural network (GNN). Based on these frame-to-model associations, our back-end optimizes object bounding volumes, represented as super-quadrics, under multi-view geometry constraints and the object scale prior. We validate the proposed system on ScanNet where we show a significant improvement over existing RGB-only methods.
△ Less
Submitted 23 August, 2021;
originally announced August 2021.
-
An Improved Measurement of Neutrino Oscillation Parameters by the NOvA Experiment
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
D. Bhattarai,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg
, et al. (180 additional authors not shown)
Abstract:
We present new $ν_μ\rightarrowν_e$, $ν_μ\rightarrowν_μ$, $\overlineν_μ\rightarrow\overlineν_e$, and $\overlineν_μ\rightarrow\overlineν_μ$ oscillation measurements by the NOvA experiment, with a 50% increase in neutrino-mode beam exposure over the previously reported results. The additional data, combined with previously published neutrino and antineutrino data, are all analyzed using improved tech…
▽ More
We present new $ν_μ\rightarrowν_e$, $ν_μ\rightarrowν_μ$, $\overlineν_μ\rightarrow\overlineν_e$, and $\overlineν_μ\rightarrow\overlineν_μ$ oscillation measurements by the NOvA experiment, with a 50% increase in neutrino-mode beam exposure over the previously reported results. The additional data, combined with previously published neutrino and antineutrino data, are all analyzed using improved techniques and simulations. A joint fit to the $ν_e$, $ν_μ$, $\overlineν_e$, and $\overlineν_μ$ candidate samples within the 3-flavor neutrino oscillation framework continues to yield a best-fit point in the normal mass ordering and the upper octant of the $θ_{23}$ mixing angle, with $Δm^{2}_{32} = (2.41\pm0.07)\times 10^{-3}$ eV$^2$ and $\sin^2θ_{23} = 0.57^{+0.03}_{-0.04}$. The data disfavor combinations of oscillation parameters that give rise to a large asymmetry in the rates of $ν_e$ and $\overlineν_e$ appearance. This includes values of the CP-violating phase in the vicinity of $δ_\text{CP} = π/2$ which are excluded by $>3σ$ for the inverted mass ordering, and values around $δ_\text{CP} = 3π/2$ in the normal ordering which are disfavored at 2$σ$ confidence.
△ Less
Submitted 8 August, 2022; v1 submitted 18 August, 2021;
originally announced August 2021.
-
Extended search for supernova-like neutrinos in NOvA coincident with LIGO/Virgo detections
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg,
N. Buchanan
, et al. (178 additional authors not shown)
Abstract:
A search is performed for supernova-like neutrino interactions coincident with 76 gravitational wave events detected by the LIGO/Virgo Collaboration. For 40 of these events, full readout of the time around the gravitational wave is available from the NOvA Far Detector. For these events, we set limits on the fluence of the sum of all neutrino flavors of $F < 7(4)\times 10^{10}\mathrm{cm}^{-2}$ at 9…
▽ More
A search is performed for supernova-like neutrino interactions coincident with 76 gravitational wave events detected by the LIGO/Virgo Collaboration. For 40 of these events, full readout of the time around the gravitational wave is available from the NOvA Far Detector. For these events, we set limits on the fluence of the sum of all neutrino flavors of $F < 7(4)\times 10^{10}\mathrm{cm}^{-2}$ at 90% C.L. assuming energy and time distributions corresponding to the Garching supernova models with masses 9.6(27)$\mathrm{M}_\odot$. Under the hypothesis that any given gravitational wave event was caused by a supernova, this corresponds to a distance of $r > 29(50)$kpc at 90% C.L. Weaker limits are set for other gravitational wave events with partial Far Detector data and/or Near Detector data.
△ Less
Submitted 23 August, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.
-
Search for active-sterile antineutrino mixing using neutral-current interactions with the NOvA experiment
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg,
N. Buchanan
, et al. (174 additional authors not shown)
Abstract:
This Letter reports results from the first long-baseline search for sterile antineutrinos mixing in an accelerator-based antineutrino-dominated beam. The rate of neutral-current interactions in the two NOvA detectors, at distances of 1 km and 810 km from the beam source, is analyzed using an exposure of $12.51\times10^{20}$ protons-on-target from the NuMI beam at Fermilab running in antineutrino m…
▽ More
This Letter reports results from the first long-baseline search for sterile antineutrinos mixing in an accelerator-based antineutrino-dominated beam. The rate of neutral-current interactions in the two NOvA detectors, at distances of 1 km and 810 km from the beam source, is analyzed using an exposure of $12.51\times10^{20}$ protons-on-target from the NuMI beam at Fermilab running in antineutrino mode. A total of $121$ of neutral-current candidates are observed at the Far Detector, compared to a prediction of $122\pm11$(stat.)$\pm15$(syst.) assuming mixing between three active flavors. No evidence for $\barν_μ\rightarrow\barν_{s}$ oscillation is observed. Interpreting this result within a 3+1 model, constraints are placed on the mixing angles $θ_{24} < 25^{\circ}$ and $θ_{34} < 32^{\circ}$ at the 90% C.L. for $0.05$eV$^{2} \leq Δm^{2}_{41} \leq 0.5$eV$^{2}$, the range of mass splittings that produces no significant oscillations at the Near Detector. These are the first 3+1 confidence limits set using long-baseline accelerator antineutrinos.
△ Less
Submitted 30 September, 2021; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Seasonal Variation of Multiple-Muon Cosmic Ray Air Showers Observed in the NOvA Detector on the Surface
Authors:
M. A. Acero,
P. Adamson,
L. Aliaga,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair,
A. C. Booth,
R. Bowles,
C. Bromberg,
N. Buchanan
, et al. (172 additional authors not shown)
Abstract:
We report the rate of cosmic ray air showers with multiplicities exceeding 15 muon tracks recorded in the NOvA Far Detector between May 2016 and May 2018. The detector is located on the surface under an overburden of 3.6 meters water equivalent. We observe a seasonal dependence in the rate of multiple-muon showers, which varies in magnitude with multiplicity and zenith angle. During this period, t…
▽ More
We report the rate of cosmic ray air showers with multiplicities exceeding 15 muon tracks recorded in the NOvA Far Detector between May 2016 and May 2018. The detector is located on the surface under an overburden of 3.6 meters water equivalent. We observe a seasonal dependence in the rate of multiple-muon showers, which varies in magnitude with multiplicity and zenith angle. During this period, the effective atmospheric temperature and surface pressure ranged between 210 K to 230 K and 940mbar to 990mbar, respectively; the shower rates are anti-correlated with the variation in the effective temperature. The variations are about 30% larger for the highest multiplicities than the lowest multiplicities and 20% larger for showers near the horizon than vertical showers.
△ Less
Submitted 13 July, 2021; v1 submitted 9 May, 2021;
originally announced May 2021.
-
Scalable Scene Flow from Point Clouds in the Real World
Authors:
Philipp Jund,
Chris Sweeney,
Nichola Abdo,
Zhifeng Chen,
Jonathon Shlens
Abstract:
Autonomous vehicles operate in highly dynamic environments necessitating an accurate assessment of which aspects of a scene are moving and where they are moving to. A popular approach to 3D motion estimation, termed scene flow, is to employ 3D point cloud data from consecutive LiDAR scans, although such approaches have been limited by the small size of real-world, annotated LiDAR data. In this wor…
▽ More
Autonomous vehicles operate in highly dynamic environments necessitating an accurate assessment of which aspects of a scene are moving and where they are moving to. A popular approach to 3D motion estimation, termed scene flow, is to employ 3D point cloud data from consecutive LiDAR scans, although such approaches have been limited by the small size of real-world, annotated LiDAR data. In this work, we introduce a new large-scale dataset for scene flow estimation derived from corresponding tracked 3D objects, which is $\sim$1,000$\times$ larger than previous real-world datasets in terms of the number of annotated frames. We demonstrate how previous works were bounded based on the amount of real LiDAR data available, suggesting that larger datasets are required to achieve state-of-the-art predictive performance. Furthermore, we show how previous heuristics for operating on point clouds such as down-sampling heavily degrade performance, motivating a new class of models that are tractable on the full point cloud. To address this issue, we introduce the FastFlow3D architecture which provides real time inference on the full point cloud. Additionally, we design human-interpretable metrics that better capture real world aspects by accounting for ego-motion and providing breakdowns per object type. We hope that this dataset may provide new opportunities for develo** real world scene flow systems.
△ Less
Submitted 25 October, 2021; v1 submitted 1 March, 2021;
originally announced March 2021.
-
Billion-pixel X-ray camera (BiPC-X)
Authors:
Zhehui Wang,
Kaitlin Anagnost,
Cris W. Barnes,
D. M. Dattelbaum,
Eric R. Fossum,
Eldred Lee,
Jifeng Liu,
J. J. Ma,
W. Z. Meijer,
Wanyi Nie,
C. M. Sweeney,
Audrey C. Therrien,
Hsinhan Tsai,
Xin Que
Abstract:
The continuing improvement in quantum efficiency (above 90% for single visible photons), reduction in noise (below 1 electron per pixel), and shrink in pixel pitch (less than 1 micron) motivate billion-pixel X-ray cameras (BiPC-X) based on commercial CMOS imaging sensors. We describe BiPC-X designs and prototype construction based on flexible tiling of commercial CMOS imaging sensors with millions…
▽ More
The continuing improvement in quantum efficiency (above 90% for single visible photons), reduction in noise (below 1 electron per pixel), and shrink in pixel pitch (less than 1 micron) motivate billion-pixel X-ray cameras (BiPC-X) based on commercial CMOS imaging sensors. We describe BiPC-X designs and prototype construction based on flexible tiling of commercial CMOS imaging sensors with millions of pixels. Device models are given for direct detection of low energy X-rays ($<$ 10 keV) and indirect detection of higher energies using scintillators. Modified Birks's law is proposed for light-yield nonproportionality in scintillators as a function of X-ray energy. Single X-ray sensitivity and spatial resolution have been validated experimentally using laboratory X-ray source and the Argonne Advanced Photon Source. Possible applications include wide field-of-view (FOV) or large X-ray aperture measurements in high-temperature plasmas, the state-of-the-art synchrotron, X-ray Free Electron Laser (XFEL), and pulsed power facilities.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
Cosmic-ray transport and gamma-ray emission in M31
Authors:
Audrey Do,
Matthew Duong,
Alex McDaniel,
Collin O'Connor,
Stefano Profumo,
Justine Rafael,
Connor Sweeney,
Washington Vera III
Abstract:
We study the possibility that an extended cosmic-ray leptonic and/or hadronic halo is at the origin of the large-scale gamma-ray emission detected from the Andromeda Galaxy (M31). We consider a broad ensemble of non-homogeneous diffusion scenarios and of cosmic-ray injection sources. We find that cosmic-ray electrons and protons could be, and very likely are, responsible for part, or all, of the g…
▽ More
We study the possibility that an extended cosmic-ray leptonic and/or hadronic halo is at the origin of the large-scale gamma-ray emission detected from the Andromeda Galaxy (M31). We consider a broad ensemble of non-homogeneous diffusion scenarios and of cosmic-ray injection sources. We find that cosmic-ray electrons and protons could be, and very likely are, responsible for part, or all, of the gamma-ray emission from M31, including out to more than 100 kpc from the center of the galaxy. We also simulate possible emission from pulsars in M31, and consider the effect of regions of highly inefficient diffusion around cosmic-ray acceleration sites, as suggested by recent TeV halo observations with Cherenkov telescopes.
△ Less
Submitted 28 December, 2020;
originally announced December 2020.
-
Search for Slow Magnetic Monopoles with the NOvA Detector on the Surface
Authors:
NOvA Collaboration,
M. A. Acero,
P. Adamson,
L. Aliaga,
T. Alion,
V. Allakhverdian,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
S. Bending,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair
, et al. (174 additional authors not shown)
Abstract:
We report a search for a magnetic monopole component of the cosmic-ray flux in a 95-day exposure of the NOvA experiment's Far Detector, a 14 kt segmented liquid scintillator detector designed primarily to observe GeV-scale electron neutrinos. No events consistent with monopoles were observed, setting an upper limit on the flux of $2\times 10^{-14} \mathrm{cm^{-2}s^{-1}sr^{-1}}$ at 90% C.L. for mon…
▽ More
We report a search for a magnetic monopole component of the cosmic-ray flux in a 95-day exposure of the NOvA experiment's Far Detector, a 14 kt segmented liquid scintillator detector designed primarily to observe GeV-scale electron neutrinos. No events consistent with monopoles were observed, setting an upper limit on the flux of $2\times 10^{-14} \mathrm{cm^{-2}s^{-1}sr^{-1}}$ at 90% C.L. for monopole speed $6\times 10^{-4} < β< 5\times 10^{-3}$ and mass greater than $5\times 10^{8}$ GeV. Because of NOvA's small overburden of 3 meters-water equivalent, this constraint covers a previously unexplored low-mass region.
△ Less
Submitted 5 January, 2021; v1 submitted 10 September, 2020;
originally announced September 2020.
-
Reducing Drift in Structure From Motion Using Extended Features
Authors:
Aleksander Holynski,
David Geraghty,
Jan-Michael Frahm,
Chris Sweeney,
Richard Szeliski
Abstract:
Low-frequency long-range errors (drift) are an endemic problem in 3D structure from motion, and can often hamper reasonable reconstructions of the scene. In this paper, we present a method to dramatically reduce scale and positional drift by using extended structural features such as planes and vanishing points. Unlike traditional feature matches, our extended features are able to span non-overlap…
▽ More
Low-frequency long-range errors (drift) are an endemic problem in 3D structure from motion, and can often hamper reasonable reconstructions of the scene. In this paper, we present a method to dramatically reduce scale and positional drift by using extended structural features such as planes and vanishing points. Unlike traditional feature matches, our extended features are able to span non-overlap** input images, and hence provide long-range constraints on the scale and shape of the reconstruction. We add these features as additional constraints to a state-of-the-art global structure from motion algorithm and demonstrate that the added constraints enable the reconstruction of particularly drift-prone sequences such as long, low field-of-view videos without inertial measurements. Additionally, we provide an analysis of the drift-reducing capabilities of these constraints by evaluating on a synthetic dataset. Our structural features are able to significantly reduce drift for scenes that contain long-spanning man-made structures, such as aligned rows of windows or planar building facades.
△ Less
Submitted 13 October, 2020; v1 submitted 27 August, 2020;
originally announced August 2020.
-
Domain Adaptation of Learned Features for Visual Localization
Authors:
Sungyong Baik,
Hyo ** Kim,
Tianwei Shen,
Eddy Ilg,
Kyoung Mu Lee,
Chris Sweeney
Abstract:
We tackle the problem of visual localization under changing conditions, such as time of day, weather, and seasons. Recent learned local features based on deep neural networks have shown superior performance over classical hand-crafted local features. However, in a real-world scenario, there often exists a large domain gap between training and target images, which can significantly degrade the loca…
▽ More
We tackle the problem of visual localization under changing conditions, such as time of day, weather, and seasons. Recent learned local features based on deep neural networks have shown superior performance over classical hand-crafted local features. However, in a real-world scenario, there often exists a large domain gap between training and target images, which can significantly degrade the localization accuracy. While existing methods utilize a large amount of data to tackle the problem, we present a novel and practical approach, where only a few examples are needed to reduce the domain gap. In particular, we propose a few-shot domain adaptation framework for learned local features that deals with varying conditions in visual localization. The experimental results demonstrate the superior performance over baselines, while using a scarce number of training examples from the target domain.
△ Less
Submitted 21 August, 2020;
originally announced August 2020.
-
Adjusting Neutrino Interaction Models and Evaluating Uncertainties using NOvA Near Detector Data
Authors:
NOvA Collaboration,
M. A. Acero,
P. Adamson,
G. Agam,
L. Aliaga,
T. Alion,
V. Allakhverdian,
N. Anfimov,
A. Antoshkin,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
S. Bending,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
J. Blair
, et al. (170 additional authors not shown)
Abstract:
The two-detector design of the NOvA neutrino oscillation experiment, in which two functionally identical detectors are exposed to an intense neutrino beam, aids in canceling leading order effects of cross-section uncertainties. However, limited knowledge of neutrino interaction cross sections still gives rise to some of the largest systematic uncertainties in current oscillation measurements. We s…
▽ More
The two-detector design of the NOvA neutrino oscillation experiment, in which two functionally identical detectors are exposed to an intense neutrino beam, aids in canceling leading order effects of cross-section uncertainties. However, limited knowledge of neutrino interaction cross sections still gives rise to some of the largest systematic uncertainties in current oscillation measurements. We show contemporary models of neutrino interactions to be discrepant with data from NOvA, consistent with discrepancies seen in other experiments. Adjustments to neutrino interaction models in GENIE that improve agreement with our data are presented. We also describe systematic uncertainties on these models, including uncertainties on multi-nucleon interactions from a newly developed procedure using NOvA near detector data.
△ Less
Submitted 10 December, 2020; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Supernova neutrino detection in NOvA
Authors:
NOvA Collaboration,
M. A. Acero,
P. Adamson,
G. Agam,
L. Aliaga,
T. Alion,
V. Allakhverdian,
N. Anfimov,
A. Antoshkin,
E. Arrieta-Diaz,
L. Asquith,
A. Aurisano,
A. Back,
C. Backhouse,
M. Baird,
N. Balashov,
P. Baldi,
B. A. Bambah,
S. Bashar,
K. Bays,
S. Bending,
R. Bernstein,
V. Bhatnagar,
B. Bhuyan,
J. Bian
, et al. (177 additional authors not shown)
Abstract:
The NOvA long-baseline neutrino experiment uses a pair of large, segmented, liquid-scintillator calorimeters to study neutrino oscillations, using GeV-scale neutrinos from the Fermilab NuMI beam. These detectors are also sensitive to the flux of neutrinos which are emitted during a core-collapse supernova through inverse beta decay interactions on carbon at energies of…
▽ More
The NOvA long-baseline neutrino experiment uses a pair of large, segmented, liquid-scintillator calorimeters to study neutrino oscillations, using GeV-scale neutrinos from the Fermilab NuMI beam. These detectors are also sensitive to the flux of neutrinos which are emitted during a core-collapse supernova through inverse beta decay interactions on carbon at energies of $\mathcal{O}(10~\text{MeV})$. This signature provides a means to study the dominant mode of energy release for a core-collapse supernova occurring in our galaxy. We describe the data-driven software trigger system developed and employed by the NOvA experiment to identify and record neutrino data from nearby galactic supernovae. This technique has been used by NOvA to self-trigger on potential core-collapse supernovae in our galaxy, with an estimated sensitivity reaching out to 10~kpc distance while achieving a detection efficiency of 23\% to 49\% for supernovae from progenitor stars with masses of 9.6M$_\odot$ to 27M$_\odot$, respectively.
△ Less
Submitted 29 July, 2020; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Bayesian spatial extreme value analysis of maximum temperatures in County Dublin, Ireland
Authors:
John O'Sullivan,
Conor Sweeney,
Andrew C. Parnell
Abstract:
In this study, we begin a comprehensive characterisation of temperature extremes in Ireland for the period 1981-2010. We produce return levels of anomalies of daily maximum temperature extremes for an area over Ireland, for the 30-year period 1981-2010. We employ extreme value theory (EVT) to model the data using the generalised Pareto distribution (GPD) as part of a three-level Bayesian hierarchi…
▽ More
In this study, we begin a comprehensive characterisation of temperature extremes in Ireland for the period 1981-2010. We produce return levels of anomalies of daily maximum temperature extremes for an area over Ireland, for the 30-year period 1981-2010. We employ extreme value theory (EVT) to model the data using the generalised Pareto distribution (GPD) as part of a three-level Bayesian hierarchical model. We use predictive processes in order to solve the computationally difficult problem of modelling data over a very dense spatial field. To our knowledge, this is the first study to combine predictive processes and EVT in this manner. The model is fit using Markov chain Monte Carlo (MCMC) algorithms. Posterior parameter estimates and return level surfaces are produced, in addition to specific site analysis at synoptic stations, including Casement Aerodrome and Dublin Airport. Observational data from the period 2011-2018 is included in this site analysis to determine if there is evidence of a change in the observed extremes. An increase in the frequency of extreme anomalies, but not the severity, is observed for this period. We found that the frequency of observed extreme anomalies from 2011-2018 at the Casement Aerodrome and Phoenix Park synoptic stations exceed the upper bounds of the credible intervals from the model by 20% and 7% respectively.
△ Less
Submitted 16 June, 2019;
originally announced June 2019.
-
Structure from Motion for Panorama-Style Videos
Authors:
Chris Sweeney,
Aleksander Holynski,
Brian Curless,
Steve M Seitz
Abstract:
We present a novel Structure from Motion pipeline that is capable of reconstructing accurate camera poses for panorama-style video capture without prior camera intrinsic calibration. While panorama-style capture is common and convenient, previous reconstruction methods fail to obtain accurate reconstructions due to the rotation-dominant motion and small baseline between views. Our method is built…
▽ More
We present a novel Structure from Motion pipeline that is capable of reconstructing accurate camera poses for panorama-style video capture without prior camera intrinsic calibration. While panorama-style capture is common and convenient, previous reconstruction methods fail to obtain accurate reconstructions due to the rotation-dominant motion and small baseline between views. Our method is built on the assumption that the camera motion approximately corresponds to motion on a sphere, and we introduce three novel relative pose methods to estimate the fundamental matrix and camera distortion for spherical motion. These solvers are efficient and robust, and provide an excellent initialization for bundle adjustment. A soft prior on the camera poses is used to discourage large deviations from the spherical motion assumption when performing bundle adjustment, which allows cameras to remain properly constrained for optimization in the absence of well-triangulated 3D points. To validate the effectiveness of the proposed method we evaluate our approach on both synthetic and real-world data, and demonstrate that camera poses are accurate enough for multiview stereo.
△ Less
Submitted 8 June, 2019;
originally announced June 2019.
-
StereoDRNet: Dilated Residual Stereo Net
Authors:
Rohan Chabra,
Julian Straub,
Chris Sweeney,
Richard Newcombe,
Henry Fuchs
Abstract:
We propose a system that uses a convolution neural network (CNN) to estimate depth from a stereo pair followed by volumetric fusion of the predicted depth maps to produce a 3D reconstruction of a scene. Our proposed depth refinement architecture, predicts view-consistent disparity and occlusion maps that helps the fusion system to produce geometrically consistent reconstructions. We utilize 3D dil…
▽ More
We propose a system that uses a convolution neural network (CNN) to estimate depth from a stereo pair followed by volumetric fusion of the predicted depth maps to produce a 3D reconstruction of a scene. Our proposed depth refinement architecture, predicts view-consistent disparity and occlusion maps that helps the fusion system to produce geometrically consistent reconstructions. We utilize 3D dilated convolutions in our proposed cost filtering network that yields better filtering while almost halving the computational cost in comparison to state of the art cost filtering architectures.For feature extraction we use the Vortex Pooling architecture. The proposed method achieves state of the art results in KITTI 2012, KITTI 2015 and ETH 3D stereo benchmarks. Finally, we demonstrate that our system is able to produce high fidelity 3D scene reconstructions that outperforms the state of the art stereo system.
△ Less
Submitted 2 June, 2019; v1 submitted 3 April, 2019;
originally announced April 2019.
-
SlimNets: An Exploration of Deep Model Compression and Acceleration
Authors:
Ini Oguntola,
Subby Olubeko,
Christopher Sweeney
Abstract:
Deep neural networks have achieved increasingly accurate results on a wide variety of complex tasks. However, much of this improvement is due to the growing use and availability of computational resources (e.g use of GPUs, more layers, more parameters, etc). Most state-of-the-art deep networks, despite performing well, over-parameterize approximate functions and take a significant amount of time t…
▽ More
Deep neural networks have achieved increasingly accurate results on a wide variety of complex tasks. However, much of this improvement is due to the growing use and availability of computational resources (e.g use of GPUs, more layers, more parameters, etc). Most state-of-the-art deep networks, despite performing well, over-parameterize approximate functions and take a significant amount of time to train. With increased focus on deploying deep neural networks on resource constrained devices like smart phones, there has been a push to evaluate why these models are so resource hungry and how they can be made more efficient. This work evaluates and compares three distinct methods for deep model compression and acceleration: weight pruning, low rank factorization, and knowledge distillation. Comparisons on VGG nets trained on CIFAR10 show that each of the models on their own are effective, but that the true power lies in combining them. We show that by combining pruning and knowledge distillation methods we can create a compressed network 85 times smaller than the original, all while retaining 96% of the original model's accuracy.
△ Less
Submitted 1 August, 2018;
originally announced August 2018.
-
Continuous regional trace gas source attribution using a field-deployed dual frequency comb spectrometer
Authors:
Sean Coburn,
Caroline B. Alden,
Robert Wright,
Kevin Cossel,
Esther Baumann,
Gar-Wing Truong,
Fabrizio Giorgetta,
Colm Sweeney,
Nathan R. Newbury,
Kuldeep Prasad,
Ian Coddington,
Gregory B. Rieker
Abstract:
Identification and quantification of trace gas sources is a major challenge for understanding and regulating air quality and greenhouse gas emissions. Current approaches either provide continuous but localized monitoring, or quasi-instantaneous 'snapshot-in-time' regional monitoring. There is a need for emissions detection that provides both continuous and regional coverage, because sources and si…
▽ More
Identification and quantification of trace gas sources is a major challenge for understanding and regulating air quality and greenhouse gas emissions. Current approaches either provide continuous but localized monitoring, or quasi-instantaneous 'snapshot-in-time' regional monitoring. There is a need for emissions detection that provides both continuous and regional coverage, because sources and sinks can be episodic and spatially variable. We field deploy a dual frequency comb laser spectrometer for the first time, enabling an observing system that provides continuous detection of trace gas sources over multiple-square-kilometer regions. Field tests simulating methane emissions from oil and gas production demonstrate detection and quantification of a 1.6 g min^-1 source (approximate emissions from a small pneumatic valve) from a distance of 1 km, and the ability to discern two leaks among a field of many potential sources. The technology achieves the goal of detecting, quantifying, and attributing emissions sources continuously through time, over large areas, and at emissions rates ~1000x lower than current regional approaches. It therefore provides a useful tool for monitoring and mitigating undesirable sources and closes a major information gap in the atmospheric sciences.
△ Less
Submitted 21 November, 2017;
originally announced November 2017.
-
GraphMatch: Efficient Large-Scale Graph Construction for Structure from Motion
Authors:
Qiaodong Cui,
Victor Fragoso,
Chris Sweeney,
Pradeep Sen
Abstract:
We present GraphMatch, an approximate yet efficient method for building the matching graph for large-scale structure-from-motion (SfM) pipelines. Unlike modern SfM pipelines that use vocabulary (Voc.) trees to quickly build the matching graph and avoid a costly brute-force search of matching image pairs, GraphMatch does not require an expensive offline pre-processing phase to construct a Voc. tree…
▽ More
We present GraphMatch, an approximate yet efficient method for building the matching graph for large-scale structure-from-motion (SfM) pipelines. Unlike modern SfM pipelines that use vocabulary (Voc.) trees to quickly build the matching graph and avoid a costly brute-force search of matching image pairs, GraphMatch does not require an expensive offline pre-processing phase to construct a Voc. tree. Instead, GraphMatch leverages two priors that can predict which image pairs are likely to match, thereby making the matching process for SfM much more efficient. The first is a score computed from the distance between the Fisher vectors of any two images. The second prior is based on the graph distance between vertices in the underlying matching graph. GraphMatch combines these two priors into an iterative "sample-and-propagate" scheme similar to the PatchMatch algorithm. Its sampling stage uses Fisher similarity priors to guide the search for matching image pairs, while its propagation stage explores neighbors of matched pairs to find new ones with a high image similarity score. Our experiments show that GraphMatch finds the most image pairs as compared to competing, approximate methods while at the same time being the most efficient.
△ Less
Submitted 4 October, 2017;
originally announced October 2017.
-
ANSAC: Adaptive Non-minimal Sample and Consensus
Authors:
Victor Fragoso,
Chris Sweeney,
Pradeep Sen,
Matthew Turk
Abstract:
While RANSAC-based methods are robust to incorrect image correspondences (outliers), their hypothesis generators are not robust to correct image correspondences (inliers) with positional error (noise). This slows down their convergence because hypotheses drawn from a minimal set of noisy inliers can deviate significantly from the optimal model. This work addresses this problem by introducing ANSAC…
▽ More
While RANSAC-based methods are robust to incorrect image correspondences (outliers), their hypothesis generators are not robust to correct image correspondences (inliers) with positional error (noise). This slows down their convergence because hypotheses drawn from a minimal set of noisy inliers can deviate significantly from the optimal model. This work addresses this problem by introducing ANSAC, a RANSAC-based estimator that accounts for noise by adaptively using more than the minimal number of correspondences required to generate a hypothesis. ANSAC estimates the inlier ratio (the fraction of correct correspondences) of several ranked subsets of candidate correspondences and generates hypotheses from them. Its hypothesis-generation mechanism prioritizes the use of subsets with high inlier ratio to generate high-quality hypotheses. ANSAC uses an early termination criterion that keeps track of the inlier ratio history and terminates when it has not changed significantly for a period of time. The experiments show that ANSAC finds good homography and fundamental matrix estimates in a few iterations, consistently outperforming state-of-the-art methods.
△ Less
Submitted 27 September, 2017;
originally announced September 2017.
-
Large Scale SfM with the Distributed Camera Model
Authors:
Chris Sweeney,
Victor Fragoso,
Tobias Hollerer,
Matthew Turk
Abstract:
We introduce the distributed camera model, a novel model for Structure-from-Motion (SfM). This model describes image observations in terms of light rays with ray origins and directions rather than pixels. As such, the proposed model is capable of describing a single camera or multiple cameras simultaneously as the collection of all light rays observed. We show how the distributed camera model is a…
▽ More
We introduce the distributed camera model, a novel model for Structure-from-Motion (SfM). This model describes image observations in terms of light rays with ray origins and directions rather than pixels. As such, the proposed model is capable of describing a single camera or multiple cameras simultaneously as the collection of all light rays observed. We show how the distributed camera model is a generalization of the standard camera model and describe a general formulation and solution to the absolute camera pose problem that works for standard or distributed cameras. The proposed method computes a solution that is up to 8 times more efficient and robust to rotation singularities in comparison with gDLS. Finally, this method is used in an novel large-scale incremental SfM pipeline where distributed cameras are accurately and robustly merged together. This pipeline is a direct generalization of traditional incremental SfM; however, instead of incrementally adding one camera at a time to grow the reconstruction the reconstruction is grown by adding a distributed camera. Our pipeline produces highly accurate reconstructions efficiently by avoiding the need for many bundle adjustment iterations and is capable of computing a 3D model of Rome from over 15,000 images in just 22 minutes.
△ Less
Submitted 30 November, 2016; v1 submitted 13 July, 2016;
originally announced July 2016.
-
Frequency Comb-Based Remote Sensing of Greenhouse Gases over Kilometer Air Paths
Authors:
Gregory B. Rieker,
Fabrizio R. Giorgetta,
William C. Swann,
Jon Kofler,
Alex M. Zolot,
Laura C. Sinclair,
Esther Baumann,
Christopher Cromer,
Gabrielle Petron,
Colm Sweeney,
Pieter P. Tans,
Ian Coddington,
Nathan R. Newbury
Abstract:
We demonstrate coherent dual frequency-comb spectroscopy for detecting variations in greenhouse gases. High signal-to-noise spectra are acquired spanning 5990 to 6260 cm^-1 (1600 to 1670 nm) covering ~700 absorption features from CO2, CH4, H2O, HDO, and 13CO2, across a 2-km open-air path. The transmission of each frequency comb tooth is resolved, leading to spectra with <1 kHz frequency accuracy,…
▽ More
We demonstrate coherent dual frequency-comb spectroscopy for detecting variations in greenhouse gases. High signal-to-noise spectra are acquired spanning 5990 to 6260 cm^-1 (1600 to 1670 nm) covering ~700 absorption features from CO2, CH4, H2O, HDO, and 13CO2, across a 2-km open-air path. The transmission of each frequency comb tooth is resolved, leading to spectra with <1 kHz frequency accuracy, no instrument lineshape, and a 0.0033-cm^-1 point spacing. The fitted path-averaged concentrations and temperature yield dry-air mole fractions. These are compared with a point sensor under well-mixed conditions to evaluate current absorption models for real atmospheres. In heterogeneous conditions, time-resolved data demonstrate tracking of strong variations in mole fractions. A precision of <1 ppm for CO2 and <3 ppb for CH4 is achieved in 5 minutes in this initial demonstration. Future portable systems could support regional emissions monitoring and validation of the spectral databases critical to global satellite-based trace gas monitoring.
△ Less
Submitted 12 June, 2014;
originally announced June 2014.
-
Exciton Dynamics in Carbon Nanotubes: From the Luttinger Liquid to Harmonic Oscillators
Authors:
M. C. Sweeney,
J. D. Eaves
Abstract:
We show that the absorption spectrum in semiconducting nanotubes can be determined using the bosonization technique combined with mean-field theory and a harmonic approximation. Our results indicate that a multiple band semiconducting nanotube reduces to a system of weakly coupled harmonic oscillators. Additionally, the quasiparticle nature of the electron and hole that comprise an optical exciton…
▽ More
We show that the absorption spectrum in semiconducting nanotubes can be determined using the bosonization technique combined with mean-field theory and a harmonic approximation. Our results indicate that a multiple band semiconducting nanotube reduces to a system of weakly coupled harmonic oscillators. Additionally, the quasiparticle nature of the electron and hole that comprise an optical exciton emerges naturally from the bosonized model.
△ Less
Submitted 28 May, 2013;
originally announced May 2013.
-
Carrier Transport in Heterojunction Nanocrystals Under Strain
Authors:
Mark C. Sweeney,
Joel D. Eaves
Abstract:
We present a theory for carrier transport in semiconducting nanoscale heterostructures that emphasizes the effects of strain at the interface between two different crystal structures. An exactly solvable model shows that the interface region, or junction, acts as a scattering potential that facilitates charge separation but also supports bound interfacial states. As a case study, we model a Type-I…
▽ More
We present a theory for carrier transport in semiconducting nanoscale heterostructures that emphasizes the effects of strain at the interface between two different crystal structures. An exactly solvable model shows that the interface region, or junction, acts as a scattering potential that facilitates charge separation but also supports bound interfacial states. As a case study, we model a Type-II CdS/ZnSe heterostructure. After advancing a theory similar to that employed in model molecular conductance calculations, we calculate the electron and hole photocurrents and conductances, including non-linear effects, through the junction at steady-state.
△ Less
Submitted 11 October, 2011; v1 submitted 5 October, 2011;
originally announced October 2011.
-
Doubly Quantized Vortices in Bulk Ginzburg-Landau Superconductors
Authors:
Mark C. Sweeney,
Martin P. Gelfand
Abstract:
We have extended Brandt's method for accurate, efficient calculations within Ginzburg-Landau theory for periodic vortex lattices at arbitrary mean induction to lattices of "doubly quantized" vortices.
We have extended Brandt's method for accurate, efficient calculations within Ginzburg-Landau theory for periodic vortex lattices at arbitrary mean induction to lattices of "doubly quantized" vortices.
△ Less
Submitted 25 March, 2010;
originally announced March 2010.
-
Simple Vortex States in Films of Type-I Ginzburg-Landau Superconductor
Authors:
Mark C. Sweeney,
Martin P. Gelfand
Abstract:
Sufficiently thin films of type-I superconductor in a perpendicular magnetic field exhibit a triangular vortex lattice, while thick films develop an intermediate state. To elucidate what happens between these two regimes, precise numerical calculations have been made within Ginzburg-Landau theory at $κ=0.5$ and 0.25 for a variety of vortex lattice structures with one flux quantum per unit cell.…
▽ More
Sufficiently thin films of type-I superconductor in a perpendicular magnetic field exhibit a triangular vortex lattice, while thick films develop an intermediate state. To elucidate what happens between these two regimes, precise numerical calculations have been made within Ginzburg-Landau theory at $κ=0.5$ and 0.25 for a variety of vortex lattice structures with one flux quantum per unit cell. The phase diagram in the space of mean induction and film thickness includes a narrow wedge in which a square lattice is stable, surrounded by the domain of stability of the triangular lattice at thinner films/lower fields and, on the other side, rectangular lattices with continuously varying aspect ratio. The vortex lattice has an anomalously small shear modulus within and close to the square lattice phase.
△ Less
Submitted 2 March, 2010;
originally announced March 2010.