-
Stability, convergence, and pressure-robustness of numerical schemes for incompressible flows with hybrid velocity and pressure
Authors:
Lorenzo Botti,
Michele Botti,
Daniele Antonio Di Pietro,
Francesco Carlo Massa
Abstract:
In this work we study the stability, convergence, and pressure-robustness of discretization methods for incompressible flows with hybrid velocity and pressure. Specifically, focusing on the Stokes problem, we identify a set of assumptions that yield inf-sup stability as well as error estimates which distinguish the velocity- and pressure-related contributions to the error. We additionally identify…
▽ More
In this work we study the stability, convergence, and pressure-robustness of discretization methods for incompressible flows with hybrid velocity and pressure. Specifically, focusing on the Stokes problem, we identify a set of assumptions that yield inf-sup stability as well as error estimates which distinguish the velocity- and pressure-related contributions to the error. We additionally identify the key properties under which the pressure-related contributions vanish in the estimate of the velocity, thus leading to pressure-robustness. Several examples of existing and new schemes that fit into the framework are provided, and extensive numerical validation of the theoretical properties is provided.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
DINOv2: Learning Robust Visual Features without Supervision
Authors:
Maxime Oquab,
Timothée Darcet,
Théo Moutakanni,
Huy Vo,
Marc Szafraniec,
Vasil Khalidov,
Pierre Fernandez,
Daniel Haziza,
Francisco Massa,
Alaaeldin El-Nouby,
Mahmoud Assran,
Nicolas Ballas,
Wojciech Galuba,
Russell Howes,
Po-Yao Huang,
Shang-Wen Li,
Ishan Misra,
Michael Rabbat,
Vasu Sharma,
Gabriel Synnaeve,
Hu Xu,
Hervé Jegou,
Julien Mairal,
Patrick Labatut,
Armand Joulin
, et al. (1 additional authors not shown)
Abstract:
The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any system by producing all-purpose visual features, i.e., features that work across image distributions and tasks without finetuning. This work shows that existing pr…
▽ More
The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any system by producing all-purpose visual features, i.e., features that work across image distributions and tasks without finetuning. This work shows that existing pretraining methods, especially self-supervised methods, can produce such features if trained on enough curated data from diverse sources. We revisit existing approaches and combine different techniques to scale our pretraining in terms of data and model size. Most of the technical contributions aim at accelerating and stabilizing the training at scale. In terms of data, we propose an automatic pipeline to build a dedicated, diverse, and curated image dataset instead of uncurated data, as typically done in the self-supervised literature. In terms of models, we train a ViT model (Dosovitskiy et al., 2020) with 1B parameters and distill it into a series of smaller models that surpass the best available all-purpose features, OpenCLIP (Ilharco et al., 2021) on most of the benchmarks at image and pixel levels.
△ Less
Submitted 2 February, 2024; v1 submitted 14 April, 2023;
originally announced April 2023.
-
Hybrid Transformers for Music Source Separation
Authors:
Simon Rouard,
Francisco Massa,
Alexandre Défossez
Abstract:
A natural question arising in Music Source Separation (MSS) is whether long range contextual information is useful, or whether local acoustic features are sufficient. In other fields, attention based Transformers have shown their ability to integrate information over long sequences. In this work, we introduce Hybrid Transformer Demucs (HT Demucs), an hybrid temporal/spectral bi-U-Net based on Hybr…
▽ More
A natural question arising in Music Source Separation (MSS) is whether long range contextual information is useful, or whether local acoustic features are sufficient. In other fields, attention based Transformers have shown their ability to integrate information over long sequences. In this work, we introduce Hybrid Transformer Demucs (HT Demucs), an hybrid temporal/spectral bi-U-Net based on Hybrid Demucs, where the innermost layers are replaced by a cross-domain Transformer Encoder, using self-attention within one domain, and cross-attention across domains. While it performs poorly when trained only on MUSDB, we show that it outperforms Hybrid Demucs (trained on the same data) by 0.45 dB of SDR when using 800 extra training songs. Using sparse attention kernels to extend its receptive field, and per source fine-tuning, we achieve state-of-the-art results on MUSDB with extra training data, with 9.20 dB of SDR.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
HHO methods for the incompressible Navier-Stokes and the incompressible Euler equations
Authors:
Lorenzo Botti,
Francesco Carlo Massa
Abstract:
We propose two Hybrid High-Order (HHO) methods for the incompressible Navier-Stokes equations and investigate their robustness with respect to the Reynolds number. While both methods rely on a HHO formulation of the viscous term, the pressure-velocity coupling is fundamentally different, up to the point that the two approaches can be considered antithetical. The first method is kinetic energy pres…
▽ More
We propose two Hybrid High-Order (HHO) methods for the incompressible Navier-Stokes equations and investigate their robustness with respect to the Reynolds number. While both methods rely on a HHO formulation of the viscous term, the pressure-velocity coupling is fundamentally different, up to the point that the two approaches can be considered antithetical. The first method is kinetic energy preserving, meaning that the skew-symmetric discretization of the convective term is guaranteed not to alter the kinetic energy balance. The approximated velocity fields exactly satisfy the divergence free constraint and continuity of the normal component of the velocity is weakly enforced on the mesh skeleton, leading to H-div conformity. The second scheme relies on Godunov fluxes for pressure-velocity coupling: a Harten, Lax and van Leer (HLL) approximated Riemann Solver designed for cell centered formulations is adapted to hybrid face centered formulations. The resulting numerical scheme is robust up to the inviscid limit, meaning that it can be applied for seeking approximate solutions of the incompressible Euler equations. The schemes are numerically validated performing steady and unsteady two dimensional test cases and evaluating the convergence rates on h-refined mesh sequences. In addition to standard benchmark flow problems, specifically conceived test cases are conducted for studying the error behaviour when approaching the inviscid limit.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Experimental quantum memristor
Authors:
Michele Spagnolo,
Joshua Morris,
Simone Piacentini,
Michael Antesberger,
Francesco Massa,
Francesco Ceccarelli,
Andrea Crespi,
Roberto Osellame,
Philip Walther
Abstract:
Quantum computer technology harnesses the features of quantum physics for revolutionizing information processing and computing. As such, quantum computers use physical quantum gates that process information unitarily, even though the final computing steps might be measurement-based or non-unitary. The applications of quantum computers cover diverse areas, reaching from well-known quantum algorithm…
▽ More
Quantum computer technology harnesses the features of quantum physics for revolutionizing information processing and computing. As such, quantum computers use physical quantum gates that process information unitarily, even though the final computing steps might be measurement-based or non-unitary. The applications of quantum computers cover diverse areas, reaching from well-known quantum algorithms to quantum machine learning and quantum neural networks. The last of these is of particular interest by belonging to the promising field of artificial intelligence. However, quantum neural networks are technologically challenging as the underlying computation requires non-unitary operations for mimicking the behavior of neurons. A landmark development for classical neural networks was the realization of memory-resistors, or "memristors". These are passive circuit elements that keep a memory of their past states in the form of a resistive hysteresis and thus provide access to nonlinear gate operations. The quest for realising a quantum memristor led to a few proposals, all of which face limited technological practicality. Here we introduce and experimentally demonstrate a novel quantum-optical memristor that is based on integrated photonics and acts on single photons. We characterize its memristive behavior and underline the practical potential of our device by numerically simulating instances of quantum reservoir computing, where we predict an advantage in the use of our quantum memristor over classical architectures. Given recent progress in the realization of photonic circuits for neural networks applications, our device could become a building block of immediate and near-term quantum neuromorphic architectures.
△ Less
Submitted 17 May, 2021; v1 submitted 11 May, 2021;
originally announced May 2021.
-
Training data-efficient image transformers & distillation through attention
Authors:
Hugo Touvron,
Matthieu Cord,
Matthijs Douze,
Francisco Massa,
Alexandre Sablayrolles,
Hervé Jégou
Abstract:
Recently, neural networks purely based on attention were shown to address image understanding tasks such as image classification. However, these visual transformers are pre-trained with hundreds of millions of images using an expensive infrastructure, thereby limiting their adoption.
In this work, we produce a competitive convolution-free transformer by training on Imagenet only. We train them o…
▽ More
Recently, neural networks purely based on attention were shown to address image understanding tasks such as image classification. However, these visual transformers are pre-trained with hundreds of millions of images using an expensive infrastructure, thereby limiting their adoption.
In this work, we produce a competitive convolution-free transformer by training on Imagenet only. We train them on a single computer in less than 3 days. Our reference vision transformer (86M parameters) achieves top-1 accuracy of 83.1% (single-crop evaluation) on ImageNet with no external data.
More importantly, we introduce a teacher-student strategy specific to transformers. It relies on a distillation token ensuring that the student learns from the teacher through attention. We show the interest of this token-based distillation, especially when using a convnet as a teacher. This leads us to report results competitive with convnets for both Imagenet (where we obtain up to 85.2% accuracy) and when transferring to other tasks. We share our code and models.
△ Less
Submitted 15 January, 2021; v1 submitted 23 December, 2020;
originally announced December 2020.
-
End-to-End Object Detection with Transformers
Authors:
Nicolas Carion,
Francisco Massa,
Gabriel Synnaeve,
Nicolas Usunier,
Alexander Kirillov,
Sergey Zagoruyko
Abstract:
We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression procedure or anchor generation that explicitly encode our prior knowledge about the task. The main ingredients of the new framework, called DEtection TRansformer or DET…
▽ More
We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression procedure or anchor generation that explicitly encode our prior knowledge about the task. The main ingredients of the new framework, called DEtection TRansformer or DETR, are a set-based global loss that forces unique predictions via bipartite matching, and a transformer encoder-decoder architecture. Given a fixed small set of learned object queries, DETR reasons about the relations of the objects and the global image context to directly output the final set of predictions in parallel. The new model is conceptually simple and does not require a specialized library, unlike many other modern detectors. DETR demonstrates accuracy and run-time performance on par with the well-established and highly-optimized Faster RCNN baseline on the challenging COCO object detection dataset. Moreover, DETR can be easily generalized to produce panoptic segmentation in a unified manner. We show that it significantly outperforms competitive baselines. Training code and pretrained models are available at https://github.com/facebookresearch/detr.
△ Less
Submitted 28 May, 2020; v1 submitted 26 May, 2020;
originally announced May 2020.
-
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Authors:
Adam Paszke,
Sam Gross,
Francisco Massa,
Adam Lerer,
James Bradbury,
Gregory Chanan,
Trevor Killeen,
Zeming Lin,
Natalia Gimelshein,
Luca Antiga,
Alban Desmaison,
Andreas Köpf,
Edward Yang,
Zach DeVito,
Martin Raison,
Alykhan Tejani,
Sasank Chilamkurthy,
Benoit Steiner,
Lu Fang,
Junjie Bai,
Soumith Chintala
Abstract:
Deep learning frameworks have often focused on either usability or speed, but not both. PyTorch is a machine learning library that shows that these two goals are in fact compatible: it provides an imperative and Pythonic programming style that supports code as a model, makes debugging easy and is consistent with other popular scientific computing libraries, while remaining efficient and supporting…
▽ More
Deep learning frameworks have often focused on either usability or speed, but not both. PyTorch is a machine learning library that shows that these two goals are in fact compatible: it provides an imperative and Pythonic programming style that supports code as a model, makes debugging easy and is consistent with other popular scientific computing libraries, while remaining efficient and supporting hardware accelerators such as GPUs.
In this paper, we detail the principles that drove the implementation of PyTorch and how they are reflected in its architecture. We emphasize that every aspect of PyTorch is a regular Python program under the full control of its user. We also explain how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance.
We demonstrate the efficiency of individual subsystems, as well as the overall speed of PyTorch on several common benchmarks.
△ Less
Submitted 3 December, 2019;
originally announced December 2019.
-
MLPerf Inference Benchmark
Authors:
Vijay Janapa Reddi,
Christine Cheng,
David Kanter,
Peter Mattson,
Guenther Schmuelling,
Carole-Jean Wu,
Brian Anderson,
Maximilien Breughe,
Mark Charlebois,
William Chou,
Ramesh Chukka,
Cody Coleman,
Sam Davis,
Pan Deng,
Greg Diamos,
Jared Duke,
Dave Fick,
J. Scott Gardner,
Itay Hubara,
Sachin Idgunji,
Thomas B. Jablin,
Jeff Jiao,
Tom St. John,
Pankaj Kanwar,
David Lee
, et al. (22 additional authors not shown)
Abstract:
Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic…
▽ More
Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark's flexibility and adaptability.
△ Less
Submitted 9 May, 2020; v1 submitted 6 November, 2019;
originally announced November 2019.
-
Experimental Semi-quantum Key Distribution With Classical Users
Authors:
Francesco Massa,
Preeti Yadav,
Amir Moqanaki,
Walter O. Krawec,
Paulo Mateus,
Nikola Paunković,
André Souto,
Philip Walther
Abstract:
Quantum key distribution, which allows two distant parties to share an unconditionally secure cryptographic key, promises to play an important role in the future of communication. For this reason such technique has attracted many theoretical and experimental efforts, thus becoming one of the most prominent quantum technologies of the last decades. The security of the key relies on quantum mechanic…
▽ More
Quantum key distribution, which allows two distant parties to share an unconditionally secure cryptographic key, promises to play an important role in the future of communication. For this reason such technique has attracted many theoretical and experimental efforts, thus becoming one of the most prominent quantum technologies of the last decades. The security of the key relies on quantum mechanics and therefore requires the users to be capable of performing quantum operations, such as state preparation or measurements in multiple bases. A natural question is whether and to what extent these requirements can be relaxed and the quantum capabilities of the users reduced. Here we demonstrate a novel quantum key distribution scheme, where users are fully classical. In our protocol, the quantum operations are performed by an untrusted third party acting as a server, which gives the users access to a superimposed single photon, and the key exchange is achieved via interaction-free measurements on the shared state. We also provide a full security proof of the protocol by computing the secret key rate in the realistic scenario of finite-resources, as well as practical experimental conditions of imperfect photon source and detectors. Our approach deepens the understanding of the fundamental principles underlying quantum key distribution and, at the same time, opens up new interesting possibilities for quantum cryptography networks
△ Less
Submitted 18 September, 2022; v1 submitted 5 August, 2019;
originally announced August 2019.
-
Novel Single-mode Narrow-band Photon Source of High Brightness for Hybrid Quantum Systems
Authors:
Amir Moqanaki,
Francesco Massa,
Philip Walther
Abstract:
Cavity-enhanced Spontaneous parametric down-conversion (SPDC) is capable of efficient generation of single photons with suitable spectral properties for interfacing with the atoms. However, beside the remarkable progress of this technique, multi-mode longitudinal emission remains as major drawback. Here we demonstrate a bright source of single photons that overcomes this limitation by a novel mode…
▽ More
Cavity-enhanced Spontaneous parametric down-conversion (SPDC) is capable of efficient generation of single photons with suitable spectral properties for interfacing with the atoms. However, beside the remarkable progress of this technique, multi-mode longitudinal emission remains as major drawback. Here we demonstrate a bright source of single photons that overcomes this limitation by a novel mode-selection technique based on the introduction of an additional birefringent element to the cavity. This enables us to tune the double resonance condition independent of the phase matching, and thus to achieve single-mode operation without mode filters. Our source emits single-frequency-mode photons at 852 nm, which is compatible to the Cs D2 line, with a bandwidth of 10.9 MHz and a photon-pair generation rate exceeding 47 KHz at 10 mW of pump power, while maintaining a low $g^{\left(2\right)}(0) =$ 0.13. The efficiency of our source is further underlined by measuring a four-photon generation rate of 37 Hz at 20 mW of pump power. This brightness opens up a variety of new applications reaching from hybrid light-matter interactions to optical quantum information tasks based on long temporal coherence.
△ Less
Submitted 8 March, 2019; v1 submitted 4 October, 2018;
originally announced October 2018.
-
Experimental two-way communication with one photon
Authors:
Francesco Massa,
Amir Moqanaki,
Ämin Baumeler,
Flavio Del Santo,
Joshua A. Kettlewell,
Borivoje Dakic,
Philip Walther
Abstract:
Superposition of two or more states is one of the fundamental concepts of quantum mechanics and provides the basis for several advantages quantum information processing offers. In this work, we experimentally demonstrate that quantum superposition permits two-way communication between two distant parties that can exchange only one particle once, an impossible task in classical physics. This is ach…
▽ More
Superposition of two or more states is one of the fundamental concepts of quantum mechanics and provides the basis for several advantages quantum information processing offers. In this work, we experimentally demonstrate that quantum superposition permits two-way communication between two distant parties that can exchange only one particle once, an impossible task in classical physics. This is achieved by preparing a single photon in a coherent superposition of the two parties' locations. Furthermore, we show that this concept allows the parties to perform secure quantum communication, where the transmitted bits and even the direction of communication remain private. These important features can lead to the development of new quantum communication schemes, which are simultaneously secure and resource-efficient.
△ Less
Submitted 19 February, 2019; v1 submitted 14 February, 2018;
originally announced February 2018.
-
Experimental entanglement of temporal order
Authors:
Giulia Rubino,
Lee A. Rozema,
Francesco Massa,
Mateus Araújo,
Magdalena Zych,
Časlav Brukner,
Philip Walther
Abstract:
The study of causal relations has recently been applied to the quantum realm, leading to the discovery that not all physical processes have a definite causal structure. While indefinite causal processes have previously been experimentally shown, these proofs relied on the quantum description of the experiments. Yet, the same experimental data could also be compatible with definite causal structure…
▽ More
The study of causal relations has recently been applied to the quantum realm, leading to the discovery that not all physical processes have a definite causal structure. While indefinite causal processes have previously been experimentally shown, these proofs relied on the quantum description of the experiments. Yet, the same experimental data could also be compatible with definite causal structures within different descriptions. Here, we present the first demonstration of indefinite temporal order outside of quantum formalism. We show that our experimental outcomes are incompatible with a class of generalised probabilistic theories satisfying the assumptions of locality and definite temporal order. To this end, we derive physical constraints (in the form of a Bell-like inequality) on experimental outcomes within such a class of theories. We then experimentally invalidate these theories by violating the inequality using entangled temporal order. This provides experimental evidence that there exist correlations in nature which are incompatible with the assumptions of locality and definite temporal order.
△ Less
Submitted 29 December, 2021; v1 submitted 19 December, 2017;
originally announced December 2017.
-
Frame Interpolation with Multi-Scale Deep Loss Functions and Generative Adversarial Networks
Authors:
Joost van Amersfoort,
Wenzhe Shi,
Alejandro Acosta,
Francisco Massa,
Johannes Totz,
Zehan Wang,
Jose Caballero
Abstract:
Frame interpolation attempts to synthesise frames given one or more consecutive video frames. In recent years, deep learning approaches, and notably convolutional neural networks, have succeeded at tackling low- and high-level computer vision problems including frame interpolation. These techniques often tackle two problems, namely algorithm efficiency and reconstruction quality. In this paper, we…
▽ More
Frame interpolation attempts to synthesise frames given one or more consecutive video frames. In recent years, deep learning approaches, and notably convolutional neural networks, have succeeded at tackling low- and high-level computer vision problems including frame interpolation. These techniques often tackle two problems, namely algorithm efficiency and reconstruction quality. In this paper, we present a multi-scale generative adversarial network for frame interpolation (\mbox{FIGAN}). To maximise the efficiency of our network, we propose a novel multi-scale residual estimation module where the predicted flow and synthesised frame are constructed in a coarse-to-fine fashion. To improve the quality of synthesised intermediate video frames, our network is jointly supervised at different levels with a perceptual loss function that consists of an adversarial and two content losses. We evaluate the proposed approach using a collection of 60fps videos from YouTube-8m. Our results improve the state-of-the-art accuracy and provide subjective visual quality comparable to the best performing interpolation method at x47 faster runtime.
△ Less
Submitted 26 February, 2019; v1 submitted 16 November, 2017;
originally announced November 2017.
-
Lyapunov functions for a non-linear model of the X-ray bursting of the microquasar GRS 1915+105
Authors:
A. Ardito,
P. Ricciardi,
E. Massaro,
T. Mineo,
F. Massa
Abstract:
This paper introduces a biparametric family of Lyapunov functions for a non-linear mathematical model based on the FitzHugh-Nagumo equations able to reproduce some main features of the X-ray bursting behaviour exhibited by the microquasar GRS 1915+105. These functions are useful to investigate the properties of equilibrium points and allow us to demonstrate a theorem on the global stability. The t…
▽ More
This paper introduces a biparametric family of Lyapunov functions for a non-linear mathematical model based on the FitzHugh-Nagumo equations able to reproduce some main features of the X-ray bursting behaviour exhibited by the microquasar GRS 1915+105. These functions are useful to investigate the properties of equilibrium points and allow us to demonstrate a theorem on the global stability. The transition between bursting and stable behaviour is also analyzed.
△ Less
Submitted 10 May, 2017;
originally announced May 2017.
-
Gravitationally induced phase shift on a single photon
Authors:
Christopher Hilweg,
Francesco Massa,
Denis Martynov,
Nergis Mavalvala,
Piotr T. Chrusciel,
Philip Walther
Abstract:
The effect of the Earth's gravitational potential on a quantum wave function has only been observed for massive particles. In this paper we present a scheme to measure a gravitationally induced phase shift on a single photon travelling in a coherent superposition along different paths of an optical fiber interferometer. To create a measurable signal for the interaction between the static gravitati…
▽ More
The effect of the Earth's gravitational potential on a quantum wave function has only been observed for massive particles. In this paper we present a scheme to measure a gravitationally induced phase shift on a single photon travelling in a coherent superposition along different paths of an optical fiber interferometer. To create a measurable signal for the interaction between the static gravitational potential and the wave function of the photon, we propose a variant of a conventional Mach-Zehnder interferometer. We show that the predicted relative phase difference of $10^{-5}$ radians is measurable even in the presence of fiber noise, provided additional stabilization techniques are implemented for each arm of a large-scale fiber interferometer. Effects arising from the rotation of the Earth and the material properties of the fibers are analysed. We conclude that optical fiber interferometry is a feasible way to measure the gravitationally induced phase shift on a single-photon wave function, and thus provides a means to corroborate the equivalence of the energy of the photon and its effective gravitational mass.
△ Less
Submitted 12 December, 2016;
originally announced December 2016.
-
Crafting a multi-task CNN for viewpoint estimation
Authors:
Francisco Massa,
Renaud Marlet,
Mathieu Aubry
Abstract:
Convolutional Neural Networks (CNNs) were recently shown to provide state-of-the-art results for object category viewpoint estimation. However different ways of formulating this problem have been proposed and the competing approaches have been explored with very different design choices. This paper presents a comparison of these approaches in a unified setting as well as a detailed analysis of the…
▽ More
Convolutional Neural Networks (CNNs) were recently shown to provide state-of-the-art results for object category viewpoint estimation. However different ways of formulating this problem have been proposed and the competing approaches have been explored with very different design choices. This paper presents a comparison of these approaches in a unified setting as well as a detailed analysis of the key factors that impact performance. Followingly, we present a new joint training method with the detection task and demonstrate its benefit. We also highlight the superiority of classification approaches over regression approaches, quantify the benefits of deeper architectures and extended training data, and demonstrate that synthetic data is beneficial even when using ImageNet training data. By combining all these elements, we demonstrate an improvement of approximately 5% mAVP over previous state-of-the-art results on the Pascal3D+ dataset. In particular for their most challenging 24 view classification task we improve the results from 31.1% to 36.1% mAVP.
△ Less
Submitted 13 September, 2016;
originally announced September 2016.
-
Time properties of the the rho-class burst of the microquasar GRS 1915+105 observed with BeppoSAX in April 1999
Authors:
T. Mineo,
F. Massa,
E. Massaro,
A. D'Ai
Abstract:
We present a temporal analysis of a BeppoSAX observation of GRS 1915+105 performed on April 13, 1999 when the source was in the rho class, which is characterised by quasi-regular bursting activity. The aim of the present work is to confirm and extend the validity of the results obtained with a BeppoSAX observation performed on October 2000 on the recurrence time of the burst and on the hard X-ray…
▽ More
We present a temporal analysis of a BeppoSAX observation of GRS 1915+105 performed on April 13, 1999 when the source was in the rho class, which is characterised by quasi-regular bursting activity. The aim of the present work is to confirm and extend the validity of the results obtained with a BeppoSAX observation performed on October 2000 on the recurrence time of the burst and on the hard X-ray delay. We divided the entire data set into several series, each corresponding to a satellite orbit, and performed the Fourier and wavelet analysis and the limit cycle map** technique using the count rate and the average energy as independent variables. We found that the count rates correlate with the recurrence time of bursts and with hard X-ray delay, confirming the results previously obtained. In this observation, however, the recurrence times are distributed along two parallel branches with a constant difference of 5.2+/-0.5 s.
△ Less
Submitted 13 January, 2016;
originally announced January 2016.
-
Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views
Authors:
Francisco Massa,
Bryan Russell,
Mathieu Aubry
Abstract:
This paper presents an end-to-end convolutional neural network (CNN) for 2D-3D exemplar detection. We demonstrate that the ability to adapt the features of natural images to better align with those of CAD rendered views is critical to the success of our technique. We show that the adaptation can be learned by compositing rendered views of textured object models on natural images. Our approach can…
▽ More
This paper presents an end-to-end convolutional neural network (CNN) for 2D-3D exemplar detection. We demonstrate that the ability to adapt the features of natural images to better align with those of CAD rendered views is critical to the success of our technique. We show that the adaptation can be learned by compositing rendered views of textured object models on natural images. Our approach can be naturally incorporated into a CNN detection pipeline and extends the accuracy and speed benefits from recent advances in deep learning to 2D-3D exemplar detection. We applied our method to two tasks: instance detection, where we evaluated on the IKEA dataset, and object category detection, where we out-perform Aubry et al. for "chair" detection on a subset of the Pascal VOC dataset.
△ Less
Submitted 18 April, 2016; v1 submitted 8 December, 2015;
originally announced December 2015.
-
Dynamical moments reveal a topological quantum transition in a photonic quantum walk
Authors:
Filippo Cardano,
Maria Maffei,
Francesco Massa,
Bruno Piccirillo,
Corrado de Lisio,
Giulio De Filippis,
Vittorio Cataudella,
Enrico Santamato,
Lorenzo Marrucci
Abstract:
Many phenomena in solid-state physics can be understood in terms of their topological properties. Recently, controlled protocols of quantum walks are proving to be effective simulators of such phenomena. Here we report the realization of a photonic quantum walk showing both the trivial and the non-trivial topologies associated with chiral symmetry in one-dimensional periodic systems, as in the Su-…
▽ More
Many phenomena in solid-state physics can be understood in terms of their topological properties. Recently, controlled protocols of quantum walks are proving to be effective simulators of such phenomena. Here we report the realization of a photonic quantum walk showing both the trivial and the non-trivial topologies associated with chiral symmetry in one-dimensional periodic systems, as in the Su-Schrieffer-Heeger model of polyacetylene. We find that the probability distribution moments of the walker position after many steps behave differently in the two topological phases and can be used as direct indicators of the quantum transition: while varying a control parameter, these moments exhibit a slope discontinuity at the transition point, and remain constant in the non-trivial phase. Extending this approach to higher dimensions, different topological classes, and other typologies of quantum phases may offer new general instruments for investigating quantum transitions in such complex systems.
△ Less
Submitted 7 July, 2015;
originally announced July 2015.
-
Convolutional Neural Networks for joint object detection and pose estimation: A comparative study
Authors:
Francisco Massa,
Mathieu Aubry,
Renaud Marlet
Abstract:
In this paper we study the application of convolutional neural networks for jointly detecting objects depicted in still images and estimating their 3D pose. We identify different feature representations of oriented objects, and energies that lead a network to learn this representations. The choice of the representation is crucial since the pose of an object has a natural, continuous structure whil…
▽ More
In this paper we study the application of convolutional neural networks for jointly detecting objects depicted in still images and estimating their 3D pose. We identify different feature representations of oriented objects, and energies that lead a network to learn this representations. The choice of the representation is crucial since the pose of an object has a natural, continuous structure while its category is a discrete variable. We evaluate the different approaches on the joint object detection and pose estimation task of the Pascal3D+ benchmark using Average Viewpoint Precision. We show that a classification approach on discretized viewpoints achieves state-of-the-art performance for joint object detection and pose estimation, and significantly outperforms existing baselines on this benchmark.
△ Less
Submitted 28 February, 2015; v1 submitted 22 December, 2014;
originally announced December 2014.
-
Quantum walks and wavepacket dynamics on a lattice with twisted photons
Authors:
Filippo Cardano,
Francesco Massa,
Hammam Qassim,
Ebrahim Karimi,
Sergei Slussarenko,
Domenico Paparo,
Corrado de Lisio,
Fabio Sciarrino,
Enrico Santamato,
Robert W. Boyd,
Lorenzo Marrucci
Abstract:
The "quantum walk" has emerged recently as a paradigmatic process for the dynamic simulation of complex quantum systems, entanglement production and quantum computation. Hitherto, photonic implementations of quantum walks have mainly been based on multi-path interferometric schemes in real space. Here, we report the experimental realization of a discrete quantum walk taking place in the orbital an…
▽ More
The "quantum walk" has emerged recently as a paradigmatic process for the dynamic simulation of complex quantum systems, entanglement production and quantum computation. Hitherto, photonic implementations of quantum walks have mainly been based on multi-path interferometric schemes in real space. Here, we report the experimental realization of a discrete quantum walk taking place in the orbital angular momentum space of light, both for a single photon and for two simultaneous photons. In contrast to previous implementations, the whole process develops in a single light beam, with no need of interferometers; it requires optical resources scaling linearly with the number of steps; and it allows flexible control of input and output superposition states. Exploiting the latter property, we explored the system band structure in momentum space and the associated spin-orbit topological features by simulating the quantum dynamics of Gaussian wavepackets. Our demonstration introduces a novel versatile photonic platform for quantum simulations.
△ Less
Submitted 22 April, 2015; v1 submitted 21 July, 2014;
originally announced July 2014.
-
Non-linear oscillator models for the X-ray bursting of the microquasar GRS 1915+105
Authors:
E. Massaro,
A. Ardito,
P. Ricciardi,
F. Massa,
T. Mineo,
A. D'Ai'
Abstract:
The microquasar GRS 1915+105, exhibits a large variety of characteristic states, according to its luminosity, spectral state, and variability. The most interesting one is the so-called rho-state, whose light curve shows recurrent bursts. This paper presents a model based on Fitzhugh-Nagumo equations containing two variables: x, linked to the source photon luminosity L detected by the MECS, and y r…
▽ More
The microquasar GRS 1915+105, exhibits a large variety of characteristic states, according to its luminosity, spectral state, and variability. The most interesting one is the so-called rho-state, whose light curve shows recurrent bursts. This paper presents a model based on Fitzhugh-Nagumo equations containing two variables: x, linked to the source photon luminosity L detected by the MECS, and y related to the mean photon energy. We aim at providing a simple mathematical framework composed by non-linear differential equations useful to predict the observed light curve and the energy lags for the rho-state and possibly other classes of the source. We studied the equilibrium state and the stability conditions of this system that includes one external parameter, J, that can be considered a function of the disk accretion rate. Our work is based on observations performed with the MECS on board BeppoSAX when the source was in rho and nu mode, respectively. The evolution of the mean count rate and photon energy were derived from a study of the trajectories in the count rate - photon energy plane. Assuming J constant, we found a solution that reproduces the x profile of the rho class bursts and, unexpectedly, we found that y exhibited a time modulation similar to that of the mean energy. Moreover, assuming a slowly modulated J the solutions for x quite similar to those observed in the nu class light curves is reproduced. According these results, the outer mass accretion rate is probably responsible for the state transitions, but within the rho-class it is constant. This finding makes stronger the heuristic meaning of the non-linear model and suggests a simple relation between the variable x and y. However, how a system of dynamical equations can be derived from the complex mathematical apparatus of accretion disks remains to be furtherly explored.
△ Less
Submitted 29 April, 2014;
originally announced April 2014.
-
Photonic quantum walk in a single beam with twisted light
Authors:
Filippo Cardano,
Francesco Massa,
Ebrahim Karimi,
Sergei Slussarenko,
Domenico Paparo,
Corrado de Lisio,
Fabio Sciarrino,
Enrico Santamato,
Lorenzo Marrucci
Abstract:
Inspired by the classical phenomenon of random walk, the concept of quantum walk has emerged recently as a powerful platform for the dynamical simulation of complex quantum systems, entanglement production and universal quantum computation. Such a wide perspective motivates a renewing search for efficient, scalable and stable implementations of this quantum process. Photonic approaches have hither…
▽ More
Inspired by the classical phenomenon of random walk, the concept of quantum walk has emerged recently as a powerful platform for the dynamical simulation of complex quantum systems, entanglement production and universal quantum computation. Such a wide perspective motivates a renewing search for efficient, scalable and stable implementations of this quantum process. Photonic approaches have hitherto mainly focused on multi-path schemes, requiring interferometric stability and a number of optical elements that scales quadratically with the number of steps. Here we report the experimental realization of a quantum walk taking place in the orbital angular momentum space of light, both for a single photon and for two simultaneous indistinguishable photons. The whole process develops in a single light beam, with no need of interferometers, and requires optical resources scaling linearly with the number of steps. Our demonstration introduces a novel versatile photonic platform for implementing quantum simulations, based on exploiting the transverse modes of a single light beam as quantum degrees of freedom.
△ Less
Submitted 19 March, 2014;
originally announced March 2014.
-
The complex time behaviour of the microquasar GRS 1915+105 in the ρ-class observed with BeppoSAX. III: The hard X-ray delay and limit cycle map**
Authors:
F. Massa,
E. Massaro,
T. Mineo,
A. D'Aì,
M. Feroci,
P. Casella,
T. Belloni
Abstract:
The microquasar GRS1915+105 was observed by BeppoSAX in October 2000 for about ten days while the source was in ρ-mode, which is characterized by a quasi-regular type I bursting activity. This paper presents a systematic analysis of the delay of the hard and soft X-ray emission at the burst peaks. The lag, also apparent from the comparison of the [1.7-3.4] keV light curves with those in the [6.8-1…
▽ More
The microquasar GRS1915+105 was observed by BeppoSAX in October 2000 for about ten days while the source was in ρ-mode, which is characterized by a quasi-regular type I bursting activity. This paper presents a systematic analysis of the delay of the hard and soft X-ray emission at the burst peaks. The lag, also apparent from the comparison of the [1.7-3.4] keV light curves with those in the [6.8-10.2] keV range, is evaluated and studied as a function of time, spectral parameters, and flux. We apply the limit cycle map** technique, using as independent variables the count rate and the mean photon rate. The results using this technique were also cross-checked using a more standard approach with the cross-correlation methods. Data are organized in runs, each relative to a continuous observation interval. The detected hard-soft delay changes in the course of the pointing from about 3 s to about 10 s and presents a clear correlation with the baseline count rate.
△ Less
Submitted 1 July, 2013; v1 submitted 7 June, 2013;
originally announced June 2013.
-
The complex behaviour of the microquasar GRS 1915+105 in the rho class observed with BeppoSAX. II: Time-resolved spectral analysis
Authors:
T. Mineo,
E. Massaro,
A. D'Ai,
F. Massa,
M. Feroci,
G. Ventura,
P. Casella,
C. Ferrigno,
T. Belloni
Abstract:
BeppoSAX observed GRS 1915+105 on October 2000 with a long pointing lasting about ten days. During this observation, the source was mainly in the rho class characterized by bursts with a recurrence time of between 40 and 100 s. We identify five segments in the burst structure and accumulate the average spectra of these segments during each satellite orbit. We present a detailed spectral analysis a…
▽ More
BeppoSAX observed GRS 1915+105 on October 2000 with a long pointing lasting about ten days. During this observation, the source was mainly in the rho class characterized by bursts with a recurrence time of between 40 and 100 s. We identify five segments in the burst structure and accumulate the average spectra of these segments during each satellite orbit. We present a detailed spectral analysis aimed at determining variations that occur during the burst and understanding the physical process that produces them. We compare MECS, HPGSPC, and PDS spectra with several models. Under the assumption that a single model is able to fit all spectra, we find that the combination of a multi-temperature black-body disk and a hybrid corona is able to give a consistent physical explanation of the source behaviour. Our measured variations in KT_el, tau, KT_in, and R_in appear to be either correlated or anti-correlated with the count rate in the energy range 1.6-10 keV. The strongest variations are detected along the burst segments: almost all parameters exhibit significant variations in the segments that have the highest fluxes (pulse) with the exception of R_in, which varies continuously and reaches a maximum just before the peak. The flux of the multi-temperature disk strongly increases in the pulse and simultaneously the corona contribution is significantly reduced. The disk luminosity increases in the pulse and the R_in-T_in correlation can be most successfully interpreted in term of the slim disk model. In addition, the reduction in the corona luminosity at the bursts might represent the condensation of the corona onto the disk.
△ Less
Submitted 17 November, 2011; v1 submitted 24 October, 2011;
originally announced October 2011.
-
The complex behaviour of the microquasar GRS 1915+105 in the rho class observed with BeppoSAX. I: Timing analysis
Authors:
E. Massaro,
G. Ventura,
F. Massa,
M. Feroci,
T. Mineo,
G. Cusumano,
P. Casella,
T. Belloni
Abstract:
GRS 1915+105 was observed by BeppoSAX for about 10 days in October 2000. For about 80% of the time, the source was in the variability class $ρ$, characterised by a series of recurrent bursts. We describe the results of the timing analysis performed on the MECS (1.6--10 keV) and PDS (15--100 keV) data. The X-ray count rate from \grss showed an increasing trend with different characteristics in th…
▽ More
GRS 1915+105 was observed by BeppoSAX for about 10 days in October 2000. For about 80% of the time, the source was in the variability class $ρ$, characterised by a series of recurrent bursts. We describe the results of the timing analysis performed on the MECS (1.6--10 keV) and PDS (15--100 keV) data. The X-ray count rate from \grss showed an increasing trend with different characteristics in the various energy bands. Fourier and wavelet analyses detect a variation in the recurrence time of the bursts, from 45--50 s to about 75 s, which appear well correlated with the count rate. From the power distribution of peaks in Fourier periodograms and wavelet spectra, we distinguished between the {\it regular} and {\it irregular} variability modes of the $ρ$ class, which are related to variations in the count rate in the 3--10 keV range. We identified two components in the burst structure: the slow leading trail, and the pulse, superimposed on a rather stable level. We found that the change in the recurrence time of the regular mode is caused by the slow leading trails, while the duration of the pulse phase remains far more stable. The evolution in the mean count rates shows that the time behaviour of both the leading trail and the baseline level are very similar to those observed in the 1.6--3 and 15--100 keV ranges, while that of the pulse follows the peak number. These differences in the time behaviour and count rates at different energies indicate that the process responsible for the pulses must produce the strongest emission between 3 and 10 keV, while that associated with both the leading trail and the baseline dominates at lower and higher energies
△ Less
Submitted 25 January, 2010;
originally announced January 2010.
-
Cherenkov Flashes and Fluorescence Flares on Telescopes: New lights on UHECR Spectroscopy while unveiling Neutrinos Astronomy
Authors:
D. Fargion,
P. Oliva,
F. Massa,
G. Moreno
Abstract:
Cherenkov Telescopes (as Magic, Hess and Veritas), while pointing horizontally should reveal also the fluorescence flare tails of nearby down-going air-showers. Such air-showers, born at higher (tens km) altitudes, are growing and extending up to lowest atmospheres (EeVs) or up to higher (few km) quotas (PeVs). Viceversa, as it has been foreseen and only recently observed, the opposite takes pla…
▽ More
Cherenkov Telescopes (as Magic, Hess and Veritas), while pointing horizontally should reveal also the fluorescence flare tails of nearby down-going air-showers. Such air-showers, born at higher (tens km) altitudes, are growing and extending up to lowest atmospheres (EeVs) or up to higher (few km) quotas (PeVs). Viceversa, as it has been foreseen and only recently observed, the opposite takes place. Fluorescence Telescopes made for UHECR detection may be blazed by inclined Cherenkov lights. The geomagnetic splitting may tag the energy as well as the inclined shower footprint as seen in a recent peculiar event in AUGER. Additional stereoscopic detection may define the event origination distance and its consequent primary composition, extending our understanding on UHECR composition, while unveling a novel tau Neutrino Astronomy.
△ Less
Submitted 31 October, 2007; v1 submitted 20 October, 2007;
originally announced October 2007.
-
A Novel Approach for an Integrated Straw tube-Microstrip Detector
Authors:
E. Basile,
F. Bellucci,
L. Benussi,
M. Bertani,
S. Bianco,
M. A. Caponero,
D. Colonna,
F. Di Falco,
F. L. Fabbri,
F. Felli,
M. Giardoni,
A. La Monaca,
G. Mensitieri,
B. Ortenzi,
M. Pallotta,
A. Paolozzi,
L. Passamonti,
D. Pierluigi,
C. Pucci,
A. Russo,
G. Saviano,
F. Massa,
F. Casali,
M. Bettuzzi,
D. Bianconi F. Baruffaldi
, et al. (1 additional authors not shown)
Abstract:
We report on a novel concept of silicon microstrips and straw tubes detector, where integration is accomplished by a straw module with straws not subjected to mechanical tension in a Rohacell $^{\circledR}$ lattice and carbon fiber reinforced plastic shell. Results on mechanical and test beam performances are reported on as well.
We report on a novel concept of silicon microstrips and straw tubes detector, where integration is accomplished by a straw module with straws not subjected to mechanical tension in a Rohacell $^{\circledR}$ lattice and carbon fiber reinforced plastic shell. Results on mechanical and test beam performances are reported on as well.
△ Less
Submitted 28 December, 2005;
originally announced December 2005.
-
Two- and Three-Dimensional Reconstruction and Analysis of the Straw Tubes Tomography in the Btev Experiment
Authors:
E. Basile,
F. Bellucci,
L. Benussi,
M. Bertani,
S. Bianco,
M. A. Caponero,
D. Colonna,
F. Di Falco,
F. L. Fabbri,
F. Felli,
M. Giardoni,
A. La Monaca,
F. Massa,
G. Mensitieri,
B. Ortenzi,
M. Pallotta,
A. Paolozzi,
L. Passamonti,
D. Pierluigi,
C. Pucci,
A. Russo,
G. Saviano F. Casali,
M. Bettuzzi,
D. Bianconi
Abstract:
A check of the eccentricity of the aluminised kapton straw tubes used in the BTeV experiment is accomplished using X-ray tomography of the section of tubes modules. 2 and 3-dimensional images of the single tubes and of the modules are reconstructed and analysed. Preliminary results show that a precision better than 40 $μ$m can be reached on the measurement of the straw radii.
A check of the eccentricity of the aluminised kapton straw tubes used in the BTeV experiment is accomplished using X-ray tomography of the section of tubes modules. 2 and 3-dimensional images of the single tubes and of the modules are reconstructed and analysed. Preliminary results show that a precision better than 40 $μ$m can be reached on the measurement of the straw radii.
△ Less
Submitted 28 December, 2005;
originally announced December 2005.
-
Micrometric Position Monitoring Using Fiber Bragg Grating Sensors in Silicon Detectors
Authors:
E. Basile,
F. Bellucci,
L. Benussi,
M. Bertani,
S. Bianco,
M. A. Caponero,
D. Colonna,
F. Di Falco,
F. L. Fabbri,
F. Felli,
M. Giardoni,
A. La Monaca,
F. Massa,
G. Mensitieri,
B. Ortenzi,
M. Pallotta,
A. Paolozzi,
L. Passamonti,
D. Pierluigi,
C. Pucci,
A. Russo,
G. Saviano
Abstract:
We show R&D results including long term stability, resolution, radiation hardness and characterization of Fiber Grating sensors used to monitor structure deformation, repositioning and surveying of silicon detector in High Energy Physics.
We show R&D results including long term stability, resolution, radiation hardness and characterization of Fiber Grating sensors used to monitor structure deformation, repositioning and surveying of silicon detector in High Energy Physics.
△ Less
Submitted 28 December, 2005;
originally announced December 2005.