Skip to main content

Showing 1–50 of 61 results for author: Murino, V

.
  1. arXiv:2310.02201  [pdf, other

    cs.CV

    Learnable Data Augmentation for One-Shot Unsupervised Domain Adaptation

    Authors: Julio Ivan Davila Carrazco, Pietro Morerio, Alessio Del Bue, Vittorio Murino

    Abstract: This paper presents a classification framework based on learnable data augmentation to tackle the One-Shot Unsupervised Domain Adaptation (OS-UDA) problem. OS-UDA is the most challenging setting in Domain Adaptation, as only one single unlabeled target sample is assumed to be available for model adaptation. Driven by such single sample, our method LearnAug-UDA learns how to augment source data, ma… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted to The 34th British Machine Vision Conference (BMVC 2023)

  2. arXiv:2308.08303  [pdf, other

    cs.CV

    Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: Objects are crucial for understanding human-object interactions. By identifying the relevant objects, one can also predict potential future interactions or actions that may occur with these objects. In this paper, we study the problem of Short-Term Object interaction anticipation (STA) and propose NAOGAT (Next-Active-Object Guided Anticipation Transformer), a multi-modal end-to-end transformer net… ▽ More

    Submitted 5 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted in WACV'24

  3. arXiv:2305.16066  [pdf, other

    cs.CV

    Guided Attention for Next Active Object @ EGO4D STA Challenge

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: In this technical report, we describe the Guided-Attention mechanism based solution for the short-term anticipation (STA) challenge for the EGO4D challenge. It combines the object detections, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of ST… ▽ More

    Submitted 4 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Winner of CVPR@2023 Ego4D STA challenge. arXiv admin note: substantial text overlap with arXiv:2305.12953

  4. arXiv:2305.12953  [pdf, other

    cs.CV

    Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: Short-term action anticipation (STA) in first-person videos is a challenging task that involves understanding the next active object interactions and predicting future actions. Existing action anticipation methods have primarily focused on utilizing features extracted from video clips, but often overlooked the importance of objects and their interactions. To this end, we propose a novel approach t… ▽ More

    Submitted 23 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE ICIP 2023, see project page here : https://sanketsans.github.io/guided-attention-egocentric.html

  5. arXiv:2305.04628  [pdf, other

    cs.CV

    Target-driven One-Shot Unsupervised Domain Adaptation

    Authors: Julio Ivan Davila Carrazco, Suvarna Kishorkumar Kadam, Pietro Morerio, Alessio Del Bue, Vittorio Murino

    Abstract: In this paper, we introduce a novel framework for the challenging problem of One-Shot Unsupervised Domain Adaptation (OSUDA), which aims to adapt to a target domain with only a single unlabeled target sample. Unlike existing approaches that rely on large labeled source and unlabeled target data, our Target-driven One-Shot UDA (TOS-UDA) approach employs a learnable augmentation strategy guided by t… ▽ More

    Submitted 17 July, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to 22nd International Conference on IMAGE ANALYSIS AND PROCESSING (ICIAP) 2023

    Journal ref: 22nd International Conference on IMAGE ANALYSIS AND PROCESSING (ICIAP) 2023

  6. arXiv:2304.07374  [pdf, other

    cs.CV

    Continual Source-Free Unsupervised Domain Adaptation

    Authors: Waqar Ahmed, Pietro Morerio, Vittorio Murino

    Abstract: Existing Source-free Unsupervised Domain Adaptation (SUDA) approaches inherently exhibit catastrophic forgetting. Typically, models trained on a labeled source domain and adapted to unlabeled target data improve performance on the target while drop** performance on the source, which is not available during adaptation. In this study, our goal is to cope with the challenging problem of SUDA in a c… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted at International Conference on Image Analysis and Processing, 2023

  7. arXiv:2302.06358  [pdf, other

    cs.CV

    Anticipating Next Active Objects for Egocentric Videos

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: This paper addresses the problem of anticipating the next-active-object location in the future, for a given egocentric video clip where the contact might happen, before any action takes place. The problem is considerably hard, as we aim at estimating the position of such objects in a scenario where the observed clip and the action segment are separated by the so-called ``time to contact'' (TTC) se… ▽ More

    Submitted 1 May, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE ACCESS, this paper carries the Manuscript DOI: 10.1109/ACCESS.2024.3395282. The complete peer-reviewed version is available via this DOI, while the arXiv version is a post-author manuscript without peer-review

  8. arXiv:2207.12842  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation for Video Transformers in Action Recognition

    Authors: Victor G. Turrisi da Costa, Giacomo Zara, Paolo Rota, Thiago Oliveira-Santos, Nicu Sebe, Vittorio Murino, Elisa Ricci

    Abstract: Over the last few years, Unsupervised Domain Adaptation (UDA) techniques have acquired remarkable importance and popularity in computer vision. However, when compared to the extensive literature available for images, the field of videos is still relatively unexplored. On the other hand, the performance of a model in action recognition is heavily affected by domain shift. In this paper, we propose… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at ICPR 2022

  9. arXiv:2104.09191  [pdf, other

    cs.CV

    Compact CNN Structure Learning by Knowledge Distillation

    Authors: Waqar Ahmed, Andrea Zunino, Pietro Morerio, Vittorio Murino

    Abstract: The concept of compressing deep Convolutional Neural Networks (CNNs) is essential to use limited computation, power, and memory resources on embedded devices. However, existing methods achieve this objective at the cost of a drop in inference accuracy in computer vision tasks. To address such a drawback, we propose a framework that leverages knowledge distillation along with customizable block-wis… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: This paper has been accepted to ICPR 2020

  10. arXiv:2103.15973  [pdf, other

    cs.CV

    Adaptive Pseudo-Label Refinement by Negative Ensemble Learning for Source-Free Unsupervised Domain Adaptation

    Authors: Waqar Ahmed, Pietro Morerio, Vittorio Murino

    Abstract: The majority of existing Unsupervised Domain Adaptation (UDA) methods presumes source and target domain data to be simultaneously available during training. Such an assumption may not hold in practice, as source data is often inaccessible (e.g., due to privacy reasons). On the contrary, a pre-trained source model is always considered to be available, even though performing poorly on target due to… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

  11. arXiv:2103.12437  [pdf, other

    cs.CV

    Learning without Seeing nor Knowing: Towards Open Zero-Shot Learning

    Authors: Federico Marmoreo, Julio Ivan Davila Carrazco, Vittorio Murino, Jacopo Cavazza

    Abstract: In Generalized Zero-Shot Learning (GZSL), unseen categories (for which no visual data are available at training time) can be predicted by leveraging their class embeddings (e.g., a list of attributes describing them) together with a complementary pool of seen classes (paired with both visual data and class embeddings). Despite GZSL is arguably challenging, we posit that knowing in advance the clas… ▽ More

    Submitted 14 September, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  12. arXiv:2102.03266  [pdf, other

    cs.CV

    Transductive Zero-Shot Learning by Decoupled Feature Generation

    Authors: Federico Marmoreo, Jacopo Cavazza, Vittorio Murino

    Abstract: In this paper, we address zero-shot learning (ZSL), the problem of recognizing categories for which no labeled visual data are available during training. We focus on the transductive setting, in which unlabelled visual data from unseen classes is available. State-of-the-art paradigms in ZSL typically exploit generative adversarial networks to synthesize visual features from semantic attributes. We… ▽ More

    Submitted 14 September, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Published at the IEEE/CVF Winter Conference on Computer Vision (WACV) 2021

  13. arXiv:2010.09557  [pdf, other

    cs.CV

    A Versatile Crack Inspection Portable System based on Classifier Ensemble and Controlled Illumination

    Authors: Milind G. Padalkar, Carlos Beltrán-González, Matteo Bustreo, Alessio Del Bue, Vittorio Murino

    Abstract: This paper presents a novel setup for automatic visual inspection of cracks in ceramic tile as well as studies the effect of various classifiers and height-varying illumination conditions for this task. The intuition behind this setup is that cracks can be better visualized under specific lighting conditions than others. Our setup, which is designed for field work with constraints in its maximum d… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted in ICPR 2020

  14. arXiv:2010.07906  [pdf, other

    cs.MS cs.LG

    DSLib: An open source library for the dominant set clustering method

    Authors: Sebastiano Vascon, Samuel Rota Bulò, Vittorio Murino, Marcello Pelillo

    Abstract: DSLib is an open-source implementation of the Dominant Set (DS) clustering algorithm written entirely in Matlab. The DS method is a graph-based clustering technique rooted in the evolutionary game theory that starts gaining lots of interest in the computer science community. Thanks to its duality with game theory and its strict relation to the notion of maximal clique, has been explored in several… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  15. arXiv:2005.04813  [pdf, other

    cs.CV cs.LG eess.IV

    The Visual Social Distancing Problem

    Authors: Marco Cristani, Alessio Del Bue, Vittorio Murino, Francesco Setti, Alessandro Vinciarelli

    Abstract: One of the main and most effective measures to contain the recent viral outbreak is the maintenance of the so-called Social Distancing (SD). To comply with this constraint, workplaces, public institutions, transports and schools will likely adopt restrictions over the minimum inter-personal distance between people. Given this actual scenario, it is crucial to massively measure the compliance to su… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: 9 pages, 5 figures. All the authors equally contributed to this manuscript and they are listed by alphabetical order. Under submission

  16. arXiv:2004.09374  [pdf, other

    cs.CV

    Complex-Object Visual Inspection via Multiple Lighting Configurations

    Authors: Maya Aghaei, Matteo Bustreo, Pietro Morerio, Nicolo Carissimi, Alessio Del Bue, Vittorio Murino

    Abstract: The design of an automatic visual inspection system is usually performed in two stages. While the first stage consists in selecting the most suitable hardware setup for highlighting most effectively the defects on the surface to be inspected, the second stage concerns the development of algorithmic solutions to exploit the potentials offered by the collected data. In this paper, first, we presen… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 8 pages, 7 figures, submitted to ICPR2020

  17. arXiv:2004.08270  [pdf, other

    cs.CV

    Weakly Supervised Geodesic Segmentation of Egyptian Mummy CT Scans

    Authors: Avik Hati, Matteo Bustreo, Diego Sona, Vittorio Murino, Alessio Del Bue

    Abstract: In this paper, we tackle the task of automatically analyzing 3D volumetric scans obtained from computed tomography (CT) devices. In particular, we address a particular task for which data is very limited: the segmentation of ancient Egyptian mummies CT scans. We aim at digitally unwrap** the mummy and identify different segments such as body, bandages and jewelry. The problem is complex because… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  18. arXiv:2003.06498  [pdf, other

    cs.CV

    Explainable Deep Classification Models for Domain Generalization

    Authors: Andrea Zunino, Sarah Adel Bargal, Riccardo Volpi, Mehrnoosh Sameki, Jianming Zhang, Stan Sclaroff, Vittorio Murino, Kate Saenko

    Abstract: Conventionally, AI models are thought to trade off explainability for lower accuracy. We develop a training strategy that not only leads to a more explainable AI system for object classification, but as a consequence, suffers no perceptible accuracy degradation. Explanations are defined as regions of visual evidence upon which a deep classification network makes a decision. This is represented in… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

  19. arXiv:2003.06430  [pdf, other

    cs.CV

    Learning Unbiased Representations via Mutual Information Backpropagation

    Authors: Ruggero Ragonesi, Riccardo Volpi, Jacopo Cavazza, Vittorio Murino

    Abstract: We are interested in learning data-driven representations that can generalize well, even when trained on inherently biased data. In particular, we face the case where some attributes (bias) of the data, if learned by the model, can severely compromise its generalization properties. We tackle this problem through the lens of information theory, leveraging recent findings for a differentiable estima… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: Code publicly available at https://github.com/rugrag/learn-unbiased

  20. arXiv:2002.05046  [pdf, other

    cs.CV

    Intra-Camera Supervised Person Re-Identification

    Authors: ** Zhu, Xiatian Zhu, Minxian Li, Pietro Morerio, Vittorio Murino, Shaogang Gong

    Abstract: Existing person re-identification (re-id) methods mostly exploit a large set of cross-camera identity labelled training data. This requires a tedious data collection and annotation process, leading to poor scalability in practical re-id applications. On the other hand unsupervised re-id methods do not need identity label information, but they usually suffer from much inferior and insufficient mode… ▽ More

    Submitted 16 January, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Accepted to IJCV

  21. arXiv:2001.02950  [pdf, other

    cs.CV

    Generative Pseudo-label Refinement for Unsupervised Domain Adaptation

    Authors: Pietro Morerio, Riccardo Volpi, Ruggero Ragonesi, Vittorio Murino

    Abstract: We investigate and characterize the inherent resilience of conditional Generative Adversarial Networks (cGANs) against noise in their conditioning labels, and exploit this fact in the context of Unsupervised Domain Adaptation (UDA). In UDA, a classifier trained on the labelled source set can be used to infer pseudo-labels on the unlabelled target set. However, this will result in a significant amo… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

  22. arXiv:1912.10982  [pdf, other

    cs.CV

    DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition

    Authors: Nuno C. Garcia, Sarah Adel Bargal, Vitaly Ablavsky, Pietro Morerio, Vittorio Murino, Stan Sclaroff

    Abstract: In this work, we address the problem of learning an ensemble of specialist networks using multimodal data, while considering the realistic and challenging scenario of possible missing modalities at test time. Our goal is to leverage the complementary information of multiple modalities to the benefit of the ensemble and each individual network. We introduce a novel Distillation Multiple Choice Lear… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

  23. Aggregation Signature for Small Object Tracking

    Authors: Chunlei Liu, Wenrui Ding, **yu Yang, Vittorio Murino, Baochang Zhang, Jungong Han, Guodong Guo

    Abstract: Small object tracking becomes an increasingly important task, which however has been largely unexplored in computer vision. The great challenges stem from the facts that: 1) small objects show extreme vague and variable appearances, and 2) they tend to be lost easier as compared to normal-sized ones due to the shaking of lens. In this paper, we propose a novel aggregation signature suitable for sm… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: IEEE Transactions on Image Processing, 2019

  24. arXiv:1910.10035  [pdf, other

    eess.IV cs.CV

    Scanner Invariant Multiple Sclerosis Lesion Segmentation from MRI

    Authors: Shahab Aslani, Vittorio Murino, Michael Dayan, Roger Tam, Diego Sona, Ghassan Hamarneh

    Abstract: This paper presents a simple and effective generalization method for magnetic resonance imaging (MRI) segmentation when data is collected from multiple MRI scanning sites and as a consequence is affected by (site-)domain shifts. We propose to integrate a traditional encoder-decoder network with a regularization network. This added network includes an auxiliary loss term which is responsible for th… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  25. arXiv:1908.10359  [pdf, other

    cs.CV

    Unsupervised Domain-Adaptive Person Re-identification Based on Attributes

    Authors: ** Zhu, Pietro Morerio, Vittorio Murino

    Abstract: Pedestrian attributes, e.g., hair length, clothes type and color, locally describe the semantic appearance of a person. Training person re-identification (ReID) algorithms under the supervision of such attributes have proven to be effective in extracting local features which are important for ReID. Unlike person identity, attributes are consistent across different domains (or datasets). However, m… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: 5 pages, accepted by ICIP2019

  26. arXiv:1908.10344  [pdf, other

    cs.CV

    Intra-Camera Supervised Person Re-Identification: A New Benchmark

    Authors: ** Zhu, Xiatian Zhu, Minxian Li, Vittorio Murino, Shaogang Gong

    Abstract: Existing person re-identification (re-id) methods rely mostly on a large set of inter-camera identity labelled training data, requiring a tedious data collection and annotation process therefore leading to poor scalability in practical re-id applications. To overcome this fundamental limitation, we consider person re-identification without inter-camera identity association but only with identity l… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: 9 pages, 3 figures, accepted by ICCV Workshop on Real-World Recognition from Low-Quality Images and Videos, 2019

  27. arXiv:1904.07933  [pdf, other

    cs.CV cs.SD eess.AS

    Audio-Visual Model Distillation Using Acoustic Images

    Authors: Andrés F. Pérez, Valentina Sanguineti, Pietro Morerio, Vittorio Murino

    Abstract: In this paper, we investigate how to learn rich and robust feature representations for audio classification from visual data and acoustic images, a novel audio data modality. Former models learn audio representations from raw signals or spectral data acquired by a single microphone, with remarkable results in classification and retrieval. However, such representations are not so robust towards var… ▽ More

    Submitted 11 February, 2020; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: Accepted at WACV 2020; supplementary material at page 11; code available at https://github.com/afperezm/acoustic-images-distillation

  28. arXiv:1903.11900  [pdf, other

    cs.LG cs.CV stat.ML

    Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets

    Authors: Riccardo Volpi, Vittorio Murino

    Abstract: We are concerned with the vulnerability of computer vision models to distributional shifts. We formulate a combinatorial optimization problem that allows evaluating the regions in the image space where a given model is more vulnerable, in terms of image transformations applied to the input, and face it with standard search algorithms. We further embed this idea in a training procedure, where we de… ▽ More

    Submitted 20 August, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: ICCV 2019 (camera ready)

  29. arXiv:1902.01395  [pdf

    q-bio.NC cs.LG stat.ML

    Comparison of brain connectomes using geodesic distance on manifold:a twin study

    Authors: A. Yamin, M. Dayan, L. Squarcina, P. Brambilla, V. Murino, V. Diwadkar, D. Sona

    Abstract: fMRI is a unique non-invasive approach for understanding the functional organization of the human brain, and task-based fMRI promotes identification of functionally relevant brain regions associated with a given task. Here, we use fMRI (using the Poffenberger Paradigm) data collected in mono- and dizygotic twin pairs to propose a novel approach for assessing similarity in functional networks. In p… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

    Comments: Paper is accepted for presentation in ISBI 2019. Camera ready has been submitted on 15 Jan 2019

  30. arXiv:1812.02626  [pdf, other

    cs.CV

    Guided Zoom: Questioning Network Evidence for Fine-grained Classification

    Authors: Sarah Adel Bargal, Andrea Zunino, Vitali Petsiuk, Jianming Zhang, Kate Saenko, Vittorio Murino, Stan Sclaroff

    Abstract: We propose Guided Zoom, an approach that utilizes spatial grounding of a model's decision to make more informed predictions. It does so by making sure the model has "the right reasons" for a prediction, defined as reasons that are coherent with those used to make similar correct decisions at training time. The reason/evidence upon which a deep convolutional neural network makes a prediction is def… ▽ More

    Submitted 23 March, 2020; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: BMVC 2019 Camera Ready Version

  31. Multi-branch Convolutional Neural Network for Multiple Sclerosis Lesion Segmentation

    Authors: Shahab Aslani, Michael Dayan, Loredana Storelli, Massimo Filippi, Vittorio Murino, Maria A Rocca, Diego Sona

    Abstract: In this paper, we present an automated approach for segmenting multiple sclerosis (MS) lesions from multi-modal brain magnetic resonance images. Our method is based on a deep end-to-end 2D convolutional neural network (CNN) for slice-based segmentation of 3D volumetric data. The proposed CNN includes a multi-branch downsampling path, which enables the network to encode information from multiple mo… ▽ More

    Submitted 8 April, 2019; v1 submitted 7 November, 2018; originally announced November 2018.

    Comments: This paper has been accepted for publication in NeuroImage

  32. Learning with privileged information via adversarial discriminative modality distillation

    Authors: Nuno C. Garcia, Pietro Morerio, Vittorio Murino

    Abstract: Heterogeneous data modalities can provide complementary cues for several tasks, usually leading to more robust algorithms and better performance. However, while training data can be accurately collected to include a variety of sensory modalities, it is often the case that not all of them are available in real life (testing) scenarios, where a model has to be deployed. This raises the challenge of… ▽ More

    Submitted 26 July, 2019; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  33. arXiv:1806.07110  [pdf, other

    cs.CV

    Modality Distillation with Multiple Stream Networks for Action Recognition

    Authors: Nuno Garcia, Pietro Morerio, Vittorio Murino

    Abstract: Diverse input data modalities can provide complementary cues for several tasks, usually leading to more robust algorithms and better performance. However, while a (training) dataset could be accurately designed to include a variety of sensory inputs, it is often the case that not all modalities could be available in real life (testing) scenarios, where a model has to be deployed. This raises the c… ▽ More

    Submitted 29 October, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted at ECCV 2018; Supp. material at p.16; code available

  34. arXiv:1805.12018  [pdf, other

    cs.CV

    Generalizing to Unseen Domains via Adversarial Data Augmentation

    Authors: Riccardo Volpi, Hongseok Namkoong, Ozan Sener, John Duchi, Vittorio Murino, Silvio Savarese

    Abstract: We are concerned with learning models that generalize well to different \emph{unseen} domains. We consider a worst-case formulation over data distributions that are near the source domain in the feature space. Only using training data from a single source distribution, we propose an iterative procedure that augments the dataset with examples from a fictitious target domain that is "hard" under the… ▽ More

    Submitted 6 November, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to NIPS 2018 (camera ready)

  35. arXiv:1805.09092  [pdf, other

    cs.CV

    Excitation Dropout: Encouraging Plasticity in Deep Neural Networks

    Authors: Andrea Zunino, Sarah Adel Bargal, Pietro Morerio, Jianming Zhang, Stan Sclaroff, Vittorio Murino

    Abstract: We propose a guided dropout regularizer for deep networks based on the evidence of a network prediction defined as the firing of neurons in specific paths. In this work, we utilize the evidence at each neuron to determine the probability of dropout, rather than drop** out neurons uniformly at random as in standard dropout. In essence, we dropout with higher probability those neurons which contri… ▽ More

    Submitted 21 January, 2021; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: This work is published in the International Journal of Computer Vision (IJCV) in 2021

  36. arXiv:1711.10290  [pdf, other

    cs.CV

    Scalable and Compact 3D Action Recognition with Approximated RBF Kernel Machines

    Authors: Jacopo Cavazza, Pietro Morerio, Vittorio Murino

    Abstract: Despite the recent deep learning (DL) revolution, kernel machines still remain powerful methods for action recognition. DL has brought the use of large datasets and this is typically a problem for kernel approaches, which are not scaling up efficiently due to kernel Gram matrices. Nevertheless, kernel methods are still attractive and more generally applicable since they can equally manage differen… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  37. arXiv:1711.10288  [pdf, other

    cs.CV

    Minimal-Entropy Correlation Alignment for Unsupervised Deep Domain Adaptation

    Authors: Pietro Morerio, Jacopo Cavazza, Vittorio Murino

    Abstract: In this work, we face the problem of unsupervised domain adaptation with a novel deep learning approach which leverages on our finding that entropy minimization is induced by the optimal alignment of second order statistics between source and target domains. We formally demonstrate this hypothesis and, aiming at achieving an optimal alignment in practical cases, we adopt a more principled strategy… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  38. arXiv:1711.08561  [pdf, other

    cs.CV

    Adversarial Feature Augmentation for Unsupervised Domain Adaptation

    Authors: Riccardo Volpi, Pietro Morerio, Silvio Savarese, Vittorio Murino

    Abstract: Recent works showed that Generative Adversarial Networks (GANs) can be successfully applied in unsupervised domain adaptation, where, given a labeled source dataset and an unlabeled target dataset, the goal is to train powerful classifiers for the target samples. In particular, it was shown that a GAN objective function can be used to learn target features indistinguishable from the source ones. I… ▽ More

    Submitted 4 May, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Accepted to CVPR 2018

  39. arXiv:1711.06778  [pdf, other

    cs.CV

    Excitation Backprop for RNNs

    Authors: Sarah Adel Bargal, Andrea Zunino, Donghyun Kim, Jianming Zhang, Vittorio Murino, Stan Sclaroff

    Abstract: Deep models are state-of-the-art for many vision tasks including video action recognition and video captioning. Models are trained to caption or classify activity in videos, but little is known about the evidence used to make such decisions. Grounding decisions made by deep networks has been studied in spatial visual content, giving more insight into model predictions for images. However, such stu… ▽ More

    Submitted 8 March, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: CVPR 2018 Camera Ready Version

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018

  40. arXiv:1710.05092  [pdf, other

    cs.LG stat.ML

    Dropout as a Low-Rank Regularizer for Matrix Factorization

    Authors: Jacopo Cavazza, Pietro Morerio, Benjamin Haeffele, Connor Lane, Vittorio Murino, Rene Vidal

    Abstract: Regularization for matrix factorization (MF) and approximation problems has been carried out in many different ways. Due to its popularity in deep learning, dropout has been applied also for this class of problems. Despite its solid empirical performance, the theoretical properties of dropout as a regularizer remain quite elusive for this class of problems. In this paper, we present a theoretical… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  41. arXiv:1710.03487  [pdf, other

    cs.LG stat.ML

    An Analysis of Dropout for Matrix Factorization

    Authors: Jacopo Cavazza, Connor Lane, Benjamin D. Haeffele, Vittorio Murino, René Vidal

    Abstract: Dropout is a simple yet effective algorithm for regularizing neural networks by randomly drop** out units through Bernoulli multiplicative noise, and for some restricted problem classes, such as linear or logistic regression, several theoretical studies have demonstrated the equivalence between dropout and a fully deterministic optimization problem with data-dependent Tikhonov regularization. Th… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

  42. arXiv:1709.01695  [pdf, other

    cs.CV

    A Compact Kernel Approximation for 3D Action Recognition

    Authors: Jacopo Cavazza, Pietro Morerio, Vittorio Murino

    Abstract: 3D action recognition was shown to benefit from a covariance representation of the input data (joint 3D positions). A kernel machine feed with such feature is an effective paradigm for 3D action recognition, yielding state-of-the-art results. Yet, the whole framework is affected by the well-known scalability issue. In fact, in general, the kernel function has to be evaluated for all pairs of insta… ▽ More

    Submitted 4 October, 2017; v1 submitted 6 September, 2017; originally announced September 2017.

    Comments: Best paper award special mention at the 19th edition of the GIRPR International Conference on Image Analysis and Processing (ICIAP) 2017

  43. arXiv:1708.01846  [pdf, other

    cs.CV

    Manifold Constrained Low-Rank Decomposition

    Authors: Chen Chen, Baochang Zhang, Alessio Del Bue, Vittorio Murino

    Abstract: Low-rank decomposition (LRD) is a state-of-the-art method for visual data reconstruction and modelling. However, it is a very challenging problem when the image data contains significant occlusion, noise, illumination variation, and misalignment from rotation or viewpoint changes. We leverage the specific structure of data in order to improve the performance of LRD when the data are not ideal. To… ▽ More

    Submitted 6 August, 2017; originally announced August 2017.

  44. What Will I Do Next? The Intention from Motion Experiment

    Authors: Andrea Zunino, Jacopo Cavazza, Atesh Koul, Andrea Cavallo, Cristina Becchio, Vittorio Murino

    Abstract: In computer vision, video-based approaches have been widely explored for the early classification and the prediction of actions or activities. However, it remains unclear whether this modality (as compared to 3D kinematics) can still be reliable for the prediction of human intentions, defined as the overarching goal embedded in an action sequence. Since the same action can be performed with differ… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops

  45. When Kernel Methods meet Feature Learning: Log-Covariance Network for Action Recognition from Skeletal Data

    Authors: Jacopo Cavazza, Pietro Morerio, Vittorio Murino

    Abstract: Human action recognition from skeletal data is a hot research topic and important in many open domain applications of computer vision, thanks to recently introduced 3D sensors. In the literature, naive methods simply transfer off-the-shelf techniques from video to the skeletal representation. However, the current state-of-the-art is contended between to different paradigms: kernel-based methods an… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 2017 IEEE Computer Vision and Pattern Recognition (CVPR) Workshops

  46. arXiv:1706.03112  [pdf, other

    cs.CV

    Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

    Authors: Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

    Abstract: Person re-identification is an open and challenging problem in computer vision. Existing approaches have concentrated on either designing the best feature representation or learning optimal matching metrics in a static setting where the number of cameras are fixed in a network. Most approaches have neglected the dynamic and open world nature of the re-identification problem, where a new camera may… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.

    Comments: CVPR 2017 Spotlight

  47. arXiv:1705.08180  [pdf, other

    cs.CV

    Correlation Alignment by Riemannian Metric for Domain Adaptation

    Authors: Pietro Morerio, Vittorio Murino

    Abstract: Domain adaptation techniques address the problem of reducing the sensitivity of machine learning methods to the so-called domain shift, namely the difference between source (training) and target (test) data distributions. In particular, unsupervised domain adaptation assumes no labels are available in the target domain. To this end, aligning second order statistics (covariances) of target and sour… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

  48. arXiv:1703.06229  [pdf, other

    cs.NE cs.LG stat.ML

    Curriculum Dropout

    Authors: Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, Vittorio Murino

    Abstract: Dropout is a very effective way of regularizing neural networks. Stochastically "drop** out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. Besides, Dropout can be interpreted as an approximate model aggregation technique, where an exponential number of smaller networks are averaged in o… ▽ More

    Submitted 3 August, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: Accepted at ICCV (International Conference on Computer Vision) 2017

  49. arXiv:1701.02898  [pdf, other

    cs.CV q-bio.NC

    Modeling Retinal Ganglion Cell Population Activity with Restricted Boltzmann Machines

    Authors: Matteo Zanotto, Riccardo Volpi, Alessandro Maccione, Luca Berdondini, Diego Sona, Vittorio Murino

    Abstract: The retina is a complex nervous system which encodes visual stimuli before higher order processing occurs in the visual cortex. In this study we evaluated whether information about the stimuli received by the retina can be retrieved from the firing rate distribution of Retinal Ganglion Cells (RGCs), exploiting High-Density 64x64 MEA technology. To this end, we modeled the RGC population activity u… ▽ More

    Submitted 17 January, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

  50. arXiv:1609.09251  [pdf, other

    cs.CV

    Kernel Methods on Approximate Infinite-Dimensional Covariance Operators for Image Classification

    Authors: Hà Quang Minh, Marco San Biagio, Loris Bazzani, Vittorio Murino

    Abstract: This paper presents a novel framework for visual object recognition using infinite-dimensional covariance operators of input features in the paradigm of kernel methods on infinite-dimensional Riemannian manifolds. Our formulation provides in particular a rich representation of image features by exploiting their non-linear correlations. Theoretically, we provide a finite-dimensional approximation o… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

    Comments: 18 double-column pages