Skip to main content

Showing 1–44 of 44 results for author: Tommasi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01820  [pdf, other

    cs.CV cs.AI

    Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning

    Authors: Leonardo Iurada, Marco Ciccone, Tatiana Tommasi

    Abstract: Recent advances in neural network pruning have shown how it is possible to reduce the computational costs and memory demands of deep learning models before training. We focus on this framework and propose a new pruning at initialization algorithm that leverages the Neural Tangent Kernel (NTK) theory to align the training dynamics of the sparse network with that of the dense one. Specifically, we s… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted CVPR 2024 - https://iurada.github.io/PX

  2. arXiv:2403.19826   

    cs.AI

    Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation

    Authors: Qitian Ma, Shyam Nanda Rai, Carlo Masone, Tatiana Tommasi

    Abstract: In the domain of computer vision, semantic segmentation emerges as a fundamental application within machine learning, wherein individual pixels of an image are classified into distinct semantic categories. This task transcends traditional accuracy metrics by incorporating uncertainty quantification, a critical measure for assessing the reliability of each segmentation prediction. Such quantificati… ▽ More

    Submitted 8 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Premature Submission: accidentally submitted before it was ready

  3. arXiv:2312.11417  [pdf, other

    cs.CV

    PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models

    Authors: Antonio Alliegro, Yawar Siddiqui, Tatiana Tommasi, Matthias Nießner

    Abstract: We introduce PolyDiff, the first diffusion-based approach capable of directly generating realistic and diverse 3D polygonal meshes. In contrast to methods that use alternate 3D shape representations (e.g. implicit representations), our approach is a discrete denoising diffusion probabilistic model that operates natively on the polygonal mesh data structure. This enables learning of both the geomet… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2311.15475  [pdf, other

    cs.CV cs.LG

    MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers

    Authors: Yawar Siddiqui, Antonio Alliegro, Alexey Artemov, Tatiana Tommasi, Daniele Sirigatti, Vladislav Rosov, Angela Dai, Matthias Nießner

    Abstract: We introduce MeshGPT, a new approach for generating triangle meshes that reflects the compactness typical of artist-created meshes, in contrast to dense triangle meshes extracted by iso-surfacing methods from neural fields. Inspired by recent advances in powerful large language models, we adopt a sequence-based approach to autoregressively generate triangle meshes as sequences of triangles. We fir… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: Project Page: https://nihalsid.github.io/mesh-gpt/, Video: https://youtu.be/UV90O1_69_o

  5. arXiv:2311.01885  [pdf, other

    cs.LG cs.RO

    Domain Randomization via Entropy Maximization

    Authors: Gabriele Tiboni, Pascal Klink, Jan Peters, Tatiana Tommasi, Carlo D'Eramo, Georgia Chalvatzaki

    Abstract: Varying dynamics parameters in simulation is a popular Domain Randomization (DR) approach for overcoming the reality gap in Reinforcement Learning (RL). Nevertheless, DR heavily hinges on the choice of the sampling distribution of the dynamics parameters, since high variability is crucial to regularize the agent's behavior but notoriously leads to overly conservative policies when randomizing exce… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Published as a conference paper at ICLR 2024. Project website at https://gabrieletiboni.github.io/doraemon/

  6. arXiv:2310.03388  [pdf, other

    cs.CV cs.LG

    OpenPatch: a 3D patchwork for Out-Of-Distribution detection

    Authors: Paolo Rabino, Antonio Alliegro, Francesco Cappio Borlino, Tatiana Tommasi

    Abstract: Moving deep learning models from the laboratory setting to the open world entails preparing them to handle unforeseen conditions. In several applications the occurrence of novel classes during deployment poses a significant threat, thus it is crucial to effectively detect them. Ideally, this skill should be used when needed without requiring any further computational training effort at every new t… ▽ More

    Submitted 23 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  7. arXiv:2308.07123  [pdf, other

    cs.CV

    An Outlook into the Future of Egocentric Vision

    Authors: Chiara Plizzari, Gabriele Goletto, Antonino Furnari, Siddhant Bansal, Francesco Ragusa, Giovanni Maria Farinella, Dima Damen, Tatiana Tommasi

    Abstract: What will the future be? We wonder! In this survey, we explore the gap between current research in egocentric vision and the ever-anticipated future, where wearable computing, with outward facing cameras and digital overlays, is expected to be integrated in our every day lives. To understand this gap, the article starts by envisaging the future through character-based stories, showcasing through e… ▽ More

    Submitted 7 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: We invite comments, suggestions and corrections here: https://openreview.net/forum?id=V3974SUk1w

  8. arXiv:2307.06179  [pdf, other

    cs.CV

    Large Class Separation is not what you need for Relational Reasoning-based OOD Detection

    Authors: Lorenzo Li Lu, Giulia D'Ascenzi, Francesco Cappio Borlino, Tatiana Tommasi

    Abstract: Standard recognition approaches are unable to deal with novel categories at test time. Their overconfidence on the known classes makes the predictions unreliable for safety-critical applications such as healthcare or autonomous driving. Out-Of-Distribution (OOD) detection methods provide a solution by identifying semantic novelty. Most of these methods leverage a learning stage on the known data,… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted for publication at ICIAP 2023

  9. arXiv:2303.14411  [pdf, other

    cs.CV cs.AI

    Fairness meets Cross-Domain Learning: a new perspective on Models and Metrics

    Authors: Leonardo Iurada, Silvia Bucci, Timothy M. Hospedales, Tatiana Tommasi

    Abstract: Deep learning-based recognition systems are deployed at scale for several real-world applications that inevitably involve our social life. Although being of great support when making complex decisions, they might capture spurious data correlations and leverage sensitive attributes (e.g. age, gender, ethnicity). How to factor out this information while kee** a high prediction performance is a tas… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  10. arXiv:2303.04136  [pdf, other

    cs.RO cs.LG

    Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots

    Authors: Gabriele Tiboni, Andrea Protopapa, Tatiana Tommasi, Giuseppe Averta

    Abstract: Soft robots are gaining popularity thanks to their intrinsic safety to contacts and adaptability. However, the potentially infinite number of Degrees of Freedom makes their modeling a daunting task, and in many cases only an approximated description is available. This challenge makes reinforcement learning (RL) based approaches inefficient when deployed on a realistic scenario, due to the large do… ▽ More

    Submitted 25 January, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Presented as conference paper at IEEE/RSJ IROS 2023, Detroit, USA. Project website at https://andreaprotopapa.github.io/dr-soro/

  11. arXiv:2211.06930  [pdf, other

    cs.RO cs.CV

    PaintNet: Unstructured Multi-Path Learning from 3D Point Clouds for Robotic Spray Painting

    Authors: Gabriele Tiboni, Raffaello Camoriano, Tatiana Tommasi

    Abstract: Popular industrial robotic problems such as spray painting and welding require (i) conditioning on free-shape 3D objects and (ii) planning of multiple trajectories to solve the task. Yet, existing solutions make strong assumptions on the form of input surfaces and the nature of output paths, resulting in limited approaches unable to cope with real-data variability. By leveraging on recent advances… ▽ More

    Submitted 6 December, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Presented as conference paper at IEEE/RSJ IROS 2023, Detroit, USA. Project website at https://gabrieletiboni.github.io/paintnet

  12. arXiv:2207.11554  [pdf, other

    cs.CV

    3DOS: Towards 3D Open Set Learning -- Benchmarking and Understanding Semantic Novelty Detection on Point Clouds

    Authors: Antonio Alliegro, Francesco Cappio Borlino, Tatiana Tommasi

    Abstract: In recent years there has been significant progress in the field of 3D learning on classification, detection and segmentation problems. The vast majority of the existing studies focus on canonical closed-set conditions, neglecting the intrinsic open nature of the real-world. This limits the abilities of robots and autonomous systems involved in safety-critical applications that require managing no… ▽ More

    Submitted 17 January, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: Accepted by NeurIPS 2022 Datasets and Benchmarks Track. Code: https://github.com/antoalli/3D_OS

  13. arXiv:2207.08699  [pdf, other

    cs.CV

    Semantic Novelty Detection via Relational Reasoning

    Authors: Francesco Cappio Borlino, Silvia Bucci, Tatiana Tommasi

    Abstract: Semantic novelty detection aims at discovering unknown categories in the test data. This task is particularly relevant in safety-critical applications, such as autonomous driving or healthcare, where it is crucial to recognize unknown objects at deployment time and issue a warning to the user accordingly. Despite the impressive advancements of deep learning research, existing models still need a f… ▽ More

    Submitted 2 September, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  14. arXiv:2206.14661  [pdf, other

    cs.RO cs.LG

    Online vs. Offline Adaptive Domain Randomization Benchmark

    Authors: Gabriele Tiboni, Karol Arndt, Giuseppe Averta, Ville Kyrki, Tatiana Tommasi

    Abstract: Physics simulators have shown great promise for conveniently learning reinforcement learning policies in safe, unconstrained environments. However, transferring the acquired knowledge to the real world can be challenging due to the reality gap. To this end, several methods have been recently proposed to automatically tune simulator parameters with posterior distributions given real data, for use w… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 15 pages, 6 figures

  15. arXiv:2205.14420  [pdf, other

    cs.LG cs.AI cs.NE

    Fault-Aware Design and Training to Enhance DNNs Reliability with Zero-Overhead

    Authors: Niccolò Cavagnero, Fernando Dos Santos, Marco Ciccone, Giuseppe Averta, Tatiana Tommasi, Paolo Rech

    Abstract: Deep Neural Networks (DNNs) enable a wide series of technological advancements, ranging from clinical imaging, to predictive industrial maintenance and autonomous driving. However, recent findings indicate that transient hardware faults may corrupt the models prediction dramatically. For instance, the radiation-induced misprediction probability can be so high to impede a safe deployment of DNNs mo… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    Comments: 7 pages, 6 figures

  16. arXiv:2203.09257  [pdf, other

    cs.CV cs.RO

    Contrastive Learning for Cross-Domain Open World Recognition

    Authors: Francesco Cappio Borlino, Silvia Bucci, Tatiana Tommasi

    Abstract: The ability to evolve is fundamental for any valuable autonomous agent whose knowledge cannot remain limited to that injected by the manufacturer. Consider for example a home assistant robot: it should be able to incrementally learn new object categories when requested, but also to recognize the same objects in different environments (rooms) and poses (hand-held/on the floor/above furniture), whil… ▽ More

    Submitted 2 September, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: This work has been accepted for publication at IROS 2022

  17. arXiv:2203.05585  [pdf, other

    cs.RO

    End-to-End Learning to Grasp via Sampling from Object Point Clouds

    Authors: Antonio Alliegro, Martin Rudorfer, Fabio Frattin, Aleš Leonardis, Tatiana Tommasi

    Abstract: The ability to grasp objects is an essential skill that enables many robotic manipulation tasks. Recent works have studied point cloud-based methods for object gras** by starting from simulated datasets and have shown promising performance in real-world scenarios. Nevertheless, many of them still rely on ad-hoc geometric heuristics to generate grasp candidates, which fail to generalize to object… ▽ More

    Submitted 6 June, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: 8 pages, under review for RA-L/IROS 2022

  18. arXiv:2107.02067  [pdf, other

    cs.CV

    Distance-based Hyperspherical Classification for Multi-source Open-Set Domain Adaptation

    Authors: Silvia Bucci, Francesco Cappio Borlino, Barbara Caputo, Tatiana Tommasi

    Abstract: Vision systems trained in closed-world scenarios fail when presented with new environmental conditions, new data distributions, and novel classes at deployment time. How to move towards open-world learning is a long-standing research question. The existing solutions mainly focus on specific aspects of the problem (single domain Open-Set, multi-domain Closed-Set), or propose complex strategies whic… ▽ More

    Submitted 20 October, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: accepted at WACV 2022

  19. arXiv:2106.03496  [pdf, other

    cs.CV

    Self-Supervision & Meta-Learning for One-Shot Unsupervised Cross-Domain Detection

    Authors: F. Cappio Borlino, S. Polizzotto, B. Caputo, T. Tommasi

    Abstract: Deep detection approaches are powerful in controlled conditions, but appear brittle and fail when source models are used off-the-shelf on unseen domains. Most of the existing works on domain adaptation simplify the setting and access jointly both a large source dataset and a sizable amount of target samples. However this scenario is unrealistic in many practical cases as when monitoring image feed… ▽ More

    Submitted 1 September, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted for Publication at Computer Vision and Image Understanding (CVIU) Journal

  20. arXiv:2106.02498  [pdf, other

    cs.AI

    Towards Fairness Certification in Artificial Intelligence

    Authors: Tatiana Tommasi, Silvia Bucci, Barbara Caputo, Pietro Asinari

    Abstract: Thanks to the great progress of machine learning in the last years, several Artificial Intelligence (AI) techniques have been increasingly moving from the controlled research laboratory settings to our everyday life. AI is clearly supportive in many decision-making scenarios, but when it comes to sensitive areas such as health care, hiring policies, education, banking or justice, with major impact… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  21. arXiv:2103.16671  [pdf, other

    cs.CV

    Denoise and Contrast for Category Agnostic Shape Completion

    Authors: Antonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli, Tatiana Tommasi

    Abstract: In this paper, we present a deep learning model that exploits the power of self-supervision to perform 3D point cloud completion, estimating the missing part and a context region around it. Local and global information are encoded in a combined embedding. A denoising pretext task provides the network with the needed local cues, decoupled from the high-level semantics and naturally shared over mult… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

  22. arXiv:2103.14672  [pdf, other

    cs.CV cs.RO

    Multi-Modal RGB-D Scene Recognition Across Domains

    Authors: Andrea Ferreri, Silvia Bucci, Tatiana Tommasi

    Abstract: Scene recognition is one of the basic problems in computer vision research with extensive applications in robotics. When available, depth images provide helpful geometric cues that complement the RGB texture information and help to identify discriminative scene image features. Depth sensing technology developed fast in the last years and a great variety of 3D cameras have been introduced, each wit… ▽ More

    Submitted 7 September, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at Deep Multi-Task Learning in Computer Vision (DeepMTL) workshop, ICCV 2021

  23. arXiv:2101.09060  [pdf, other

    cs.CV cs.LG

    Rethinking Domain Generalization Baselines

    Authors: Francesco Cappio Borlino, Antonio D'Innocente, Tatiana Tommasi

    Abstract: Despite being very powerful in standard learning settings, deep learning models can be extremely brittle when deployed in scenarios different from those on which they were trained. Domain generalization methods investigate this problem and data augmentation strategies have shown to be helpful tools to increase data variability, supporting model robustness across domains. In our work we focus on st… ▽ More

    Submitted 27 January, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted at ICPR 2020

  24. arXiv:2007.12368  [pdf, other

    cs.CV

    Self-Supervised Learning Across Domains

    Authors: Silvia Bucci, Antonio D'Innocente, Yujun Liao, Fabio Maria Carlucci, Barbara Caputo, Tatiana Tommasi

    Abstract: Human adaptability relies crucially on learning and merging knowledge from both supervised and unsupervised tasks: the parents point out few important concepts, but then the children fill in the gaps on their own. This is particularly effective, because supervised learning can never be exhaustive and thus learning autonomously allows to discover invariances and regularities that help to generalize… ▽ More

    Submitted 31 March, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: Accepted at IEEE T-PAMI

  25. arXiv:2007.12360  [pdf, other

    cs.CV

    On the Effectiveness of Image Rotation for Open Set Domain Adaptation

    Authors: Silvia Bucci, Mohammad Reza Loghmani, Tatiana Tommasi

    Abstract: Open Set Domain Adaptation (OSDA) bridges the domain gap between a labeled source domain and an unlabeled target domain, while also rejecting target classes that are not present in the source. To avoid negative transfer, OSDA can be tackled by first separating the known/unknown target samples and then aligning known target samples with the source data. We propose a novel method to addresses both t… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: accepted at ECCV 2020

  26. arXiv:2005.11610  [pdf, other

    cs.CV

    One-Shot Unsupervised Cross-Domain Detection

    Authors: Antonio D'Innocente, Francesco Cappio Borlino, Silvia Bucci, Barbara Caputo, Tatiana Tommasi

    Abstract: Despite impressive progress in object detection over the last years, it is still an open challenge to reliably detect objects across visual domains. Although the topic has attracted attention recently, current approaches all rely on the ability to access a sizable amount of target data for use at training time. This is a heavy assumption, as often it is not possible to anticipate the domain where… ▽ More

    Submitted 23 May, 2020; originally announced May 2020.

  27. arXiv:2005.10589  [pdf, other

    cs.CV physics.med-ph

    Bridging the gap between Natural and Medical Images through Deep Colorization

    Authors: Lia Morra, Luca Piano, Fabrizio Lamberti, Tatiana Tommasi

    Abstract: Deep learning has thrived by training on large-scale datasets. However, in many applications, as for medical image diagnosis, getting massive amount of data is still prohibitive due to privacy, lack of acquisition homogeneity and annotation cost. In this scenario, transfer learning from natural image collections is a standard practice that attempts to tackle shape, texture and color discrepancies… ▽ More

    Submitted 19 October, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: accepted for publication at ICPR2020

  28. arXiv:2004.07392  [pdf, other

    cs.CV

    Joint Supervised and Self-Supervised Learning for 3D Real-World Challenges

    Authors: Antonio Alliegro, Davide Boscaini, Tatiana Tommasi

    Abstract: Point cloud processing and 3D shape understanding are very challenging tasks for which deep learning techniques have demonstrated great potentials. Still further progresses are essential to allow artificial intelligent agents to interact with the real world, where the amount of annotated data may be limited and integrating new sources of knowledge becomes crucial to support autonomous learning. He… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

  29. arXiv:1910.03915  [pdf, other

    cs.CV

    Learning to Generalize One Sample at a Time with Self-Supervision

    Authors: Antonio D'Innocente, Silvia Bucci, Barbara Caputo, Tatiana Tommasi

    Abstract: Although deep networks have significantly increased the performance of visual recognition methods, it is still challenging to achieve the robustness across visual domains that is necessary for real-world applications. To tackle this issue, research on domain adaptation and generalization has flourished over the last decade. An important aspect to consider when assessing the work done in the litera… ▽ More

    Submitted 11 October, 2019; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Submitted to ICCV 2019

  30. arXiv:1906.05199  [pdf, other

    cs.CV

    Tackling Partial Domain Adaptation with Self-Supervision

    Authors: Silvia Bucci, Antonio D'Innocente, Tatiana Tommasi

    Abstract: Domain adaptation approaches have shown promising results in reducing the marginal distribution difference among visual domains. They allow to train reliable models that work over datasets of different nature (photos, paintings etc), but they still struggle when the domains do not share an identical label space. In the partial domain adaptation setting, where the target covers only a subset of the… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

  31. arXiv:1903.06864  [pdf, other

    cs.CV cs.LG

    Domain Generalization by Solving Jigsaw Puzzles

    Authors: Fabio Maria Carlucci, Antonio D'Innocente, Silvia Bucci, Barbara Caputo, Tatiana Tommasi

    Abstract: Human adaptability relies crucially on the ability to learn and merge knowledge both from supervised and unsupervised learning: the parents point out few important concepts, but then the children fill in the gaps on their own. This is particularly effective, because supervised learning can never be exhaustive and thus learning autonomously allows to discover invariances and regularities that help… ▽ More

    Submitted 14 April, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: Accepted at CVPR 2019 (oral)

  32. arXiv:1808.01102  [pdf, other

    cs.CV

    Hallucinating Agnostic Images to Generalize Across Domains

    Authors: Fabio M. Carlucci, Paolo Russo, Tatiana Tommasi, Barbara Caputo

    Abstract: The ability to generalize across visual domains is crucial for the robustness of artificial recognition systems. Although many training sources may be available in real contexts, the access to even unlabeled target samples cannot be taken for granted, which makes standard unsupervised domain adaptation methods inapplicable in the wild. In this work we investigate how to exploit multiple sources by… ▽ More

    Submitted 10 July, 2019; v1 submitted 3 August, 2018; originally announced August 2018.

  33. arXiv:1802.08833  [pdf, other

    cs.CV cs.RO

    Adaptive Deep Learning through Visual Domain Localization

    Authors: Gabriele Angeletti, Barbara Caputo, Tatiana Tommasi

    Abstract: A commercial robot, trained by its manufacturer to recognize a predefined number and type of objects, might be used in many settings, that will in general differ in their illumination conditions, background, type and degree of clutter, and so on. Recent computer vision works tackle this generalization issue through domain adaptation methods, assuming as source the visual domain where the system is… ▽ More

    Submitted 24 February, 2018; originally announced February 2018.

    Comments: Accepted at ICRA 2018

  34. arXiv:1705.08824  [pdf, other

    cs.CV

    From source to target and back: symmetric bi-directional adaptive GAN

    Authors: Paolo Russo, Fabio Maria Carlucci, Tatiana Tommasi, Barbara Caputo

    Abstract: The effectiveness of generative adversarial approaches in producing images according to a specific style or visual domain has recently opened new directions to solve the unsupervised domain adaptation problem. It has been shown that source labeled images can be modified to mimic target samples making it possible to train directly a classifier in the target domain, despite the original lack of anno… ▽ More

    Submitted 29 November, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

  35. arXiv:1705.07795  [pdf, other

    cs.LG math.OC stat.ML

    Training Deep Networks without Learning Rates Through Coin Betting

    Authors: Francesco Orabona, Tatiana Tommasi

    Abstract: Deep learning methods achieve state-of-the-art performance in many application scenarios. Yet, these methods require a significant amount of hyperparameters tuning in order to achieve the best results. In particular, tuning the learning rates in the stochastic optimization process is still one of the main bottlenecks. In this paper, we propose a new stochastic gradient descent procedure for deep n… ▽ More

    Submitted 4 November, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: Camera-ready version for NIPS 2017

  36. arXiv:1702.08513  [pdf, other

    cs.CV cs.DB cs.LG cs.RO

    Learning Deep Visual Object Models From Noisy Web Data: How to Make it Work

    Authors: Nizar Massouh, Francesca Babiloni, Tatiana Tommasi, Jay Young, Nick Hawes, Barbara Caputo

    Abstract: Deep networks thrive when trained on large scale data collections. This has given ImageNet a central role in the development of deep architectures for visual object classification. However, ImageNet was created during a specific period in time, and as such it is prone to aging, as well as dataset bias issues. Moving beyond fixed training datasets will lead to more robust visual systems, especially… ▽ More

    Submitted 28 February, 2017; originally announced February 2017.

    Comments: 8 pages, 7 figures, 3 tables

    Journal ref: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  37. arXiv:1611.00393  [pdf, other

    cs.CV

    Combining Multiple Cues for Visual Madlibs Question Answering

    Authors: Tatiana Tommasi, Arun Mallya, Bryan Plummer, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg

    Abstract: This paper presents an approach for answering fill-in-the-blank multiple choice questions from the Visual Madlibs dataset. Instead of generic and commonly used representations trained on the ImageNet classification task, our approach employs a combination of networks trained for specialized tasks such as scene recognition, person activity classification, and attribute prediction. We also present a… ▽ More

    Submitted 7 February, 2018; v1 submitted 1 November, 2016; originally announced November 2016.

    Comments: submitted to IJCV -- under review

  38. arXiv:1608.03410  [pdf, other

    cs.CV

    Solving Visual Madlibs with Multiple Cues

    Authors: Tatiana Tommasi, Arun Mallya, Bryan Plummer, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg

    Abstract: This paper focuses on answering fill-in-the-blank style multiple choice questions from the Visual Madlibs dataset. Previous approaches to Visual Question Answering (VQA) have mainly used generic image features from networks trained on the ImageNet dataset, despite the wide scope of questions. In contrast, our approach employs features derived from networks trained for specialized tasks of scene cl… ▽ More

    Submitted 11 August, 2016; originally announced August 2016.

    Comments: accepted at BMVC 2016

  39. arXiv:1607.06144  [pdf, other

    cs.CV

    Learning the Roots of Visual Domain Shift

    Authors: Tatiana Tommasi, Martina Lanzi, Paolo Russo, Barbara Caputo

    Abstract: In this paper we focus on the spatial nature of visual domain shift, attempting to learn where domain adaptation originates in each given image of the source and target set. We borrow concepts and techniques from the CNN visualization literature, and learn domainnes maps able to localize the degree of domain specificity in images. We derive from these maps features related to different domainnes l… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: Extended Abstract

  40. arXiv:1510.01544  [pdf, other

    cs.CV

    Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks

    Authors: Efstratios Gavves, Thomas Mensink, Tatiana Tommasi, Cees G. M. Snoek, Tinne Tuytelaars

    Abstract: How can we reuse existing knowledge, in the form of available datasets, when solving a new and apparently unrelated target task from a set of unlabeled data? In this work we make a first contribution to answer this question in the context of image classification. We frame this quest as an active learning problem and use zero-shot classifiers to guide the learning process by linking the new task to… ▽ More

    Submitted 6 October, 2015; originally announced October 2015.

  41. arXiv:1505.01257  [pdf

    cs.CV

    A Deeper Look at Dataset Bias

    Authors: Tatiana Tommasi, Novi Patricia, Barbara Caputo, Tinne Tuytelaars

    Abstract: The presence of a bias in each image data collection has recently attracted a lot of attention in the computer vision community showing the limits in generalization of any learning method trained on a specific dataset. At the same time, with the rapid development of deep learning architectures, the activation values of Convolutional Neural Networks (CNN) are emerging as reliable and robust image d… ▽ More

    Submitted 6 May, 2015; originally announced May 2015.

  42. arXiv:1411.4491  [pdf, other

    cs.CV cs.LG

    Joint cross-domain classification and subspace learning for unsupervised adaptation

    Authors: Basura Fernando, Tatiana Tommasi, Tinne Tuytelaars

    Abstract: Domain adaptation aims at adapting the knowledge acquired on a source domain to a new different but related target domain. Several approaches have beenproposed for classification tasks in the unsupervised scenario, where no labeled target data are available. Most of the attention has been dedicated to searching a new domain-invariant representation, leaving the definition of the prediction functio… ▽ More

    Submitted 28 April, 2015; v1 submitted 17 November, 2014; originally announced November 2014.

    Comments: Paper is under consideration at Pattern Recognition Letters

  43. arXiv:1409.7556  [pdf, other

    cs.CV

    Location Recognition Over Large Time Lags

    Authors: Basura Fernando, Tatiana Tommasi, Tinne Tuytelaars

    Abstract: Would it be possible to automatically associate ancient pictures to modern ones and create fancy cultural heritage city maps? We introduce here the task of recognizing the location depicted in an old photo given modern annotated images collected from the Internet. We present an extensive analysis on different features, looking for the most discriminative and most robust to the image variability in… ▽ More

    Submitted 25 May, 2015; v1 submitted 26 September, 2014; originally announced September 2014.

  44. arXiv:1402.5923  [pdf

    cs.CV

    A Testbed for Cross-Dataset Analysis

    Authors: Tatiana Tommasi, Tinne Tuytelaars, Barbara Caputo

    Abstract: Since its beginning visual recognition research has tried to capture the huge variability of the visual world in several image collections. The number of available datasets is still progressively growing together with the amount of samples per object category. However, this trend does not correspond directly to an increasing in the generalization capabilities of the developed recognition systems.… ▽ More

    Submitted 24 February, 2014; originally announced February 2014.

    Report number: December 2013, Technical Report: KUL/ESAT/PSI/1304