Skip to main content

Showing 1–50 of 127 results for author: Di, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13424  [pdf, other

    cs.CV cs.LG

    Towards a multimodal framework for remote sensing image change retrieval and captioning

    Authors: Roger Ferrod, Luigi Di Caro, Dino Ienco

    Abstract: Recently, there has been increasing interest in multimodal applications that integrate text with other modalities, such as images, audio and video, to facilitate natural language interactions with multimodal AI systems. While applications involving standard modalities have been extensively explored, there is still a lack of investigation into specific data modalities such as remote sensing (RS) da… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.11840  [pdf, other

    cs.CV

    LLaNA: Large Language and NeRF Assistant

    Authors: Andrea Amaduzzi, Pierluigi Zama Ramirez, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated an excellent understanding of images and 3D data. However, both modalities have shortcomings in holistically capturing the appearance and geometry of objects. Meanwhile, Neural Radiance Fields (NeRFs), which encode information within the weights of a simple Multi-Layer Perceptron (MLP), have emerged as an increasingly widespread modality t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review. Project page: https://andreamaduzzi.github.io/llana/

  3. arXiv:2405.12731  [pdf, other

    cs.SE

    From Today's Code to Tomorrow's Symphony: The AI Transformation of Developer's Routine by 2030

    Authors: Matteo Ciniselli, Niccolò Puccinelli, Ketai Qiu, Luca Di Grazia

    Abstract: In the rapidly evolving landscape of software engineering, the integration of Artificial Intelligence (AI) into the Software Development Life-Cycle (SDLC) heralds a transformative era for developers. Recently, we have assisted to a pivotal shift towards AI-assisted programming, exemplified by tools like GitHub Copilot and OpenAI's ChatGPT, which have become a crucial element for coding, debugging,… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  4. arXiv:2405.07387  [pdf, other

    cs.LG

    Semantic Loss Functions for Neuro-Symbolic Structured Prediction

    Authors: Kareem Ahmed, Stefano Teso, Paolo Morettin, Luca Di Liello, Pierfrancesco Ardino, Jacopo Gobbi, Yitao Liang, Eric Wang, Kai-Wei Chang, Andrea Passerini, Guy Van den Broeck

    Abstract: Structured output prediction problems are ubiquitous in machine learning. The prominent approach leverages neural networks as powerful feature extractors, otherwise assuming the independence of the outputs. These outputs, however, jointly encode an object, e.g. a path in a graph, and are therefore related through the structure underlying the output space. We discuss the semantic loss, which inject… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Preprint of Ch. 22 "Semantic Loss Functions for Neuro-Symbolic Structured Prediction" in "Compendium of Neurosymbolic Artificial Intelligence", https://ebooks.iospress.nl/ISBN/978-1-64368-406-2. arXiv admin note: substantial text overlap with arXiv:2201.11250, arXiv:2007.13197

  5. arXiv:2405.05828  [pdf, other

    cs.RO cs.CV

    MAD-ICP: It Is All About Matching Data -- Robust and Informed LiDAR Odometry

    Authors: Simone Ferrari, Luca Di Giammarino, Leonardo Brizi, Giorgio Grisetti

    Abstract: LiDAR odometry is the task of estimating the ego-motion of the sensor from sequential laser scans. This problem has been addressed by the community for more than two decades, and many effective solutions are available nowadays. Most of these systems implicitly rely on assumptions about the operating environment, the sensor used, and motion pattern. When these assumptions are violated, several well… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: https://github.com/rvp-group/mad-icp

  6. arXiv:2405.01085  [pdf, other

    cs.CV

    Single Image Super-Resolution Based on Global-Local Information Synergy

    Authors: Nianzu Qiao, Lamei Di, Changyin Sun

    Abstract: Although several image super-resolution solutions exist, they still face many challenges. CNN-based algorithms, despite the reduction in computational complexity, still need to improve their accuracy. While Transformer-based algorithms have higher accuracy, their ultra-high computational complexity makes them difficult to be accepted in practical applications. To overcome the existing challenges,… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  7. arXiv:2405.01083  [pdf, other

    cs.CV

    MCMS: Multi-Category Information and Multi-Scale Stripe Attention for Blind Motion Deblurring

    Authors: Nianzu Qiao, Lamei Di, Changyin Sun

    Abstract: Deep learning-based motion deblurring techniques have advanced significantly in recent years. This class of techniques, however, does not carefully examine the inherent flaws in blurry images. For instance, low edge and structural information are traits of blurry images. The high-frequency component of blurry images is edge information, and the low-frequency component is structure information. A b… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  8. arXiv:2404.16558  [pdf, other

    cs.CV cs.AI cs.RO

    DeepKalPose: An Enhanced Deep-Learning Kalman Filter for Temporally Consistent Monocular Vehicle Pose Estimation

    Authors: Leandro Di Bella, Yangxintong Lyu, Adrian Munteanu

    Abstract: This paper presents DeepKalPose, a novel approach for enhancing temporal consistency in monocular vehicle pose estimation applied on video through a deep-learning-based Kalman Filter. By integrating a Bi-directional Kalman filter strategy utilizing forward and backward time-series processing, combined with a learnable motion model to represent complex motion patterns, our method significantly impr… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 4 pages, 3 Figures, published to IET Electronic Letters

    Journal ref: Electronics Letters (ISSN: 00135194), jaar: 2024, volume: 60, nummer: 8, startpagina: ?

  9. arXiv:2404.11322  [pdf, other

    cs.CV cs.RO

    VBR: A Vision Benchmark in Rome

    Authors: Leonardo Brizi, Emanuele Giacomini, Luca Di Giammarino, Simone Ferrari, Omar Salem, Lorenzo De Rebotti, Giorgio Grisetti

    Abstract: This paper presents a vision and perception research dataset collected in Rome, featuring RGB data, 3D point clouds, IMU, and GPS data. We introduce a new benchmark targeting visual odometry and SLAM, to advance the research in autonomous robotics and computer vision. This work complements existing datasets by simultaneously addressing several issues, such as environment diversity, motion patterns… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted at IEEE ICRA 2024 Website: https://rvp-group.net/datasets/slam.html

  10. arXiv:2404.08245  [pdf, other

    cs.DC stat.CO

    A Distributed Approach for Persistent Homology Computation on a Large Scale

    Authors: Riccardo Ceccaroni, Lorenzo Di Rocco, Umberto Ferraro Petrillo, Pierpaolo Brutti

    Abstract: Persistent homology (PH) is a powerful mathematical method to automatically extract relevant insights from images, such as those obtained by high-resolution imaging devices like electron microscopes or new-generation telescopes. However, the application of this method comes at a very high computational cost, that is bound to explode more because new imaging devices generate an ever-growing amount… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  11. arXiv:2404.07993  [pdf, other

    cs.CV

    Connecting NeRFs, Images, and Text

    Authors: Francesco Ballerini, Pierluigi Zama Ramirez, Roberto Mirabella, Samuele Salti, Luigi Di Stefano

    Abstract: Neural Radiance Fields (NeRFs) have emerged as a standard framework for representing 3D scenes and objects, introducing a novel data type for information exchange and storage. Concurrently, significant progress has been made in multimodal representation learning for text and image data. This paper explores a novel research direction that aims to connect the NeRF modality with other modalities, sim… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPRW-INRV 2024

  12. arXiv:2404.05133  [pdf, other

    cs.CL

    EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection

    Authors: Francesca Grasso, Stefano Locci, Giovanni Siragusa, Luigi Di Caro

    Abstract: Anthropogenic ecological crisis constitutes a significant challenge that all within the academy must urgently face, including the Natural Language Processing (NLP) community. While recent years have seen increasing work revolving around climate-centric discourse, crucial environmental and ecological topics outside of climate change remain largely unaddressed, despite their prominent importance. Ma… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  13. arXiv:2404.03743  [pdf, other

    cs.CV

    Test Time Training for Industrial Anomaly Segmentation

    Authors: Alex Costanzino, Pierluigi Zama Ramirez, Mirko Del Moro, Agostino Aiezzo, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

    Abstract: Anomaly Detection and Segmentation (AD&S) is crucial for industrial quality control. While existing methods excel in generating anomaly scores for each pixel, practical applications require producing a binary segmentation to identify anomalies. Due to the absence of labeled anomalies in many real scenarios, standard practices binarize these maps based on some statistics derived from a validation s… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted at VAND 2.0, CVPRW 2024

  14. PyTy: Repairing Static Type Errors in Python

    Authors: Yiu Wai Chow, Luca Di Grazia, Michael Pradel

    Abstract: Gradual ty** enables developers to annotate types of their own choosing, offering a flexible middle ground between no type annotations and a fully statically typed language. As more and more code bases get type-annotated, static type checkers detect an increasingly large number of type errors. Unfortunately, fixing these errors requires manual effort, hampering the adoption of gradual ty** in… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Journal ref: ICSE 2024

  15. arXiv:2312.13277  [pdf, other

    cs.CV

    Deep Learning on 3D Neural Fields

    Authors: Pierluigi Zama Ramirez, Luca De Luigi, Daniele Sirocchi, Adriano Cardace, Riccardo Spezialetti, Francesco Ballerini, Samuele Salti, Luigi Di Stefano

    Abstract: In recent years, Neural Fields (NFs) have emerged as an effective tool for encoding diverse continuous signals such as images, videos, audio, and 3D shapes. When applied to 3D data, NFs offer a solution to the fragmentation and limitations associated with prevalent discrete representations. However, given that NFs are essentially neural networks, it remains unclear whether and how they can be seam… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Extended version of the paper "Deep Learning on Implicit Neural Representations of Shapes" that was presented at ICLR 2023. arXiv admin note: text overlap with arXiv:2302.05438

  16. arXiv:2312.04521  [pdf, other

    cs.CV

    Multimodal Industrial Anomaly Detection by Crossmodal Feature Map**

    Authors: Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti, Luigi Di Stefano

    Abstract: The paper explores the industrial multimodal Anomaly Detection (AD) task, which exploits point clouds and RGB images to localize anomalies. We introduce a novel light and fast framework that learns to map features from one modality to the other on nominal samples. At test time, anomalies are detected by pinpointing inconsistencies between observed and mapped features. Extensive experiments show th… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  17. arXiv:2311.03197  [pdf, other

    eess.SY cs.LG

    Stable Linear Subspace Identification: A Machine Learning Approach

    Authors: Loris Di Natale, Muhammad Zakwan, Bratislav Svetozarevic, Philipp Heer, Giancarlo Ferrari-Trecate, Colin N. Jones

    Abstract: Machine Learning (ML) and linear System Identification (SI) have been historically developed independently. In this paper, we leverage well-established ML tools - especially the automatic differentiation framework - to introduce SIMBa, a family of discrete linear multi-step-ahead state-space SI methods using backpropagation. SIMBa relies on a novel Linear-Matrix-Inequality-based free parametrizati… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at ECC 2024

  18. arXiv:2310.01140  [pdf, other

    cs.CV

    Neural Processing of Tri-Plane Hybrid Neural Fields

    Authors: Adriano Cardace, Pierluigi Zama Ramirez, Francesco Ballerini, Allan Zhou, Samuele Salti, Luigi Di Stefano

    Abstract: Driven by the appealing properties of neural fields for storing and communicating 3D data, the problem of directly processing them to address tasks such as classification and part segmentation has emerged and has been investigated in recent works. Early approaches employ neural fields parameterized by shared networks trained on the whole dataset, achieving good task performance but sacrificing rec… ▽ More

    Submitted 30 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024

  19. arXiv:2310.00758  [pdf, other

    eess.SY cs.LG

    Data-driven adaptive building thermal controller tuning with constraints: A primal-dual contextual Bayesian optimization approach

    Authors: Wenjie Xu, Bratislav Svetozarevic, Loris Di Natale, Philipp Heer, Colin N Jones

    Abstract: We study the problem of tuning the parameters of a room temperature controller to minimize its energy consumption, subject to the constraint that the daily cumulative thermal discomfort of the occupants is below a given threshold. We formulate it as an online constrained black-box optimization problem where, on each day, we observe some relevant environmental context and adaptively select the cont… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  20. arXiv:2309.08272  [pdf, other

    cs.CL cs.IR

    Structural Self-Supervised Objectives for Transformers

    Authors: Luca Di Liello

    Abstract: This thesis focuses on improving the pre-training of natural language models using unsupervised raw data to make them more efficient and aligned with downstream applications. In the first part, we introduce three alternative pre-training objectives to BERT's Masked Language Modeling (MLM), namely Random Token Substitution (RTS), Cluster-based Random Token Substitution (C-RTS), and Swapped Langua… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Ph.D. Thesis

  21. arXiv:2309.07917  [pdf, other

    cs.CV

    Looking at words and points with attention: a benchmark for text-to-shape coherence

    Authors: Andrea Amaduzzi, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

    Abstract: While text-conditional 3D object generation and manipulation have seen rapid progress, the evaluation of coherence between generated 3D shapes and input textual descriptions lacks a clear benchmark. The reason is twofold: a) the low quality of the textual descriptions in the only publicly available dataset of text-shape pairs; b) the limited effectiveness of the metrics used to quantitatively asse… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: ICCV 2023 Workshop "AI for 3D Content Creation", Project page: https://cvlab-unibo.github.io/CrossCoherence-Web/, 26 pages

  22. arXiv:2309.07874  [pdf, other

    cs.RO

    Ca$^2$Lib: Simple and Accurate LiDAR-RGB Calibration using Small Common Markers

    Authors: Emanuele Giacomini, Leonardo Brizi, Luca Di Giammarino, Omar Salem, Patrizio Perugini, Giorgio Grisetti

    Abstract: In many fields of robotics, knowing the relative position and orientation between two sensors is a mandatory precondition to operate with multiple sensing modalities. In this context, the pair LiDAR-RGB cameras offer complementary features: LiDARs yield sparse high quality range measurements, while RGB cameras provide a dense color measurement of the environment. Existing techniques often rely eit… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 7 pages, 10 figures

  23. arXiv:2309.01206  [pdf

    cs.RO

    Comparative Safety Performance of Autonomous- and Human Drivers: A Real-World Case Study of the Waymo One Service

    Authors: Luigi Di Lillo, Tilia Gode, Xilin Zhou, Margherita Atzei, Ruoshu Chen, Trent Victor

    Abstract: This study compares the safety of autonomous- and human drivers. It finds that the Waymo One autonomous service is significantly safer towards other road users than human drivers are, as measured via collision causation. The result is determined by comparing Waymo's third party liability insurance claims data with mileage- and zip-code-calibrated Swiss Re (human driver) private passenger vehicle b… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  24. arXiv:2308.01050  [pdf, other

    cs.RO cs.AI cs.LG

    A Counterfactual Safety Margin Perspective on the Scoring of Autonomous Vehicles' Riskiness

    Authors: Alessandro Zanardi, Andrea Censi, Margherita Atzei, Luigi Di Lillo, Emilio Frazzoli

    Abstract: Autonomous Vehicles (AVs) promise a range of societal advantages, including broader access to mobility, reduced road accidents, and enhanced transportation efficiency. However, evaluating the risks linked to AVs is complex due to limited historical data and the swift progression of technology. This paper presents a data-driven framework for assessing the risk of different AVs' behaviors in various… ▽ More

    Submitted 28 November, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: updated experiments

  25. arXiv:2307.15052  [pdf, other

    cs.CV

    Learning Depth Estimation for Transparent and Mirror Surfaces

    Authors: Alex Costanzino, Pierluigi Zama Ramirez, Matteo Poggi, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

    Abstract: Inferring the depth of transparent or mirror (ToM) surfaces represents a hard challenge for either sensors, algorithms, or deep networks. We propose a simple pipeline for learning to estimate depth properly for such surfaces with neural networks, without requiring any ground-truth annotation. We unveil how to obtain reliable pseudo labels by in-painting ToM objects in images and processing them wi… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV 2023. Project Page: https://cvlab-unibo.github.io/Depth4ToM

  26. arXiv:2307.09776  [pdf, other

    cs.LO cs.FL cs.PL eess.SY

    LTL Synthesis on Infinite-State Arenas defined by Programs

    Authors: Shaun Azzopardi, Nir Piterman, Gerardo Schneider, Luca di Stefano

    Abstract: This paper deals with the problem of automatically and correctly controlling infinite-state reactive programs to achieve LTL goals. Applications include adapting a program to new requirements, or to repair bugs discovered in the original specification or program code. Existing approaches are able to solve this problem for safety and some reachability properties, but require an a priori template of… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  27. arXiv:2306.05801  [pdf, other

    cs.AI

    Strategies to exploit XAI to improve classification systems

    Authors: Andrea Apicella, Luca Di Lorenzo, Francesco Isgrò, Andrea Pollastro, Roberto Prevete

    Abstract: Explainable Artificial Intelligence (XAI) aims to provide insights into the decision-making process of AI models, allowing users to understand their results beyond their decisions. A significant goal of XAI is to improve the performance of AI models by providing explanations for their decision-making processes. However, most XAI literature focuses on how to explain an AI system, while less attenti… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: This work has been accepted to be presented to The 1st World Conference on eXplainable Artificial Intelligence (xAI 2023), July 26-28, 2023 - Lisboa, Portugal

  28. arXiv:2305.15358  [pdf, other

    cs.CL cs.LG

    Context-Aware Transformer Pre-Training for Answer Sentence Selection

    Authors: Luca Di Liello, Siddhant Garg, Alessandro Moschitti

    Abstract: Answer Sentence Selection (AS2) is a core component for building an accurate Question Answering pipeline. AS2 models rank a set of candidate sentences based on how likely they answer a given question. The state of the art in AS2 exploits pre-trained transformers by transferring them on large annotated datasets, while using local contextual information around the candidate sentence. In this paper,… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

  29. arXiv:2304.10448  [pdf, other

    cs.CV

    ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects

    Authors: Marco Toschi, Riccardo De Matteo, Riccardo Spezialetti, Daniele De Gregorio, Luigi Di Stefano, Samuele Salti

    Abstract: In this paper, we focus on the problem of rendering novel views from a Neural Radiance Field (NeRF) under unobserved light conditions. To this end, we introduce a novel dataset, dubbed ReNe (Relighting NeRF), framing real world objects under one-light-at-time (OLAT) conditions, annotated with accurate ground-truth camera and light poses. Our acquisition pipeline leverages two robotic arms holding,… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR 2023 as a highlight

  30. arXiv:2304.02991  [pdf, other

    cs.CV

    Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation

    Authors: Adriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

    Abstract: 3D semantic segmentation is a critical task in many real-world applications, such as autonomous driving, robotics, and mixed reality. However, the task is extremely challenging due to ambiguities coming from the unstructured, sparse, and uncolored nature of the 3D point clouds. A possible solution is to combine the 3D information with others coming from sensors featuring a different modality, such… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at the CVPR2023 Workshop on Autonomous Driving (WAD)

  31. arXiv:2303.16878  [pdf, other

    cs.CV cs.RO

    Photometric LiDAR and RGB-D Bundle Adjustment

    Authors: Luca Di Giammarino, Emanuele Giacomini, Leonardo Brizi, Omar Salem, Giorgio Grisetti

    Abstract: The joint optimization of the sensor trajectory and 3D map is a crucial characteristic of Simultaneous Localization and Map** (SLAM) systems. To achieve this, the gold standard is Bundle Adjustment (BA). Modern 3D LiDARs now retain higher resolutions that enable the creation of point cloud images resembling those taken by conventional cameras. Nevertheless, the typical effective global refinemen… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: 11 pages, 9 figures

  32. arXiv:2303.15356  [pdf, other

    physics.soc-ph cs.SI stat.ME

    Hypergraphx: a library for higher-order network analysis

    Authors: Quintino Francesco Lotito, Martina Contisciani, Caterina De Bacco, Leonardo Di Gaetano, Luca Gallo, Alberto Montresor, Federico Musciotto, Nicolò Ruggeri, Federico Battiston

    Abstract: From social to biological systems, many real-world systems are characterized by higher-order, non-dyadic interactions. Such systems are conveniently described by hypergraphs, where hyperedges encode interactions among an arbitrary number of units. Here, we present an open-source python library, hypergraphx (HGX), providing a comprehensive collection of algorithms and functions for the analysis of… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Journal ref: Journal of Complex Networks, Volume 11, Issue 3, June 2023

  33. arXiv:2303.07312  [pdf, other

    cs.RO

    Enhancing LiDAR performance: Robust De-skewing Exclusively Relying on Range Measurements

    Authors: Omar Salem, Emanuele Giacomini, Leonardo Brizi, Luca Di Giammarino, Giorgio Grisetti

    Abstract: Most commercially available Light Detection and Ranging (LiDAR)s measure the distances along a 2D section of the environment by sequentially sampling the free range along directions centered at the sensor's origin. When the sensor moves during the acquisition, the measured ranges are affected by a phenomenon known as "skewing", which appears as a distortion in the acquired scan. Skewing potentiall… ▽ More

    Submitted 16 October, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 6 pages , 5 figures

  34. arXiv:2302.05438  [pdf, other

    cs.CV

    Deep Learning on Implicit Neural Representations of Shapes

    Authors: Luca De Luigi, Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

    Abstract: Implicit Neural Representations (INRs) have emerged in the last few years as a powerful tool to encode continuously a variety of different signals like images, videos, audio and 3D shapes. When applied to 3D shapes, INRs allow to overcome the fragmentation and shortcomings of the popular discrete representations used so far. Yet, considering that INRs consist in neural networks, it is not clear wh… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Accepted at ICLR 2023

  35. arXiv:2301.11310  [pdf, other

    cs.CV

    Learning Good Features to Transfer Across Tasks and Domains

    Authors: Pierluigi Zama Ramirez, Adriano Cardace, Luca De Luigi, Alessio Tonioni, Samuele Salti, Luigi Di Stefano

    Abstract: Availability of labelled data is the major obstacle to the deployment of deep learning algorithms for computer vision tasks in new domains. The fact that many frameworks adopted to solve different tasks share the same architecture suggests that there should be a way of reusing the knowledge learned in a specific setting to solve novel tasks with limited or no additional supervision. In this work,… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: Extended version of the paper "Learning Across Tasks and Domains" presented at ICCV 2019. Accepted at TPAMI

  36. arXiv:2301.08245  [pdf, other

    cs.CV

    Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces

    Authors: Pierluigi Zama Ramirez, Alex Costanzino, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

    Abstract: Estimating depth from images nowadays yields outstanding results, both in terms of in-domain accuracy and generalization. However, we identify two main challenges that remain open in this field: dealing with non-Lambertian materials and effectively processing high-resolution images. Purposely, we propose a novel dataset that includes accurate and dense ground-truth labels at high resolution, featu… ▽ More

    Submitted 30 January, 2024; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: Extension of the paper "Open Challenges in Deep Stereo: the Booster Dataset" presented at CVPR 2022. Accepted at TPAMI

  37. arXiv:2212.12380  [pdf, other

    cs.LG cs.AI eess.SY

    Towards Scalable Physically Consistent Neural Networks: an Application to Data-driven Multi-zone Thermal Building Models

    Authors: Loris Di Natale, Bratislav Svetozarevic, Philipp Heer, Colin Neil Jones

    Abstract: With more and more data being collected, data-driven modeling methods have been gaining in popularity in recent years. While physically sound, classical gray-box models are often cumbersome to identify and scale, and their accuracy might be hindered by their limited expressiveness. On the other hand, classical black-box methods, typically relying on Neural Networks (NNs) nowadays, often achieve im… ▽ More

    Submitted 4 April, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: Accepted in Applied Energy

  38. arXiv:2211.16691  [pdf, other

    cs.LG cs.AI

    Computationally Efficient Reinforcement Learning: Targeted Exploration leveraging Simple Rules

    Authors: Loris Di Natale, Bratislav Svetozarevic, Philipp Heer, Colin N. Jones

    Abstract: Model-free Reinforcement Learning (RL) generally suffers from poor sample complexity, mostly due to the need to exhaustively explore the state-action space to find well-performing policies. On the other hand, we postulate that expert knowledge of the system often allows us to design simple rules we expect good policies to follow at all times. In this work, we hence propose a simple yet effective m… ▽ More

    Submitted 12 September, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to CDC 2023

  39. arXiv:2211.13762  [pdf, other

    cs.CV

    ScanNeRF: a Scalable Benchmark for Neural Radiance Fields

    Authors: Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi Di Stefano

    Abstract: In this paper, we propose the first-ever real benchmark thought for evaluating Neural Radiance Fields (NeRFs) and, in general, Neural Rendering (NR) frameworks. We design and implement an effective pipeline for scanning real objects in quantity and effortlessly. Our scan station is built with less than 500$ hardware budget and can collect roughly 4000 images of a scanned object in just 5 minutes.… ▽ More

    Submitted 20 December, 2022; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: WACV 2023. The first three authors contributed equally. Project page: https://eyecan-ai.github.io/scannerf/

  40. arXiv:2211.06130  [pdf, other

    cs.LG

    Physically Consistent Neural ODEs for Learning Multi-Physics Systems

    Authors: Muhammad Zakwan, Loris Di Natale, Bratislav Svetozarevic, Philipp Heer, Colin N. Jones, Giancarlo Ferrari Trecate

    Abstract: Despite the immense success of neural networks in modeling system dynamics from data, they often remain physics-agnostic black boxes. In the particular case of physical systems, they might consequently make physically inconsistent predictions, which makes them unreliable in practice. In this paper, we leverage the framework of Irreversible port-Hamiltonian Systems (IPHS), which can describe most m… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: First two authors contributed equally. Submitted to IFAC 2023

  41. arXiv:2211.05035  [pdf, ps, other

    cs.CL cs.LG

    Combining Contrastive Learning and Knowledge Graph Embeddings to develop medical word embeddings for the Italian language

    Authors: Denys Amore Bondarenko, Roger Ferrod, Luigi Di Caro

    Abstract: Word embeddings play a significant role in today's Natural Language Processing tasks and applications. While pre-trained models may be directly employed and integrated into existing pipelines, they are often fine-tuned to better fit with specific languages or domains. In this paper, we attempt to improve available embeddings in the uncovered niche of the Italian medical domain through the combinat… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  42. arXiv:2210.13536  [pdf, other

    cs.CL

    Effective Pre-Training Objectives for Transformer-based Autoencoders

    Authors: Luca Di Liello, Matteo Gabburo, Alessandro Moschitti

    Abstract: In this paper, we study trade-offs between efficiency, cost and accuracy when pre-training Transformer encoders with different pre-training objectives. For this purpose, we analyze features of common objectives and combine them to create new effective pre-training approaches. Specifically, we designed light token generators based on a straightforward statistical approach, which can replace ELECTRA… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022 Findings

  43. arXiv:2210.08226  [pdf, other

    cs.CV

    Self-Distillation for Unsupervised 3D Domain Adaptation

    Authors: Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

    Abstract: Point cloud classification is a popular task in 3D vision. However, previous works, usually assume that point clouds at test time are obtained with the same procedure or sensor as those at training time. Unsupervised Domain Adaptation (UDA) instead, breaks this assumption and tries to solve the task on an unlabeled target domain, leveraging only on a supervised source domain. For point cloud class… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: WACV 2023, Project Page: https://cvlab-unibo.github.io/FeatureDistillation/

  44. arXiv:2209.00648  [pdf, other

    cs.CV

    Cross-Spectral Neural Radiance Fields

    Authors: Matteo Poggi, Pierluigi Zama Ramirez, Fabio Tosi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

    Abstract: We propose X-NeRF, a novel method to learn a Cross-Spectral scene representation given images captured from cameras with different light spectrum sensitivity, based on the Neural Radiance Fields formulation. X-NeRF optimizes camera poses across spectra during training and exploits Normalized Cross-Device Coordinates (NXDC) to render images of different modalities from arbitrary viewpoints, which a… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 3DV 2022. Project page: https://cvlab-unibo.github.io/xnerf-web/

  45. arXiv:2207.06355  [pdf, other

    stat.ML cs.LG

    Contextual Decision Trees

    Authors: Tommaso Aldinucci, Enrico Civitelli, Leonardo di Gangi, Alessandro Sestini

    Abstract: Focusing on Random Forests, we propose a multi-armed contextual bandit recommendation framework for feature-based selection of a single shallow tree of the learned ensemble. The trained system, which works on top of the Random Forest, dynamically identifies a base predictor that is responsible for providing the final output. In this way, we obtain local interpretations by observing the rules of th… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  46. HiPE: Hierarchical Initialization for Pose Graphs

    Authors: Tiziano Guadagnino, Luca Di Giammarino, Giorgio Grisetti

    Abstract: Pose graph optimization is a non-convex optimization problem encountered in many areas of robotics perception. Its convergence to an accurate solution is conditioned by two factors: the non-linearity of the cost function in use and the initial configuration of the pose variables. In this paper, we present HiPE, a novel hierarchical algorithm for pose graph initialization. Our approach exploits a c… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Journal ref: IEEE Robotics and Automation Letters, vol. 7, no. 1, pp. 287-294, Jan. 2022

  47. arXiv:2206.07047  [pdf, other

    cs.CV

    RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

    Authors: Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

    Abstract: We address the problem of registering synchronized color (RGB) and multi-spectral (MS) images featuring very different resolution by solving stereo matching correspondences. Purposely, we introduce a novel RGB-MS dataset framing 13 different scenes in indoor environments and providing a total of 34 image pairs annotated with semi-dense, high-resolution ground-truth labels in the form of disparity… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: CVPR 2022, New Orleans. Project page: https://cvlab-unibo.github.io/rgb-ms-web/

  48. arXiv:2206.05194  [pdf, other

    cs.CV cs.LG

    Learning the Space of Deep Models

    Authors: Gianluca Berardi, Luca De Luigi, Samuele Salti, Luigi Di Stefano

    Abstract: Embedding of large but redundant data, such as images or text, in a hierarchy of lower-dimensional spaces is one of the key features of representation learning approaches, which nowadays provide state-of-the-art solutions to problems once believed hard or impossible to solve. In this work, in a plot twist with a strong meta aftertaste, we show how trained deep models are as redundant as the data t… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted at ICPR2022

  49. arXiv:2206.04671  [pdf, other

    cs.CV

    Open Challenges in Deep Stereo: the Booster Dataset

    Authors: Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

    Abstract: We present a novel high-resolution and challenging stereo dataset framing indoor scenes annotated with dense and accurate ground-truth disparities. Peculiar to our dataset is the presence of several specular and transparent surfaces, i.e. the main causes of failures for state-of-the-art stereo networks. Our acquisition pipeline leverages a novel deep space-time stereo framework which allows for ea… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: CVPR 2022, New Orleans. Project page: https://cvlab-unibo.github.io/booster-web/

  50. arXiv:2205.15703  [pdf, other

    eess.SY cs.LG

    Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-based MPC, Bilevel DeePC, and Deep Reinforcement Learning

    Authors: Loris Di Natale, Yingzhao Lian, Emilio T. Maddalena, Jicheng Shi, Colin N. Jones

    Abstract: This manuscript offers the perspective of experimentalists on a number of modern data-driven techniques: model predictive control relying on Gaussian processes, adaptive data-driven control based on behavioral theory, and deep reinforcement learning. These techniques are compared in terms of data requirements, ease of use, computational burden, and robustness in the context of real-world applicati… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.