Skip to main content

Showing 1–15 of 15 results for author: Pizzati, F

.
  1. arXiv:2406.14563  [pdf, other

    cs.CL cs.AI cs.LG

    Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

    Authors: Hasan Abed Al Kader Hammoud, Umberto Michieli, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem, Mete Ozay

    Abstract: Merging Large Language Models (LLMs) is a cost-effective technique for combining multiple expert LLMs into a single versatile model, retaining the expertise of the original ones. However, current approaches often overlook the importance of safety alignment during merging, leading to highly misaligned models. This work investigates the effects of model merging on alignment. We evaluate several popu… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2405.08597  [pdf, other

    cs.LG

    Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Csaba Botos, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Jackson, Phillip H. S. Torr, Trevor Darrell, Yong Lee, Jakob Foerster

    Abstract: Applications of Generative AI (Gen AI) are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about the potential risks of the technology, and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This reg… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Extension of arXiv:2404.17047

  3. arXiv:2404.17047  [pdf, other

    cs.LG

    Near to Mid-term Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob Foerster

    Abstract: In the next few years, applications of Generative AI are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about potential risks and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation i… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML'24 as a position paper

  4. arXiv:2404.08031  [pdf, other

    cs.CV cs.AI cs.LG

    Latent Guard: a Safety Framework for Text-to-image Generation

    Authors: Runtao Liu, Ashkan Khakzar, **dong Gu, Qifeng Chen, Philip Torr, Fabio Pizzati

    Abstract: With the ability to generate high-quality images, text-to-image (T2I) models can be exploited for creating inappropriate content. To prevent misuse, existing safety measures are either based on text blacklists, which can be easily circumvented, or harmful content classification, requiring large datasets for training and offering low flexibility. Hence, we propose Latent Guard, a framework designed… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: under review

  5. arXiv:2403.13808  [pdf, other

    cs.CV cs.AI cs.LG

    On Pretraining Data Diversity for Self-Supervised Learning

    Authors: Hasan Abed Al Kader Hammoud, Tuhin Das, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

    Abstract: We explore the impact of training with more diverse datasets, characterized by the number of unique samples, on the performance of self-supervised learning (SSL) under a fixed computational budget. Our findings consistently demonstrate that increasing pretraining data diversity enhances SSL performance, albeit only when the distribution distance to the downstream data is minimal. Notably, even wit… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Under review

  6. arXiv:2402.01832  [pdf, other

    cs.CV cs.AI cs.LG

    SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

    Authors: Hasan Abed Al Kader Hammoud, Hani Itani, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

    Abstract: We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic text-image pairs, significantly departing from previous methods relying on real data. Leveraging recent text-to-image (TTI) generative networks and large language models (LLM), we are able to generate synthetic datasets of images and corresponding captions at any scale, with no human intervention. With trainin… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Under review

  7. arXiv:2311.17060  [pdf, other

    cs.CV cs.GR

    Material Palette: Extraction of Materials from a Single Image

    Authors: Ivan Lopes, Fabio Pizzati, Raoul de Charette

    Abstract: In this paper, we propose a method to extract physically-based rendering (PBR) materials from a single real-world image. We do so in two steps: first, we map regions of the image to material concepts using a diffusion model, which allows the sampling of texture images resembling each material in the scene. Second, we benefit from a separate network to decompose the generated textures into Spatiall… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 8 pages, 11 figures, 2 tables. Webpage https://astra-vision.github.io/MaterialPalette/

  8. arXiv:2111.13681  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    ManiFest: Manifold Deformation for Few-shot Image Translation

    Authors: Fabio Pizzati, Jean-François Lalonde, Raoul de Charette

    Abstract: Most image-to-image translation methods require a large number of training images, which restricts their applicability. We instead propose ManiFest: a framework for few-shot image translation that learns a context-aware representation of a target domain from a few images only. To enforce feature consistency, our framework learns a style manifold between source and proxy anchor domains (assumed to… ▽ More

    Submitted 20 July, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  9. arXiv:2109.04468  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Leveraging Local Domains for Image-to-Image Translation

    Authors: Anthony Dell'Eva, Fabio Pizzati, Massimo Bertozzi, Raoul de Charette

    Abstract: Image-to-image (i2i) networks struggle to capture local changes because they do not affect the global scene structure. For example, translating from highway scenes to offroad, i2i networks easily focus on global color features but ignore obvious traits for humans like the absence of lane markings. In this paper, we leverage human knowledge about spatial domain characteristics which we refer to as… ▽ More

    Submitted 14 February, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: VISAPP 2022 Best Paper Award

  10. arXiv:2107.14229  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Physics-informed Guided Disentanglement in Generative Networks

    Authors: Fabio Pizzati, Pietro Cerri, Raoul de Charette

    Abstract: Image-to-image translation (i2i) networks suffer from entanglement effects in presence of physics-related phenomena in target domain (such as occlusions, fog, etc), lowering altogether the translation quality, controllability and variability. In this paper, we propose a general framework to disentangle visual traits in target images. Primarily, we build upon collection of simple physics models, gu… ▽ More

    Submitted 27 April, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: TPAMI 2023. Code: https://github.com/astra-vision/GuidedDisent

  11. arXiv:2103.06879  [pdf, other

    cs.CV cs.AI cs.LG

    CoMoGAN: continuous model-guided image-to-image translation

    Authors: Fabio Pizzati, Pietro Cerri, Raoul de Charette

    Abstract: CoMoGAN is a continuous GAN relying on the unsupervised reorganization of the target data on a functional manifold. To that matter, we introduce a new Functional Instance Normalization layer and residual mechanism, which together disentangle image content from position on target manifold. We rely on naive physics-inspired models to guide the training while allowing private model/translations featu… ▽ More

    Submitted 29 June, 2022; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: CVPR 2021 oral

  12. arXiv:2004.01071  [pdf, other

    cs.CV cs.LG eess.IV

    Model-based occlusion disentanglement for image-to-image translation

    Authors: Fabio Pizzati, Pietro Cerri, Raoul de Charette

    Abstract: Image-to-image translation is affected by entanglement phenomena, which may occur in case of target data encompassing occlusions such as raindrops, dirt, etc. Our unsupervised model-based learning disentangles scene and occlusions, while benefiting from an adversarial pipeline to regress physical parameters of the occlusion model. The experiments demonstrate our method is able to handle varying ty… ▽ More

    Submitted 20 July, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: ECCV 2020

  13. arXiv:1910.10563  [pdf, other

    cs.CV cs.LG

    Domain Bridge for Unpaired Image-to-Image Translation and Unsupervised Domain Adaptation

    Authors: Fabio Pizzati, Raoul de Charette, Michela Zaccaria, Pietro Cerri

    Abstract: Image-to-image translation architectures may have limited effectiveness in some circumstances. For example, while generating rainy scenarios, they may fail to model typical traits of rain as water drops, and this ultimately impacts the synthetic images realism. With our method, called domain bridge, web-crawled data are exploited to reduce the domain gap, leading to the inclusion of previously ign… ▽ More

    Submitted 14 March, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: WACV 20 camera ready

  14. arXiv:1907.01294  [pdf, other

    cs.CV

    Lane Detection and Classification using Cascaded CNNs

    Authors: Fabio Pizzati, Marco Allodi, Alejandro Barrera, Fernando García

    Abstract: Lane detection is extremely important for autonomous vehicles. For this reason, many approaches use lane boundary information to locate the vehicle inside the street, or to integrate GPS-based localization. As many other computer vision based tasks, convolutional neural networks (CNNs) represent the state-of-the-art technology to indentify lane boundaries. However, the position of the lane boundar… ▽ More

    Submitted 17 July, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: Presented at Eurocast 2019

  15. arXiv:1905.00941  [pdf, other

    cs.CV cs.LG cs.RO

    Enhanced free space detection in multiple lanes based on single CNN with scene identification

    Authors: Fabio Pizzati, Fernando García

    Abstract: Many systems for autonomous vehicles' navigation rely on lane detection. Traditional algorithms usually estimate only the position of the lanes on the road, but an autonomous control system may also need to know if a lane marking can be crossed or not, and what portion of space inside the lane is free from obstacles, to make safer control decisions. On the other hand, free space detection algorith… ▽ More

    Submitted 6 May, 2019; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: Will appear in the 2019 IEEE Intelligent Vehicles Symposium (IV 2019)

    Journal ref: 2019 IEEE Intelligent Vehicles Symposium (IV)