Skip to main content

Showing 1–6 of 6 results for author: Peruzzo, E

.
  1. arXiv:2404.07990  [pdf, other

    cs.CV cs.AI

    OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

    Authors: Moreno D'IncĂ , Elia Peruzzo, Massimiliano Mancini, Dejia Xu, Vidit Goel, Xingqian Xu, Zhangyang Wang, Humphrey Shi, Nicu Sebe

    Abstract: Text-to-image generative models are becoming increasingly popular and accessible to the general public. As these models see large-scale deployments, it is necessary to deeply investigate their safety and fairness to not disseminate and perpetuate any kind of biases. However, existing works focus on detecting closed sets of biases defined a priori, limiting the studies to well-known concepts. In th… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Highlight - Code: https://github.com/Picsart-AI-Research/OpenBias

  2. arXiv:2401.02473  [pdf, other

    cs.CV

    VASE: Object-Centric Appearance and Shape Manipulation of Real Videos

    Authors: Elia Peruzzo, Vidit Goel, Dejia Xu, Xingqian Xu, Yifan Jiang, Zhangyang Wang, Humphrey Shi, Nicu Sebe

    Abstract: Recently, several works tackled the video editing task fostered by the success of large-scale text-to-image generative models. However, most of these methods holistically edit the frame using the text, exploiting the prior given by foundation diffusion models and focusing on improving the temporal consistency across frames. In this work, we introduce a framework that is object-centric and is desig… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Project Page https://helia95.github.io/vase-website/

  3. arXiv:2312.01800  [pdf, other

    cs.CV

    Collaborative Neural Painting

    Authors: Nicola Dall'Asen, Willi Menapace, Elia Peruzzo, Enver Sangineto, Yiming Wang, Elisa Ricci

    Abstract: The process of painting fosters creativity and rational planning. However, existing generative AI mostly focuses on producing visually pleasant artworks, without emphasizing the painting process. We introduce a novel task, Collaborative Neural Painting (CNP), to facilitate collaborative art painting generation between humans and machines. Given any number of user-input brushstrokes as the context… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Submitted to Computer Vision and Image Understanding, project website at https://fodark.github.io/collaborative-neural-painting/

  4. Interactive Neural Painting

    Authors: Elia Peruzzo, Willi Menapace, Vidit Goel, Federica Arrigoni, Hao Tang, Xingqian Xu, Arman Chopikyan, Nikita Orlov, Yuxiao Hu, Humphrey Shi, Nicu Sebe, Elisa Ricci

    Abstract: In the last few years, Neural Painting (NP) techniques became capable of producing extremely realistic artworks. This paper advances the state of the art in this emerging research domain by proposing the first approach for Interactive NP. Considering a setting where a user looks at a scene and tries to reproduce it on a painting, our objective is to develop a computational framework to assist the… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: This is a preprint version of the paper to appear at Computer Vision and Image Understanding (CVIU). The final journal version will be available at https://www.sciencedirect.com/science/article/pii/S1077314223001583

    Journal ref: 10.1016/j.cviu.2023.103778

  5. arXiv:2303.17546  [pdf, other

    cs.CV cs.AI cs.LG

    PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor

    Authors: Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Xingqian Xu, Nicu Sebe, Trevor Darrell, Zhangyang Wang, Humphrey Shi

    Abstract: Generative image editing has recently witnessed extremely fast-paced growth. Some works use high-level conditioning such as text, while others use low-level conditioning. Nevertheless, most of them lack fine-grained control over the properties of the different objects present in the image, i.e. object-level image editing. In this work, we tackle the task by perceiving the images as an amalgamation… ▽ More

    Submitted 8 April, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2024, Project page https://vidit98.github.io/publication/conference-paper/pair_diff.html

  6. arXiv:2206.04636  [pdf, other

    cs.CV cs.LG

    Spatial Entropy as an Inductive Bias for Vision Transformers

    Authors: Elia Peruzzo, Enver Sangineto, Yahui Liu, Marco De Nadai, Wei Bi, Bruno Lepri, Nicu Sebe

    Abstract: Recent work on Vision Transformers (VTs) showed that introducing a local inductive bias in the VT architecture helps reducing the number of samples necessary for training. However, the architecture modifications lead to a loss of generality of the Transformer backbone, partially contradicting the push towards the development of uniform architectures, shared, e.g., by both the Computer Vision and t… ▽ More

    Submitted 14 March, 2023; v1 submitted 9 June, 2022; originally announced June 2022.