Skip to main content

Showing 1–30 of 30 results for author: Ferrer, C C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07029  [pdf, other

    cs.LG

    Fairness-Aware Meta-Learning via Nash Bargaining

    Authors: Yi Zeng, Xuelin Yang, Li Chen, Cristian Canton Ferrer, Ming **, Michael I. Jordan, Ruoxi Jia

    Abstract: To address issues of group-level fairness in machine learning, it is natural to adjust model parameters based on specific fairness objectives over a sensitive-attributed validation set. Such an adjustment procedure can be cast within a meta-learning framework. However, naive integration of fairness goals via meta-learning can cause hypergradient conflicts for subgroups, resulting in unstable conve… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2401.16247  [pdf, other

    cs.CL cs.CY

    Towards Red Teaming in Multimodal and Multilingual Translation

    Authors: Christophe Ropers, David Dale, Prangthip Hansanti, Gabriel Mejia Gonzalez, Ivan Evtimov, Corinne Wong, Christophe Touret, Kristina Pereyra, Seohyun Sonia Kim, Cristian Canton Ferrer, Pierre Andrews, Marta R. Costa-jussà

    Abstract: Assessing performance in Natural Language Processing is becoming increasingly complex. One particular challenge is the potential for evaluation datasets to overlap with training data, either directly or indirectly, which can lead to skewed results and overestimation of model performance. As a consequence, human evaluation is gaining increasing interest as a means to assess the performance and reli… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.05187

    ACM Class: I.2.7

  3. arXiv:2310.15848  [pdf, other

    cs.LG cs.CV

    On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms

    Authors: Surbhi Mittal, Kartik Thakral, Richa Singh, Mayank Vatsa, Tamar Glaser, Cristian Canton Ferrer, Tal Hassner

    Abstract: Artificial Intelligence (AI) has made its way into various scientific fields, providing astonishing improvements over existing algorithms for a wide variety of tasks. In recent years, there have been severe concerns over the trustworthiness of AI technologies. The scientific community has focused on the development of trustworthy AI algorithms. However, machine and deep learning algorithms, popula… ▽ More

    Submitted 24 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: corrected typos

  4. arXiv:2309.15251  [pdf, other

    cs.CV cs.AI

    VPA: Fully Test-Time Visual Prompt Adaptation

    Authors: Jiachen Sun, Mark Ibrahim, Melissa Hall, Ivan Evtimov, Z. Morley Mao, Cristian Canton Ferrer, Caner Hazirbas

    Abstract: Textual prompt tuning has demonstrated significant performance improvements in adapting natural language processing models to a variety of downstream tasks by treating hand-engineered prompts as trainable parameters. Inspired by the success of textual prompting, several studies have investigated the efficacy of visual prompt tuning. In this work, we present Visual Prompt Adaptation (VPA), the firs… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  5. arXiv:2308.12950  [pdf, other

    cs.CL

    Code Llama: Open Foundation Models for Code

    Authors: Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, **gyu Liu, Romain Sauvestre, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom , et al. (1 additional authors not shown)

    Abstract: We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks. We provide multiple flavors to cover a wide range of applications: foundation models (Code Llama), Python specializations (Code Llama… ▽ More

    Submitted 31 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  6. arXiv:2307.09288  [pdf, other

    cs.CL cs.AI

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    Authors: Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini , et al. (43 additional authors not shown)

    Abstract: In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  7. arXiv:2306.11710  [pdf, other

    cs.CV

    Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis

    Authors: Maxim Maximov, Tim Meinhardt, Ismail Elezi, Zoe Papakipos, Caner Hazirbas, Cristian Canton Ferrer, Laura Leal-Taixé

    Abstract: The advent of data-driven technology solutions is accompanied by an increasing concern with data privacy. This is of particular importance for human-centered image recognition tasks, such as pedestrian detection, re-identification, and tracking. To highlight the importance of privacy issues and motivate future research, we motivate and introduce the Pedestrian Dataset De-Identification (PDI) task.… ▽ More

    Submitted 22 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  8. arXiv:2303.04838  [pdf, other

    cs.CV cs.AI cs.CL cs.CY

    The Casual Conversations v2 Dataset

    Authors: Bilal Porgali, Vítor Albiero, Jordan Ryda, Cristian Canton Ferrer, Caner Hazirbas

    Abstract: This paper introduces a new large consent-driven dataset aimed at assisting in the evaluation of algorithmic bias and robustness of computer vision and audio speech models in regards to 11 attributes that are self-provided or labeled by trained annotators. The dataset includes 26,467 videos of 5,567 unique paid participants, with an average of almost 5 videos per person, recorded in Brazil, India,… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  9. arXiv:2212.04825  [pdf, other

    cs.CV

    A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others

    Authors: Zhiheng Li, Ivan Evtimov, Albert Gordo, Caner Hazirbas, Tal Hassner, Cristian Canton Ferrer, Chenliang Xu, Mark Ibrahim

    Abstract: Machine learning models have been found to learn shortcuts -- unintended decision rules that are unable to generalize -- undermining models' reliability. Previous works address this problem under the tenuous assumption that only a single shortcut exists in the training data. Real-world images are rife with multiple visual cues from background to texture. Key to advancing the reliability of vision… ▽ More

    Submitted 21 March, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: CVPR 2023. Code is available at https://github.com/facebookresearch/Whac-A-Mole

  10. arXiv:2211.05809  [pdf, other

    cs.CV cs.AI cs.CL cs.CY

    Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness

    Authors: Caner Hazirbas, Ye** Bang, Tiezheng Yu, Parisa Assar, Bilal Porgali, Vítor Albiero, Stefan Hermanek, Jacqueline Pan, Emily McReynolds, Miranda Bogen, Pascale Fung, Cristian Canton Ferrer

    Abstract: Develo** robust and fair AI systems require datasets with comprehensive set of labels that can help ensure the validity and legitimacy of relevant measurements. Recent efforts, therefore, focus on collecting person-related datasets that have carefully selected labels, including sensitive characteristics, and consent forms in place to use those attributes for model testing and development. Respon… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  11. arXiv:2203.17260  [pdf, other

    cs.CV cs.LG

    Generating High Fidelity Data from Low-density Regions using Diffusion Models

    Authors: Vikash Sehwag, Caner Hazirbas, Albert Gordo, Firat Ozgenel, Cristian Canton Ferrer

    Abstract: Our work focuses on addressing sample deficiency from low-density regions of data manifold in common image datasets. We leverage diffusion process based generative models to synthesize novel images from low-density regions. We observe that uniform sampling from diffusion models predominantly samples from high-density regions of the data manifold. Therefore, we modify the sampling process to guide… ▽ More

    Submitted 26 June, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 (fixed some discrepancies in notation - v2)

  12. arXiv:2202.04007  [pdf, other

    cs.CV cs.LG

    Results and findings of the 2021 Image Similarity Challenge

    Authors: Zoë Papakipos, Giorgos Tolias, Tomas Jenicek, Ed Pizzi, Shuhei Yokoo, Wenhao Wang, Yifan Sun, Weipu Zhang, Yi Yang, Sanjay Addicam, Sergio Manuel Papadakis, Cristian Canton Ferrer, Ondrej Chum, Matthijs Douze

    Abstract: The 2021 Image Similarity Challenge introduced a dataset to serve as a new benchmark to evaluate recent image copy detection methods. There were 200 participants to the competition. This paper presents a quantitative and qualitative analysis of the top submissions. It appears that the most difficult image transformations involve either severe image crops or hiding into unrelated images, combined w… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  13. arXiv:2106.09672  [pdf, other

    cs.CV

    The 2021 Image Similarity Dataset and Challenge

    Authors: Matthijs Douze, Giorgos Tolias, Ed Pizzi, Zoë Papakipos, Lowik Chanussot, Filip Radenovic, Tomas Jenicek, Maxim Maximov, Laura Leal-Taixé, Ismail Elezi, Ondřej Chum, Cristian Canton Ferrer

    Abstract: This paper introduces a new benchmark for large-scale image similarity detection. This benchmark is used for the Image Similarity Challenge at NeurIPS'21 (ISC2021). The goal is to determine whether a query image is a modified copy of any image in a reference corpus of size 1~million. The benchmark features a variety of image transformations such as automated transformations, hand-crafted image edi… ▽ More

    Submitted 21 February, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

  14. arXiv:2106.09222  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Localized Uncertainty Attacks

    Authors: Ousmane Amadou Dia, Theofanis Karaletsos, Caner Hazirbas, Cristian Canton Ferrer, Ilknur Kaynar Kabul, Erik Meijer

    Abstract: The susceptibility of deep learning models to adversarial perturbations has stirred renewed attention in adversarial examples resulting in a number of attacks. However, most of these attacks fail to encompass a large spectrum of adversarial perturbations that are imperceptible to humans. In this paper, we present localized uncertainty attacks, a novel class of threat models against deterministic a… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: CVPR 2021 Workshop on Adversarial Machine Learning in Computer Vision

  15. arXiv:2104.02821  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Measuring Fairness in AI: the Casual Conversations Dataset

    Authors: Caner Hazirbas, Joanna Bitton, Brian Dolhansky, Jacqueline Pan, Albert Gordo, Cristian Canton Ferrer

    Abstract: This paper introduces a novel dataset to help researchers evaluate their computer vision and audio models for accuracy across a diverse set of age, genders, apparent skin tones and ambient lighting conditions. Our dataset is composed of 3,011 subjects and contains over 45,000 videos, with an average of 15 videos per person. The videos were recorded in multiple U.S. states with a diverse set of adu… ▽ More

    Submitted 3 November, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

  16. arXiv:2011.12902  [pdf, other

    cs.CV cs.AI cs.CR

    Adversarial Evaluation of Multimodal Models under Realistic Gray Box Assumption

    Authors: Ivan Evtimov, Russel Howes, Brian Dolhansky, Hamed Firooz, Cristian Canton Ferrer

    Abstract: This work examines the vulnerability of multimodal (image + text) models to adversarial threats similar to those discussed in previous literature on unimodal (image- or text-only) models. We introduce realistic assumptions of partial model knowledge and access, and discuss how these assumptions differ from the standard "black-box"/"white-box" dichotomy common in current literature on adversarial a… ▽ More

    Submitted 9 June, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  17. arXiv:2011.09957  [pdf, other

    cs.CV

    Adversarial Threats to DeepFake Detection: A Practical Perspective

    Authors: Paarth Neekhara, Brian Dolhansky, Joanna Bitton, Cristian Canton Ferrer

    Abstract: Facially manipulated images and videos or DeepFakes can be used maliciously to fuel misinformation or defame individuals. Therefore, detecting DeepFakes is crucial to increase the credibility of social media platforms and other media sharing web sites. State-of-the art DeepFake detection techniques rely on neural network based classification models which are known to be vulnerable to adversarial e… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

  18. arXiv:2011.09473  [pdf, other

    cs.CV

    Adversarial collision attacks on image hashing functions

    Authors: Brian Dolhansky, Cristian Canton Ferrer

    Abstract: Hashing images with a perceptual algorithm is a common approach to solving duplicate image detection problems. However, perceptual image hashing algorithms are differentiable, and are thus vulnerable to gradient-based adversarial attacks. We demonstrate that not only is it possible to modify an image to produce an unrelated hash, but an exact image hash collision between a source and target image… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  19. arXiv:2009.10311  [pdf, other

    cs.SI cs.AI

    Preserving Integrity in Online Social Networks

    Authors: Alon Halevy, Cristian Canton Ferrer, Hao Ma, Umut Ozertem, Patrick Pantel, Marzieh Saeidi, Fabrizio Silvestri, Ves Stoyanov

    Abstract: Online social networks provide a platform for sharing information and free expression. However, these networks are also used for malicious purposes, such as distributing misinformation and hate speech, selling illegal drugs, and coordinating sex trafficking or child exploitation. This paper surveys the state of the art in kee** online platforms and their users safe from such harm, also known as… ▽ More

    Submitted 25 September, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

  20. arXiv:2006.07397  [pdf, other

    cs.CV cs.LG

    The DeepFake Detection Challenge (DFDC) Dataset

    Authors: Brian Dolhansky, Joanna Bitton, Ben Pflaum, Jikuo Lu, Russ Howes, Menglin Wang, Cristian Canton Ferrer

    Abstract: Deepfakes are a recent off-the-shelf manipulation technique that allows anyone to swap two identities in a single video. In addition to Deepfakes, a variety of GAN-based face swap** methods have also been published with accompanying code. To counter this emerging threat, we have constructed an extremely large face swap video dataset to enable the training of detection models, and organized the a… ▽ More

    Submitted 27 October, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  21. arXiv:1912.06895  [pdf, ps, other

    cs.CV cs.CR cs.LG eess.IV

    Deep Poisoning: Towards Robust Image Data Sharing against Visual Disclosure

    Authors: Hao Guo, Brian Dolhansky, Eric Hsin, Phong Dinh, Cristian Canton Ferrer, Song Wang

    Abstract: Due to respectively limited training data, different entities addressing the same vision task based on certain sensitive images may not train a robust deep network. This paper introduces a new vision task where various entities share task-specific image data to enlarge each other's training data volume without visually disclosing sensitive contents (e.g. illegal images). Then, we present a new str… ▽ More

    Submitted 8 November, 2020; v1 submitted 14 December, 2019; originally announced December 2019.

  22. arXiv:1910.08854  [pdf, other

    cs.CV cs.CY

    The Deepfake Detection Challenge (DFDC) Preview Dataset

    Authors: Brian Dolhansky, Russ Howes, Ben Pflaum, Nicole Baram, Cristian Canton Ferrer

    Abstract: In this paper, we introduce a preview of the Deepfakes Detection Challenge (DFDC) dataset consisting of 5K videos featuring two facial modification algorithms. A data collection campaign has been carried out where participating actors have entered into an agreement to the use and manipulation of their likenesses in our creation of the dataset. Diversity in several axes (gender, skin-tone, age, etc… ▽ More

    Submitted 23 October, 2019; v1 submitted 19 October, 2019; originally announced October 2019.

  23. arXiv:1910.02334  [pdf, other

    cs.MM cs.CL cs.CV

    Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation

    Authors: Benet Oriol Sabat, Cristian Canton Ferrer, Xavier Giro-i-Nieto

    Abstract: This work addresses the challenge of hate speech detection in Internet memes, and attempts using visual information to automatically detect hate speech, unlike any previous work of our knowledge. Memes are pixel-based multimedia documents that contain photos or illustrations together with phrases which, when combined, usually adopt a funny meaning. However, hate memes are also used to spread hate… ▽ More

    Submitted 5 October, 2019; originally announced October 2019.

    Comments: AI for Social Good Workshop at NeurIPS 2019 (short paper)

  24. arXiv:1712.03999  [pdf, other

    cs.CV cs.LG stat.ML

    Eye In-Painting with Exemplar Generative Adversarial Networks

    Authors: Brian Dolhansky, Cristian Canton Ferrer

    Abstract: This paper introduces a novel approach to in-painting where the identity of the object to remove or change is preserved and accounted for at inference time: Exemplar GANs (ExGANs). ExGANs are a type of conditional GAN that utilize exemplar information to produce high-quality, personalized in painting results. We propose using exemplar information in the form of a reference image of the region to i… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

  25. arXiv:1709.05775  [pdf, other

    cs.CV

    Social Style Characterization from Egocentric Photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Cristian Canton Ferrer, Petia Radeva

    Abstract: This paper proposes a system for automatic social pattern characterization using a wearable photo-camera. The proposed pipeline consists of three major steps. First, detection of people with whom the camera wearer interacts and, second, categorization of the detected social interactions into formal and informal. These two steps act at event-level where each potential social event is modeled as a m… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: International Conference on Computer Vision (ICCV). Workshop on Egocentric Percetion, Interaction and Computing

  26. arXiv:1709.01424  [pdf, other

    cs.CV

    Towards social pattern characterization in egocentric photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Cristian Canton Ferrer, Petia Radeva

    Abstract: Following the increasingly popular trend of social interaction analysis in egocentric vision, this manuscript presents a comprehensive study for automatic social pattern characterization of a wearable photo-camera user, by relying on the visual analysis of egocentric photo-streams. The proposed framework consists of three major steps. The first step is to detect social interactions of the user whe… ▽ More

    Submitted 9 January, 2018; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: 42 pages, 14 figures. Submitted to Elsevier, Computer Vision and Image Understanding (Under Review)

  27. arXiv:1707.04092  [pdf, other

    cs.CV cs.AI cs.MM

    Disentangling Motion, Foreground and Background Features in Videos

    Authors: Xunyu Lin, Victor Campos, Xavier Giro-i-Nieto, Jordi Torres, Cristian Canton Ferrer

    Abstract: This paper introduces an unsupervised framework to extract semantically rich features for video representation. Inspired by how the human visual system groups objects based on motion cues, we propose a deep convolutional neural network that disentangles motion, foreground and background information. The proposed architecture consists of a 3D convolutional feature encoder for blocks of 16 frames, w… ▽ More

    Submitted 17 July, 2017; v1 submitted 13 July, 2017; originally announced July 2017.

    Comments: Poster presented at the CVPR 2017 Workshop Brave New Ideas for Motion Representations in Videos

  28. arXiv:1701.01081  [pdf, other

    cs.CV

    SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

    Authors: Junting Pan, Cristian Canton Ferrer, Kevin McGuinness, Noel E. O'Connor, Jordi Torres, Elisa Sayrol, Xavier Giro-i-Nieto

    Abstract: We introduce SalGAN, a deep convolutional neural network for visual saliency prediction trained with adversarial examples. The first stage of the network consists of a generator model whose weights are learned by back-propagation computed from a binary cross entropy (BCE) loss over downsampled versions of the saliency maps. The resulting prediction is processed by a discriminator network trained t… ▽ More

    Submitted 1 July, 2018; v1 submitted 4 January, 2017; originally announced January 2017.

    Comments: Submitted for review to Computer Vision and Image Understanding (CVIU)

  29. arXiv:1608.01041  [pdf, other

    cs.CV

    Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution

    Authors: Emad Barsoum, Cha Zhang, Cristian Canton Ferrer, Zhengyou Zhang

    Abstract: Crowd sourcing has become a widely adopted scheme to collect ground truth labels. However, it is a well-known problem that these labels can be very noisy. In this paper, we demonstrate how to learn a deep convolutional neural network (DCNN) from noisy labels, using facial expression recognition as an example. More specifically, we have 10 taggers to label each input image, and compare four differe… ▽ More

    Submitted 23 September, 2016; v1 submitted 2 August, 2016; originally announced August 2016.

    Comments: Submitted to ICMI 2016

  30. arXiv:1604.07866  [pdf, other

    cs.LG cs.CV

    Learning by tracking: Siamese CNN for robust target association

    Authors: Laura Leal-Taixé, Cristian Canton Ferrer, Konrad Schindler

    Abstract: This paper introduces a novel approach to the task of data association within the context of pedestrian tracking, by introducing a two-stage learning scheme to match pairs of detections. First, a Siamese convolutional neural network (CNN) is trained to learn descriptors encoding local spatio-temporal structures between the two input image patches, aggregating pixel values and optical flow informat… ▽ More

    Submitted 4 August, 2016; v1 submitted 26 April, 2016; originally announced April 2016.

    Journal ref: Computer Vision and Pattern Recognition Conference Workshops (CVPRW). DeepVision: Deep Learning for Computer Vision. 2016