Skip to main content

Showing 1–13 of 13 results for author: Kijak, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.13936  [pdf, other

    cs.CL cs.CV

    Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning

    Authors: Antoine Chaffin, Ewa Kijak, Vincent Claveau

    Abstract: Training image captioning models using teacher forcing results in very generic samples, whereas more distinctive captions can be very useful in retrieval applications or to produce alternative texts describing images for accessibility. Reinforcement Learning (RL) allows to use cross-modal retrieval similarity score between the generated caption and the input image as reward to guide the training,… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  2. arXiv:2311.05538  [pdf, other

    cs.LG cs.CV

    Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples

    Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

    Abstract: Mixup refers to interpolation-based data augmentation, originally motivated as a way to go beyond empirical risk minimization (ERM). Its extensions mostly focus on the definition of interpolation and the space (input or feature) where it takes place, while the augmentation process itself is less studied. In most methods, the number of generated examples is limited to the mini-batch size and the nu… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023. arXiv admin note: substantial text overlap with arXiv:2206.14868

  3. arXiv:2311.02502  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    MAAIP: Multi-Agent Adversarial Interaction Priors for imitation from fighting demonstrations for physics-based characters

    Authors: Mohamed Younes, Ewa Kijak, Richard Kulpa, Simon Malinowski, Franck Multon

    Abstract: Simulating realistic interaction and motions for physics-based characters is of great interest for interactive applications, and automatic secondary character animation in the movie and video game industries. Recent works in reinforcement learning have proposed impressive results for single character simulation, especially the ones that use imitation learning based techniques. However, imitating m… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: SCA'23, Supplementary video: https://youtu.be/wQfIiw_rQ3w

    MSC Class: 68U99 ACM Class: I.3.8; I.3.m

    Journal ref: ACM SIGGRAPH / Eurographics Symposium on Computer Animation (SCA), August 4-6, 2023, Los Angeles, CA, USA

  4. arXiv:2306.14840  [pdf, other

    eess.IV cs.CV

    Building Flyweight FLIM-based CNNs with Adaptive Decoding for Object Detection

    Authors: Leonardo de Melo Joao, Azael de Melo e Sousa, Bianca Martins dos Santos, Silvio Jamil Ferzoli Guimaraes, Jancarlo Ferreira Gomes, Ewa Kijak, Alexandre Xavier Falcao

    Abstract: State-of-the-art (SOTA) object detection methods have succeeded in several applications at the price of relying on heavyweight neural networks, which makes them inefficient and inviable for many applications with computational resource constraints. This work presents a method to build a Convolutional Neural Network (CNN) layer by layer for object detection from user-drawn markers on discriminative… ▽ More

    Submitted 5 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

  5. arXiv:2206.14868  [pdf, other

    cs.LG cs.CV

    Teach me how to Interpolate a Myriad of Embeddings

    Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

    Abstract: Mixup refers to interpolation-based data augmentation, originally motivated as a way to go beyond empirical risk minimization (ERM). Yet, its extensions focus on the definition of interpolation and the space where it takes place, while the augmentation itself is less studied: For a mini-batch of size $m$, most methods interpolate between $m$ pairs with a single scalar interpolation factor $λ$. I… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  6. Which Discriminator for Cooperative Text Generation?

    Authors: Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, Ewa Kijak, Vincent Claveau

    Abstract: Language models generate texts by successively predicting probability distributions for next tokens given past ones. A growing field of interest tries to leverage external information in the decoding process so that the generated texts have desired properties, such as being more natural, non toxic, faithful, or having a specific writing style. A solution is to use a classifier at each generation s… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 6 pages, 2 figures, accepted to SIGIR 2022

  7. arXiv:2201.12320  [pdf, other

    cs.LG cs.CL

    Generative Cooperative Networks for Natural Language Generation

    Authors: Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski

    Abstract: Generative Adversarial Networks (GANs) have known a tremendous success for many continuous generation tasks, especially in the field of image generation. However, for discrete outputs such as language, optimizing GANs remains an open problem with many instabilities, as no gradient can be properly back-propagated from the discriminator output to the generator parameters. An alternative is to learn… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  8. arXiv:2110.13016  [pdf, other

    cs.CL cs.IR

    Generating artificial texts as substitution or complement of training data

    Authors: Vincent Claveau, Antoine Chaffin, Ewa Kijak

    Abstract: The quality of artificially generated texts has considerably improved with the advent of transformers. The question of using these models to generate learning data for supervised learning tasks naturally arises. In this article, this question is explored under 3 aspects: (i) are artificial data an efficient complement? (ii) can they replace the original data when those are not available or cannot… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 8 pages

  9. arXiv:2109.13582  [pdf, other

    cs.CL

    PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding

    Authors: Antoine Chaffin, Vincent Claveau, Ewa Kijak

    Abstract: Large language models (LM) based on Transformers allow to generate plausible long texts. In this paper, we explore how this generation can be further controlled at decoding time to satisfy certain constraints (e.g. being non-toxic, conveying certain emotions, using a specific writing style, etc.) without fine-tuning the LM. Precisely, we formalize constrained generation as a tree exploration proce… ▽ More

    Submitted 4 May, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: 15 pages, 5 tables, 7 figures, accepted to NAACL 2022

  10. arXiv:2106.04990  [pdf, other

    cs.LG cs.CV

    It Takes Two to Tango: Mixup for Deep Metric Learning

    Authors: Shashanka Venkataramanan, Bill Psomas, Ewa Kijak, Laurent Amsaleg, Konstantinos Karantzalos, Yannis Avrithis

    Abstract: Metric learning involves learning a discriminative representation such that embeddings of similar classes are encouraged to be close, while embeddings of dissimilar classes are pushed far apart. State-of-the-art methods focus mostly on sophisticated loss functions or mining strategies. On the one hand, metric learning losses consider two or more examples at a time. On the other hand, modern data a… ▽ More

    Submitted 28 February, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022

  11. arXiv:2103.15375  [pdf, other

    cs.CV

    AlignMixup: Improving Representations By Interpolating Aligned Features

    Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

    Abstract: Mixup is a powerful data augmentation method that interpolates between two or more examples in the input or feature space and between the corresponding target labels. Many recent mixup methods focus on cutting and pasting two or more objects into one image, which is more about efficient processing than interpolation. However, how to best interpolate images is not well defined. In this sense, mixup… ▽ More

    Submitted 25 March, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2022

  12. arXiv:2011.04971  [pdf, other

    cs.CV

    Detecting Human-Object Interaction with Mixed Supervision

    Authors: Suresh Kirthi Kumaraswamy, Miao**g Shi, Ewa Kijak

    Abstract: Human object interaction (HOI) detection is an important task in image understanding and reasoning. It is in a form of HOI triplet <human; verb; object>, requiring bounding boxes for human and object, and action between them for the task completion. In other words, this task requires strong supervision for training that is however hard to procure. A natural solution to overcome this is to pursue w… ▽ More

    Submitted 12 November, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: WACV 2021 - camera ready

  13. arXiv:1704.03755  [pdf, other

    cs.CV cs.IR

    Unsupervised part learning for visual recognition

    Authors: Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frederic Jurie

    Abstract: Part-based image classification aims at representing categories by small sets of learned discriminative parts, upon which an image representation is built. Considered as a promising avenue a decade ago, this direction has been neglected since the advent of deep neural networks. In this context, this paper brings two contributions: first, it shows that despite the recent success of end-to-end holis… ▽ More

    Submitted 12 April, 2017; originally announced April 2017.