Skip to main content

Showing 1–5 of 5 results for author: Giró-i-Nieto, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2303.05007  [pdf, other

    cs.CR cs.CV cs.MM cs.SD eess.AS

    Towards Robust Image-in-Audio Deep Steganography

    Authors: Jaume Ros, Margarita Geleta, Jordi Pons, Xavier Giro-i-Nieto

    Abstract: The field of steganography has experienced a surge of interest due to the recent advancements in AI-powered techniques, particularly in the context of multimodal setups that enable the concealment of signals within signals of a different nature. The primary objectives of all steganographic methods are to achieve perceptual transparency, robustness, and large embedding capacity - which often presen… ▽ More

    Submitted 14 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 8 pages, 5 figures, 2 tables

    MSC Class: 68T99 ACM Class: I.4.9; I.2.m

  2. arXiv:2209.03027  [pdf, other

    cs.CV cs.AI eess.IV

    SIRA: Relightable Avatars from a Single Image

    Authors: Pol Caselles, Eduard Ramon, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer, Gil Triginer

    Abstract: Recovering the geometry of a human head from a single image, while factorizing the materials and illumination is a severely ill-posed problem that requires prior information to be solved. Methods based on 3D Morphable Models (3DMM), and their combination with differentiable renderers, have shown promising results. However, the expressiveness of 3DMMs is limited, and they typically yield over-smoot… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  3. arXiv:2106.09814  [pdf, other

    cs.MM cs.SD eess.AS

    PixInWav: Residual Steganography for Hiding Pixels in Audio

    Authors: Margarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giro-i-Nieto

    Abstract: Steganography comprises the mechanics of hiding data in a host media that may be publicly available. While previous works focused on unimodal setups (e.g., hiding images in images, or hiding audio in audio), PixInWav targets the multimodal case of hiding images in audio. To this end, we propose a novel residual architecture operating on top of short-time discrete cosine transform (STDCT) audio spe… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Extended abstract presented in CVPR 2021 Women in Computer Vision Workshop

  4. arXiv:1908.08856  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing Knee OA Severity with CNN attention-based end-to-end architectures

    Authors: Marc Górriz, Joseph Antony, Kevin McGuinness, Xavier Giró-i-Nieto, Noel E. O'Connor

    Abstract: This work proposes a novel end-to-end convolutional neural network (CNN) architecture to automatically quantify the severity of knee osteoarthritis (OA) using X-Ray images, which incorporates trainable attention modules acting as unsupervised fine-grained detectors of the region of interest (ROI). The proposed attention modules can be applied at different levels and scales across any CNN pipeline… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Proceedings of the 2nd International Conference on Medical Imaging with Deep Learning

    Journal ref: Proceedings of The 2nd International Conference on Medical Imaging with Deep Learning, PMLR 102:197-214, 2019

  5. arXiv:1801.02200  [pdf, other

    cs.IR cs.CV cs.SD eess.AS

    Cross-modal Embeddings for Video and Audio Retrieval

    Authors: Didac Surís, Amanda Duarte, Amaia Salvador, Jordi Torres, Xavier Giró-i-Nieto

    Abstract: The increasing amount of online videos brings several opportunities for training self-supervised neural networks. The creation of large scale datasets of videos such as the YouTube-8M allows us to deal with this large amount of data in manageable way. In this work, we find new ways of exploiting this dataset by taking advantage of the multi-modal information it provides. By means of a neural netwo… ▽ More

    Submitted 7 January, 2018; originally announced January 2018.

    Comments: 6 pages, 3 figures