Skip to main content

Showing 1–7 of 7 results for author: Santos, S F d

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01375  [pdf, other

    cs.CV

    TransferAttn: Transferable-guided Attention Is All You Need for Video Domain Adaptation

    Authors: André Sacilotti, Samuel Felipe dos Santos, Nicu Sebe, Jurandy Almeida

    Abstract: Unsupervised domain adaptation (UDA) in videos is a challenging task that remains not well explored compared to image-based UDA techniques. Although vision transformers (ViT) achieve state-of-the-art performance in many computer vision tasks, their use in video domain adaptation has still been little explored. Our key idea is to use the transformer layers as a feature encoder and incorporate spati… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2309.11464  [pdf, other

    cs.CV

    Budget-Aware Pruning: Handling Multiple Domains with Less Parameters

    Authors: Samuel Felipe dos Santos, Rodrigo Berriel, Thiago Oliveira-Santos, Nicu Sebe, Jurandy Almeida

    Abstract: Deep learning has achieved state-of-the-art performance on several computer vision tasks and domains. Nevertheless, it still has a high computational cost and demands a significant amount of parameters. Such requirements hinder the use in resource-limited environments and demand both software and hardware optimization. Another limitation is that deep models are usually specialized into a single do… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2210.08101

  3. arXiv:2309.11417   

    cs.CV

    CNNs for JPEGs: A Study in Computational Cost

    Authors: Samuel Felipe dos Santos, Nicu Sebe, Jurandy Almeida

    Abstract: Convolutional neural networks (CNNs) have achieved astonishing advances over the past decade, defining state-of-the-art in several computer vision tasks. CNNs are capable of learning robust representations of the data directly from the RGB pixels. However, most image data are usually available in compressed format, from which the JPEG is the most widely used due to transmission and storage purpose… ▽ More

    Submitted 22 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: A previous version of this work had already been submitted to ArXiv and is available at arXiv:2012.14426. Instead of maintaining two different submissions, we decided to submit a replacement for the previous submission

  4. Budget-Aware Pruning for Multi-Domain Learning

    Authors: Samuel Felipe dos Santos, Rodrigo Berriel, Thiago Oliveira-Santos, Nicu Sebe, Jurandy Almeida

    Abstract: Deep learning has achieved state-of-the-art performance on several computer vision tasks and domains. Nevertheless, it still has a high computational cost and demands a significant amount of parameters. Such requirements hinder the use in resource-limited environments and demand both software and hardware optimization. Another limitation is that deep models are usually specialized into a single do… ▽ More

    Submitted 16 September, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Journal ref: 22nd International Conference on Image Analysis and Processing (ICIAP'23), 2023, pp. 477-489

  5. Less is More: Accelerating Faster Neural Networks Straight from JPEG

    Authors: Samuel Felipe dos Santos, Jurandy Almeida

    Abstract: Most image data available are often stored in a compressed format, from which JPEG is the most widespread. To feed this data on a convolutional neural network (CNN), a preliminary decoding process is required to obtain RGB pixels, demanding a high computational load and memory usage. For this reason, the design of CNNs for processing JPEG compressed data has gained attention in recent years. In mo… ▽ More

    Submitted 24 August, 2022; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.14426

    Journal ref: in 2021 25th Iberoamerican Congress on Pattern Recognition (CIARP), 2021, pp. 237-247

  6. arXiv:2012.14426  [pdf, other

    cs.CV

    CNNs for JPEGs: A Study in Computational Cost

    Authors: Samuel Felipe dos Santos, Nicu Sebe, Jurandy Almeida

    Abstract: Convolutional neural networks (CNNs) have achieved astonishing advances over the past decade, defining state-of-the-art in several computer vision tasks. CNNs are capable of learning robust representations of the data directly from the RGB pixels. However, most image data are usually available in compressed format, from which the JPEG is the most widely used due to transmission and storage purpose… ▽ More

    Submitted 22 September, 2023; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.13726

  7. Faster and Accurate Compressed Video Action Recognition Straight from the Frequency Domain

    Authors: Samuel Felipe dos Santos, Jurandy Almeida

    Abstract: Human action recognition has become one of the most active field of research in computer vision due to its wide range of applications, like surveillance, medical, industrial environments, smart homes, among others. Recently, deep learning has been successfully used to learn powerful and interpretable features for recognizing human actions in videos. Most of the existing deep learning approaches ha… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

    Journal ref: in 2020 33rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2020, pp. 62-68