Skip to main content

Showing 1–8 of 8 results for author: Estevam, V

Searching in archive cs. Search in all archives.
.
  1. Leveraging Model Fusion for Improved License Plate Recognition

    Authors: Rayson Laroca, Luiz A. Zanlorensi, Valter Estevam, Rodrigo Minetto, David Menotti

    Abstract: License Plate Recognition (LPR) plays a critical role in various applications, such as toll collection, parking management, and traffic law enforcement. Although LPR has witnessed significant advancements through the development of deep learning, there has been a noticeable lack of studies exploring the potential improvements in results by fusing the outputs from multiple recognition models. This… ▽ More

    Submitted 5 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted for presentation at the Iberoamerican Congress on Pattern Recognition (CIARP) 2023

  2. Do We Train on Test Data? The Impact of Near-Duplicates on License Plate Recognition

    Authors: Rayson Laroca, Valter Estevam, Alceu S. Britto Jr., Rodrigo Minetto, David Menotti

    Abstract: This work draws attention to the large fraction of near-duplicates in the training and test sets of datasets widely adopted in License Plate Recognition (LPR) research. These duplicates refer to images that, although different, show the same license plate. Our experiments, conducted on the two most popular datasets in the field, show a substantial decrease in recognition rate when six well-known m… ▽ More

    Submitted 4 August, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2023

  3. Global Semantic Descriptors for Zero-Shot Action Recognition

    Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

    Abstract: The success of Zero-shot Action Recognition (ZSAR) methods is intrinsically related to the nature of semantic side information used to transfer knowledge, although this aspect has not been primarily investigated in the literature. This work introduces a new ZSAR method based on the relationships of actions-objects and actions-descriptive sentences. We demonstrate that representing all object class… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Journal ref: IEEE Signal Processing Letters, vol. 29, pp. 1843-1847, 2022

  4. A First Look at Dataset Bias in License Plate Recognition

    Authors: Rayson Laroca, Marcelo Santos, Valter Estevam, Eduardo Luz, David Menotti

    Abstract: Public datasets have played a key role in advancing the state of the art in License Plate Recognition (LPR). Although dataset bias has been recognized as a severe problem in the computer vision community, it has been largely overlooked in the LPR literature. LPR models are usually trained and evaluated separately on each dataset. In this scenario, they have often proven robust in the dataset they… ▽ More

    Submitted 30 December, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  5. On the Cross-dataset Generalization in License Plate Recognition

    Authors: Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti

    Abstract: Automatic License Plate Recognition (ALPR) systems have shown remarkable performance on license plates (LPs) from multiple regions due to advances in deep learning and the increasing availability of datasets. The evaluation of deep ALPR systems is usually done within each dataset; therefore, it is questionable if such results are a reliable indicator of generalization ability. In this paper, we pr… ▽ More

    Submitted 21 December, 2022; v1 submitted 1 January, 2022; originally announced January 2022.

    Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2022

  6. Tell me what you see: A zero-shot action recognition method based on natural language descriptions

    Authors: Valter Estevam, Rayson Laroca, David Menotti, Helio Pedrini

    Abstract: This paper presents a novel approach to Zero-Shot Action Recognition. Recent works have explored the detection and classification of objects to obtain semantic information from videos with remarkable performance. Inspired by them, we propose using video captioning methods to extract semantic information about objects, scenes, humans, and their relationships. To the best of our knowledge, this is t… ▽ More

    Submitted 11 September, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: Published at Multimedia Tools and Applications

  7. arXiv:2112.08455  [pdf, other

    cs.CV

    Dense Video Captioning Using Unsupervised Semantic Information

    Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

    Abstract: We introduce a method to learn unsupervised semantic visual information based on the premise that complex events (e.g., minutes) can be decomposed into simpler events (e.g., a few seconds), and that these simple events are shared across several complex events. We split a long video into short frame sequences to extract their latent representation with three-dimensional convolutional neural network… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  8. Zero-Shot Action Recognition in Videos: A Survey

    Authors: Valter Estevam, Helio Pedrini, David Menotti

    Abstract: Zero-Shot Action Recognition has attracted attention in the last years and many approaches have been proposed for recognition of objects, events and actions in images and videos. There is a demand for methods that can classify instances from classes that are not present in the training of models, especially in the complex problem of automatic video understanding, since collecting, annotating and l… ▽ More

    Submitted 17 November, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: Preprint