Skip to main content

Showing 1–26 of 26 results for author: Laroca, R

.
  1. Leveraging Model Fusion for Improved License Plate Recognition

    Authors: Rayson Laroca, Luiz A. Zanlorensi, Valter Estevam, Rodrigo Minetto, David Menotti

    Abstract: License Plate Recognition (LPR) plays a critical role in various applications, such as toll collection, parking management, and traffic law enforcement. Although LPR has witnessed significant advancements through the development of deep learning, there has been a noticeable lack of studies exploring the potential improvements in results by fusing the outputs from multiple recognition models. This… ▽ More

    Submitted 5 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted for presentation at the Iberoamerican Congress on Pattern Recognition (CIARP) 2023

  2. Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers

    Authors: Valfride Nascimento, Rayson Laroca, Jorge de A. Lambert, William Robson Schwartz, David Menotti

    Abstract: Recent years have seen significant developments in the field of License Plate Recognition (LPR) through the integration of deep learning techniques and the increasing availability of training data. Nevertheless, reconstructing license plates (LPs) from low-resolution (LR) surveillance footage remains challenging. To address this issue, we introduce a Single-Image Super-Resolution (SISR) approach t… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Journal ref: Computers & Graphics, vol. 113, pp. 69-76, 2023

  3. Do We Train on Test Data? The Impact of Near-Duplicates on License Plate Recognition

    Authors: Rayson Laroca, Valter Estevam, Alceu S. Britto Jr., Rodrigo Minetto, David Menotti

    Abstract: This work draws attention to the large fraction of near-duplicates in the training and test sets of datasets widely adopted in License Plate Recognition (LPR) research. These duplicates refer to images that, although different, show the same license plate. Our experiments, conducted on the two most popular datasets in the field, show a substantial decrease in recognition rate when six well-known m… ▽ More

    Submitted 4 August, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2023

  4. DACov: A Deeper Analysis of Data Augmentation on the Computed Tomography Segmentation Problem

    Authors: Bruno A. Krinski, Daniel V. Ruiz, Rayson Laroca, Eduardo Todt

    Abstract: Due to the COVID-19 global pandemic, computer-assisted diagnoses of medical images have gained much attention, and robust methods of semantic segmentation of Computed Tomography (CT) images have become highly desirable. In this work, we present a deeper analysis of how data augmentation techniques improve segmentation performance on this problem. We evaluate 20 traditional augmentation techniques… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Journal ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 2023

  5. Combining Attention Module and Pixel Shuffle for License Plate Super-Resolution

    Authors: Valfride Nascimento, Rayson Laroca, Jorge de A. Lambert, William Robson Schwartz, David Menotti

    Abstract: The License Plate Recognition (LPR) field has made impressive advances in the last decade due to novel deep learning approaches combined with the increased availability of training data. However, it still has some open issues, especially when the data come from low-resolution (LR) and low-quality images/videos, as in surveillance systems. This work focuses on license plate (LP) reconstruction in L… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  6. Face Super-Resolution Using Stochastic Differential Equations

    Authors: Marcelo dos Santos, Rayson Laroca, Rafael O. Ribeiro, João Neves, Hugo Proença, David Menotti

    Abstract: Diffusion models have proven effective for various applications such as images, audio and graph generation. Other important applications are image super-resolution and the solution of inverse problems. More recently, some works have used stochastic differential equations (SDEs) to generalize diffusion models to continuous time. In this work, we introduce SDEs to generate super-resolution face imag… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  7. Global Semantic Descriptors for Zero-Shot Action Recognition

    Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

    Abstract: The success of Zero-shot Action Recognition (ZSAR) methods is intrinsically related to the nature of semantic side information used to transfer knowledge, although this aspect has not been primarily investigated in the literature. This work introduces a new ZSAR method based on the relationships of actions-objects and actions-descriptive sentences. We demonstrate that representing all object class… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Journal ref: IEEE Signal Processing Letters, vol. 29, pp. 1843-1847, 2022

  8. A First Look at Dataset Bias in License Plate Recognition

    Authors: Rayson Laroca, Marcelo Santos, Valter Estevam, Eduardo Luz, David Menotti

    Abstract: Public datasets have played a key role in advancing the state of the art in License Plate Recognition (LPR). Although dataset bias has been recognized as a severe problem in the computer vision community, it has been largely overlooked in the LPR literature. LPR models are usually trained and evaluated separately on each dataset. In this scenario, they have often proven robust in the dataset they… ▽ More

    Submitted 30 December, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  9. Image-based Automatic Dial Meter Reading in Unconstrained Scenarios

    Authors: Gabriel Salomon, Rayson Laroca, David Menotti

    Abstract: The replacement of analog meters with smart meters is costly, laborious, and far from complete in develo** countries. The Energy Company of Parana (Copel) (Brazil) performs more than 4 million meter readings (almost entirely of non-smart devices) per month, and we estimate that 850 thousand of them are from dial meters. Therefore, an image-based automatic reading system can reduce human errors,… ▽ More

    Submitted 23 October, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Journal ref: Measurement, vol. 204, p. 112025, 2022

  10. On the Cross-dataset Generalization in License Plate Recognition

    Authors: Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti

    Abstract: Automatic License Plate Recognition (ALPR) systems have shown remarkable performance on license plates (LPs) from multiple regions due to advances in deep learning and the increasing availability of datasets. The evaluation of deep ALPR systems is usually done within each dataset; therefore, it is questionable if such results are a reliable indicator of generalization ability. In this paper, we pr… ▽ More

    Submitted 21 December, 2022; v1 submitted 1 January, 2022; originally announced January 2022.

    Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2022

  11. Tell me what you see: A zero-shot action recognition method based on natural language descriptions

    Authors: Valter Estevam, Rayson Laroca, David Menotti, Helio Pedrini

    Abstract: This paper presents a novel approach to Zero-Shot Action Recognition. Recent works have explored the detection and classification of objects to obtain semantic information from videos with remarkable performance. Inspired by them, we propose using video captioning methods to extract semantic information about objects, scenes, humans, and their relationships. To the best of our knowledge, this is t… ▽ More

    Submitted 11 September, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: Published at Multimedia Tools and Applications

  12. arXiv:2112.08455  [pdf, other

    cs.CV

    Dense Video Captioning Using Unsupervised Semantic Information

    Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

    Abstract: We introduce a method to learn unsupervised semantic visual information based on the premise that complex events (e.g., minutes) can be decomposed into simpler events (e.g., a few seconds), and that these simple events are shared across several complex events. We split a long video into short frame sequences to extract their latent representation with three-dimensional convolutional neural network… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  13. A New Periocular Dataset Collected by Mobile Devices in Unconstrained Scenarios

    Authors: Luiz A. Zanlorensi, Rayson Laroca, Diego R. Lucio, Lucas R. Santos, Alceu S. Britto Jr., David Menotti

    Abstract: Recently, ocular biometrics in unconstrained environments using images obtained at visible wavelength have gained the researchers' attention, especially with images captured by mobile devices. Periocular recognition has been demonstrated to be an alternative when the iris trait is not available due to occlusions or low image resolution. However, the periocular trait does not have the high uniquene… ▽ More

    Submitted 14 November, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Journal ref: Scientific Reports, vol. 12, p. 17989, 2022

  14. arXiv:2010.16307  [pdf, other

    cs.CV

    Automatic Counting and Identification of Train Wagons Based on Computer Vision and Deep Learning

    Authors: Rayson Laroca, Alessander Cidral Boslooper, David Menotti

    Abstract: In this work, we present a robust and efficient solution for counting and identifying train wagons using computer vision and deep learning. The proposed solution is cost-effective and can easily replace solutions based on radiofrequency identification (RFID), which are known to have high installation and maintenance costs. According to our experiments, our two-stage methodology achieves impressive… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: An article about the proposed system has been published in the October 2020 issue of Railway Gazette International, the leading business journal for the worldwide rail industry

  15. Towards Image-based Automatic Meter Reading in Unconstrained Scenarios: A Robust and Efficient Approach

    Authors: Rayson Laroca, Alessandra B. Araujo, Luiz A. Zanlorensi, Eduardo C. de Almeida, David Menotti

    Abstract: Existing approaches for image-based Automatic Meter Reading (AMR) have been evaluated on images captured in well-controlled scenarios. However, real-world meter reading presents unconstrained scenarios that are way more challenging due to dirt, various lighting conditions, scale variations, in-plane and out-of-plane rotations, among other factors. In this work, we present an end-to-end approach fo… ▽ More

    Submitted 12 May, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

    Journal ref: IEEE Access, vol. 9, pp. 67569-67584, 2021

  16. Deep Learning for Image-based Automatic Dial Meter Reading: Dataset and Baselines

    Authors: Gabriel Salomon, Rayson Laroca, David Menotti

    Abstract: Smart meters enable remote and automatic electricity, water and gas consumption reading and are being widely deployed in developed countries. Nonetheless, there is still a huge number of non-smart meters in operation. Image-based Automatic Meter Reading (AMR) focuses on dealing with this type of meter readings. We estimate that the Energy Company of Paraná (Copel), in Brazil, performs more than 85… ▽ More

    Submitted 8 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: Accepted for presentation at the 2020 International Joint Conference on Neural Networks (IJCNN)

  17. Ocular Recognition Databases and Competitions: A Survey

    Authors: Luiz A. Zanlorensi, Rayson Laroca, Eduardo Luz, Alceu S. Britto Jr., Luiz S. Oliveira, David Menotti

    Abstract: The use of the iris and periocular region as biometric traits has been extensively investigated, mainly due to the singularity of the iris features and the use of the periocular region when the image resolution is not sufficient to extract iris information. In addition to providing information about an individual's identity, features extracted from these traits can also be explored to obtain other… ▽ More

    Submitted 4 February, 2022; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: Artificial Intelligence Review, vol. 55, pp. 129-180, 2022

  18. Vehicle-Rear: A New Dataset to Explore Feature Fusion for Vehicle Identification Using Convolutional Neural Networks

    Authors: Icaro O. de Oliveira, Rayson Laroca, David Menotti, Keiko V. O. Fonseca, Rodrigo Minetto

    Abstract: This work addresses the problem of vehicle identification through non-overlap** cameras. As our main contribution, we introduce a novel dataset for vehicle identification, called Vehicle-Rear, that contains more than three hours of high-resolution videos, with accurate information about the make, model, color and year of nearly 3,000 vehicles, in addition to the position and identification of th… ▽ More

    Submitted 25 July, 2021; v1 submitted 13 November, 2019; originally announced November 2019.

    Journal ref: IEEE Access, vol. 9, pp. 101065-101077, 2021

  19. An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detector

    Authors: Rayson Laroca, Luiz A. Zanlorensi, Gabriel R. Gonçalves, Eduardo Todt, William Robson Schwartz, David Menotti

    Abstract: This paper presents an efficient and layout-independent Automatic License Plate Recognition (ALPR) system based on the state-of-the-art YOLO object detector that contains a unified approach for license plate (LP) detection and layout classification to improve the recognition results using post-processing rules. The system is conceived by evaluating and optimizing different models, aiming at achiev… ▽ More

    Submitted 9 March, 2021; v1 submitted 4 September, 2019; originally announced September 2019.

    Journal ref: IET Intelligent Transport Systems, vol. 15, no. 4, pp. 483-503, 2021

  20. Simultaneous Iris and Periocular Region Detection Using Coarse Annotations

    Authors: Diego R. Lucio, Rayson Laroca, Luiz A. Zanlorensi, Gladston Moreira, David Menotti

    Abstract: In this work, we propose to detect the iris and periocular regions simultaneously using coarse annotations and two well-known object detectors: YOLOv2 and Faster R-CNN. We believe coarse annotations can be used in recognition systems based on the iris and periocular regions, given the much smaller engineering effort required to manually annotate the training images. We manually made coarse annotat… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2019

  21. Convolutional Neural Networks for Automatic Meter Reading

    Authors: Rayson Laroca, Victor Barroso, Matheus A. Diniz, Gabriel R. Gonçalves, William Robson Schwartz, David Menotti

    Abstract: In this paper, we tackle Automatic Meter Reading (AMR) by leveraging the high capability of Convolutional Neural Networks (CNNs). We design a two-stage approach that employs the Fast-YOLO object detector for counter detection and evaluates three different CNN-based approaches for counter recognition. In the AMR literature, most datasets are not available to the research community since the images… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Journal ref: Journal of Electronic Imaging 28(1), 013023 (5 February 2019)

  22. Robust Iris Segmentation Based on Fully Convolutional Networks and Generative Adversarial Networks

    Authors: Cides S. Bezerra, Rayson Laroca, Diego R. Lucio, Evair Severo, Lucas F. Oliveira, Alceu S. Britto Jr., David Menotti

    Abstract: The iris can be considered as one of the most important biometric traits due to its high degree of uniqueness. Iris-based biometrics applications depend mainly on the iris segmentation whose suitability is not robust for different environments such as near-infrared (NIR) and visible (VIS) ones. In this paper, two approaches for robust iris segmentation based on Fully Convolutional Networks (FCNs)… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2018

  23. The Impact of Preprocessing on Deep Representations for Iris Recognition on Unconstrained Environments

    Authors: Luiz A. Zanlorensi, Eduardo Luz, Rayson Laroca, Alceu S. Britto Jr., Luiz S. Oliveira, David Menotti

    Abstract: The use of iris as a biometric trait is widely used because of its high level of distinction and uniqueness. Nowadays, one of the major research challenges relies on the recognition of iris images obtained in visible spectrum under unconstrained environments. In this scenario, the acquired iris are affected by capture distance, rotation, blur, motion blur, low contrast and specular reflection, cre… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2018

  24. Fully Convolutional Networks and Generative Adversarial Networks Applied to Sclera Segmentation

    Authors: Diego R. Lucio, Rayson Laroca, Evair Severo, Alceu S. Britto Jr., David Menotti

    Abstract: Due to the world's demand for security systems, biometrics can be seen as an important topic of research in computer vision. One of the biometric forms that has been gaining attention is the recognition based on sclera. The initial and paramount step for performing this type of recognition is the segmentation of the region of interest, i.e. the sclera. In this context, two approaches for such task… ▽ More

    Submitted 9 July, 2018; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: Accepted for presentation at the IEEE International Conference on Biometrics: Theory, Applications, and Systems (BTAS) 2018

  25. A Benchmark for Iris Location and a Deep Learning Detector Evaluation

    Authors: Evair Severo, Rayson Laroca, Cides S. Bezerra, Luiz A. Zanlorensi, Daniel Weingaertner, Gladston Moreira, David Menotti

    Abstract: The iris is considered as the biometric trait with the highest unique probability. The iris location is an important task for biometrics systems, affecting directly the results obtained in specific applications such as iris recognition, spoofing and contact lenses detection, among others. This work defines the iris location problem as the delimitation of the smallest squared window that encompasse… ▽ More

    Submitted 30 April, 2018; v1 submitted 3 March, 2018; originally announced March 2018.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2018

  26. A Robust Real-Time Automatic License Plate Recognition Based on the YOLO Detector

    Authors: Rayson Laroca, Evair Severo, Luiz A. Zanlorensi, Luiz S. Oliveira, Gabriel Resende Gonçalves, William Robson Schwartz, David Menotti

    Abstract: Automatic License Plate Recognition (ALPR) has been a frequent topic of research due to many practical applications. However, many of the current solutions are still not robust in real-world situations, commonly depending on many constraints. This paper presents a robust and efficient ALPR system based on the state-of-the-art YOLO object detector. The Convolutional Neural Networks (CNNs) are train… ▽ More

    Submitted 28 April, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2018