Skip to main content

Showing 1–43 of 43 results for author: Menotti, D

.
  1. A Multilevel Strategy to Improve People Tracking in a Real-World Scenario

    Authors: Cristiano B. de Oliveira, Joao C. Neves, Rafael O. Ribeiro, David Menotti

    Abstract: The Palácio do Planalto, office of the President of Brazil, was invaded by protesters on January 8, 2023. Surveillance videos taken from inside the building were subsequently released by the Brazilian Supreme Court for public scrutiny. We used segments of such footage to create the UFPR-Planalto801 dataset for people tracking and re-identification in a real-world scenario. This dataset consists of… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2024

    Journal ref: Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, 2024

  2. arXiv:2404.10378  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data

    Authors: Ivan DeAndres-Tame, Ruben Tolosana, Pietro Melzi, Ruben Vera-Rodriguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Aythami Morales, Julian Fierrez, Javier Ortega-Garcia, Zhizhou Zhong, Yuge Huang, Yuxi Mi, Shouhong Ding, Shuigeng Zhou, Shuai He, Lingzhi Fu, Heng Cong, Rongyu Zhang, Zhihong Xiao, Evgeny Smirnov, Anton Pimenov, Aleksei Grigorev, Denis Timoshenko, Kaleb Mesfin Asfaw , et al. (33 additional authors not shown)

    Abstract: Synthetic data is gaining increasing relevance for training machine learning models. This is mainly motivated due to several factors such as the lack of real data and intra-class variability, time and errors produced in manual labeling, and in some cases privacy concerns, among others. This paper presents an overview of the 2nd edition of the Face Recognition Challenge in the Era of Synthetic Data… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.10476

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRw 2024)

  3. arXiv:2404.04580  [pdf, other

    cs.CV

    SDFR: Synthetic Data for Face Recognition Competition

    Authors: Hatef Otroshi Shahreza, Christophe Ecabert, Anjith George, Alexander Unnervik, Sébastien Marcel, Nicolò Di Domenico, Guido Borghi, Davide Maltoni, Fadi Boutros, Julia Vogel, Naser Damer, Ángela Sánchez-Pérez, EnriqueMas-Candela, Jorge Calvo-Zaragoza, Bernardo Biesseck, Pedro Vidal, Roger Granada, David Menotti, Ivan DeAndres-Tame, Simone Maurizio La Cava, Sara Concas, Pietro Melzi, Ruben Tolosana, Ruben Vera-Rodriguez, Gianpaolo Perelli , et al. (3 additional authors not shown)

    Abstract: Large-scale face recognition datasets are collected by crawling the Internet and without individuals' consent, raising legal, ethical, and privacy concerns. With the recent advances in generative models, recently several works proposed generating synthetic face recognition datasets to mitigate concerns in web-crawled face recognition datasets. This paper presents the summary of the Synthetic Data… ▽ More

    Submitted 9 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: The 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2024)

  4. arXiv:2311.10476  [pdf, other

    cs.CV

    FRCSyn Challenge at WACV 2024:Face Recognition Challenge in the Era of Synthetic Data

    Authors: Pietro Melzi, Ruben Tolosana, Ruben Vera-Rodriguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Ivan DeAndres-Tame, Aythami Morales, Julian Fierrez, Javier Ortega-Garcia, Weisong Zhao, Xiangyu Zhu, Zheyu Yan, Xiao-Yu Zhang, **lin Wu, Zhen Lei, Suvidha Tripathi, Mahak Kothari, Md Haider Zama, Debayan Deb, Bernardo Biesseck, Pedro Vidal, Roger Granada, Guilherme Fickel, Gustavo Führ , et al. (22 additional authors not shown)

    Abstract: Despite the widespread adoption of face recognition technology around the world, and its remarkable performance on current benchmarks, there are still several challenges that must be covered in more detail. This paper offers an overview of the Face Recognition Challenge in the Era of Synthetic Data (FRCSyn) organized at WACV 2024. This is the first international challenge aiming to explore the use… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, WACV 2024 Workshops

  5. Leveraging Model Fusion for Improved License Plate Recognition

    Authors: Rayson Laroca, Luiz A. Zanlorensi, Valter Estevam, Rodrigo Minetto, David Menotti

    Abstract: License Plate Recognition (LPR) plays a critical role in various applications, such as toll collection, parking management, and traffic law enforcement. Although LPR has witnessed significant advancements through the development of deep learning, there has been a noticeable lack of studies exploring the potential improvements in results by fusing the outputs from multiple recognition models. This… ▽ More

    Submitted 5 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted for presentation at the Iberoamerican Congress on Pattern Recognition (CIARP) 2023

  6. Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers

    Authors: Valfride Nascimento, Rayson Laroca, Jorge de A. Lambert, William Robson Schwartz, David Menotti

    Abstract: Recent years have seen significant developments in the field of License Plate Recognition (LPR) through the integration of deep learning techniques and the increasing availability of training data. Nevertheless, reconstructing license plates (LPs) from low-resolution (LR) surveillance footage remains challenging. To address this issue, we introduce a Single-Image Super-Resolution (SISR) approach t… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Journal ref: Computers & Graphics, vol. 113, pp. 69-76, 2023

  7. Do We Train on Test Data? The Impact of Near-Duplicates on License Plate Recognition

    Authors: Rayson Laroca, Valter Estevam, Alceu S. Britto Jr., Rodrigo Minetto, David Menotti

    Abstract: This work draws attention to the large fraction of near-duplicates in the training and test sets of datasets widely adopted in License Plate Recognition (LPR) research. These duplicates refer to images that, although different, show the same license plate. Our experiments, conducted on the two most popular datasets in the field, show a substantial decrease in recognition rate when six well-known m… ▽ More

    Submitted 4 August, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2023

  8. Combining Attention Module and Pixel Shuffle for License Plate Super-Resolution

    Authors: Valfride Nascimento, Rayson Laroca, Jorge de A. Lambert, William Robson Schwartz, David Menotti

    Abstract: The License Plate Recognition (LPR) field has made impressive advances in the last decade due to novel deep learning approaches combined with the increased availability of training data. However, it still has some open issues, especially when the data come from low-resolution (LR) and low-quality images/videos, as in surveillance systems. This work focuses on license plate (LP) reconstruction in L… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  9. Face Super-Resolution Using Stochastic Differential Equations

    Authors: Marcelo dos Santos, Rayson Laroca, Rafael O. Ribeiro, João Neves, Hugo Proença, David Menotti

    Abstract: Diffusion models have proven effective for various applications such as images, audio and graph generation. Other important applications are image super-resolution and the solution of inverse problems. More recently, some works have used stochastic differential equations (SDEs) to generalize diffusion models to continuous time. In this work, we introduce SDEs to generate super-resolution face imag… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  10. Global Semantic Descriptors for Zero-Shot Action Recognition

    Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

    Abstract: The success of Zero-shot Action Recognition (ZSAR) methods is intrinsically related to the nature of semantic side information used to transfer knowledge, although this aspect has not been primarily investigated in the literature. This work introduces a new ZSAR method based on the relationships of actions-objects and actions-descriptive sentences. We demonstrate that representing all object class… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Journal ref: IEEE Signal Processing Letters, vol. 29, pp. 1843-1847, 2022

  11. A First Look at Dataset Bias in License Plate Recognition

    Authors: Rayson Laroca, Marcelo Santos, Valter Estevam, Eduardo Luz, David Menotti

    Abstract: Public datasets have played a key role in advancing the state of the art in License Plate Recognition (LPR). Although dataset bias has been recognized as a severe problem in the computer vision community, it has been largely overlooked in the LPR literature. LPR models are usually trained and evaluated separately on each dataset. In this scenario, they have often proven robust in the dataset they… ▽ More

    Submitted 30 December, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  12. arXiv:2208.02760  [pdf, other

    cs.CV cs.LG

    OCFR 2022: Competition on Occluded Face Recognition From Synthetically Generated Structure-Aware Occlusions

    Authors: Pedro C. Neto, Fadi Boutros, Joao Ribeiro Pinto, Naser Damer, Ana F. Sequeira, Jaime S. Cardoso, Messaoud Bengherabi, Abderaouf Bousnat, Sana Boucheta, Nesrine Hebbadj, Mustafa Ekrem Erakın, Uğur Demir, Hazım Kemal Ekenel, Pedro Beber de Queiroz Vidal, David Menotti

    Abstract: This work summarizes the IJCB Occluded Face Recognition Competition 2022 (IJCB-OCFR-2022) embraced by the 2022 International Joint Conference on Biometrics (IJCB 2022). OCFR-2022 attracted a total of 3 participating teams, from academia. Eventually, six valid submissions were submitted and then evaluated by the organizers. The competition was held to address the challenge of face recognition in th… ▽ More

    Submitted 15 August, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: Accepted at International Joint Conference on Biometrics 2022

  13. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  14. arXiv:2204.07456  [pdf, other

    cs.CV

    ORCNet: A context-based network to simultaneously segment the ocular region components

    Authors: Diego Rafael Lucio, Luiz A. Zanlorensi, Yandre Maldonado e Gomes da Costa, David Menotti

    Abstract: Accurate extraction of the Region of Interest is critical for successful ocular region-based biometrics. In this direction, we propose a new context-based segmentation approach, entitled Ocular Region Context Network (ORCNet), introducing a specific loss function, i.e., he Punish Context Loss (PC-Loss). The PC-Loss punishes the segmentation losses of a network by using a percentage difference valu… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  15. Image-based Automatic Dial Meter Reading in Unconstrained Scenarios

    Authors: Gabriel Salomon, Rayson Laroca, David Menotti

    Abstract: The replacement of analog meters with smart meters is costly, laborious, and far from complete in develo** countries. The Energy Company of Parana (Copel) (Brazil) performs more than 4 million meter readings (almost entirely of non-smart devices) per month, and we estimate that 850 thousand of them are from dial meters. Therefore, an image-based automatic reading system can reduce human errors,… ▽ More

    Submitted 23 October, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Journal ref: Measurement, vol. 204, p. 112025, 2022

  16. On the Cross-dataset Generalization in License Plate Recognition

    Authors: Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti

    Abstract: Automatic License Plate Recognition (ALPR) systems have shown remarkable performance on license plates (LPs) from multiple regions due to advances in deep learning and the increasing availability of datasets. The evaluation of deep ALPR systems is usually done within each dataset; therefore, it is questionable if such results are a reliable indicator of generalization ability. In this paper, we pr… ▽ More

    Submitted 21 December, 2022; v1 submitted 1 January, 2022; originally announced January 2022.

    Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2022

  17. Tell me what you see: A zero-shot action recognition method based on natural language descriptions

    Authors: Valter Estevam, Rayson Laroca, David Menotti, Helio Pedrini

    Abstract: This paper presents a novel approach to Zero-Shot Action Recognition. Recent works have explored the detection and classification of objects to obtain semantic information from videos with remarkable performance. Inspired by them, we propose using video captioning methods to extract semantic information about objects, scenes, humans, and their relationships. To the best of our knowledge, this is t… ▽ More

    Submitted 11 September, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: Published at Multimedia Tools and Applications

  18. arXiv:2112.08455  [pdf, other

    cs.CV

    Dense Video Captioning Using Unsupervised Semantic Information

    Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

    Abstract: We introduce a method to learn unsupervised semantic visual information based on the premise that complex events (e.g., minutes) can be decomposed into simpler events (e.g., a few seconds), and that these simple events are shared across several complex events. We split a long video into short frame sequences to extract their latent representation with three-dimensional convolutional neural network… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  19. arXiv:2109.05524  [pdf, other

    cs.CV

    A Decidability-Based Loss Function

    Authors: Pedro Silva, Gladston Moreira, Vander Freitas, Rodrigo Silva, David Menotti, Eduardo Luz

    Abstract: Nowadays, deep learning is the standard approach for a wide range of problems, including biometrics, such as face recognition and speech recognition, etc. Biometric problems often use deep learning models to extract features from images, also known as embeddings. Moreover, the loss function used during training strongly influences the quality of the generated embeddings. In this work, a loss funct… ▽ More

    Submitted 11 February, 2022; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: 23 pages, 7 figures

  20. arXiv:2109.03625  [pdf, other

    q-bio.GN cs.CE

    Computational methods for differentially expressed gene analysis from RNA-Seq: an overview

    Authors: Juliana Costa-Silva, Douglas S. Domingues, David Menotti, Mariangela Hungria, Fabricio M Lopes

    Abstract: The analysis of differential gene expression from RNA-Seq data has become a standard for several research areas mainly involving bioinformatics. The steps for the computational analysis of these data include many data types and file formats, and a wide variety of computational tools that can be applied alone or together as pipelines. This paper presents a review of differential expression analysis… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  21. Open-set Face Recognition for Small Galleries Using Siamese Networks

    Authors: Gabriel Salomon, Alceu Britto, Rafael H. Vareto, William R. Schwartz, David Menotti

    Abstract: Face recognition has been one of the most relevant and explored fields of Biometrics. In real-world applications, face recognition methods usually must deal with scenarios where not all probe individuals were seen during the training phase (open-set scenarios). Therefore, open-set face recognition is a subject of increasing interest as it deals with identifying individuals in a space where not all… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Journal ref: 2020 International Conference on Systems, Signals and Image Processing (IWSSIP), 2020, pp. 161-166

  22. A New Periocular Dataset Collected by Mobile Devices in Unconstrained Scenarios

    Authors: Luiz A. Zanlorensi, Rayson Laroca, Diego R. Lucio, Lucas R. Santos, Alceu S. Britto Jr., David Menotti

    Abstract: Recently, ocular biometrics in unconstrained environments using images obtained at visible wavelength have gained the researchers' attention, especially with images captured by mobile devices. Periocular recognition has been demonstrated to be an alternative when the iris trait is not available due to occlusions or low image resolution. However, the periocular trait does not have the high uniquene… ▽ More

    Submitted 14 November, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Journal ref: Scientific Reports, vol. 12, p. 17989, 2022

  23. arXiv:2010.16307  [pdf, other

    cs.CV

    Automatic Counting and Identification of Train Wagons Based on Computer Vision and Deep Learning

    Authors: Rayson Laroca, Alessander Cidral Boslooper, David Menotti

    Abstract: In this work, we present a robust and efficient solution for counting and identifying train wagons using computer vision and deep learning. The proposed solution is cost-effective and can easily replace solutions based on radiofrequency identification (RFID), which are known to have high installation and maintenance costs. According to our experiments, our two-stage methodology achieves impressive… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: An article about the proposed system has been published in the October 2020 issue of Railway Gazette International, the leading business journal for the worldwide rail industry

  24. Towards Image-based Automatic Meter Reading in Unconstrained Scenarios: A Robust and Efficient Approach

    Authors: Rayson Laroca, Alessandra B. Araujo, Luiz A. Zanlorensi, Eduardo C. de Almeida, David Menotti

    Abstract: Existing approaches for image-based Automatic Meter Reading (AMR) have been evaluated on images captured in well-controlled scenarios. However, real-world meter reading presents unconstrained scenarios that are way more challenging due to dirt, various lighting conditions, scale variations, in-plane and out-of-plane rotations, among other factors. In this work, we present an end-to-end approach fo… ▽ More

    Submitted 12 May, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

    Journal ref: IEEE Access, vol. 9, pp. 67569-67584, 2021

  25. Deep Learning for Image-based Automatic Dial Meter Reading: Dataset and Baselines

    Authors: Gabriel Salomon, Rayson Laroca, David Menotti

    Abstract: Smart meters enable remote and automatic electricity, water and gas consumption reading and are being widely deployed in developed countries. Nonetheless, there is still a huge number of non-smart meters in operation. Image-based Automatic Meter Reading (AMR) focuses on dealing with this type of meter readings. We estimate that the Energy Company of Paraná (Copel), in Brazil, performs more than 85… ▽ More

    Submitted 8 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: Accepted for presentation at the 2020 International Joint Conference on Neural Networks (IJCNN)

  26. arXiv:2004.05717  [pdf, other

    eess.IV cs.CV cs.LG

    Towards an Effective and Efficient Deep Learning Model for COVID-19 Patterns Detection in X-ray Images

    Authors: Eduardo Luz, Pedro Lopes Silva, Rodrigo Silva, Ludmila Silva, Gladston Moreira, David Menotti

    Abstract: Confronting the pandemic of COVID-19, is nowadays one of the most prominent challenges of the human species. A key factor in slowing down the virus propagation is the rapid diagnosis and isolation of infected patients. The standard method for COVID-19 identification, the Reverse transcription polymerase chain reaction method, is time-consuming and in short supply due to the pandemic. Thus, researc… ▽ More

    Submitted 24 April, 2021; v1 submitted 12 April, 2020; originally announced April 2020.

    Comments: This is a preprint of an article published in Research on Biomedical Engineering. The final authenticated version is available online at https://doi.org/10.1007/s42600-021-00151-6

  27. arXiv:2003.00833  [pdf, other

    cs.CV

    CNN Hyperparameter tuning applied to Iris Liveness Detection

    Authors: Gabriela Y. Kimura, Diego R. Lucio, Alceu S. Britto Jr., David Menotti

    Abstract: The iris pattern has significantly improved the biometric recognition field due to its high level of stability and uniqueness. Such physical feature has played an important role in security and other related areas. However, presentation attacks, also known as spoofing techniques, can be used to bypass the biometric system with artifacts such as printed images, artificial eyes, and textured contact… ▽ More

    Submitted 12 February, 2020; originally announced March 2020.

    Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP 2020)

  28. arXiv:2002.03985  [pdf, other

    cs.CV

    Unconstrained Periocular Recognition: Using Generative Deep Learning Frameworks for Attribute Normalization

    Authors: Luiz A. Zanlorensi, Hugo Proença, David Menotti

    Abstract: Ocular biometric systems working in unconstrained environments usually face the problem of small within-class compactness caused by the multiple factors that jointly degrade the quality of the obtained data. In this work, we propose an attribute normalization strategy based on deep learning generative frameworks, that reduces the variability of the samples used in pairwise comparisons, without red… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

  29. Ocular Recognition Databases and Competitions: A Survey

    Authors: Luiz A. Zanlorensi, Rayson Laroca, Eduardo Luz, Alceu S. Britto Jr., Luiz S. Oliveira, David Menotti

    Abstract: The use of the iris and periocular region as biometric traits has been extensively investigated, mainly due to the singularity of the iris features and the use of the periocular region when the image resolution is not sufficient to extract iris information. In addition to providing information about an individual's identity, features extracted from these traits can also be explored to obtain other… ▽ More

    Submitted 4 February, 2022; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: Artificial Intelligence Review, vol. 55, pp. 129-180, 2022

  30. Deep Representations for Cross-spectral Ocular Biometrics

    Authors: Luiz A. Zanlorensi, Diego R. Lucio, Alceu S. Britto Jr., Hugo Proença, David Menotti

    Abstract: One of the major challenges in ocular biometrics is the cross-spectral scenario, i.e., how to match images acquired in different wavelengths (typically visible (VIS) against near-infrared (NIR)). This article designs and extensively evaluates cross-spectral ocular verification methods, for both the closed and open-world settings, using well known deep learning representations based on the iris and… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: This paper is a postprint of a paper submitted to and accepted for publication inIET Biometrics and is subject to Institution of Engineering and Technology Copyright. The copy of the record is available at the IET Digital Library

  31. Vehicle-Rear: A New Dataset to Explore Feature Fusion for Vehicle Identification Using Convolutional Neural Networks

    Authors: Icaro O. de Oliveira, Rayson Laroca, David Menotti, Keiko V. O. Fonseca, Rodrigo Minetto

    Abstract: This work addresses the problem of vehicle identification through non-overlap** cameras. As our main contribution, we introduce a novel dataset for vehicle identification, called Vehicle-Rear, that contains more than three hours of high-resolution videos, with accurate information about the make, model, color and year of nearly 3,000 vehicles, in addition to the position and identification of th… ▽ More

    Submitted 25 July, 2021; v1 submitted 13 November, 2019; originally announced November 2019.

    Journal ref: IEEE Access, vol. 9, pp. 101065-101077, 2021

  32. Zero-Shot Action Recognition in Videos: A Survey

    Authors: Valter Estevam, Helio Pedrini, David Menotti

    Abstract: Zero-Shot Action Recognition has attracted attention in the last years and many approaches have been proposed for recognition of objects, events and actions in images and videos. There is a demand for methods that can classify instances from classes that are not present in the training of models, especially in the complex problem of automatic video understanding, since collecting, annotating and l… ▽ More

    Submitted 17 November, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: Preprint

  33. An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detector

    Authors: Rayson Laroca, Luiz A. Zanlorensi, Gabriel R. Gonçalves, Eduardo Todt, William Robson Schwartz, David Menotti

    Abstract: This paper presents an efficient and layout-independent Automatic License Plate Recognition (ALPR) system based on the state-of-the-art YOLO object detector that contains a unified approach for license plate (LP) detection and layout classification to improve the recognition results using post-processing rules. The system is conceived by evaluating and optimizing different models, aiming at achiev… ▽ More

    Submitted 9 March, 2021; v1 submitted 4 September, 2019; originally announced September 2019.

    Journal ref: IET Intelligent Transport Systems, vol. 15, no. 4, pp. 483-503, 2021

  34. Simultaneous Iris and Periocular Region Detection Using Coarse Annotations

    Authors: Diego R. Lucio, Rayson Laroca, Luiz A. Zanlorensi, Gladston Moreira, David Menotti

    Abstract: In this work, we propose to detect the iris and periocular regions simultaneously using coarse annotations and two well-known object detectors: YOLOv2 and Faster R-CNN. We believe coarse annotations can be used in recognition systems based on the iris and periocular regions, given the much smaller engineering effort required to manually annotate the training images. We manually made coarse annotat… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2019

  35. Convolutional Neural Networks for Automatic Meter Reading

    Authors: Rayson Laroca, Victor Barroso, Matheus A. Diniz, Gabriel R. Gonçalves, William Robson Schwartz, David Menotti

    Abstract: In this paper, we tackle Automatic Meter Reading (AMR) by leveraging the high capability of Convolutional Neural Networks (CNNs). We design a two-stage approach that employs the Fast-YOLO object detector for counter detection and evaluates three different CNN-based approaches for counter recognition. In the AMR literature, most datasets are not available to the research community since the images… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Journal ref: Journal of Electronic Imaging 28(1), 013023 (5 February 2019)

  36. Robust Iris Segmentation Based on Fully Convolutional Networks and Generative Adversarial Networks

    Authors: Cides S. Bezerra, Rayson Laroca, Diego R. Lucio, Evair Severo, Lucas F. Oliveira, Alceu S. Britto Jr., David Menotti

    Abstract: The iris can be considered as one of the most important biometric traits due to its high degree of uniqueness. Iris-based biometrics applications depend mainly on the iris segmentation whose suitability is not robust for different environments such as near-infrared (NIR) and visible (VIS) ones. In this paper, two approaches for robust iris segmentation based on Fully Convolutional Networks (FCNs)… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2018

  37. The Impact of Preprocessing on Deep Representations for Iris Recognition on Unconstrained Environments

    Authors: Luiz A. Zanlorensi, Eduardo Luz, Rayson Laroca, Alceu S. Britto Jr., Luiz S. Oliveira, David Menotti

    Abstract: The use of iris as a biometric trait is widely used because of its high level of distinction and uniqueness. Nowadays, one of the major research challenges relies on the recognition of iris images obtained in visible spectrum under unconstrained environments. In this scenario, the acquired iris are affected by capture distance, rotation, blur, motion blur, low contrast and specular reflection, cre… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2018

  38. Fully Convolutional Networks and Generative Adversarial Networks Applied to Sclera Segmentation

    Authors: Diego R. Lucio, Rayson Laroca, Evair Severo, Alceu S. Britto Jr., David Menotti

    Abstract: Due to the world's demand for security systems, biometrics can be seen as an important topic of research in computer vision. One of the biometric forms that has been gaining attention is the recognition based on sclera. The initial and paramount step for performing this type of recognition is the segmentation of the region of interest, i.e. the sclera. In this context, two approaches for such task… ▽ More

    Submitted 9 July, 2018; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: Accepted for presentation at the IEEE International Conference on Biometrics: Theory, Applications, and Systems (BTAS) 2018

  39. A Benchmark for Iris Location and a Deep Learning Detector Evaluation

    Authors: Evair Severo, Rayson Laroca, Cides S. Bezerra, Luiz A. Zanlorensi, Daniel Weingaertner, Gladston Moreira, David Menotti

    Abstract: The iris is considered as the biometric trait with the highest unique probability. The iris location is an important task for biometrics systems, affecting directly the results obtained in specific applications such as iris recognition, spoofing and contact lenses detection, among others. This work defines the iris location problem as the delimitation of the smallest squared window that encompasse… ▽ More

    Submitted 30 April, 2018; v1 submitted 3 March, 2018; originally announced March 2018.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2018

  40. A Robust Real-Time Automatic License Plate Recognition Based on the YOLO Detector

    Authors: Rayson Laroca, Evair Severo, Luiz A. Zanlorensi, Luiz S. Oliveira, Gabriel Resende Gonçalves, William Robson Schwartz, David Menotti

    Abstract: Automatic License Plate Recognition (ALPR) has been a frequent topic of research due to many practical applications. However, many of the current solutions are still not robust in real-world situations, commonly depending on many constraints. This paper presents a robust and efficient ALPR system based on the state-of-the-art YOLO object detector. The Convolutional Neural Networks (CNNs) are train… ▽ More

    Submitted 28 April, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2018

  41. Benchmark for License Plate Character Segmentation

    Authors: Gabriel Resende Gonçalves, Sirlene Pio Gomes da Silva, David Menotti, William Robson Schwartz

    Abstract: Automatic License Plate Recognition (ALPR) has been the focus of many researches in the past years. In general, ALPR is divided into the following problems: detection of on-track vehicles, license plates detection, segmention of license plate characters and optical character recognition (OCR). Even though commercial solutions are available for controlled acquisition conditions, e.g., the entrance… ▽ More

    Submitted 31 October, 2016; v1 submitted 11 July, 2016; originally announced July 2016.

    Comments: 32 pages, single column

    Journal ref: J. Electron. Imaging. 25(5), 053034 (Oct 24, 2016)

  42. Deep Representations for Iris, Face, and Fingerprint Spoofing Detection

    Authors: David Menotti, Giovani Chiachia, Allan Pinto, William Robson Schwartz, Helio Pedrini, Alexandre Xavier Falcao, Anderson Rocha

    Abstract: Biometrics systems have significantly improved person identification and authentication, playing an important role in personal, national, and global security. However, these systems might be deceived (or "spoofed") and, despite the recent advances in spoofing detection, current solutions often rely on domain knowledge, specific biometric reading systems, and attack types. We assume a very limited… ▽ More

    Submitted 29 January, 2015; v1 submitted 8 October, 2014; originally announced October 2014.

    Comments: Pre-print of article that will appear in the IEEE Transactions on Information Forenseics and Security (T.IFS), Special Issue on Biometric Spoofing and Countermeasures, vol 10, n. 4, April 2015

  43. Brazilian License Plate Detection Using Histogram of Oriented Gradients and Sliding Windows

    Authors: R. F. Prates, G. Cámara-Chávez, William R. Schwartz, D. Menotti

    Abstract: Due to the increasingly need for automatic traffic monitoring, vehicle license plate detection is of high interest to perform automatic toll collection, traffic law enforcement, parking lot access control, among others. In this paper, a sliding window approach based on Histogram of Oriented Gradients (HOG) features is used for Brazilian license plate detection. This approach consists in scanning t… ▽ More

    Submitted 9 January, 2014; originally announced January 2014.

    Journal ref: International Journal of Computer Science & Information Technology (IJCSIT) Vol 5, No 6, pp. 39-52, December 2013