Skip to main content

Showing 1–42 of 42 results for author: Avila, S

Searching in archive cs. Search in all archives.
.
  1. Gender Bias Detection in Court Decisions: A Brazilian Case Study

    Authors: Raysa Benatti, Fabiana Severi, Sandra Avila, Esther Luna Colombini

    Abstract: Data derived from the realm of the social sciences is often produced in digital text form, which motivates its use as a source for natural language processing methods. Researchers and practitioners have developed and relied on artificial intelligence techniques to collect, process, and analyze documents in the legal field, especially for tasks such as text summarization and classification. While i… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 27 pages; 2 figures; 6 tables. To appear in the proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT 24), June 3 to 6, 2024, Rio de Janeiro, Brazil

  2. arXiv:2405.20420  [pdf, other

    cs.LG cs.CV

    Back to the Basics on Predicting Transfer Performance

    Authors: Levy Chaves, Eduardo Valle, Alceu Bissoto, Sandra Avila

    Abstract: In the evolving landscape of deep learning, selecting the best pre-trained models from a growing number of choices is a challenge. Transferability scorers propose alleviating this scenario, but their recent proliferation, ironically, poses the challenge of their own assessment. In this work, we propose both robust benchmark guidelines for transferability scorers, and a well-founded technique to co… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 15 pages, 3 figures, 2 tables

  3. arXiv:2403.01183  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery

    Authors: Pedro H. V. Valois, João Macedo, Leo S. F. Ribeiro, Jefersson A. dos Santos, Sandra Avila

    Abstract: Crime in the 21st century is split into a virtual and real world. However, the former has become a global menace to people's well-being and security in the latter. The challenges it presents must be faced with unified global cooperation, and we must rely more than ever on automated yet trustworthy tools to combat the ever-growing nature of online offenses. Over 10 million child sexual abuse report… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 13 pages, 5 figures, 4 tables. Under review

  4. arXiv:2402.03353  [pdf

    q-fin.ST cs.LG math.FA math.NA

    Tweet Influence on Market Trends: Analyzing the Impact of Social Media Sentiment on Biotech Stocks

    Authors: C. Sarai R. Avila

    Abstract: This study investigates the relationship between tweet sentiment across diverse categories: news, company opinions, CEO opinions, competitor opinions, and stock market behavior in the biotechnology sector, with a focus on understanding the impact of social media discourse on investor sentiment and decision-making processes. We analyzed historical stock market data for ten of the largest and most i… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

    Comments: This submission includes 51 pages and 24 figures

    MSC Class: 62P05; 91G70; 62H30; 91B84; 68T05 ACM Class: I.2.7; I.2.6; K.4.1; A.0; J.1

  5. arXiv:2310.13683  [pdf, other

    cs.LG

    CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages

    Authors: Gabriel Oliveira dos Santos, Diego A. B. Moreira, Alef Iury Ferreira, Jhessica Silva, Luiz Pereira, Pedro Bueno, Thiago Sousa, Helena Maia, Nádia Da Silva, Esther Colombini, Helio Pedrini, Sandra Avila

    Abstract: This work introduces CAPIVARA, a cost-efficient framework designed to enhance the performance of multilingual CLIP models in low-resource languages. While CLIP has excelled in zero-shot vision-language tasks, the resource-intensive nature of model training remains challenging. Many datasets lack linguistic diversity, featuring solely English descriptions for images. CAPIVARA addresses this by augm… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  6. Assessing the Generalizability of Deep Neural Networks-Based Models for Black Skin Lesions

    Authors: Luana Barros, Levy Chaves, Sandra Avila

    Abstract: Melanoma is the most severe type of skin cancer due to its ability to cause metastasis. It is more common in black people, often affecting acral regions: palms, soles, and nails. Deep neural networks have shown tremendous potential for improving clinical care and skin cancer diagnosis. Nevertheless, prevailing studies predominantly rely on datasets of white skin tones, neglecting to report diagnos… ▽ More

    Submitted 25 January, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: 18 pages, 3 figures, 7 tables. Accepted at CIARP 2023

  7. arXiv:2308.07444  [pdf, other

    cs.CV cs.AI

    The Performance of Transferability Metrics does not Translate to Medical Tasks

    Authors: Levy Chaves, Alceu Bissoto, Eduardo Valle, Sandra Avila

    Abstract: Transfer learning boosts the performance of medical image analysis by enabling deep learning (DL) on small datasets through the knowledge acquired from large ones. As the number of DL architectures explodes, exhaustively attempting all candidates becomes unfeasible, motivating cheaper alternatives for choosing them. Transferability scoring methods emerge as an enticing solution, allowing to effici… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 10 pages, 3 figures. Accepted at the DART workshop @ MICCAI 2023

  8. arXiv:2308.05595  [pdf, other

    cs.CV

    Test-Time Selection for Robust Skin Lesion Analysis

    Authors: Alceu Bissoto, Catarina Barata, Eduardo Valle, Sandra Avila

    Abstract: Skin lesion analysis models are biased by artifacts placed during image acquisition, which influence model predictions despite carrying no clinical information. Solutions that address this problem by regularizing models to prevent learning those spurious features achieve only partial success, and existing test-time debiasing techniques are inappropriate for skin lesion analysis due to either makin… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: Accepted at ISIC Workshop @ MICCAI 2023

  9. arXiv:2307.01753  [pdf, other

    astro-ph.CO cs.LG physics.comp-ph physics.data-an

    Local primordial non-Gaussianity from the large-scale clustering of photometric DESI luminous red galaxies

    Authors: Mehdi Rezaie, Ashley J. Ross, Hee-Jong Seo, Hui Kong, Anna Porredon, Lado Samushia, Edmond Chaussidon, Alex Krolewski, Arnaud de Mattia, Florian Beutler, Jessica Nicole Aguilar, Steven Ahlen, Shadab Alam, Santiago Avila, Benedict Bahr-Kalus, Jose Bermejo-Climent, David Brooks, Todd Claybaugh, Shaun Cole, Kyle Dawson, Axel de la Macorra, Peter Doel, Andreu Font-Ribera, Jaime E. Forero-Romero, Satya Gontcho A Gontcho , et al. (24 additional authors not shown)

    Abstract: We use angular clustering of luminous red galaxies from the Dark Energy Spectroscopic Instrument (DESI) imaging surveys to constrain the local primordial non-Gaussianity parameter $\fnl$. Our sample comprises over 12 million targets, covering 14,000 square degrees of the sky, with redshifts in the range $0.2< z < 1.35$. We identify Galactic extinction, survey depth, and astronomical seeing as the… ▽ More

    Submitted 25 June, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 21 pages, 17 figures, 7 tables (Appendix excluded). Published in MNRAS

  10. arXiv:2305.05807  [pdf, other

    cs.CV cs.AI cs.LG

    Even Small Correlation and Diversity Shifts Pose Dataset-Bias Issues

    Authors: Alceu Bissoto, Catarina Barata, Eduardo Valle, Sandra Avila

    Abstract: Distribution shifts are common in real-world datasets and can affect the performance and reliability of deep learning models. In this paper, we study two types of distribution shifts: diversity shifts, which occur when test samples exhibit patterns unseen during training, and correlation shifts, which occur when test data present a different correlation between seen invariant and spurious features… ▽ More

    Submitted 21 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Paper under consideration at Pattern Recognition Letters

  11. arXiv:2211.00498  [pdf, other

    cs.CY

    Should I disclose my dataset? Caveats between reproducibility and individual data rights

    Authors: Raysa M. Benatti, Camila M. L. Villarroel, Sandra Avila, Esther L. Colombini, Fabiana C. Severi

    Abstract: Natural language processing techniques have helped domain experts solve legal problems. Digital availability of court documents increases possibilities for researchers, who can access them as a source for building datasets -- whose disclosure is aligned with good reproducibility practices in computational research. Large and digitized court systems, such as the Brazilian one, are prone to be explo… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 10 pages, 2 figures. To be published in the 4th Workshop on Natural Legal Language Processing (NLLP 2022), co-located with the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

    ACM Class: K.4.1; K.5.0

  12. arXiv:2208.09756  [pdf, other

    cs.CV cs.AI

    Artifact-Based Domain Generalization of Skin Lesion Models

    Authors: Alceu Bissoto, Catarina Barata, Eduardo Valle, Sandra Avila

    Abstract: Deep Learning failure cases are abundant, particularly in the medical area. Recent studies in out-of-distribution generalization have advanced considerably on well-controlled synthetic datasets, but they do not represent medical imaging contexts. We propose a pipeline that relies on artifacts annotation to enable generalization evaluation and debiasing for the challenging skin lesion analysis cont… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

    Comments: Accepted to the ISIC Skin Image Analysis Workshop @ ECCV 2022

  13. arXiv:2206.00356  [pdf, other

    eess.IV cs.CV cs.LG

    A Survey on Deep Learning for Skin Lesion Segmentation

    Authors: Zahra Mirikharaji, Kumar Abhishek, Alceu Bissoto, Catarina Barata, Sandra Avila, Eduardo Valle, M. Emre Celebi, Ghassan Hamarneh

    Abstract: Skin cancer is a major public health problem that could benefit from computer-aided diagnosis to reduce the burden of this common disease. Skin lesion segmentation from images is an important step toward achieving this goal. However, the presence of natural and artificial artifacts (e.g., hair and air bubbles), intrinsic factors (e.g., lesion shape and contrast), and variations in image acquisitio… ▽ More

    Submitted 20 June, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Published in Medical Image Analysis (2023); 55 pages, 10 figures; Mirikharaji and Abhishek: Joint first authors; Celebi and Hamarneh: Joint senior authors

    Journal ref: Medical Image Analysis (2023): 102863

  14. arXiv:2204.14110  [pdf, other

    cs.CV cs.CY

    Seeing without Looking: Analysis Pipeline for Child Sexual Abuse Datasets

    Authors: Camila Laranjeira, João Macedo, Sandra Avila, Jefersson A. dos Santos

    Abstract: The online sharing and viewing of Child Sexual Abuse Material (CSAM) are growing fast, such that human experts can no longer handle the manual inspection. However, the automatic classification of CSAM is a challenging field of research, largely due to the inaccessibility of target data that is - and should forever be - private and in sole possession of law enforcement agencies. To aid researchers… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: FAccT 2022 - 5th Conference on Fairness, Accountability and Transparency

    MSC Class: 68U99 ACM Class: J.4

  15. arXiv:2110.00881  [pdf, other

    cs.CV cs.LG

    Weakly Supervised Attention-based Models Using Activation Maps for Citrus Mite and Insect Pest Classification

    Authors: Edson Bollis, Helena Maia, Helio Pedrini, Sandra Avila

    Abstract: Citrus juices and fruits are commodities with great economic potential in the international market, but productivity losses caused by mites and other pests are still far from being a good mark. Despite the integrated pest mechanical aspect, only a few works on automatic classification have handled images with orange mite characteristics, which means tiny and noisy regions of interest. On the compu… ▽ More

    Submitted 2 October, 2021; originally announced October 2021.

    Comments: 18 pages, 9 figures, 5 tables. Paper under review

  16. arXiv:2109.13701  [pdf, other

    cs.CV cs.CL

    CIDEr-R: Robust Consensus-based Image Description Evaluation

    Authors: Gabriel Oliveira dos Santos, Esther Luna Colombini, Sandra Avila

    Abstract: This paper shows that CIDEr-D, a traditional evaluation metric for image description, does not work properly on datasets where the number of words in the sentence is significantly greater than those in the MS COCO Captions dataset. We also show that CIDEr-D has performance hampered by the lack of multiple reference sentences and high variance of sentence length. To bypass this problem, we introduc… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: Paper accepted to the 7th Workshop on Noisy User-generated Text (W-NUT). 10 pages, 4 figures, 3 tables

  17. arXiv:2106.09229  [pdf, other

    cs.CV

    An Evaluation of Self-Supervised Pre-Training for Skin-Lesion Analysis

    Authors: Levy Chaves, Alceu Bissoto, Eduardo Valle, Sandra Avila

    Abstract: Self-supervised pre-training appears as an advantageous alternative to supervised pre-trained for transfer learning. By synthesizing annotations on pretext tasks, self-supervision allows to pre-train models on large amounts of pseudo-labels before fine-tuning them on the target task. In this work, we assess self-supervision for the diagnosis of skin lesions, comparing three self-supervised pipelin… ▽ More

    Submitted 20 August, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: 18 pages, 3 figures. Accepted at Seventh ISIC Skin Image Analysis Workshop @ECCV 2022

  18. arXiv:2104.10603  [pdf, other

    eess.IV cs.CV

    GAN-Based Data Augmentation and Anonymization for Skin-Lesion Analysis: A Critical Review

    Authors: Alceu Bissoto, Eduardo Valle, Sandra Avila

    Abstract: Despite the growing availability of high-quality public datasets, the lack of training samples is still one of the main challenges of deep-learning for skin lesion analysis. Generative Adversarial Networks (GANs) appear as an enticing alternative to alleviate the issue, by synthesizing samples indistinguishable from real images, with a plethora of works employing them for medical applications. Nev… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted to the ISIC Skin Image Analysis Workshop @ CVPR 2021

  19. arXiv:2104.02472  [pdf, other

    cs.LG eess.IV eess.SP

    Depth Evaluation for Metal Surface Defects by Eddy Current Testing using Deep Residual Convolutional Neural Networks

    Authors: Tian Meng, Yang Tao, Ziqi Chen, Jorge R. Salas Avila, Qiaoye Ran, Yuchun Shao, Ruochen Huang, Yuedong Xie, Qian Zhao, Zhijie Zhang, Hujun Yin, Anthony J. Peyton, Wuliang Yin

    Abstract: Eddy current testing (ECT) is an effective technique in the evaluation of the depth of metal surface defects. However, in practice, the evaluation primarily relies on the experience of an operator and is often carried out by manual inspection. In this paper, we address the challenges of automatic depth evaluation of metal surface defects by virtual of state-of-the-art deep learning (DL) techniques… ▽ More

    Submitted 8 March, 2021; originally announced April 2021.

  20. arXiv:2103.11474  [pdf, other

    cs.CV cs.CL

    #PraCegoVer: A Large Dataset for Image Captioning in Portuguese

    Authors: Gabriel Oliveira dos Santos, Esther Luna Colombini, Sandra Avila

    Abstract: Automatically describing images using natural sentences is an important task to support visually impaired people's inclusion onto the Internet. It is still a big challenge that requires understanding the relation of the objects present in the image and their attributes and actions they are involved in. Then, visual interpretation methods are needed, but linguistic models are also necessary to verb… ▽ More

    Submitted 27 October, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: 23 pages, 21 figures, 2 tables

  21. arXiv:2009.12856  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Machine Learning for Searching the Dark Energy Survey for Trans-Neptunian Objects

    Authors: B. Henghes, O. Lahav, D. W. Gerdes, E. Lin, R. Morgan, T. M. C. Abbott, M. Aguena, S. Allam, J. Annis, S. Avila, E. Bertin, D. Brooks, D. L. Burke, A. CarneroRosell, M. CarrascoKind, J. Carretero, C. Conselice, M. Costanzi, L. N. da Costa, J. DeVicente, S. Desai, H. T. Diehl, P. Doel, S. Everett, I. Ferrero , et al. (34 additional authors not shown)

    Abstract: In this paper we investigate how implementing machine learning could improve the efficiency of the search for Trans-Neptunian Objects (TNOs) within Dark Energy Survey (DES) data when used alongside orbit fitting. The discovery of multiple TNOs that appear to show a similarity in their orbital parameters has led to the suggestion that one or more undetected planets, an as yet undiscovered "Planet 9… ▽ More

    Submitted 10 December, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: Published in PASP, 16 pages, 6 figures

    Journal ref: PASP 133 014501 (2021)

  22. arXiv:2004.13856  [pdf, other

    cs.CV

    Less is More: Sample Selection and Label Conditioning Improve Skin Lesion Segmentation

    Authors: Vinicius Ribeiro, Sandra Avila, Eduardo Valle

    Abstract: Segmenting skin lesions images is relevant both for itself and for assisting in lesion classification, but suffers from the challenge in obtaining annotated data. In this work, we show that segmentation may improve with less data, by selecting the training samples with best inter-annotator agreement, and conditioning the ground-truth masks to remove excessive detail. We perform an exhaustive exper… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: Accepted to the ISIC Skin Image Analysis Workshop @ CVPR 2020

  23. arXiv:2004.11457  [pdf, other

    cs.CV

    Debiasing Skin Lesion Datasets and Models? Not So Fast

    Authors: Alceu Bissoto, Eduardo Valle, Sandra Avila

    Abstract: Data-driven models are now deployed in a plethora of real-world applications - including automated diagnosis - but models learned from data risk learning biases from that same data. When models learn spurious correlations not found in real-world situations, their deployment for critical tasks, such as medical decisions, can be catastrophic. In this work we address this issue for skin-lesion classi… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: Accepted to the ISIC Skin Image Analysis Workshop @ CVPR 2020

  24. arXiv:2004.11252  [pdf, other

    cs.CV cs.LG

    Weakly Supervised Learning Guided by Activation Map** Applied to a Novel Citrus Pest Benchmark

    Authors: Edson Bollis, Helio Pedrini, Sandra Avila

    Abstract: Pests and diseases are relevant factors for production losses in agriculture and, therefore, promote a huge investment in the prevention and detection of its causative agents. In many countries, Integrated Pest Management is the most widely used process to prevent and mitigate the damages caused by pests and diseases in citrus crops. However, its results are credited by humans who visually inspect… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: Accepted to The 1st International Workshop on Agriculture-Vision Workshop - CVPR 2020

  25. arXiv:1910.13076  [pdf, other

    cs.CV cs.LG

    The Six Fronts of the Generative Adversarial Networks

    Authors: Alceu Bissoto, Eduardo Valle, Sandra Avila

    Abstract: Generative Adversarial Networks fostered a newfound interest in generative models, resulting in a swelling wave of new works that new-coming researchers may find formidable to surf. In this paper, we intend to help those researchers, by splitting that incoming wave into six "fronts": Architectural Contributions, Conditional Techniques, Normalization and Constraint Contributions, Loss Functions, Im… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

  26. Grape detection, segmentation and tracking using deep neural networks and three-dimensional association

    Authors: Thiago T. Santos, Leonardo L. de Souza, Andreza A. dos Santos, Sandra Avila

    Abstract: Agricultural applications such as yield prediction, precision agriculture and automated harvesting need systems able to infer the crop state from low-cost sensing devices. Proximal sensing using affordable cameras combined with computer vision has seen a promising alternative, strengthened after the advent of convolutional neural networks (CNNs) as an alternative for challenging pattern recognitio… ▽ More

    Submitted 7 February, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Journal ref: Computers and Electronics in Agriculture, 170, 105-247 (2020)

  27. arXiv:1906.02415  [pdf, other

    cs.CV

    Handling Inter-Annotator Agreement for Automated Skin Lesion Segmentation

    Authors: Vinicius Ribeiro, Sandra Avila, Eduardo Valle

    Abstract: In this work, we explore the issue of the inter-annotator agreement for training and evaluating automated segmentation of skin lesions. We explore what different degrees of agreement represent, and how they affect different use cases for segmentation. We also evaluate how conditioning the ground truths using different (but very simple) algorithms may help to enhance agreement and may be appropriat… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: 10 pages, 5 images

  28. arXiv:1904.12724  [pdf, other

    cs.CV

    Solo or Ensemble? Choosing a CNN Architecture for Melanoma Classification

    Authors: Fábio Perez, Sandra Avila, Eduardo Valle

    Abstract: Convolutional neural networks (CNNs) deliver exceptional results for computer vision, including medical image analysis. With the growing number of available architectures, picking one over another is far from obvious. Existing art suggests that, when performing transfer learning, the performance of CNN architectures on ImageNet correlates strongly with their performance on target tasks. We evaluat… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

    Comments: ISIC Skin Image Analysis Workshop @ CVPR 2019

  29. arXiv:1904.08910  [pdf, other

    cs.CV

    Combating the Elsagate phenomenon: Deep learning architectures for disturbing cartoons

    Authors: Akari Ishikawa, Edson Bollis, Sandra Avila

    Abstract: Watching cartoons can be useful for children's intellectual, social and emotional development. However, the most popular video sharing platform today provides many videos with Elsagate content. Elsagate is a phenomenon that depicts childhood characters in disturbing circumstances (e.g., gore, toilet humor, drinking urine, stealing). Even with this threat easily available for children, there is no… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: 6 pages, 5 figures, 2 tables. Paper accepted at 7th IAPR/IEEE International Workshop on Biometrics and Forensics (IWBF)

  30. arXiv:1904.08818  [pdf, other

    cs.CV

    (De)Constructing Bias on Skin Lesion Datasets

    Authors: Alceu Bissoto, Michel Fornaciali, Eduardo Valle, Sandra Avila

    Abstract: Melanoma is the deadliest form of skin cancer. Automated skin lesion analysis plays an important role for early detection. Nowadays, the ISIC Archive and the Atlas of Dermoscopy dataset are the most employed skin lesion sources to benchmark deep-learning based tools. However, all datasets contain biases, often unintentional, due to how they were acquired and annotated. Those biases distort the per… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: 9 pages, 6 figures. Paper accepted at 2019 ISIC Skin Image Anaylsis Workshop @ IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

  31. Skin Lesion Synthesis with Generative Adversarial Networks

    Authors: Alceu Bissoto, Fábio Perez, Eduardo Valle, Sandra Avila

    Abstract: Skin cancer is by far the most common type of cancer. Early detection is the key to increase the chances for successful treatment significantly. Currently, Deep Neural Networks are the state-of-the-art results on automated skin cancer classification. To push the results further, we need to address the lack of annotated data, which is expensive and require much effort from specialists. To bypass th… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

    Comments: Conference: ISIC Skin Image Analysis Workshop and Challenge @ MICCAI 2018

  32. Data Augmentation for Skin Lesion Analysis

    Authors: Fábio Perez, Cristina Vasconcelos, Sandra Avila, Eduardo Valle

    Abstract: Deep learning models show remarkable results in automated skin lesion analysis. However, these models demand considerable amounts of data, while the availability of annotated skin lesion images is often limited. Data augmentation can expand the training dataset by transforming input images. In this work, we investigate the impact of 13 data augmentation scenarios for melanoma classification traine… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

    Comments: 8 pages, 3 figures, to be presented on ISIC Skin Image Analysis Workshop

  33. arXiv:1808.08480  [pdf, ps, other

    cs.CV

    Deep-Learning Ensembles for Skin-Lesion Segmentation, Analysis, Classification: RECOD Titans at ISIC Challenge 2018

    Authors: Alceu Bissoto, Fábio Perez, Vinícius Ribeiro, Michel Fornaciali, Sandra Avila, Eduardo Valle

    Abstract: This extended abstract describes the participation of RECOD Titans in parts 1 to 3 of the ISIC Challenge 2018 "Skin Lesion Analysis Towards Melanoma Detection" (MICCAI 2018). Although our team has a long experience with melanoma classification and moderate experience with lesion segmentation, the ISIC Challenge 2018 was the very first time we worked on lesion attribute detection. For each task we… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

  34. arXiv:1711.00441  [pdf, other

    cs.CV

    Data, Depth, and Design: Learning Reliable Models for Skin Lesion Analysis

    Authors: Eduardo Valle, Michel Fornaciali, Afonso Menegola, Julia Tavares, Flávia Vasques Bittencourt, Lin Tzy Li, Sandra Avila

    Abstract: Deep learning fostered a leap ahead in automated skin lesion analysis in the last two years. Those models are expensive to train and difficult to parameterize. Objective: We investigate methodological issues for designing and evaluating deep learning models for skin lesion analysis. We explore 10 choices faced by researchers: use of transfer learning, model architecture, train dataset, image resol… ▽ More

    Submitted 17 August, 2019; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 12 pages, 6 figures, 3 tables. Article accepted at Neurocomputing

  35. Knowledge Transfer for Melanoma Screening with Deep Learning

    Authors: Afonso Menegola, Michel Fornaciali, Ramon Pires, Flávia Vasques Bittencourt, Sandra Avila, Eduardo Valle

    Abstract: Knowledge transfer impacts the performance of deep learning -- the state of the art for image classification tasks, including automated melanoma screening. Deep learning's greed for large amounts of training data poses a challenge for medical tasks, which we can alleviate by recycling knowledge from models trained on different tasks, in a scheme called transfer learning. Although much of the best… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

    Comments: 4 pages

  36. arXiv:1703.04819  [pdf, other

    cs.CV

    RECOD Titans at ISIC Challenge 2017

    Authors: Afonso Menegola, Julia Tavares, Michel Fornaciali, Lin Tzy Li, Sandra Avila, Eduardo Valle

    Abstract: This extended abstract describes the participation of RECOD Titans in parts 1 and 3 of the ISIC Challenge 2017 "Skin Lesion Analysis Towards Melanoma Detection" (ISBI 2017). Although our team has a long experience with melanoma classification, the ISIC Challenge 2017 was the very first time we worked on skin-lesion segmentation. For part 1 (segmentation), our final submission used four of our mode… ▽ More

    Submitted 14 March, 2017; originally announced March 2017.

    Comments: 5 pages

  37. arXiv:1609.01228  [pdf, ps, other

    cs.CV

    Towards Automated Melanoma Screening: Exploring Transfer Learning Schemes

    Authors: Afonso Menegola, Michel Fornaciali, Ramon Pires, Sandra Avila, Eduardo Valle

    Abstract: Deep learning is the current bet for image classification. Its greed for huge amounts of annotated data limits its usage in medical imaging context. In this scenario transfer learning appears as a prominent solution. In this report we aim to clarify how transfer learning schemes may influence classification results. We are particularly focused in the automated melanoma screening problem, a case of… ▽ More

    Submitted 5 September, 2016; originally announced September 2016.

  38. A Mid-level Video Representation based on Binary Descriptors: A Case Study for Pornography Detection

    Authors: Carlos Caetano, Sandra Avila, William Robson Schwartz, Silvio Jamil F. Guimarães, Arnaldo de A. Araújo

    Abstract: With the growing amount of inappropriate content on the Internet, such as pornography, arises the need to detect and filter such material. The reason for this is given by the fact that such content is often prohibited in certain environments (e.g., schools and workplaces) or for certain publics (e.g., children). In recent years, many works have been mainly focused on detecting pornographic images… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: Manuscript accepted at Elsevier Neurocomputing

  39. Deep Neural Networks Under Stress

    Authors: Micael Carvalho, Matthieu Cord, Sandra Avila, Nicolas Thome, Eduardo Valle

    Abstract: In recent years, deep architectures have been used for transfer learning with state-of-the-art performance in many datasets. The properties of their features remain, however, largely unstudied under the transfer perspective. In this work, we present an extensive analysis of the resiliency of feature vectors extracted from deep models, with special focus on the trade-off between performance and com… ▽ More

    Submitted 23 May, 2016; v1 submitted 11 May, 2016; originally announced May 2016.

    Comments: This article corresponds to the accepted version at IEEE ICIP 2016. We will link the DOI as soon as it is available

  40. arXiv:1604.04024  [pdf, other

    cs.CV

    Towards Automated Melanoma Screening: Proper Computer Vision & Reliable Results

    Authors: Michel Fornaciali, Micael Carvalho, Flávia Vasques Bittencourt, Sandra Avila, Eduardo Valle

    Abstract: In this paper we survey, analyze and criticize current art on automated melanoma screening, reimplementing a baseline technique, and proposing two novel ones. Melanoma, although highly curable when detected early, ends as one of the most dangerous types of cancer, due to delayed diagnosis and treatment. Its incidence is soaring, much faster than the number of trained professionals able to diagnose… ▽ More

    Submitted 6 May, 2016; v1 submitted 13 April, 2016; originally announced April 2016.

    Comments: Minor corrections on State of the Art and Conclusion

  41. arXiv:1511.06704  [pdf, other

    cs.CV

    Semantic Diversity versus Visual Diversity in Visual Dictionaries

    Authors: Otávio A. B. Penatti, Sandra Avila, Eduardo Valle, Ricardo da S. Torres

    Abstract: Visual dictionaries are a critical component for image classification/retrieval systems based on the bag-of-visual-words (BoVW) model. Dictionaries are usually learned without supervision from a training set of images sampled from the collection of interest. However, for large, general-purpose, dynamic image collections (e.g., the Web), obtaining a representative sample in terms of semantic concep… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

  42. arXiv:1101.2427  [pdf

    cs.CV cs.SI

    Content-Based Filtering for Video Sharing Social Networks

    Authors: Eduardo Valle, Sandra de Avila, Antonio da Luz Jr., Fillipe de Souza, Marcelo Coelho, Arnaldo Araújo

    Abstract: In this paper we compare the use of several features in the task of content filtering for video social networks, a very challenging task, not only because the unwanted content is related to very high-level semantic concepts (e.g., pornography, violence, etc.) but also because videos from social networks are extremely assorted, preventing the use of constrained a priori information. We propose a si… ▽ More

    Submitted 12 January, 2011; originally announced January 2011.

    ACM Class: I.5.4; I.4.8