Skip to main content

Showing 1–22 of 22 results for author: Goncalves, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06124  [pdf, other

    cs.LG cs.CV

    Structured Generations: Using Hierarchical Clusters to guide Diffusion Models

    Authors: Jorge da Silva Goncalves, Laura Manduchi, Moritz Vandenhirtz, Julia Vogt

    Abstract: This paper introduces Diffuse-TreeVAE, a deep generative model that integrates hierarchical clustering into the framework of Denoising Diffusion Probabilistic Models (DDPMs). The proposed approach generates new images by sampling from a root embedding of a learned latent tree VAE-based structure, it then propagates through hierarchical paths, and utilizes a second-stage DDPM to refine and generate… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 8 pages, 7 figures, Structured Probabilistic Inference & Generative Modeling workshop of ICML 2024

  2. arXiv:2407.02626  [pdf

    cs.DB

    The text2term tool to map free-text descriptions of biomedical terms to ontologies

    Authors: Rafael S. Gonçalves, Jason Payne, Amelia Tan, Carmen Benitez, Jamie Haddock, Robert Gentleman

    Abstract: There is an ongoing need for scalable tools to aid researchers in both retrospective and prospective standardization of discrete entity types -- such as disease names, cell types or chemicals -- that are used in metadata associated with biomedical data. When metadata are not well-structured or precise, the associated data are harder to find and are often burdensome to reuse, analyze or integrate w… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2311.17771  [pdf, other

    cs.CL

    Supervising the Centroid Baseline for Extractive Multi-Document Summarization

    Authors: Simão Gonçalves, Gonçalo Correia, Diogo Pernes, Afonso Mendes

    Abstract: The centroid method is a simple approach for extractive multi-document summarization and many improvements to its pipeline have been proposed. We further refine it by adding a beam search process to the sentence selection and also a centroid estimation attention model that leads to improved results. We demonstrate this in several multi-document summarization datasets, including in a multilingual s… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at "The 4th New Frontiers in Summarization (with LLMs) Workshop"

  4. arXiv:2304.08457  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    Deep Learning Criminal Networks

    Authors: Haroldo V. Ribeiro, Diego D. Lopes, Arthur A. B. Pessa, Alvaro F. Martins, Bruno R. da Cunha, Sebastian Goncalves, Ervin K. Lenzi, Quentin S. Hanley, Matjaz Perc

    Abstract: Recent advances in deep learning methods have enabled researchers to develop and apply algorithms for the analysis and modeling of complex networks. These advances have sparked a surge of interest at the interface between network science and machine learning. Despite this, the use of machine learning methods to investigate criminal networks remains surprisingly scarce. Here, we explore the potenti… ▽ More

    Submitted 4 June, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 14 two-column pages, 5 figures

    Journal ref: Chaos, Solitons & Fractals 172, 113579 (2023)

  5. An interpretable machine learning system for colorectal cancer diagnosis from pathology slides

    Authors: Pedro C. Neto, Diana Montezuma, Sara P. Oliveira, Domingos Oliveira, João Fraga, Ana Monteiro, João Monteiro, Liliana Ribeiro, Sofia Gonçalves, Stefan Reinhard, Inti Zlobec, Isabel M. Pinto, Jaime S. Cardoso

    Abstract: Considering the profound transformation affecting pathology practice, we aimed to develop a scalable artificial intelligence (AI) system to diagnose colorectal cancer from whole-slide images (WSI). For this, we propose a deep learning (DL) system that learns from weak labels, a sampling strategy that reduces the number of training samples by a factor of six without compromising performance, an app… ▽ More

    Submitted 30 April, 2024; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: Accepted at npj Precision Oncology. Available at: https://www.nature.com/articles/s41698-024-00539-4

    Journal ref: npj Precis. Onc. 8, 56 (2024)

  6. arXiv:2209.03171  [pdf, other

    physics.soc-ph cs.LG cs.SI stat.ML

    Machine Learning Partners in Criminal Networks

    Authors: Diego D. Lopes, Bruno R. da Cunha, Alvaro F. Martins, Sebastian Goncalves, Ervin K. Lenzi, Quentin S. Hanley, Matjaz Perc, Haroldo V. Ribeiro

    Abstract: Recent research has shown that criminal networks have complex organizational structures, but whether this can be used to predict static and dynamic properties of criminal networks remains little explored. Here, by combining graph representation learning and machine learning methods, we show that structural properties of political corruption, police intelligence, and money laundering networks can b… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 10 pages, 4 figures, supplementary information; accepted for publication in Scientific Reports

    Journal ref: Sci. Rep. 12, 15746 (2022)

  7. arXiv:2111.12209  [pdf

    cs.NI eess.SP

    Sistema de sensoriamento sem fio aplicavel a deteccao de incendios florestais

    Authors: Lucas Santos Goncalves, Celso Barbosa Carvalho

    Abstract: In this research work, a hardware and software system is developed that uses wireless sensors to monitor environmental variables such as temperature, gas concentration and luminosity, in order to detect the existence of forest fires. Lora technology was used for wireless sensor networks with communication range that can reach on average up to 5km in urban areas and 10km in rural areas. The develop… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: in Portuguese

  8. arXiv:2105.12118  [pdf, other

    cs.ET physics.comp-ph physics.optics

    Solving the One-dimensional Distance Geometry Problem by Optical Computing

    Authors: S. B. Hengeveld, N. Rubiano da Silva, D. S. Gonçalves, P. H. Souto Ribeiro, A. Mucherino

    Abstract: Distance geometry problem belongs to a class of hard problems in classical computation that can be understood in terms of a set of inputs processed according to a given transformation, and for which the number of possible outcomes grows exponentially with the number of inputs. It is conjectured that quantum computing schemes can solve problems belonging to this class in a time that grows only at a… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: 8 pages, 1 figure

  9. A new algorithm for the $^K$DMDGP subclass of Distance Geometry Problems

    Authors: Douglas S. Goncalves, Carlile Lavor, Leo Liberti, Michael Souza

    Abstract: The fundamental inverse problem in distance geometry is the one of finding positions from inter-point distances. The Discretizable Molecular Distance Geometry Problem (DMDGP) is a subclass of the Distance Geometry Problem (DGP) whose search space can be discretized and represented by a binary tree, which can be explored by a Branch-and-Prune (BP) algorithm. It turns out that this combinatorial sea… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: This is a full version of the extended abstract accepted at CTW2020

  10. Use of OWL and Semantic Web Technologies at Pinterest

    Authors: Rafael S. Gonçalves, Matthew Horridge, Rui Li, Yu Liu, Mark A. Musen, Csongor I. Nyulas, Evelyn Obamos, Dhananjay Shrouty, David Temple

    Abstract: Pinterest is a popular Web application that has over 250 million active users. It is a visual discovery engine for finding ideas for recipes, fashion, weddings, home decoration, and much more. In the last year, the company adopted Semantic Web technologies to create a knowledge graph that aims to represent the vast amount of content and users on Pinterest, to help both content recommendation and a… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

  11. The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments

    Authors: Rafael S. Gonçalves, Martin J. O'Connor, Marcos Martínez-Romero, Attila L. Egyedi, Debra Willrett, John Graybeal, Mark A. Musen

    Abstract: The Center for Expanded Data Annotation and Retrieval (CEDAR) aims to revolutionize the way that metadata describing scientific experiments are authored. The software we have developed--the CEDAR Workbench--is a suite of Web-based tools and REST APIs that allows users to construct metadata templates, to fill in templates to generate high-quality metadata, and to share and manage these resources. T… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

  12. Aligning Biomedical Metadata with Ontologies Using Clustering and Embeddings

    Authors: Rafael S. Gonçalves, Maulik R. Kamdar, Mark A. Musen

    Abstract: The metadata about scientific experiments published in online repositories have been shown to suffer from a high degree of representational heterogeneity---there are often many ways to represent the same type of information, such as a geographical location via its latitude and longitude. To harness the potential that metadata have for discovering scientific data, it is crucial that they be represe… ▽ More

    Submitted 16 May, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

  13. WebProtégé: A Cloud-Based Ontology Editor

    Authors: Matthew Horridge, Rafael S. Gonçalves, Csongor I. Nyulas, Tania Tudorache, Mark A. Musen

    Abstract: We present WebProtégé, a tool to develop ontologies represented in the Web Ontology Language (OWL). WebProtégé is a cloud-based application that allows users to collaboratively edit OWL ontologies, and it is available for use at https://webprotege.stanford.edu. WebProtégeé currently hosts more than 68,000 OWL ontology projects and has over 50,000 user accounts. In this paper, we detail the main ne… ▽ More

    Submitted 5 March, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

  14. arXiv:1810.05962  [pdf, ps, other

    physics.soc-ph cs.SI

    Empirical determination of the optimum attack for fragmentation of modular networks

    Authors: Carolina de Abreu, Bruno Requião da Cunha, Sebastián Gonçalves

    Abstract: All possible removals of $n=5$ nodes from networks of size $N=100$ are performed in order to find the optimal set of nodes which fragments the original network into the smallest largest connected component. The resulting attacks are ordered according to the size of the largest connected component and compared with the state of the art methods of network attacks. We chose attacks of size $5$ on rel… ▽ More

    Submitted 13 October, 2018; originally announced October 2018.

    Comments: 14 pages, 6 figures

  15. The variable quality of metadata about biological samples used in biomedical experiments

    Authors: Rafael S. Gonçalves, Mark A. Musen

    Abstract: We present an analytical study of the quality of metadata about samples used in biomedical experiments. The metadata under analysis are stored in two well-known databases: BioSample---a repository managed by the National Center for Biotechnology Information (NCBI), and BioSamples---a repository managed by the European Bioinformatics Institute (EBI). We tested whether 11.4M sample metadata records… ▽ More

    Submitted 18 January, 2019; v1 submitted 17 August, 2018; originally announced August 2018.

    Comments: arXiv admin note: text overlap with arXiv:1708.01286

  16. arXiv:1803.00985  [pdf, other

    cs.CL

    Hybrid Model For Word Prediction Using Naive Bayes and Latent Information

    Authors: Henrique X. Goulart, Mauro D. L. Tosi, Daniel Soares Gonçalves, Rodrigo F. Maia, Guilherme A. Wachs-Lopes

    Abstract: Historically, the Natural Language Processing area has been given too much attention by many researchers. One of the main motivation beyond this interest is related to the word prediction problem, which states that given a set words in a sentence, one can recommend the next word. In literature, this problem is solved by methods based on syntactic or semantic analysis. Solely, each of these analysi… ▽ More

    Submitted 2 March, 2018; originally announced March 2018.

  17. arXiv:1708.01286  [pdf

    cs.DB

    Metadata in the BioSample Online Repository are Impaired by Numerous Anomalies

    Authors: Rafael S. Gonçalves, Martin J. O'Connor, Marcos Martínez-Romero, John Graybeal, Mark A. Musen

    Abstract: The metadata about scientific experiments are crucial for finding, reproducing, and reusing the data that the metadata describe. We present a study of the quality of the metadata stored in BioSample--a repository of metadata about samples used in biomedical experiments managed by the U.S. National Center for Biomedical Technology Information (NCBI). We tested whether 6.6 million BioSample metadata… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

  18. arXiv:1608.02619  [pdf, other

    physics.soc-ph cs.SI

    Performance of attack strategies on modular networks

    Authors: Bruno Requião da Cunha, Sebastián Gonçalves

    Abstract: Vulnerabilities of complex networks have became a trend topic in complex systems recently due to its real world applications. Most real networks tend to be very fragile to high betweenness adaptive attacks. However, recent contributions have shown the importance of interconnected nodes in the integrity of networks and module-based attacks have appeared promising when compared to traditional malici… ▽ More

    Submitted 8 August, 2016; originally announced August 2016.

    Comments: 14 pages, 4 figures, pre-print

  19. arXiv:1504.06177  [pdf, ps, other

    cs.OH

    State of the Art of the Intra-Task Dynamic Voltage and Frequency Scaling Technique

    Authors: Rawlinson S. Gonçalves, Raimundo da Silva Barreto

    Abstract: In recent years there has been an increasing use of embedded systems because of advances in technology, the reduction of the costs of electronic equipment and mainly the popularity of mobile devices. Many of these systems implement low power consumption policies to extend their autonomy, usually because they have a reduced amount of resources and the great majority of them use electric power from… ▽ More

    Submitted 23 April, 2015; originally announced April 2015.

    Comments: 94 pages, in Portuguese

  20. arXiv:1502.00353  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Complex networks vulnerability to module-based attacks

    Authors: Bruno Requião da Cunha, Juan Carlos González-Avella, Sebastián Gonçalves

    Abstract: In the multidisciplinary field of Network Science, optimization of procedures for efficiently breaking complex networks is attracting much attention from practical points of view. In this contribution we present a module-based method to efficiently break complex networks. The procedure first identifies the communities in which the network can be represented, then it deletes the nodes (edges) that… ▽ More

    Submitted 1 February, 2015; originally announced February 2015.

    Comments: 8 pages, 8 figures

  21. arXiv:1208.2609  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO

    Epidemics scenarios in the "Romantic network"

    Authors: Alexsandro M. Carvalho, Sebastian Goncalves

    Abstract: The structure of sexual contacts, its contacts network and its temporal interactions, play an important role in the spread of sexually transmitted infections. Unfortunately, that kind of data is very hard to obtain. One of the few exceptions is the "Romantic network" which is a complete structure of a real sexual network of a high school. In terms of topology, unlike other sexual networks classifi… ▽ More

    Submitted 9 August, 2012; originally announced August 2012.

    Comments: 9 pages text, plus references, and 10 figures (with subfigures) Epidemic simulations on a small real network

    MSC Class: 81T80; 91D30; 92D25

  22. arXiv:1201.1572  [pdf, ps, other

    physics.soc-ph cs.SI

    A dynamical model for competing opinions

    Authors: S. R. Souza, S. Goncalves

    Abstract: We propose an opinion model based on agents located at the vertices of a regular lattice. Each agent has an independent opinion (among an arbitrary, but fixed, number of choices) and its own degree of conviction. The latter changes every time it interacts with another agent who has a different opinion. The dynamics leads to size distributions of clusters (made up of agents which have the same opin… ▽ More

    Submitted 7 January, 2012; originally announced January 2012.

    Journal ref: Physical Review E 85, 056103 (2012)