A shortest-path based clustering algorithm for joint human-machine analysis of complex datasets
Authors:
Diego Ulisse Pizzagalli,
Santiago Fernandez Gonzalez,
Rolf Krause
Abstract:
Clustering is a technique for the analysis of datasets obtained by empirical studies in several disciplines with a major application for biomedical research. Essentially, clustering algorithms are executed by machines aiming at finding groups of related points in a dataset. However, the result of grou** depends on both metrics for point-to-point similarity and rules for point-to-group associatio…
▽ More
Clustering is a technique for the analysis of datasets obtained by empirical studies in several disciplines with a major application for biomedical research. Essentially, clustering algorithms are executed by machines aiming at finding groups of related points in a dataset. However, the result of grou** depends on both metrics for point-to-point similarity and rules for point-to-group association. Indeed, non-appropriate metrics and rules can lead to undesirable clustering artifacts. This is especially relevant for datasets, where groups with heterogeneous structures co-exist. In this work, we propose an algorithm that achieves clustering by exploring the paths between points. This allows both, to evaluate the properties of the path (such as gaps, density variations, etc.), and expressing the preference for certain paths. Moreover, our algorithm supports the integration of existing knowledge about admissible and non-admissible clusters by training a path classifier. We demonstrate the accuracy of the proposed method on challenging datasets including points from synthetic shapes in publicly available benchmarks and microscopy data.
△ Less
Submitted 31 December, 2018;
originally announced December 2018.
La fauna de mamíferos fósiles del depósito paleontológico "El Abrón" (nivel ix), Pinar del Río, Cuba
Authors:
Soraida Fiol González
Abstract:
"El Abrón" is a fossil deposit located in Pinar del Rio, Cuba, and whose age is only reference level VII (17 406 years BP), it is classified as the largest collection of fossils accumulated for our archipelago, produced by trophic action of barn owls for thousands of years. The aim of this study was to determine the living taxonomic composition of the fauna of extinct mammals, and throughout the p…
▽ More
"El Abrón" is a fossil deposit located in Pinar del Rio, Cuba, and whose age is only reference level VII (17 406 years BP), it is classified as the largest collection of fossils accumulated for our archipelago, produced by trophic action of barn owls for thousands of years. The aim of this study was to determine the living taxonomic composition of the fauna of extinct mammals, and throughout the paleontological study of the deeper level of said tank (Level IX). The extracted material which it is currently stored in the warehouse of paleontological collections of the National Museum of Natural History in Havana, Cuba (MNHNCu) was analyzed. We proceeded to clean the bones, to classify and to identify them from the species and also the taphonomic analysis of the condition of the remains. It was found that the mammal fauna of the paleontological deposit under study is composed essentially of 3 orders, 7 families and 14 species. The most significative order is Chiroptera (bat fauna), represented by 4 families, 9 genus and 9 species of the total which were identified. There were reported four species of bats Erophylla sezecorni, Monophyllus redmani, Pteronotus parnelli and Tadarida brasiliensis in the location. The results are the basis of the future paleoecological studies in order to reconstruct the natural history of these species. Moreover, the discovery of new species in this area is a contribution to the knowledge about the distribution of these species in the Cuban archipelago and the age of them. Finally, the taphonomic analysis of the conservation status of these remains permitted the understanding of the processes that gave rise to the tank and its characteristics, and also it contribute to an adequate estimation of the species present in it and the relationship between spatiotemporal with the fossil.
△ Less
Submitted 14 December, 2018;
originally announced December 2018.