Skip to main content

Showing 1–2 of 2 results for author: Karasikov, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15217  [pdf, other

    cs.CV cs.LG

    Towards Large-Scale Training of Pathology Foundation Models

    Authors: kaiko. ai, Nanne Aben, Edwin D. de Jong, Ioannis Gatopoulos, Nicolas Känzig, Mikhail Karasikov, Axel Lagré, Roman Moser, Joost van Doorn, Fei Tang

    Abstract: Driven by the recent advances in deep learning methods and, in particular, by the development of modern self-supervised learning algorithms, increased interest and efforts have been devoted to build foundation models (FMs) for medical images. In this work, we present our scalable training pipeline for large pathology imaging data, and a comprehensive analysis of various hyperparameter choices and… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

  2. arXiv:1911.04200  [pdf, other

    cs.CE cs.DC cs.PF q-bio.GN

    Communication-Efficient Jaccard Similarity for High-Performance Distributed Genome Comparisons

    Authors: Maciej Besta, Raghavendra Kanakagiri, Harun Mustafa, Mikhail Karasikov, Gunnar Rätsch, Torsten Hoefler, Edgar Solomonik

    Abstract: The Jaccard similarity index is an important measure of the overlap of two sets, widely used in machine learning, computational genomics, information retrieval, and many other areas. We design and implement SimilarityAtScale, the first communication-efficient distributed algorithm for computing the Jaccard similarity among pairs of large datasets. Our algorithm provides an efficient encoding of th… ▽ More

    Submitted 11 November, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the 34st IEEE International Parallel and Distributed Processing Symposium (IPDPS'20), 2020