Skip to main content

Showing 1–9 of 9 results for author: Thoma, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1805.02475  [pdf, other

    cs.CV

    Comparative evaluation of instrument segmentation and tracking methods in minimally invasive surgery

    Authors: Sebastian Bodenstedt, Max Allan, Anthony Agustinos, Xiaofei Du, Luis Garcia-Peraza-Herrera, Hannes Kenngott, Thomas Kurmann, Beat Müller-Stich, Sebastien Ourselin, Daniil Pakhomov, Raphael Sznitman, Marvin Teichmann, Martin Thoma, Tom Vercauteren, Sandrine Voros, Martin Wagner, Pamela Wochner, Lena Maier-Hein, Danail Stoyanov, Stefanie Speidel

    Abstract: Intraoperative segmentation and tracking of minimally invasive instruments is a prerequisite for computer- and robotic-assisted surgery. Since additional hardware like tracking systems or the robot encoders are cumbersome and lack accuracy, surgical vision is evolving as promising techniques to segment and track the instruments using only the endoscopic images. However, what is missing so far are… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

  2. arXiv:1801.07779  [pdf, other

    cs.CV cs.CL

    The WiLI benchmark dataset for written language identification

    Authors: Martin Thoma

    Abstract: This paper describes the WiLI-2018 benchmark dataset for monolingual written natural language identification. WiLI-2018 is a publicly available, free of charge dataset of short text extracts from Wikipedia. It contains 1000 paragraphs of 235 languages, totaling in 23500 paragraphs. WiLI is a classification dataset: Given an unknown paragraph written in one dominant language, it has to be decided w… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

    Comments: {"pages": 12, "figures": 4, "language": "English", "author-ORCiD": ["https://orcid.org/0000-0002-6517-1690"]}

  3. arXiv:1707.09725  [pdf, other

    cs.CV

    Analysis and Optimization of Convolutional Neural Network Architectures

    Authors: Martin Thoma

    Abstract: Convolutional Neural Networks (CNNs) dominate various computer vision tasks since Alex Krizhevsky showed that they can be trained effectively and reduced the top-5 error from 26.2 % to 15.3 % on the ImageNet large scale visual recognition challenge. Many aspects of CNNs are examined in various publications, but literature about the analysis and construction of neural network architectures is rare.… ▽ More

    Submitted 31 July, 2017; originally announced July 2017.

    Comments: Master's thesis. 73 pages + 24 pages appendix; 39 figures; 33 tables

  4. arXiv:1701.08380  [pdf, other

    cs.CV

    The HASYv2 dataset

    Authors: Martin Thoma

    Abstract: This paper describes the HASYv2 dataset. HASY is a publicly available, free of charge dataset of single symbols similar to MNIST. It contains 168233 instances of 369 classes. HASY contains two challenges: A classification challenge with 10 pre-defined folds for 10-fold cross-validation and a verification challenge.

    Submitted 29 January, 2017; originally announced January 2017.

  5. arXiv:1602.06541  [pdf, other

    cs.CV

    A Survey of Semantic Segmentation

    Authors: Martin Thoma

    Abstract: This survey gives an overview over different techniques used for pixel-level semantic segmentation. Metrics and datasets for the evaluation of segmentation algorithms and traditional approaches for segmentation such as unsupervised methods, Decision Forests and SVMs are described and pointers to the relevant papers are given. Recently published approaches with convolutional neural networks are men… ▽ More

    Submitted 11 May, 2016; v1 submitted 21 February, 2016; originally announced February 2016.

    Comments: Fixed typo in accuracy metrics formula; added value range of accuracy metrics; consistent naming of variables

  6. arXiv:1601.03642  [pdf, other

    cs.CV cs.LG

    Creativity in Machine Learning

    Authors: Martin Thoma

    Abstract: Recent machine learning techniques can be modified to produce creative results. Those results did not exist before; it is not a trivial combination of the data which was fed into the machine learning system. The obtained results come in multiple forms: As images, as text and as audio. This paper gives a high level overview of how they are created and gives some examples. It is meant to be a summ… ▽ More

    Submitted 12 January, 2016; originally announced January 2016.

    Comments: 5 pages, 4 figures

  7. arXiv:1512.04469  [pdf, other

    cs.LG

    Über die Klassifizierung von Knoten in dynamischen Netzwerken mit Inhalt

    Authors: Martin Thoma

    Abstract: This paper explains the DYCOS-Algorithm as it was introduced in by Aggarwal and Li in 2011. It operates on graphs whichs nodes are partially labeled and automatically adds missing labels to nodes. To do so, the DYCOS algorithm makes use of the structure of the graph as well as content which is assigned to the node. Aggarwal and Li measured in an experimental analysis that DYCOS adds the missing la… ▽ More

    Submitted 23 November, 2015; originally announced December 2015.

    Comments: in German. This term paper was handed in on 17.01.2014

  8. On-line Recognition of Handwritten Mathematical Symbols

    Authors: Martin Thoma

    Abstract: Finding the name of an unknown symbol is often hard, but writing the symbol is easy. This bachelor's thesis presents multiple systems that use the pen trajectory to classify handwritten symbols. Five preprocessing steps, one data augmentation algorithm, five features and five variants for multilayer Perceptron training were evaluated using 166898 recordings which were collected with two crowdsourc… ▽ More

    Submitted 29 November, 2015; originally announced November 2015.

  9. arXiv:1511.00513  [pdf, other

    cs.CV

    Pixel-wise Segmentation of Street with Neural Networks

    Authors: Sebastian Bittel, Vitali Kaiser, Marvin Teichmann, Martin Thoma

    Abstract: Pixel-wise street segmentation of photographs taken from a drivers perspective is important for self-driving cars and can also support other object recognition tasks. A framework called SST was developed to examine the accuracy and execution time of different neural networks. The best neural network achieved an $F_1$-score of 89.5% with a simple feedforward neural network which trained to solve a… ▽ More

    Submitted 2 November, 2015; originally announced November 2015.