Search | arXiv e-print repository

Solving for multi-class: a survey and synthesis

Abstract: Many of the best statistical classification algorithms are binary classifiers that can only distinguish between one of two classes. The number of possible ways of generalizing binary classification to multi-class increases exponentially with the number of classes. There is some indication that the best method will depend on the dataset. Hence, we are particularly interested in data-driven solution… ▽ More Many of the best statistical classification algorithms are binary classifiers that can only distinguish between one of two classes. The number of possible ways of generalizing binary classification to multi-class increases exponentially with the number of classes. There is some indication that the best method will depend on the dataset. Hence, we are particularly interested in data-driven solution design, whether based on prior considerations or on empirical examination of the data. Here we demonstrate how a recursive control language can be used to describe a multitude of different partitioning strategies in multi-class classification, including those in most common use. We use it both to manually construct new partitioning configurations as well as to examine those that have been automatically designed. Eight different strategies were tested on eight different datasets using a support vector machine (SVM) as the base binary classifier. Numerical results suggest that a one-size-fits-all solution consisting of one-versus-one is appropriate for most datasets. Three datasets showed better accuracy using different methods. The best solution for the most improved dataset exploited a property of the data to produce an uncertainty coefficient 36\% higher (0.016 absolute gain) than one-vs.-one. For the same dataset, an adaptive solution that empirically examined the data was also more accurate than one-vs.-one while being faster. △ Less

Submitted 24 January, 2021; v1 submitted 16 September, 2018; originally announced September 2018.

Comments: Tried to cut out the fat and improve wording and organization. Returned title to the original

arXiv:1708.05917 [pdf, ps, other]

doi 10.1007/s11554-018-0769-9

Accelerating Kernel Classifiers Through Borders Map**

Authors: Peter Mills

Abstract: Support vector machines (SVM) and other kernel techniques represent a family of powerful statistical classification methods with high accuracy and broad applicability. Because they use all or a significant portion of the training data, however, they can be slow, especially for large problems. Piecewise linear classifiers are similarly versatile, yet have the additional advantages of simplicity, ea… ▽ More Support vector machines (SVM) and other kernel techniques represent a family of powerful statistical classification methods with high accuracy and broad applicability. Because they use all or a significant portion of the training data, however, they can be slow, especially for large problems. Piecewise linear classifiers are similarly versatile, yet have the additional advantages of simplicity, ease of interpretation and, if the number of component linear classifiers is not too large, speed. Here we show how a simple, piecewise linear classifier can be trained from a kernel-based classifier in order to improve the classification speed. The method works by finding the root of the difference in conditional probabilities between pairs of opposite classes to build up a representation of the decision boundary. When tested on 17 different datasets, it succeeded in improving the classification speed of a SVM for 12 of them by up to two orders-of-magnitude. Of these, two were less accurate than a simple, linear classifier. The method is best suited to problems with continuum features data and smooth probability functions. Because the component linear classifiers are built up individually from an existing classifier, rather than through a simultaneous optimization procedure, the classifier is also fast to train. △ Less

Submitted 27 January, 2023; v1 submitted 19 August, 2017; originally announced August 2017.

Comments: 37 pages; 8 figures; 7 tables. Correct way to display a correction

Journal ref: Journal of Real-Time Image Processing 17, 313-327(2020)

arXiv:1409.0158 [pdf, ps, other]

Computers Should Be Uniters Not Dividers: A Vision of Computer-Enhanced Happy Future

Authors: Alexander Titovets, Philip Mills, Vladik Kreinovich

Abstract: This manifesto provides a vision of how computers can be used to bring people together, to enhance people's use of their natural creativity, and thus, make them happier. This manifesto provides a vision of how computers can be used to bring people together, to enhance people's use of their natural creativity, and thus, make them happier. △ Less

Submitted 30 August, 2014; originally announced September 2014.

arXiv:1404.4095 [pdf, other]

Multi-borders classification

Authors: Peter Mills

Abstract: The number of possible methods of generalizing binary classification to multi-class classification increases exponentially with the number of class labels. Often, the best method of doing so will be highly problem dependent. Here we present classification software in which the partitioning of multi-class classification problems into binary classification problems is specified using a recursive con… ▽ More The number of possible methods of generalizing binary classification to multi-class classification increases exponentially with the number of class labels. Often, the best method of doing so will be highly problem dependent. Here we present classification software in which the partitioning of multi-class classification problems into binary classification problems is specified using a recursive control language. △ Less

Submitted 18 May, 2014; v1 submitted 15 April, 2014; originally announced April 2014.

Comments: Corrected error in equations: second and third equations were not linearly independent. Corrected figure to match. "Hierarchical" scheme is a decision tree

Showing 1–4 of 4 results for author: Mills, P