Skip to main content

Showing 1–50 of 62 results for author: Gorban, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12670  [pdf, other

    cs.AI cs.LG

    Stealth edits for provably fixing or attacking large language models

    Authors: Oliver J. Sutton, Qinghua Zhou, Wei Wang, Desmond J. Higham, Alexander N. Gorban, Alexander Bastounis, Ivan Y. Tyukin

    Abstract: We reveal new methods and the theoretical foundations of techniques for editing large language models. We also show how the new theory can be used to assess the editability of models and to expose their susceptibility to previously unknown malicious attacks. Our theoretical approach shows that a single metric (a specific measure of the intrinsic dimensionality of the model's features) is fundament… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 24 pages, 9 figures. Open source implementation: https://github.com/qinghua-zhou/stealth-edits

    MSC Class: 68T07; 68T50; 68W40 ACM Class: I.2.7; F.2.0

  2. arXiv:2402.06563  [pdf

    cs.LG cs.AI cs.CL cs.HC cs.IT

    What is Hiding in Medicine's Dark Matter? Learning with Missing Data in Medical Practices

    Authors: Neslihan Suzen, Evgeny M. Mirkes, Damian Roland, Jeremy Levesley, Alexander N. Gorban, Tim J. Coats

    Abstract: Electronic patient records (EPRs) produce a wealth of data but contain significant missing information. Understanding and handling this missing data is an important part of clinical data analysis and if left unaddressed could result in bias in analysis and distortion in critical conclusions. Missing data may be linked to health care professional practice patterns and imputation of missing data can… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 8 pages

    Journal ref: 2023 IEEE International Conference on Big Data (BigData), 4979-4986

  3. arXiv:2402.00899  [pdf, other

    cs.LG cs.AI stat.ML

    Weakly Supervised Learners for Correction of AI Errors with Provable Performance Guarantees

    Authors: Ivan Y. Tyukin, Tatiana Tyukina, Daniel van Helden, Zedong Zheng, Evgeny M. Mirkes, Oliver J. Sutton, Qinghua Zhou, Alexander N. Gorban, Penelope Allison

    Abstract: We present a new methodology for handling AI errors by introducing weakly supervised AI error correctors with a priori performance guarantees. These AI correctors are auxiliary maps whose role is to moderate the decisions of some previously constructed underlying classifier by either approving or rejecting its decisions. The rejection of a decision can be used as a signal to suggest abstaining fro… ▽ More

    Submitted 13 February, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    MSC Class: 68T05; 68T37

  4. arXiv:2311.13917  [pdf

    physics.soc-ph cs.LG

    Exploring the impact of social stress on the adaptive dynamics of COVID-19: Ty** the behavior of naïve populations faced with epidemics

    Authors: Innokentiy Kastalskiy, Andrei Zinovyev, Evgeny Mirkes, Victor Kazantsev, Alexander N. Gorban

    Abstract: In the context of natural disasters, human responses inevitably intertwine with natural factors. The COVID-19 pandemic, as a significant stress factor, has brought to light profound variations among different countries in terms of their adaptive dynamics in addressing the spread of infection outbreaks across different regions. This emphasizes the crucial role of cultural characteristics in natural… ▽ More

    Submitted 12 February, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 29 pages, 16 figures, 1 table, 2 appendices

    Journal ref: Communications in Nonlinear Science and Numerical Simulation, Volume 132, May 2024, 107906

  5. Relative intrinsic dimensionality is intrinsic to learning

    Authors: Oliver J. Sutton, Qinghua Zhou, Alexander N. Gorban, Ivan Y. Tyukin

    Abstract: High dimensional data can have a surprising property: pairs of data points may be easily separated from each other, or even from arbitrary subsets, with high probability using just simple linear classifiers. However, this is more of a rule of thumb than a reliable property as high dimensionality alone is neither necessary nor sufficient for successful learning. Here, we introduce a new notion of t… ▽ More

    Submitted 10 October, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures

    MSC Class: 68T09; 68T10

    Journal ref: Artificial Neural Networks and Machine Learning ICANN 2023. Lecture Notes in Computer Science, vol 14254, pp 516-529. Springer, Cham

  6. arXiv:2309.07072  [pdf, ps, other

    cs.LG

    The Boundaries of Verifiable Accuracy, Robustness, and Generalisation in Deep Learning

    Authors: Alexander Bastounis, Alexander N. Gorban, Anders C. Hansen, Desmond J. Higham, Danil Prokhorov, Oliver Sutton, Ivan Y. Tyukin, Qinghua Zhou

    Abstract: In this work, we assess the theoretical limitations of determining guaranteed stability and accuracy of neural networks in classification tasks. We consider classical distribution-agnostic framework and algorithms minimising empirical risks and potentially subjected to some weights regularisation. We show that there is a large family of tasks for which computing and verifying ideal stable and accu… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    MSC Class: 68T07; 68T05

  7. arXiv:2309.03665  [pdf, other

    cs.LG cs.AI

    How adversarial attacks can disrupt seemingly stable accurate classifiers

    Authors: Oliver J. Sutton, Qinghua Zhou, Ivan Y. Tyukin, Alexander N. Gorban, Alexander Bastounis, Desmond J. Higham

    Abstract: Adversarial attacks dramatically change the output of an otherwise accurate learning system using a seemingly inconsequential modification to a piece of input data. Paradoxically, empirical evidence indicates that even systems which are robust to large random perturbations of the input data remain susceptible to small, easily constructed, adversarial perturbations of their inputs. Here, we show th… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 11 pages, 8 figures, additional supplementary materials

  8. arXiv:2306.04745  [pdf, other

    cs.CV cs.AI

    3D Human Keypoints Estimation From Point Clouds in the Wild Without Human Labels

    Authors: Zhenzhen Weng, Alexander S. Gorban, **gwei Ji, Mahyar Najibi, Yin Zhou, Dragomir Anguelov

    Abstract: Training a 3D human keypoint detector from point clouds in a supervised manner requires large volumes of high quality labels. While it is relatively easy to capture large amounts of human point clouds, annotating 3D keypoints is expensive, subjective, error prone and especially difficult for long-tail cases (pedestrians with rare poses, scooterists, etc.). In this work, we propose GC-KPL - Geometr… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: CVPR 2023

  9. arXiv:2305.07624  [pdf, other

    cs.LG

    Agile gesture recognition for capacitive sensing devices: adapting on-the-job

    Authors: Ying Liu, Liucheng Guo, Valeri A. Makarov, Yuxiang Huang, Alexander Gorban, Evgeny Mirkes, Ivan Y. Tyukin

    Abstract: Automated hand gesture recognition has been a focus of the AI community for decades. Traditionally, work in this domain revolved largely around scenarios assuming the availability of the flow of images of the user hands. This has partly been due to the prevalence of camera-based devices and the wide availability of image data. However, there is growing demand for gesture recognition technology tha… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  10. arXiv:2212.07729  [pdf, other

    cs.CV

    HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving

    Authors: Andrei Zanfir, Mihai Zanfir, Alexander Gorban, **gwei Ji, Yin Zhou, Dragomir Anguelov, Cristian Sminchisescu

    Abstract: Autonomous driving is an exciting new industry, posing important research questions. Within the perception module, 3D human pose estimation is an emerging technology, which can enable the autonomous vehicle to perceive and understand the subtle and complex behaviors of pedestrians. While hardware systems and sensors have dramatically improved over the decades -- with cars potentially boasting comp… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Published at the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand

  11. arXiv:2211.03607  [pdf, other

    cs.LG cs.AI cs.CV

    Towards a mathematical understanding of learning from few examples with nonlinear feature maps

    Authors: Oliver J. Sutton, Alexander N. Gorban, Ivan Y. Tyukin

    Abstract: We consider the problem of data classification where the training set consists of just a few data points. We explore this phenomenon mathematically and reveal key relationships between the geometry of an AI model's feature space, the structure of the underlying data distributions, and the model's generalisation capabilities. The main thrust of our analysis is to reveal the influence on the model's… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 18 pages, 8 figures

    MSC Class: 68Q32; 68T05

  12. Domain Adaptation Principal Component Analysis: base linear method for learning with out-of-distribution data

    Authors: Evgeny M Mirkes, Jonathan Bac, Aziz Fouché, Sergey V. Stasenko, Andrei Zinovyev, Alexander N. Gorban

    Abstract: Domain adaptation is a popular paradigm in modern machine learning which aims at tackling the problem of divergence (or shift) between the labeled training and validation datasets (source domain) and a potentially large unlabeled dataset (target domain). The task is to embed both datasets red into a common space in which the source dataset is informative for training while the divergence between s… ▽ More

    Submitted 15 December, 2022; v1 submitted 28 August, 2022; originally announced August 2022.

    Journal ref: Entropy, 25(1), 33, 2023

  13. arXiv:2205.15696  [pdf

    cs.CL cs.AI cs.HC cs.IT

    An Informational Space Based Semantic Analysis for Scientific Texts

    Authors: Neslihan Suzen, Alexander N. Gorban, Jeremy Levesley, Evgeny M. Mirkes

    Abstract: One major problem in Natural Language Processing is the automatic analysis and representation of human language. Human language is ambiguous and deeper understanding of semantics and creating human-to-machine interaction have required an effort in creating the schemes for act of communication and building common-sense knowledge bases for the 'meaning' in texts. This paper introduces computational… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: 19 pages. arXiv admin note: substantial text overlap with arXiv:2009.08859, arXiv:2004.13717

    Journal ref: Computer Science & Information Technology, volume 12, number 08, pp. 81-99, 2022. CS & IT - CSCP 2022

  14. arXiv:2203.16935  [pdf, other

    cs.LG

    Learning from few examples with nonlinear feature maps

    Authors: Ivan Y. Tyukin, Oliver Sutton, Alexander N. Gorban

    Abstract: In this work we consider the problem of data classification in post-classical settings were the number of training examples consists of mere few data points. We explore the phenomenon and reveal key relationships between dimensionality of AI model's feature space, non-degeneracy of data distributions, and the model's generalisation capabilities. The main thrust of our present analysis is on the in… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    MSC Class: 68T05; 68Q32

  15. arXiv:2203.16687  [pdf, other

    cs.LG

    Quasi-orthogonality and intrinsic dimensions as measures of learning and generalisation

    Authors: Qinghua Zhou, Alexander N. Gorban, Evgeny M. Mirkes, Jonathan Bac, Andrei Zinovyev, Ivan Y. Tyukin

    Abstract: Finding best architectures of learning machines, such as deep neural networks, is a well-known technical and theoretical challenge. Recent work by Mellor et al (2021) showed that there may exist correlations between the accuracies of trained networks and the values of some easily computable measures defined on randomly initialised networks which may enable to search tens of thousands of neural arc… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    MSC Class: 68T05; 68Q32

  16. arXiv:2112.12141  [pdf, other

    cs.CV

    Multi-modal 3D Human Pose Estimation with 2D Weak Supervision in Autonomous Driving

    Authors: **gxiao Zheng, Xinwei Shi, Alexander Gorban, Junhua Mao, Yang Song, Charles R. Qi, Ting Liu, Visesh Chari, Andre Cornman, Yin Zhou, Congcong Li, Dragomir Anguelov

    Abstract: 3D human pose estimation (HPE) in autonomous vehicles (AV) differs from other use cases in many factors, including the 3D resolution and range of data, absence of dense depth maps, failure modes for LiDAR, relative location between the camera and LiDAR, and a high bar for estimation accuracy. Data collected for other use cases (such as virtual reality, gaming, and animation) may therefore not be u… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  17. arXiv:2109.02596  [pdf, other

    cs.LG stat.ML

    Scikit-dimension: a Python package for intrinsic dimension estimation

    Authors: Jonathan Bac, Evgeny M. Mirkes, Alexander N. Gorban, Ivan Tyukin, Andrei Zinovyev

    Abstract: Dealing with uncertainty in applications of machine learning to real-life data critically depends on the knowledge of intrinsic dimensionality (ID). A number of methods have been suggested for the purpose of estimating ID, but no standard package to easily apply them one by one or all at once has been implemented in Python. This technical note introduces \texttt{scikit-dimension}, an open-source P… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 12 pages, 4 figures, 1 table

    Journal ref: Entropy, 2021, 23(10), 1368

  18. arXiv:2108.13414  [pdf, other

    q-bio.NC cs.AI

    Astrocytes mediate analogous memory in a multi-layer neuron-astrocytic network

    Authors: Yuliya Tsybina, Innokentiy Kastalskiy, Mikhail Krivonosov, Alexey Zaikin, Victor Kazantsev, Alexander Gorban, Susanna Gordleeva

    Abstract: Modeling the neuronal processes underlying short-term working memory remains the focus of many theoretical studies in neuroscience. Here we propose a mathematical model of spiking neuron network (SNN) demonstrating how a piece of information can be maintained as a robust activity pattern for several seconds then completely disappear if no other stimuli come. Such short-term memory traces are prese… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: 18 pages, 6 figures, 1 table, Appendix

  19. arXiv:2106.15416  [pdf, other

    cs.LG cs.AI stat.ML

    High-dimensional separability for one- and few-shot learning

    Authors: Alexander N. Gorban, Bogdan Grechuk, Evgeny M. Mirkes, Sergey V. Stasenko, Ivan Y. Tyukin

    Abstract: This work is driven by a practical question: corrections of Artificial Intelligence (AI) errors. These corrections should be quick and non-iterative. To solve this problem without modification of a legacy AI system, we propose special `external' devices, correctors. Elementary correctors consist of two parts, a classifier that separates the situations with high risk of error from the situations in… ▽ More

    Submitted 22 October, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Corrected and restructured version with some extensions

    Journal ref: Entropy. 2021; 23(8):1090

  20. arXiv:2106.13997  [pdf, other

    cs.CR cs.AI cs.LG

    The Feasibility and Inevitability of Stealth Attacks

    Authors: Ivan Y. Tyukin, Desmond J. Higham, Alexander Bastounis, Eliyas Woldegeorgis, Alexander N. Gorban

    Abstract: We develop and study new adversarial perturbations that enable an attacker to gain control over decisions in generic Artificial Intelligence (AI) systems including deep learning neural networks. In contrast to adversarial data modification, the attack mechanism we consider here involves alterations to the AI system itself. Such a stealth attack could be conducted by a mischievous, corrupt or disgr… ▽ More

    Submitted 4 January, 2023; v1 submitted 26 June, 2021; originally announced June 2021.

    MSC Class: 68T01; 68T05; 90C31

    Journal ref: IMA Journal of Applied Mathematics, October 2023, hxad027

  21. arXiv:2104.12869  [pdf, other

    cs.CL cs.IT cs.LG

    Semantic Analysis for Automated Evaluation of the Potential Impact of Research Articles

    Authors: Neslihan Suzen, Alexander Gorban, Jeremy Levesley, Evgeny Mirkes

    Abstract: Can the analysis of the semantics of words used in the text of a scientific paper predict its future impact measured by citations? This study details examples of automated text classification that achieved 80% success rate in distinguishing between highly-cited and little-cited articles. Automated intelligent systems allow the identification of promising works that could become influential in the… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: 36 pages

  22. Demystification of Few-shot and One-shot Learning

    Authors: Ivan Y. Tyukin, Alexander N. Gorban, Muhammad H. Alkhudaydi, Qinghua Zhou

    Abstract: Few-shot and one-shot learning have been the subject of active and intensive research in recent years, with mounting evidence pointing to successful implementation and exploitation of few-shot learning algorithms in practice. Classical statistical learning theories do not fully explain why few- or one-shot learning is at all possible since traditional generalisation bounds normally require large t… ▽ More

    Submitted 29 May, 2021; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: IEEE International Joint Conference on Neural Networks, IJCNN 2021

    MSC Class: 68T05; 68T07

    Journal ref: In2021 International Joint Conference on Neural Networks (IJCNN) 2021 Jul 18 (pp. 1-7). IEEE

  23. General stochastic separation theorems with optimal bounds

    Authors: Bogdan Grechuk, Alexander N. Gorban, Ivan Y. Tyukin

    Abstract: Phenomenon of stochastic separability was revealed and used in machine learning to correct errors of Artificial Intelligence (AI) systems and analyze AI instabilities. In high-dimensional datasets under broad assumptions each point can be separated from the rest of the set by simple and robust Fisher's discriminant (is Fisher separable). Errors or clusters of errors can be separated from the rest… ▽ More

    Submitted 9 January, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Numerical examples and illustrations are added, minor corrections extended discussion and the bibliography

    Journal ref: Neural Networks, Volume 138, 2021, Pages 33-56

  24. arXiv:2009.08859  [pdf, other

    cs.CL cs.LG

    Principal Components of the Meaning

    Authors: Neslihan Suzen, Alexander Gorban, Jeremy Levesley, Evgeny Mirkes

    Abstract: In this paper we argue that (lexical) meaning in science can be represented in a 13 dimension Meaning Space. This space is constructed using principal component analysis (singular decomposition) on the matrix of word category relative information gains, where the categories are those used by the Web of Science, and the words are taken from a reduced word set from texts in the Web of Science. We sh… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  25. Trajectories, bifurcations and pseudotime in large clinical datasets: applications to myocardial infarction and diabetes data

    Authors: Sergey E. Golovenkin, Jonathan Bac, Alexander Chervov, Evgeny M. Mirkes, Yuliya V. Orlova, Emmanuel Barillot, Alexander N. Gorban, Andrei Zinovyev

    Abstract: Large observational clinical datasets become increasingly available for mining associations between various disease traits and administered therapy. These datasets can be considered as representations of the landscape of all possible disease conditions, in which a concrete pathology develops through a number of stereotypical routes, characterized by `points of no return' and `final states' (such a… ▽ More

    Submitted 5 October, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    ACM Class: I.2.6; J.3; J.2

    Journal ref: GigaScience, Volume 9, Issue 11, 2020, giaa128,

  26. arXiv:2005.06284  [pdf, other

    cs.LG cs.NE stat.ML

    Pruning coupled with learning, ensembles of minimal neural networks, and future of XAI

    Authors: Alexander N. Gorban, Evgeny M. Mirkes

    Abstract: Pruning coupled with learning aims to optimize the neural network (NN) structure for solving specific problems. This optimization can be used for various purposes: to prevent overfitting, to save resources for implementation and training, to provide explainability of the trained NN, and many others. The minimal structure that cannot be pruned further is not unique. Ensemble of minimal structures c… ▽ More

    Submitted 22 January, 2023; v1 submitted 13 May, 2020; originally announced May 2020.

    Comments: Significantly modified and extended version, 23 pages, 5 figures

  27. arXiv:2004.14230  [pdf, other

    cs.LG stat.ML

    Fractional norms and quasinorms do not help to overcome the curse of dimensionality

    Authors: Evgeny M. Mirkes, Jeza Allohibi, Alexander N. Gorban

    Abstract: The curse of dimensionality causes the well-known and widely discussed problems for machine learning methods. There is a hypothesis that using of the Manhattan distance and even fractional quasinorms lp (for p less than 1) can help to overcome the curse of dimensionality in classification problems. In this study, we systematically test this hypothesis. We confirm that fractional quasinorms have a… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Journal ref: Entropy. 2020; 22(10):1105

  28. arXiv:2004.13717  [pdf, other

    cs.CL cs.DL

    Informational Space of Meaning for Scientific Texts

    Authors: Neslihan Suzen, Evgeny M. Mirkes, Alexander N. Gorban

    Abstract: In Natural Language Processing, automatic extracting the meaning of texts constitutes an important problem. Our focus is the computational analysis of meaning of short scientific texts (abstracts or brief reports). In this paper, a vector space model is developed for quantifying the meaning of words and texts. We introduce the Meaning Space, in which the meaning of a word is represented by a vecto… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 320 pages

  29. On Adversarial Examples and Stealth Attacks in Artificial Intelligence Systems

    Authors: Ivan Y. Tyukin, Desmond J. Higham, Alexander N. Gorban

    Abstract: In this work we present a formal theoretical framework for assessing and analyzing two classes of malevolent action towards generic Artificial Intelligence (AI) systems. Our results apply to general multi-class classifiers that map from an input space into a decision space, including artificial neural networks used in deep learning applications. Two classes of attacks are considered. The first cla… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    MSC Class: 68T05; 68T10; 90C31

    Journal ref: 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, United Kingdom, 2020

  30. arXiv:2001.04959  [pdf, other

    cs.LG cs.AI stat.ML

    High--Dimensional Brain in a High-Dimensional World: Blessing of Dimensionality

    Authors: Alexander N. Gorban, Valery A. Makarov, Ivan Y. Tyukin

    Abstract: High-dimensional data and high-dimensional representations of reality are inherent features of modern Artificial Intelligence systems and applications of machine learning. The well-known phenomenon of the "curse of dimensionality" states: many problems become exponentially difficult in high dimensions. Recently, the other side of the coin, the "blessing of dimensionality", has attracted much atten… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 18 pages, 5 figures

    Journal ref: Entropy 2020, 22(1), 82

  31. arXiv:1912.06858  [pdf, other

    cs.CL cs.DL

    LScDC-new large scientific dictionary

    Authors: Neslihan Suzen, Evgeny M. Mirkes, Alexander N. Gorban

    Abstract: In this paper, we present a scientific corpus of abstracts of academic papers in English -- Leicester Scientific Corpus (LSC). The LSC contains 1,673,824 abstracts of research articles and proceeding papers indexed by Web of Science (WoS) in which publication year is 2014. Each abstract is assigned to at least one of 252 subject categories. Paper metadata include these categories and the number of… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Comments: 63 pages

  32. Blessing of dimensionality at the edge

    Authors: Ivan Y. Tyukin, Alexander N. Gorban, Alistair A. McEwan, Sepehr Meshkinfamfard, Lixin Tang

    Abstract: In this paper we present theory and algorithms enabling classes of Artificial Intelligence (AI) systems to continuously and incrementally improve with a-priori quantifiable guarantees - or more specifically remove classification errors - over time. This is distinct from state-of-the-art machine learning, AI, and software approaches. Another feature of this approach is that, in the supervised setti… ▽ More

    Submitted 10 July, 2020; v1 submitted 30 September, 2019; originally announced October 2019.

    MSC Class: 68T05; 68T45; 68Q32

    Journal ref: Information Sciences, 564, 124-143 (2021)

  33. Symphony of high-dimensional brain

    Authors: Alexander N. Gorban, Valeri A. Makarov, Ivan Y. Tyukin

    Abstract: This paper is the final part of the scientific discussion organised by the Journal "Physics of Life Rviews" about the simplicity revolution in neuroscience and AI. This discussion was initiated by the review paper "The unreasonable effectiveness of small neural ensembles in high-dimensional brain". Phys Life Rev 2019, doi 10.1016/j.plrev.2018.09.005, arXiv:1809.07656. The topics of the discussion… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Journal ref: Physics of Life Reviews, 2019

  34. arXiv:1811.05321  [pdf, other

    cs.LG cs.AI stat.ML

    Correction of AI systems by linear discriminants: Probabilistic foundations

    Authors: A. N. Gorban, A. Golubkov, B. Grechuk, E. M. Mirkes, I. Y. Tyukin

    Abstract: Artificial Intelligence (AI) systems sometimes make errors and will make errors in the future, from time to time. These errors are usually unexpected, and can lead to dramatic consequences. Intensive development of AI and its practical applications makes the problem of errors more important. Total re-engineering of the systems can create new errors and is not always possible due to the resources i… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: text overlap with arXiv:1809.07656 and arXiv:1802.02172

    Journal ref: Information Sciences 466 (2018), 303-322

  35. Fast Construction of Correcting Ensembles for Legacy Artificial Intelligence Systems: Algorithms and a Case Study

    Authors: Ivan Y. Tyukin, Alexander N. Gorban, Stephen Green, Danil Prokhorov

    Abstract: This paper presents a technology for simple and computationally efficient improvements of a generic Artificial Intelligence (AI) system, including Multilayer and Deep Learning neural networks. The improvements are, in essence, small network ensembles constructed on top of the existing AI architectures. Theoretical foundations of the technology are based on Stochastic Separation Theorems and the id… ▽ More

    Submitted 13 February, 2019; v1 submitted 12 October, 2018; originally announced October 2018.

    Journal ref: Information Sciences, 2019

  36. The unreasonable effectiveness of small neural ensembles in high-dimensional brain

    Authors: A. N. Gorban, V. A. Makarov, I. Y. Tyukin

    Abstract: Despite the widely-spread consensus on the brain complexity, sprouts of the single neuron revolution emerged in neuroscience in the 1970s. They brought many unexpected discoveries, including grandmother or concept cells and sparse coding of information in the brain. In machine learning for a long time, the famous curse of dimensionality seemed to be an unsolvable problem. Nevertheless, the idea… ▽ More

    Submitted 10 November, 2018; v1 submitted 20 September, 2018; originally announced September 2018.

    Comments: Review paper, accepted in Physics of Life Reviews; minor corrections

    Journal ref: Physics of Life Reviews Volume 29, July 2019, Pages 55-88

  37. Automatic Short Answer Grading and Feedback Using Text Mining Methods

    Authors: Neslihan Suzen, Alexander Gorban, Jeremy Levesley, Evgeny Mirkes

    Abstract: Automatic grading is not a new approach but the need to adapt the latest technology to automatic grading has become very important. As the technology has rapidly became more powerful on scoring exams and essays, especially from the 1990s onwards, partially or wholly automated grading systems using computational methods have evolved and have become a major area of research. In particular, the deman… ▽ More

    Submitted 19 December, 2019; v1 submitted 27 July, 2018; originally announced July 2018.

    Comments: 27 pages; added questions for section 6; correction of typos

    Journal ref: Procedia Computer Science 169 (2020), 726-743

  38. arXiv:1805.01516  [pdf, ps, other

    cs.NE cs.LG stat.ML

    How deep should be the depth of convolutional neural networks: a backyard dog case study

    Authors: A. N. Gorban, E. M. Mirkes, I. Y. Tyukin

    Abstract: The work concerns the problem of reducing a pre-trained deep neuronal network to a smaller network, with just few layers, whilst retaining the network's functionality on a given task The proposed approach is motivated by the observation that the aim to deliver the highest accuracy possible in the broadest range of operational conditions, which many deep neural networks models strive to achieve,… ▽ More

    Submitted 8 December, 2019; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: Edited and extended version with more detailed description of numerical experiments

  39. arXiv:1804.08588  [pdf, other

    cs.CV

    Large Scale Scene Text Verification with Guided Attention

    Authors: Dafang He, Yeqing Li, Alexander Gorban, Derrall Heath, Julian Ibarz, Qian Yu, Daniel Kifer, C. Lee Giles

    Abstract: Many tasks are related to determining if a particular text string exists in an image. In this work, we propose a new framework that learns this task in an end-to-end way. The framework takes an image and a text string as input and then outputs the probability of the text string being present in the image. This is the first end-to-end framework that learns such relationships between text and images… ▽ More

    Submitted 18 November, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: 18 pages, ACCV 2019

  40. arXiv:1804.07580  [pdf

    cs.LG q-bio.QM stat.ML

    Robust And Scalable Learning Of Complex Dataset Topologies Via Elpigraph

    Authors: Luca Albergante, Evgeny M. Mirkes, Huidong Chen, Alexis Martin, Louis Faure, Emmanuel Barillot, Luca Pinello, Alexander N. Gorban, Andrei Zinovyev

    Abstract: Large datasets represented by multidimensional data point clouds often possess non-trivial distributions with branching trajectories and excluded regions, with the recent single-cell transcriptomic studies of develo** embryo being notable examples. Reducing the complexity and producing compact and interpretable representations of such data remains a challenging task. Most of the existing computa… ▽ More

    Submitted 20 June, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: 32 pages, 14 figures

    Journal ref: Entropy 22, no. 3: 296, 2020

  41. arXiv:1802.02172  [pdf, other

    cs.AI

    Augmented Artificial Intelligence: a Conceptual Framework

    Authors: Alexander N. Gorban, Bogdan Grechuk, Ivan Y. Tyukin

    Abstract: All artificial Intelligence (AI) systems make errors. These errors are unexpected, and differ often from the typical human mistakes ("non-human" errors). The AI errors should be corrected without damage of existing skills and, hopefully, avoiding direct human expertise. This paper presents an initial summary report of project taking new and systematic approach to improving the intellectual effecti… ▽ More

    Submitted 24 March, 2018; v1 submitted 6 February, 2018; originally announced February 2018.

    Comments: The mathematical part is significantly extended. New stochastic separation theorems are proven for log-concave distributions. Some previously formulated hypotheses are confirmed

  42. Blessing of dimensionality: mathematical foundations of the statistical physics of data

    Authors: A. N. Gorban, I. Y. Tyukin

    Abstract: The concentration of measure phenomena were discovered as the mathematical background of statistical mechanics at the end of the XIX - beginning of the XX century and were then explored in mathematics of the XX-XXI centuries. At the beginning of the XXI century, it became clear that the proper utilisation of these phenomena in machine learning might transform the curse of dimensionality into the b… ▽ More

    Submitted 10 January, 2018; originally announced January 2018.

    Comments: Accepted for publication in Philosophical Transactions of the Royal Society A, 2018. Comprises of 17 pages and 4 figures

    Journal ref: Phil. Trans. R. Soc. A volume 376, issue 2118, 376 20170237, 2018

  43. Knowledge Transfer Between Artificial Intelligence Systems

    Authors: Ivan Y. Tyukin, Alexander N. Gorban, Konstantin Sofeikov, Ilya Romanenko

    Abstract: We consider the fundamental question: how a legacy "student" Artificial Intelligent (AI) system could learn from a legacy "teacher" AI system or a human expert without complete re-training and, most importantly, without requiring significant computational resources. Here "learning" is understood as an ability of one system to mimic responses of the other and vice-versa. We call such learning an Ar… ▽ More

    Submitted 14 November, 2017; v1 submitted 5 September, 2017; originally announced September 2017.

    MSC Class: 68T05; 68T30

    Journal ref: Front Neurorobot. 2018; 12: 49

  44. arXiv:1704.03549  [pdf, other

    cs.CV

    Attention-based Extraction of Structured Information from Street View Imagery

    Authors: Zbigniew Wojna, Alex Gorban, Dar-Shyang Lee, Kevin Murphy, Qian Yu, Yeqing Li, Julian Ibarz

    Abstract: We present a neural network model - based on CNNs, RNNs and a novel attention mechanism - which achieves 84.2% accuracy on the challenging French Street Name Signs (FSNS) dataset, significantly outperforming the previous state of the art (Smith'16), which achieved 72.46%. Furthermore, our new method is much simpler and more general than the previous approach. To demonstrate the generality of our m… ▽ More

    Submitted 20 August, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: Updated references, added link to the source code

  45. Stochastic Separation Theorems

    Authors: A. N. Gorban, I. Y. Tyukin

    Abstract: The problem of non-iterative one-shot and non-destructive correction of unavoidable mistakes arises in all Artificial Intelligence applications in the real world. Its solution requires robust separation of samples with errors from samples where the system works properly. We demonstrate that in (moderately) high dimension this separation could be achieved with probability close to one by linear dis… ▽ More

    Submitted 3 August, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

    Comments: 6 pages, accepted for publication in Neural Networks (Letter section)

    MSC Class: 68T10 ACM Class: I.2.6

    Journal ref: Neural Networks 94 (2017), 255-259

  46. One-Trial Correction of Legacy AI Systems and Stochastic Separation Theorems

    Authors: Alexander N. Gorban, Ilya Romanenko, Richard Burton, Ivan Y. Tyukin

    Abstract: We consider the problem of efficient "on the fly" tuning of existing, or {\it legacy}, Artificial Intelligence (AI) systems. The legacy AI systems are allowed to be of arbitrary class, albeit the data they are using for computing interim or final decision responses should posses an underlying structure of a high-dimensional topological real vector space. The tuning method that we propose enables d… ▽ More

    Submitted 13 February, 2019; v1 submitted 3 October, 2016; originally announced October 2016.

    Journal ref: Information Sciences, 484, 237-254, 2019

  47. Piece-wise quadratic approximations of arbitrary error functions for fast and robust machine learning

    Authors: A. N. Gorban, E. M. Mirkes, A. Zinovyev

    Abstract: Most of machine learning approaches have stemmed from the application of minimizing the mean squared distance principle, based on the computationally efficient quadratic optimization methods. However, when faced with high-dimensional and noisy data, the quadratic error functionals demonstrated many weaknesses including high sensitivity to contaminating factors and dimensionality curse. Therefore,… ▽ More

    Submitted 21 August, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: Edited and extended version with algortihms of regularized regression

    Journal ref: Neural Networks, Volume 84, December 2016, 28-38

  48. The THUMOS Challenge on Action Recognition for Videos "in the Wild"

    Authors: Haroon Idrees, Amir R. Zamir, Yu-Gang Jiang, Alex Gorban, Ivan Laptev, Rahul Sukthankar, Mubarak Shah

    Abstract: Automatically recognizing and localizing wide ranges of human actions has crucial importance for video understanding. Towards this goal, the THUMOS challenge was introduced in 2013 to serve as a benchmark for action recognition. Until then, video action recognition, including THUMOS challenge, had focused primarily on the classification of pre-segmented (i.e., trimmed) videos, which is an artifici… ▽ More

    Submitted 21 April, 2016; originally announced April 2016.

    Comments: Preprint submitted to Computer Vision and Image Understanding

  49. Robust principal graphs for data approximation

    Authors: A. N. Gorban, E. M. Mirkes, A. Zinovyev

    Abstract: Revealing hidden geometry and topology in noisy data sets is a challenging task. Elastic principal graph is a computationally efficient and flexible data approximator based on embedding a graph into the data space and minimizing the energy functional penalizing the deviation of graph nodes both from data points and from pluri-harmonic configuration (generalization of linearity). The structure of p… ▽ More

    Submitted 24 November, 2016; v1 submitted 22 March, 2016; originally announced March 2016.

    Comments: A talk given at ECDA2015 (European Conference on Data Analysis, September 2nd to 4th 2015, University of Essex, Colchester, UK), to be published in Archives of Data Science

    Journal ref: Archives of Data Science, Series A, Vol. 2, No. 1, 2017

  50. arXiv:1511.02917  [pdf, other

    cs.CV cs.AI

    Detecting events and key actors in multi-person videos

    Authors: Vignesh Ramanathan, Jonathan Huang, Sami Abu-El-Haija, Alexander Gorban, Kevin Murphy, Li Fei-Fei

    Abstract: Multi-person event recognition is a challenging task, often with many people active in the scene but only a small subset contributing to an actual event. In this paper, we propose a model which learns to detect events in such videos while automatically "attending" to the people responsible for the event. Our model does not use explicit annotations regarding who or where those people are during tra… ▽ More

    Submitted 16 March, 2016; v1 submitted 9 November, 2015; originally announced November 2015.

    Comments: Accepted for publication in CVPR'16