Skip to main content

Showing 1–7 of 7 results for author: Kuzovkin, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.08232  [pdf, other

    cs.LG cs.RO

    Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling

    Authors: Ashish Kumar, Ilya Kuzovkin

    Abstract: Recent advances in batch (offline) reinforcement learning have shown promising results in learning from available offline data and proved offline reinforcement learning to be an essential toolkit in learning control policies in a model-free setting. An offline reinforcement learning algorithm applied to a dataset collected by a suboptimal non-learning-based algorithm can result in a policy that ou… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  2. arXiv:2010.08715  [pdf, other

    q-bio.NC cs.AI cs.LG

    Understanding Information Processing in Human Brain by Interpreting Machine Learning Models

    Authors: Ilya Kuzovkin

    Abstract: The thesis explores the role machine learning methods play in creating intuitive computational models of neural processing. Combined with interpretability techniques, machine learning could replace human modeler and shift the focus of human effort to extracting the knowledge from the ready-made models and articulating that knowledge into intuitive descroptions of reality. This perspective makes th… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: Defended on September 22, 2020 (video recording at https://www.uttv.ee/naita?id=30480). Supervisor: Dr. Raul Vicente Zafra (Computational Neuroscience Lab, University of Tarty, Estonia). Opponents: Dr. Fabian Sinz (IRG Neuronal Intelligence, University of Tübingen, Germany), Dr. Tim C Kietzmann (Donders Institute for Brain, Cognition and Behaviour, Radboud University, Netherlands)

    Report number: Dissertationes Informaticae Universitatis Tartuensis 19 MSC Class: 68T07; 68T30; 92-08; 92-10; 92C20; 92B20 ACM Class: I.2.6; I.5.4; J.3

  3. arXiv:1910.08639  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research

    Authors: Ashish Kumar, Toby Buckley, John B. Lanier, Qiaozhi Wang, Alicia Kavelaars, Ilya Kuzovkin

    Abstract: Success stories of applied machine learning can be traced back to the datasets and environments that were put forward as challenges for the community. The challenge that the community sets as a benchmark is usually the challenge that the community eventually solves. The ultimate challenge of reinforcement learning research is to train real agents to operate in the real environment, but until now t… ▽ More

    Submitted 14 December, 2020; v1 submitted 18 October, 2019; originally announced October 2019.

    MSC Class: 68T40; 68T07 ACM Class: I.2.9; I.2.6; C.4

  4. arXiv:1907.10509  [pdf, ps, other

    eess.SP cs.LG stat.ML

    Direct information transfer rate optimisation for SSVEP-based BCI

    Authors: Anti Ingel, Ilya Kuzovkin, Raul Vicente

    Abstract: In this work, a classification method for SSVEP-based BCI is proposed. The classification method uses features extracted by traditional SSVEP-based BCI methods and finds optimal discrimination thresholds for each feature to classify the targets. Optimising the thresholds is formalised as a maximisation task of a performance measure of BCIs called information transfer rate (ITR). However, instead o… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Journal ref: Journal of neural engineering, 16(1), 016016 (2018)

  5. arXiv:1901.11529  [pdf, other

    cs.AI

    Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

    Authors: Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

    Abstract: Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual envi… ▽ More

    Submitted 29 October, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: To appear in Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada. Code available at https://github.com/offworld-projects/research-halgan

  6. Combining Static and Dynamic Features for Multivariate Sequence Classification

    Authors: Anna Leontjeva, Ilya Kuzovkin

    Abstract: Model precision in a classification task is highly dependent on the feature space that is used to train the model. Moreover, whether the features are sequential or static will dictate which classification method can be applied as most of the machine learning algorithms are designed to deal with either one or another type of data. In real-life scenarios, however, it is often the case that both stat… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: Presented at IEEE DSAA 2016

  7. arXiv:1511.08779  [pdf, other

    cs.AI cs.LG q-bio.NC

    Multiagent Cooperation and Competition with Deep Reinforcement Learning

    Authors: Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, Raul Vicente

    Abstract: Multiagent systems appear in most social, economical, and political situations. In the present work we extend the Deep Q-Learning Network architecture proposed by Google DeepMind to multiagent environments and investigate how two agents controlled by independent Deep Q-Networks interact in the classic videogame Pong. By manipulating the classical rewarding scheme of Pong we demonstrate how competi… ▽ More

    Submitted 27 November, 2015; originally announced November 2015.