Skip to main content

Showing 1–5 of 5 results for author: Davydov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.13486  [pdf, other

    math.NA cs.AI cs.LG physics.app-ph

    Separable Physics-Informed Neural Networks for the solution of elasticity problems

    Authors: Vasiliy A. Es'kin, Danil V. Davydov, Julia V. Gur'eva, Alexey O. Malkhanov, Mikhail E. Smorkalov

    Abstract: A method for solving elasticity problems based on separable physics-informed neural networks (SPINN) in conjunction with the deep energy method (DEM) is presented. Numerical experiments have been carried out for a number of problems showing that this method has a significantly higher convergence rate and accuracy than the vanilla physics-informed neural networks (PINN) and even SPINN based on a sy… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    MSC Class: 68T07; 65Z05; 65M99 ACM Class: I.2.1; I.2.7; J.2

  2. arXiv:2304.02282  [pdf, other

    math.NA cs.AI physics.comp-ph

    About optimal loss function for training physics-informed neural networks under respecting causality

    Authors: Vasiliy A. Es'kin, Danil V. Davydov, Ekaterina D. Egorova, Alexey O. Malkhanov, Mikhail A. Akhukov, Mikhail E. Smorkalov

    Abstract: A method is presented that allows to reduce a problem described by differential equations with initial and boundary conditions to the problem described only by differential equations. The advantage of using the modified problem for physics-informed neural networks (PINNs) methodology is that it becomes possible to represent the loss function in the form of a single term associated with differentia… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 25 pages, 7 figures, 6 tables

    MSC Class: 68T07 (Primary) 65Z05; 65M99 (Secondary) ACM Class: I.2.1; I.2.7; J.2

  3. arXiv:2108.06148  [pdf, other

    cs.LG cs.AI

    Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments

    Authors: Vasilii Davydov, Alexey Skrynnik, Konstantin Yakovlev, Aleksandr I. Panov

    Abstract: In this paper, we consider the problem of multi-agent navigation in partially observable grid environments. This problem is challenging for centralized planning approaches as they, typically, rely on the full knowledge of the environment. We suggest utilizing the reinforcement learning approach when the agents, first, learn the policies that map observations to actions and then follow these polici… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: This is a preprint of the paper accepted to RCAI 2021. It contains 11 pages and 5 figures

  4. arXiv:2006.09939  [pdf, other

    cs.LG cs.AI

    Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations

    Authors: Alexey Skrynnik, Aleksey Staroverov, Ermek Aitygulov, Kirill Aksenov, Vasilii Davydov, Aleksandr I. Panov

    Abstract: Currently, deep reinforcement learning (RL) shows impressive results in complex gaming and robotic environments. Often these results are achieved at the expense of huge computational costs and require an incredible number of episodes of interaction between the agent and the environment. There are two main approaches to improving the sample efficiency of reinforcement learning methods - using hiera… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  5. arXiv:1912.08664  [pdf, other

    cs.AI

    Hierarchical Deep Q-Network from Imperfect Demonstrations in Minecraft

    Authors: Alexey Skrynnik, Aleksey Staroverov, Ermek Aitygulov, Kirill Aksenov, Vasilii Davydov, Aleksandr I. Panov

    Abstract: We present Hierarchical Deep Q-Network (HDQfD) that took first place in the MineRL competition. HDQfD works on imperfect demonstrations and utilizes the hierarchical structure of expert trajectories. We introduce the procedure of extracting an effective sequence of meta-actions and subgoals from demonstration data. We present a structured task-dependent replay buffer and adaptive prioritizing tech… ▽ More

    Submitted 13 July, 2020; v1 submitted 18 December, 2019; originally announced December 2019.