Representation formulas for maximal monotone operators of type (D) in Banach spaces whose dual spaces are strictly convex
Authors:
Nguyen B. Tran,
Tran N. Nguyen,
Huynh M. Hien
Abstract:
This work deals with a maximal monotone operator $A$ of type (D) in a Banach space whose dual space is strictly convex. We establish some representations for the value $Ax$ at a given point $x$ via its values at nearby points of $x$. We show that the faces of $Ax$ are contained in the set of all weak$^*$ convergent limits of bounded nets of the operator at nearby points of $x$, then we obtain a re…
▽ More
This work deals with a maximal monotone operator $A$ of type (D) in a Banach space whose dual space is strictly convex. We establish some representations for the value $Ax$ at a given point $x$ via its values at nearby points of $x$. We show that the faces of $Ax$ are contained in the set of all weak$^*$ convergent limits of bounded nets of the operator at nearby points of $x$, then we obtain a representation for $Ax$ by use of this set. In addition, representations for the support function of $Ax$ based on the minimal-norm selection of the operator in certain Banach spaces are given.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
Neural networks with motivation
Authors:
Sergey A. Shuvaev,
Ngoc B. Tran,
Marcus Stephenson-Jones,
Bo Li,
Alexei A. Koulakov
Abstract:
How can animals behave effectively in conditions involving different motivational contexts? Here, we propose how reinforcement learning neural networks can learn optimal behavior for dynamically changing motivational salience vectors. First, we show that Q-learning neural networks with motivation can navigate in environment with dynamic rewards. Second, we show that such networks can learn complex…
▽ More
How can animals behave effectively in conditions involving different motivational contexts? Here, we propose how reinforcement learning neural networks can learn optimal behavior for dynamically changing motivational salience vectors. First, we show that Q-learning neural networks with motivation can navigate in environment with dynamic rewards. Second, we show that such networks can learn complex behaviors simultaneously directed towards several goals distributed in an environment. Finally, we show that in Pavlovian conditioning task, the responses of the neurons in our model resemble the firing patterns of neurons in the ventral pallidum (VP), a basal ganglia structure involved in motivated behaviors. We show that, similarly to real neurons, recurrent networks with motivation are composed of two oppositely-tuned classes of neurons, responding to positive and negative rewards. Our model generates predictions for the VP connectivity. We conclude that networks with motivation can rapidly adapt their behavior to varying conditions without changes in synaptic strength when expected reward is modulated by motivation. Such networks may also provide a mechanism for how hierarchical reinforcement learning is implemented in the brain.
△ Less
Submitted 18 November, 2019; v1 submitted 22 June, 2019;
originally announced June 2019.