Options
The formal foundation of an evolutionary theory of reinforcement
Borgstede, Matthias; Eggert, Frank (2022): „The formal foundation of an evolutionary theory of reinforcement“. Bamberg: Otto-Friedrich-Universität.
Faculty/Professorship:
Author:
Publisher Information:
Year of publication:
2022
Pages:
Source/Other editions:
Behavioural processes : an international journal of comparative and physiological ethology. 186 (2021), May, S. 1-30. - DOI: 10.1016/j.beproc.2021.104370
Year of first publication:
2021
Language:
English
Abstract:
Reinforcement learning is often described by analogy to natural selection. However, there is no coherent theory relating reinforcement learning to evolution within a single formal model of selection. This paper provides the formal foundation of such a unified theory. The model is based on the most general description of natural selection as given by the Price equation. We extend the Price equation to cover reinforcement learning as the result of a behavioral selection process within individuals and relate it to the principle of natural selection via the concept of statistical fitness predictors by means of a multilevel model of behavioral selection.
The main result is the covariance-based law of effect, which describes reinforcement learning on a molar level by means of the covariance between behavioral allocation and a statistical fitness predictor. We further demonstrate how this abstract principle can be applied to derive theoretical explanations of various empirical findings, like conditioned reinforcement, blocking, matching and response deprivation.
Our model is the first to apply the abstract principle of selection to derive a unified description of reinforcement learning and natural selection within a single model. It provides a general analytical tool for behavioral psychology in a similar way that the theory of natural selection does for evolutionary biology. We thus lay the formal foundation of a general theory of reinforcement as the result of behavioral selection on multiple levels.
The main result is the covariance-based law of effect, which describes reinforcement learning on a molar level by means of the covariance between behavioral allocation and a statistical fitness predictor. We further demonstrate how this abstract principle can be applied to derive theoretical explanations of various empirical findings, like conditioned reinforcement, blocking, matching and response deprivation.
Our model is the first to apply the abstract principle of selection to derive a unified description of reinforcement learning and natural selection within a single model. It provides a general analytical tool for behavioral psychology in a similar way that the theory of natural selection does for evolutionary biology. We thus lay the formal foundation of a general theory of reinforcement as the result of behavioral selection on multiple levels.
GND Keywords: ;
Verhaltenspsychologie
Operante Konditionierung
Keywords:
selection by consequences, behavioral selection, natural selection, reinforcement learning, Price equation, multilevel model of behavioral selection
DDC Classification:
RVK Classification:
Peer Reviewed:
Yes:
International Distribution:
Yes:
Type:
Article
published:
November 21, 2022
Permalink
https://fis.uni-bamberg.de/handle/uniba/50369