Page - 98 - in Limina - Grazer theologische Perspektiven, Volume 3:2

Image of the Page - 98 -

Text of the Page - 98 -

98 | www.limina-graz.eu Sara Lumbreras and Lluis Oviedo | Belief networks as complex systems Reinforcement Learning as a metaphor for decision making Another AI technique that can enrich the brain-computer metaphor is Re- inforcement Learning (RL). RL simulates an agent that can take actions in her environment. These actions can lead to a reward. The agent learns, by trial and error, the consequences of her actions. Then, she can take the de- cision that is optimal for her situation (we will refer to this as her state).1 We define “optimal” according to a given objective, described by means of an objective mathematical formula. The power of RL and Dynamic Control, its underlying technique, is that by defining only the value of the differ- ent final outcomes, it can derive the value of the intermediate outcomes, so that the method arrives at the optimal strategy at each stage even if it is not the final one, as the final value cascades into the nearer ones. For in- stance, if we apply RL to playing chess, the final outcome can be winning or not. The next-to-last states can be described in terms of their probability of resulting in a winning situation. Then, the ones before can be evaluated in terms of the next ones, and so on. Reinforcement Learning therefore provides a basic mechanism to understand how, from meta-goals, we can derive intermediate goals and optimal strategies. Belief systems have several functions as has been recognized in the litera- ture (Frank 1977). One of these functions is to evaluate a given state in or- der to select the next most desirable course of action, so that they have a key role in these processes. 3 The limitations of the pattern-recognition simile The brain is far more complex than an ANN. A single neuron is an entity of a remarkable complexity, which cannot be reduced to a mere simple math- ematical operation. We should remember that even unicellular organisms exhibit complex behaviour: amoebas hunt, protozoa build complex shells from minuscule specks of dust. A living cell is a wonderfully complex being, and we should keep in mind that ANNs are only vaguely based on the work- ings of the brain; they are not modelled as its equivalent. Reinforcement Learning provides a mechanism to understand how, from meta-goals, we can derive intermediate goals and optimal strategies. 1 For more context on these tech- niques, we refer the reader to Sut- ton/Barto 2018.

back to the book Limina - Grazer theologische Perspektiven, Volume 3:2"

Limina Grazer theologische Perspektiven, Volume 3:2

Title: Limina
Subtitle: Grazer theologische Perspektiven
Volume: 3:2
Editor: Karl Franzens University Graz
Date: 2020
Language: German
License: CC BY-NC 4.0
Size: 21.4 x 30.1 cm
Pages: 270
Categories: Zeitschriften LIMINA - Grazer theologische Perspektiven