Stochasticity, Nonlinear Value Functions, and Update Rules in Learning Aesthetic Biases

Norberto M. Grzywacz

Download from

dx.doi.org

More download options

Stochasticity, Nonlinear Value Functions, and Update Rules in Learning Aesthetic Biases

Norberto M. Grzywacz

Frontiers in Human Neuroscience 15:639081 (2021) Copy BIBT_EX

Abstract

A theoretical framework for the reinforcement learning of aesthetic biases was recently proposed based on brain circuitries revealed by neuroimaging. A model grounded on that framework accounted for interesting features of human aesthetic biases. These features included individuality, cultural predispositions, stochastic dynamics of learning and aesthetic biases, and the peak-shift effect. However, despite the success in explaining these features, a potential weakness was the linearity of the value function used to predict reward. This linearity meant that the learning process employed a value function that assumed a linear relationship between reward and sensory stimuli. Linearity is common in reinforcement learning in neuroscience. However, linearity can be problematic because neural mechanisms and the dependence of reward on sensory stimuli were typically nonlinear. Here, we analyze the learning performance with models including optimal nonlinear value functions. We also compare updating the free parameters of the value functions with the delta rule, which neuroscience models use frequently, vs. updating with a new Phi rule that considers the structure of the nonlinearities. Our computer simulations showed that optimal nonlinear value functions resulted in improvements of learning errors when the reward models were nonlinear. Similarly, the new Phi rule led to improvements in these errors. These improvements were accompanied by the straightening of the trajectories of the vector of free parameters in its phase space. This straightening meant that the process became more efficient in learning the prediction of reward. Surprisingly, however, this improved efficiency had a complex relationship with the rate of learning. Finally, the stochasticity arising from the probabilistic sampling of sensory stimuli, rewards, and motivations helped the learning process narrow the range of free parameters to nearly optimal outcomes. Therefore, we suggest that value functions and update rules optimized for social and ecological constraints are ideal for learning aesthetic biases.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Keywords

Aesthetic value Delta rule Regret minimization Value function stochastic dynamics reinforcement learning

Reprint years

DOI

10.3389/fnhum.2021.639081

My notes

Analytics

Added to PP
2021-05-10

Downloads
8 (#1,345,183)

6 months
11 (#272,000)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

Aesthetics and Psychobiology.D. E. Berlyne - 1973 - Journal of Aesthetics and Art Criticism 31 (4):553-553.

Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal.Michael J. Frank & Eric D. Claus - 2006 - Psychological Review 113 (2):300-326.

Are the sources of interest the same for everyone? Using multilevel mixture models to explore individual differences in appraisal structures.Paul J. Silvia, Robert A. Henson & Jonathan L. Templin - 2009 - Cognition and Emotion 23 (7):1389-1406.

Judgments of pleasingness and interestingness as functions of visual complexity.P. P. Aitken - 1974 - Journal of Experimental Psychology 103 (2):240.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Stochasticity, Nonlinear Value Functions, and Update Rules in Learning Aesthetic Biases

Abstract

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work