Self segmentation of sequences


chical reinforcement learning that does not rely on a pri ori hierarchical structures Thus the approach deals with a more di cult problem compared with existing work It in volves learning to segment sequences to create hierarchical structures based on reinforcement received during task ex ecution with di erent levels of control communicating with each other through sharing reinforcement estimates obtained by each others The algorithm segments sequences to re duce non Markovian temporal dependencies to facilitate the learning of the overall task Initial experiments demon strated the basic promise of the approach..



    Upload a copy of this work     Papers currently archived: 94,452

External links

  • This entry has no external links. Add one.
Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.


Added to PP

8 (#1,356,448)

6 months
8 (#528,674)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

Desiderata for cognitive architectures.Ron Sun - 2004 - Philosophical Psychology 17 (3):341-373.

Add more citations

References found in this work

No references found.

Add more references