Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Last revision Both sides next revision
memory [2018/10/17 11:48]
admin
memory [2018/10/17 11:55]
admin
Line 194: Line 194:
  
 we introduce a new paradigm for reinforcement learning where agents use recall of specific memories to credit actions from the past, allowing them to solve problems that are intractable for existing algorithms. This paradigm broadens the scope of problems that can be investigated in AI and offers a mechanistic account of behaviors that may inspire computational models in neuroscience,​ psychology, and behavioral economics. we introduce a new paradigm for reinforcement learning where agents use recall of specific memories to credit actions from the past, allowing them to solve problems that are intractable for existing algorithms. This paradigm broadens the scope of problems that can be investigated in AI and offers a mechanistic account of behaviors that may inspire computational models in neuroscience,​ psychology, and behavioral economics.
 +
 +Temporal Value Transport is a heuristic algorithm but one that expresses coherent principles we
 +believe will endure: past events are encoded, stored, retrieved, and revaluated. TVT fundamentally
 +intertwines memory systems and reinforcement learning: the attention weights on memories
 +specifically modulate the reward credited to past events.