Estimating scale-invariant future in continuous time

Tiganj, Zoran; Gershman, Samuel J.; Sederberg, Per B.; Howard, Marc W.

Computer Science > Artificial Intelligence

arXiv:1802.06426 (cs)

[Submitted on 18 Feb 2018 (v1), last revised 26 Oct 2018 (this version, v3)]

Title:Estimating scale-invariant future in continuous time

Authors:Zoran Tiganj, Samuel J. Gershman, Per B. Sederberg, Marc W. Howard

View PDF

Abstract:Natural learners must compute an estimate of future outcomes that follow from a stimulus in continuous time. Widely used reinforcement learning algorithms discretize continuous time and estimate either transition functions from one step to the next (model-based algorithms) or a scalar value of exponentially-discounted future reward using the Bellman equation (model-free algorithms). An important drawback of model-based algorithms is that computational cost grows linearly with the amount of time to be simulated. On the other hand, an important drawback of model-free algorithms is the need to select a time-scale required for exponential discounting. We present a computational mechanism, developed based on work in psychology and neuroscience, for computing a scale-invariant timeline of future outcomes. This mechanism efficiently computes an estimate of inputs as a function of future time on a logarithmically-compressed scale, and can be used to generate a scale-invariant power-law-discounted estimate of expected future reward. The representation of future time retains information about what will happen when. The entire timeline can be constructed in a single parallel operation which generates concrete behavioral and neural predictions. This computational mechanism could be incorporated into future reinforcement learning algorithms.

Comments:	25 pages, 10 figures
Subjects:	Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:1802.06426 [cs.AI]
	(or arXiv:1802.06426v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1802.06426

Submission history

From: Zoran Tiganj [view email]
[v1] Sun, 18 Feb 2018 19:09:28 UTC (1,483 KB)
[v2] Thu, 26 Jul 2018 21:43:23 UTC (2,185 KB)
[v3] Fri, 26 Oct 2018 23:46:21 UTC (2,565 KB)

Computer Science > Artificial Intelligence

Title:Estimating scale-invariant future in continuous time

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Estimating scale-invariant future in continuous time

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators