Paper: | PS-1A.38 | ||
Session: | Poster Session 1A | ||
Location: | H Lichthof | ||
Session Time: | Saturday, September 14, 16:30 - 19:30 | ||
Presentation Time: | Saturday, September 14, 16:30 - 19:30 | ||
Presentation: | Poster | ||
Publication: | 2019 Conference on Cognitive Computational Neuroscience, 13-16 September 2019, Berlin, Germany | ||
Paper Title: | Optimal planning to plan: People partially plan based on plan specificity | ||
Manuscript: | Click here to view manuscript | ||
License: | ![]() This work is licensed under a Creative Commons Attribution 3.0 Unported License. |
||
DOI: | https://doi.org/10.32470/CCN.2019.1423-0 | ||
Authors: | Mark Ho, Princeton University, United States; David Abel, Brown University, United States; Jonathan Cohen, Princeton University, United States; Michael Littman, Brown University, United States; Thomas Griffiths, Princeton University, United States | ||
Abstract: | Planning requires simulating future choices and consequences. This process is costly. But, it is also useful since it allows people to make choices in the now that have desirable future outcomes. What is a rational way to balance the immediate computational costs and future benefits of planning? Here, we argue that this involves planning to plan---adaptively deciding what actions to plan and when to plan those actions. To formalize this intuition, we develop the ideas of partial planning and information-theoretic simulation costs. Together, these allow us to define a novel Bellman objective that includes both environmental rewards and planning costs, which we solve using a gradient-based planning-to-plan algorithm. A key prediction of our account is that when the value of an immediate action depends on a more specific plan, the computational cost associated with that action will be higher. To test this qualitative prediction, we measure participant response times when solving a Gridworld task. We find evidence for our account of planning costs, indicating that people rationally plan to plan. Our formulation and results provide new insight into the meta-planning processes that support the scale and sophistication of human problem solving. |