Technical Program

Paper Detail

Paper:	PS-1A.38
Session:	Poster Session 1A
Location:	H Lichthof
Session Time:	Saturday, September 14, 16:30 - 19:30
Presentation Time:	Saturday, September 14, 16:30 - 19:30
Presentation:	Poster
Publication:	2019 Conference on Cognitive Computational Neuroscience, 13-16 September 2019, Berlin, Germany
Paper Title:	Optimal planning to plan: People partially plan based on plan specificity
Manuscript:	Click here to view manuscript
License:	This work is licensed under a Creative Commons Attribution 3.0 Unported License.
DOI:	https://doi.org/10.32470/CCN.2019.1423-0
Authors:	Mark Ho, Princeton University, United States; David Abel, Brown University, United States; Jonathan Cohen, Princeton University, United States; Michael Littman, Brown University, United States; Thomas Griffiths, Princeton University, United States
Abstract:	Planning requires simulating future choices and consequences. This process is costly. But, it is also useful since it allows people to make choices in the now that have desirable future outcomes. What is a rational way to balance the immediate computational costs and future benefits of planning? Here, we argue that this involves planning to plan---adaptively deciding what actions to plan and when to plan those actions. To formalize this intuition, we develop the ideas of partial planning and information-theoretic simulation costs. Together, these allow us to define a novel Bellman objective that includes both environmental rewards and planning costs, which we solve using a gradient-based planning-to-plan algorithm. A key prediction of our account is that when the value of an immediate action depends on a more specific plan, the computational cost associated with that action will be higher. To test this qualitative prediction, we measure participant response times when solving a Gridworld task. We find evidence for our account of planning costs, indicating that people rationally plan to plan. Our formulation and results provide new insight into the meta-planning processes that support the scale and sophistication of human problem solving.