Learning for Planning
This example uses a planner in a simple domain. Initially there are no operators but a set of actions with unknown preconditions and effects. Over time the system will learn operators from observerd state transitions which will allow it to hopefully reach the goal.
Loop:
Try to find a plan
No plan found: try random action
Plan found: execute plan
Record state transition data
Learn a decision tree to predict action effects given action and state
Convert decision tree to operators
Go here for instructions to run this example.
Last modified: 17 March 2025