We design the white-paper which purposes a new reinforcement model **PANDA (Policy Advisor Network and Decision Architecture)** applies the reinforcement learning to arrange consumer jobs for certain computer power resource.