reinforcement learning architecture