TDRL

temporal-difference reinforcement learning model(English)