Deep Reinforcement Learning

Value-based:

Policy-based: