# yzmir-deep-rl

Reinforcement learning - DQN, PPO, SAC, reward shaping, exploration - 13 skills