ut s OpenAI + DotA 2kanmy/courses/6101_1820/s13.pdf · ut s OpenAI Rapid[1] … a general-purpose...
Transcript of ut s OpenAI + DotA 2kanmy/courses/6101_1820/s13.pdf · ut s OpenAI Rapid[1] … a general-purpose...
Wism
ut
Labs
OpenAI + DotA 2
Wism
ut
Labs
DotA (Defense of the Ancients) 2 Gameplay
•
•
•
•
•
•
•
Wism
ut
Labs
DotA and StarCraft II Challenges [1, 2, 3]
• → →
•
•
•
•
Wism
ut
Labs
OpenAI DotA Approach [1]
•
•
•
•
•
γ
•
Wism
ut
Labs
OpenAI DotA ‘Cheats’ [1, 3]
•
• →
•
•
•
•
•
• →
•
•
Wism
ut
Labs
OpenAI Five Network Architecture [1, 6]
Wism
ut
Labs
OpenAI Five Network Architecture [1, 6]
Wism
ut
Labs
OpenAI Five Model Structure [1]
•
•
•
•
•
•
•
•
•
•
Wism
ut
Labs
OpenAI Five Exploration [1]
•
•
• →
•
•
• →
→
•
•
•
Wism
ut
Labs
OpenAI Rapid [1]… a general-purpose RL training system
•
•
•
•
•
•
•
•
•
→
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
• →
• →
•
•
→
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
•
•
•
• →
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
•
•
• →
•
•
→
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
•
•
•
•
•
• →
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
• →
•
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
•
•
Wism
ut
Labs
Proximal Policy Optimization [1, 4, 5]
•
→
Wism
ut
Labs
Transfer Learning for RL [1]
•
•
•
→
Wism
ut
Labs
Open Challenges & Moving Forward
•
•
•
•
•
Wism
ut
Labs
Reference Materials