6 votes

OpenAI Five

Posted June 26, 2018 by joelthelion

Tags: reinforcementlearning machinelearning

https://blog.openai.com/openai-five/

Link information

This data is scraped automatically and may be incorrect.

Published: Jun 25 2018
Word count: 1883 words

3 comments

[3]
joelthelion (OP)
June 26, 2018
Link
Does anyone know if they have a more rigorous explanation of what they did somewhere? I'd especially like to look at the network they're using and their input layers in particular.

Does anyone know if they have a more rigorous explanation of what they did somewhere? I'd especially like to look at the network they're using and their input layers in particular.
1. [2]
  Crespyl
  June 26, 2018
  Link Parent
  There's some more info linked in the blog post, I was able to find some more text including a bit on infrastructure, a writeup of the reward function, and a network architecture diagram (PDF...
  
  There's some more info linked in the blog post, I was able to find some more text including a bit on infrastructure, a writeup of the reward function, and a network architecture diagram (PDF warning). They do say they're not ready to talk in depth about the agent internals yet, but hopefully they'll share more in the future.
  
  To me, this is way more exciting than the 1v1 from last year, even with the limited mirror matchup mode, but I'm still skeptical/curious/excited to see if they can actually keep up this pace and beat the full version of the game by next year.
  
  I really want to see them try to tackle the kind of long-term knowledge heavy planning that goes into the draft pick/ban stage and selecting items appropriate for the draft, handling counter picks dynamically, vision control, etc.
  
  Hopefully they make the bots available to try out again, I'd love try playing against them.
  
  3 votes
  1. joelthelion (OP)
    June 27, 2018
    Link Parent
    Thanks a lot! My current focus is trying to apply RL to Game of Drones, which is a much simpler game. But of course I don't have the skills or the resources of a group like OpenAI, so I find it...
    
    Thanks a lot! My current focus is trying to apply RL to Game of Drones, which is a much simpler game. But of course I don't have the skills or the resources of a group like OpenAI, so I find it extremely hard to get a reasonable level. Hopefully I get get some ideas from their contributions.
    
    1 vote