OpenAI Five playing the best OpenAI employee team. Follow us on Twitch to view the live broadcast, or request an invite to attend in person! To benchmark our progress, we'll host a match versus top players on August 5th. This indicates that reinforcement learning can yield long-term planning with large but achievable scale - without fundamental advances, contrary to our own expectations upon starting the project. Using a separate LSTM for each hero and no human data, it learns recognizable strategies. It trains using a scaled-up version of Proximal Policy Optimization running on 256 GPUs and 128,000 CPU cores - a larger-scale version of the system we built to play the much-simpler solo variant of the game last year. OpenAI Five plays 180 years worth of games against itself every day, learning via self-play. We may not succeed: Dota 2 is one of the most popular and complex esports games in the world, with creative and motivated professionals who train year-round to earn part of Dota's annual $40M prize pool (the largest of any esports game). While today we play with restrictions, we aim to beat a team of top professionals at The International in August subject only to a limited set of heroes. Our team of five neural networks, OpenAI Five, has started to defeat amateur human teams at Dota 2.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |