Finding out Montezuma’s Revenge from a unmarried demonstration

Leave a Comment / AI Tools & Automation / By Sophia Blake

We’ve educated an agent to reach a prime rating of 74,500 on Montezuma’s Revenge from a unmarried human demonstration, higher than any in the past revealed consequence. Our set of rules is inconspicuous: the agent performs a chain of video games ranging from sparsely selected states from the demonstration, and learns from them by way of optimizing the sport rating the usage of PPO, the similar reinforcement finding out set of rules that underpins OpenAI 5.

Finding out Montezuma’s Revenge from a unmarried demonstration

Leave a Comment Cancel Reply

Sign up to receive email updates, fresh news and more!

Related Posts

Leave a Comment Cancel Reply