We’ve found out that evolution methods (ES), an optimization method that’s been recognized for many years, opponents the efficiency of usual reinforcement studying (RL) ways on trendy RL benchmarks (e.g. Atari/MuJoCo), whilst overcoming a lot of RL’s inconveniences.

