RL-Instructor is an open-source implementation of our interface to coach AIs by way of occasional human comments slightly than home made praise purposes. The underlying methodology was once evolved as a step against protected AI programs, but additionally applies to reinforcement studying issues of rewards which are arduous to specify.

