Introducing Custom Rewards
First, we made it possible for Mels to exist:
Then we made it possible for anyone to build a Mel:
Now we’re making it possible for anyone to teach their Mel new things, starting with custom training rewards!
By default, Mels are trained using a universal reward function in our general-purpose training and simulation system. This is so anyone can build and train a Mel to learn to move from scratch without having to be an AI researcher or roboticist to get started.
Custom rewards is a first step in opening up our AI machinery and giving people even more creative control over what and how their unique Mels learn - all while keeping things playful and accessible.
We’re in uncomfortably uncharted territory for consumer AI. But one thing’s for sure: things are going to get weird, and we’re here for it.