You train Rocky with Shaped Rewards!

Rocky starts advancing but not in the most optimum direction. He starts matching up some pandas that are sort of compatible but they always end up friend zoning each other. According to some of the previous DARPA research on panda breeding, panda friendship is a good indicator of sexual compatibility but not a guarantee. What do you do?

Give him a little treat.

Punish him with a negative reward, only perfection will be tolerated.

Give him a big treat, a success is a success

Published by B McGraw

B McGraw has lived a long and successful professional life as a software developer and researcher. After completing his BS in spaghetti coding at the department of the dark arts at Cranberry Lemon in 2005 he wasted no time in getting a masters in debugging by print statement in 2008 and obtaining his PhD with research in screwing up repos on Github in 2014. That's when he could finally get paid. In 2018 B McGraw finally made the big step of defaulting on his student loans and began advancing his career by adding his name on other people's research papers after finding one grammatical mistake in the Peer Review process.

Leave a Reply

%d bloggers like this: