You Chose Lucy!

Nice, Lucy is a classic choice. Hard to beat an actor-critic algorithm, it’s like the best of both worlds. A real peanut butter and chocolate type of combo. She’s even one of those fancy Deep-Q Network types and not one of those annoying one that always gets spooked every time she’s kicked off of social media. Just when her explorative nature guides her to a nice juicy reward, that critic side really homes in her training. She’ll be fantastic on this high dimensional pandas dataset! What sort of Reward scheme will you use?

Curriculum Learning

Dense Rewards

Shaped Rewards

Sparse Rewards

Published by B McGraw

B McGraw has lived a long and successful professional life as a software developer and researcher. After completing his BS in spaghetti coding at the department of the dark arts at Cranberry Lemon in 2005 he wasted no time in getting a masters in debugging by print statement in 2008 and obtaining his PhD with research in screwing up repos on Github in 2014. That's when he could finally get paid. In 2018 B McGraw finally made the big step of defaulting on his student loans and began advancing his career by adding his name on other people's research papers after finding one grammatical mistake in the Peer Review process.

Leave a Reply

%d bloggers like this: