You use all of the data!

code projected over woman

Why even bother not including data? The machine will figure it out, why waste your time. You’re not quite sure how well it’s working though outside of one scatter plot of dimensionally reduced data scatter plotted into a field of rainbow colored poka dots. You can’t use a clustering algorithm without making that graph, it’s the Law! That poka dot plot does look awfully busy and the axis is not labeled, what do you do?

Try Another Algorithm—seems sketch this unsupervised stuff

Try less data

Try only Relevant Data

Evaluate the results first, maybe it’s good, but should check

Trust the algorithm, it’s statistical distance, how are ya even gonna evaluate that thing?

Published by B McGraw

B McGraw has lived a long and successful professional life as a software developer and researcher. After completing his BS in spaghetti coding at the department of the dark arts at Cranberry Lemon in 2005 he wasted no time in getting a masters in debugging by print statement in 2008 and obtaining his PhD with research in screwing up repos on Github in 2014. That's when he could finally get paid. In 2018 B McGraw finally made the big step of defaulting on his student loans and began advancing his career by adding his name on other people's research papers after finding one grammatical mistake in the Peer Review process.

Leave a Reply

%d