After going on some extremely weird internet forums inside and outside of 4chan, you find the most desirable sexual characteristics of an adult grown panda and choose only the most important characteristics for a compatible sexual relationship between adult pandas according to someone by the username RedFoxyTop2018. You make the industry standard clustering graph on your massive dataset and begin to see some easily separatable distributions. There’s overlap of your cluster space, but it almost looks like you could explain what each cluster was if you cared enough to meet RedFoxyTop2018 at that Starbucks like they wanted to, and you ask for help after expressing that it was strictly for business only. What do you do?
Try Another Algorithm—seems sketch this unsupervised stuff or this RedFoxyTop guy
Try All the Data! — Let’s just leave this thing totally unsupervised, I don’t think r/PandaLove is a very scientific organization
Try some More Data–We have it, why not use it
Evaluate the results first, maybe it’s good, but should check
Lock it in, Clusters are clustering, you just wana get paid!