July 2011
1 post
Machine learning-fu
And everything else I learned during my internship, for later use: Principal component analysis (PCA) - can be used to reduce the number of dimensions in data. In R: princomp Crossvalidation - splitting the data into two sets - one is used to train the classifiers and the other to see if the classifiers didn’t overfit. Bagging, out of bag (oob) - each classifier is trained on a (random)...
Jul 20th