mardi 19 janvier 2016

How the number of nodes are determined in random forest in R

I use randomForest package to perform a binary classification. I would like to ask how randomForest() determine the number of node in each tree? I think the number of node is saved in model$forest$nrnodes. Am I correct here?

In my dataset, I have 10,000 positive and 70,000 negative samples. I build several models with default parameters except for the number of trees 50,100,200 and 500. Their performance are quite similar. The number of nodes of each model is also quite similar, around 1400.

Could some explain how this 1400 is computed? Which parameter is used to control the number of nodes in each tree? Any advice will be much appreciated!




Aucun commentaire:

Enregistrer un commentaire