opt.minsize {cubt} | R Documentation |
Optimizes the minsize parameter by cross validation.
opt.minsize(datapp, ncl, dist = "im", nvc=10,ms=NULL)
datapp |
Data set |
ncl |
desired number of clusters |
dist |
distance to use for pruning, maybe "im" for mutual information, or "hamming". |
nvc |
number of cross validations |
ms |
Optimizes by cross validation the minsize parameter for CUBT when looking for ncl clusters. The criterion to optimize is the deviance of a cubt tree, that is the sum of leaves deviances. Deviance within a leave is the sum of entropies over variables.
Returns a vector giving the possitions of observations belonging to the sample containing the prop part of the data.
Badih Ghattas