opt.minsize {cubt}R Documentation

Optimizes the minsize parameter by cross validation.

Description

Optimizes the minsize parameter by cross validation.

Usage

opt.minsize(datapp, ncl, dist = "im", nvc=10,ms=NULL)

Arguments

datapp

Data set

ncl

desired number of clusters

dist

distance to use for pruning, maybe "im" for mutual information, or "hamming".

nvc

number of cross validations

ms

Details

Optimizes by cross validation the minsize parameter for CUBT when looking for ncl clusters. The criterion to optimize is the deviance of a cubt tree, that is the sum of leaves deviances. Deviance within a leave is the sum of entropies over variables.

Value

Returns a vector giving the possitions of observations belonging to the sample containing the prop part of the data.

Author(s)

Badih Ghattas


[Package cubt version 3.2 Index]