disc {minet}R Documentation

Unsupervized Data Discretization

Description

disc discretizes data using the equal frequencies or equal width binning algorithm.

Usage

disc( data,disc.method="equalfreq",nbins=sqrt(nrow(data)) )

Arguments

data The dataset to be discretized. The columns contains variables and the rows samples.
disc.method The package implements two discretization methods "equalfreq" and "equalwidth" (default : "equalfreq") - see references.
nbins The number of bins to be used for discretization. By default the number of bins is set to sqrt(N) where N is the number of samples.

Value

disc returns the discretized dataset.

Author(s)

P.E.Meyer, F.Lafitte, G.Bontempi

References

Supervised and unsupervised discretization of continuous features. J.Dougherty, R. Kohavi, M. Sahami. ICML, 1995.

See Also

build.mim

Examples

data(syn.data)
ew.data <- disc(syn.data,"equalwidth")
ef.data <- disc(syn.data,"equalfreq")

[Package minet version 1.1.3 Index]