discretize {minet} | R Documentation |
discretize
discretizes data
using the equal frequencies or equal width binning algorithm.
"equalwidth" and "equalfreq" discretizes each random variable (each column) of the data into nbins
.
"globalequalwidth" discretizes the range of the random vector data
into nbins
.
discretize( data,disc="equalfreq",nbins=sqrt(nrow(data)) )
data |
A data.frame containing data to be discretized. The columns contains variables and the rows samples. |
disc |
The name of the discretization method to be used :"equalfreq", "equalwidth" or "globalequalwidth" (default : "equalfreq") - see references. |
nbins |
Integer specifying the number of bins to be used for the discretization. By default the number of bins is set to sqrt(N) where N is the number of samples. |
discretize
returns the discretized dataset.
Patrick E. Meyer, Frederic Lafitte, Gianluca Bontempi, Korbinian Strimmer
Supervised and unsupervised discretization of continuous features. J.Dougherty, R. Kohavi, M. Sahami. ICML, 1995.
data(syn.data) ew.data <- discretize(syn.data,"equalwidth") ef.data <- discretize(syn.data,"equalfreq") gew.data <- discretize(syn.data,"globalequalwidth")