Data preprocessing and visualization functions for classification


[Package List] [Top]

Documentation for package ‘dprep’ version 2.0

Help Pages

dprep-package Data Preprocessing for Supervised Classification
assig Auxiliary function for computing the minimun entropy discretization
baysout Outlier detection using Bay and Schwabacher's algorithm.
breastw The Breast Wisconsin dataset
bupa The Bupa dataset
ce.impute Imputation in supervised classification
ce.knn.imp Function that calls ec.knnimp to perform knn imputation
ce.mimp Mean or median imputation
censusn The census dataset
chiMerge Discretization using the Chi-Merge method
circledraw circledraw
clean Dataset Cleaning
closest Auxiliary function used in the function baysout
colon Alon et al.'s colon dataset
combinations Constructing distinct permutations
crossval Cross validation estimation of the misclassification error
cv10knn2 Auxiliary function for sequential feature selection
cv10lda2 Auxiliary function for sequential forward selection
cv10log 10-fold cross validation estimation error for the classifier based on logistic regression
cv10mlp 10-fold cross validation error estimation for the multilayer perceptron classifier
cv10rpart2 Auxiliary function for sequential feature selection
decscale Decimal Scaling
diabetes The Pima Indian Diabetes dataset
disc.1r Discretization using the Holte's 1R method
disc.ef Discretization using the method of equal frequencies
disc.ew Discretization using the equal width method
disc.mentr Discretization using the minimum entropy criterion
disc2 Auxiliary function for performing discretization using equal frequency
discretevar Performs Minimum Entropy discretization for a given attribute
dist.to.knn Auxiliary function for the LOF algorithm.
distan2 Auxiliary function used by the RELIEF function in the dprep library.
distancia Vector-Vector Euclidiean Distance Function
dprep Data Preprocessing for Supervised Classification
ec.knnimp KNN Imputation
eje1dis Basic example for discriminant analysis
finco FINCO Feature Selection Algorithm
hawkins The Hawkins-Bradu-Kass dataset
heartc The Heart Cleveland dataset
hepatitis The hepatitis dataset
imagmiss Visualization of Missing Data
inconsist Computing the inconsistency measure
ionosphere The Ionosphere dataset
knneigh.vect Auxiliary function for computing the LOF measure.
lofactor Local Outlier Factor
lvf Las Vegas Filter
mahaout Multivariate outlier detection through the boxplot of the Mahalanobis distance
mardia The Mardia's test of normality
maxdist Auxiliary function used when executing the Bay's algorithm for outlier detection
maxlof Detection of multivariate outliers using the LOF algorithm
midpoints Auxiliary function for computing minimun entropy discretization
mmnorm Min-max normalization
mo3 The third moment of a multivariate distribution
mo4 The fourth moment of a multivariate distribution
moda Calculating the Mode
my.iris The Iris dataset
near1 Auxiliary function for the reliefcont function
near2 Auxiliary function for the reliefcat function
nnmiss Auxiliary function for knn imputation
outbox Detecting outliers through boxplots of the features.
parallelplot Parallel Coordinate Plot
pp.golub The preprocessed Golub's dataset
radviz2d Radial Coordinate Visualization
rangenorm range normalization
reachability Function for computing the reachability measure in the LOF algortihm
redundancy Finding the unique observations in a dataset along with their fequencies
relief RELIEF Feature Selection
reliefcat Feature selection by the Relief Algorithm for datasets with only nominal features
reliefcont Feature selection by the Relief Algorithm for datasets with only continuous features
robout Outlier Detection with Robust Mahalonobis distance
row.matches Finding rows in a matrix equal to a given vector
sbs1 One-step sequential backward selection
score Score function used in Bay's algorithm for outlier detection
sffs Sequential Floating Forward Method
sfs Sequential Forward Selection
sfs1 One-step sequential forward selection
signorm Sigmoidal Normalization
softmaxnorm Softmax Normalization
sonar The Sonar dataset
srbct Khan et al.'s small round blood cells dataset
starcoord The star coordinates plot
surveyplot Surveyplot
tchisq Auxiliary function for the Chi-Merge discretization
top Auxiliary function for Bay's Ouylier Detection Algorithm
vehicle The Vehicle dataset
vvalen The Van Valen test for equal covariance matrices
vvalen1 Auxiliary function for computing the Van Valen's homocedasticity test
znorm Z-score normalization