SYNOPSIS
det [-h] [-v] -t string [-f int] [-l string] [-L string] [-M int] [-N int] [-p] -I [-T string] [-E string] [-e string] [-r string] [-u string] [-V] [-i string]
DESCRIPTION
This program performs a number of functions related to Density Estimation Trees. The optimal Density Estimation Tree (DET) can be trained on a set of data (specified by --train_file) using cross-validation (with number of folds specified by --folds). In addition, the density of a set of test points (specified by --test_file) can be estimated, and the importance of each dimension can be computed. If class labels are given for the training points (with --labels_file), the class memberships of each leaf in the DET can be calculated.
The created DET can be saved to a file, along with the density estimates for the test set and the variable importances.
REQUIRED OPTIONS
- --train_file (-t) [string]
- The data set on which to build a density estimation tree.
OPTIONS
- --folds (-f) [int]
- The number of folds of cross-validation to perform for the estimation (0 is LOOCV) Default value 10.
- --help (-h)
- Default help info.
- --info [string]
- Get help on a specific module or option. Default value ''.
- --labels_file (-l) [string]
- The labels for the given training data to generate the class membership of each leaf (as an extra statistic) Default value ''.
- --leaf_class_table_file (-L) [string]
- The file in which to output the leaf class membership table. Default value 'leaf_class_membership.txt'.
- --max_leaf_size (-M) [int]
- The maximum size of a leaf in the unpruned, fully grown DET. Default value 10.
- --min_leaf_size (-N) [int]
- The minimum size of a leaf in the unpruned, fully grown DET. Default value 5.
- --print_tree (-p)
- Print the tree out on the command line (or in the file specified with --tree_file).
- --print_vi (-I)
- Print the variable importance of each feature out on the command line (or in the file specified with --vi_file).
- --test_file (-T) [string]
- A set of test points to estimate the density of. Default value ''.
- --test_set_estimates_file (-E) [string]
- The file in which to output the estimates on the test set from the final optimally pruned tree. Default value ''.
- --training_set_estimates_file (-e) [string]
- The file in which to output the density estimates on the training set from the final optimally pruned tree. Default value ''.
- --tree_file (-r) [string]
- The file in which to print the final optimally pruned tree. Default value ''.
- --unpruned_tree_estimates_file (-u) [string]
- The file in which to output the density estimates on the training set from the large unpruned tree. Default value ''.
- --verbose (-v)
- Display informational messages and the full list of parameters and timers at the end of execution.
- --version (-V)
- Display the version of mlpack.
- --vi_file (-i) [string]
- The file to output the variable importance values for each feature. Default value ''.
ADDITIONAL INFORMATION
For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your distribution of MLPACK.