mlpack_mean_shift

NAME

mlpack_mean_shift - mean shift clustering

SYNOPSIS

mlpack_mean_shift [-h] [-v]

DESCRIPTION

This program performs mean shift clustering on the given dataset, storing the learned cluster assignments either as a column of labels in the input dataset or separately.

The input dataset should be specified with the ’--input_file (-i)’ parameter, and the radius used for search can be specified with the ’--radius (-r)’ parameter. The maximum number of iterations before algorithm termination is controlled with the ’--max_iterations (-m)’ parameter.

The output labels may be saved with the ’--output_file (-o)’ output parameter and the centroids of each cluster may be saved with the ’--centroid_file (-C)’ output parameter.

For example, to run mean shift clustering on the dataset ’data.csv’ and store the centroids to ’centroids.csv’, the following command may be used:

$ mean_shift --input_file data.csv --centroid_file centroids.csv

OPTIONAL INPUT OPTIONS

--help (-h) [bool]

Default help info.

--in_place (-P) [bool]

If specified, a column containing the learned cluster assignments will be added to the input dataset file. In this case, --output_file is overridden. (Do not use with Python.)

--info [string]

Get help on a specific module or option. Default value ’’.

--input_file (-i) [string]

Input dataset to perform clustering on. Default value ’’.

--labels_only (-l) [bool]

If specified, only the output labels will be written to the file specified by --output_file.

--max_iterations (-m) [int]

Maximum number of iterations before mean shift terminates. Default value 1000.

--radius (-r) [double]

If the distance between two centroids is less than the given radius, one will be removed. A radius of 0 or less means an estimate will be calculated and used for the radius. Default value 0.

--verbose (-v) [bool]

Display informational messages and the full list of parameters and timers at the end of execution.

--version (-V) [bool]

Display the version of mlpack.

OPTIONAL OUTPUT OPTIONS

--centroid_file (-C) [string] If specified, the centroids of each cluster will be written to the given matrix. Default value ’’.
--output_file (-o) [string]

Matrix to write output labels or labeled data to. Default value ’’.

ADDITIONAL INFORMATION

ADDITIONAL INFORMATION

For further information, including relevant papers, citations, and theory, For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your consult the documentation found at http://www.mlpack.org or included with your DISTRIBUTION OF MLPACK. DISTRIBUTION OF MLPACK.