File: preprocessing.md

package info (click to toggle)
mlpack 4.6.2-1
  • links: PTS, VCS
  • area: main
  • in suites: sid
  • size: 31,272 kB
  • sloc: cpp: 226,039; python: 1,934; sh: 1,198; lisp: 414; makefile: 85
file content (19 lines) | stat: -rw-r--r-- 753 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
<object data="../img/pipeline-top-2.svg" type="image/svg+xml" id="pipeline-top">
</object>

# Preprocessing / feature extraction

mlpack provides a number of utilities for data preparation and feature
extraction.  These utilities are generally used just before actually applying
any machine learning [transformations](transformations.md) or
[modeling](modeling.md).

*Note: this section is under construction and not all functionality is
documented yet.*

 * [Normalizing labels](core/normalizing_labels.md): convert labels to/from an
   arbitrary range to `[0, numClasses - 1]`, which is the range that mlpack
   classifiers require.

 * [Dataset splitting](core/split.md): split a dataset into a training and test
   set, optionally including labels.