References → Split Recipe
The split tool is used to separate one dataset into two datasets. This can help sample data, create training datasets, and validate datasets.
Configuration
Configuration | Description |
---|---|
Recipe Name | A freeform name of how a user would like to name a recipe |
Input | Select a previously constructed recipe to process |
Split Ratio | Define a ratio to split the dataset into two. Enter a number between 0.0 and 1.0. The split defined will be assigned to split_1 while the remainder will be split_2. Example: If we have 1000 records and assign a split ratio of 0.1, ~100 records will be in split_1, and the remainder in split_2 |
Result
In the data explorer, the result set will have a new dropdown in the right corner where you can preview both split outputs in the pane. When mapping the output of a split recipe into a new recipe, users will select which split piece should be used.