References → Union Recipe
Unioning data combines two datasets into one, providing a comprehensive view that enhances analysis and decision-making. This process reveals patterns and trends not visible in isolated data, improves data quality, and supports better forecasting and reporting, leading to more informed strategic outputs.
Configuration
Version 2024.7.1 and earlier
Configuration | Description |
---|---|
Recipe Name | A freeform name of how a user would like to name a recipe |
Top Input | Input the data you want to show at the top of a dataset. The schema defined by the first dataset will be applied to the second dataset. |
Bottom Input | Input the data to be merged into the top input. |
Note
Columns in a union are resolved by position. This means columns appearing in a different order between datasets will not be correctly mapped. Additionally, if there is a mismatch in the number of columns between the two input datasets, the recipe will fail.
Version 2024.7.2 and later
Configuration | Description |
---|---|
Recipe Name | A freeform name of how a user would like to name a recipe |
Top Input | Input the data you want to show at the top of a dataset. The schema defined by the first dataset will be applied to the second dataset. |
Bottom Input | Input the data to be merged into the top input. |
Union Type | Select if the Union should or should not de-duplicate non-unique records |
Union Method | Select if columns should be unioned based on: ● Postition - (e.g. Column 1 from top input will union with Column 1 from bottom input) ● Name - (e.g. Union will map to common names between top and bottom inputs) |
Handle Missing Columns | Select the behavior of the output when the top and bottom datasets have a descrepancy in a number of columns (union on position), or in names (union on name). |
Handle Type Mismatch | If enabled, the union automatically casts columns to a compatible data type if there is a mismatch. For example, unioning a string column and a numeric column will result in the column being output as string. This will eliminate the creation of null values on the output. |