Data Studio Manager → Data Flow Editor (before 2026.3.0)
The Data Flow Editor enables you to easily create data flows witin Data Studio using a simple drag-and-drop interface. To access the Data Flow Editor, open a specific data flow from the Data Studio Manager.
This version of the Data Flow Editor will not be available starting 2026.3.0, please refer to the new version.
Data Flow Editor Anatomy
The interface consists of the following key components:
Action bar
The Action bar provides key actions for managing your data flow. It includes the following controls:
| Control | Description |
|---|---|
| + Recipe | Add a recipe to transform and enrich data. |
| Settings | Configure data sampling preferences. |
| More Options (⋮) | The menu includes the following options: ● Re-validate Dataflow ● Deploy All MVs ● Refresh Schemas |
| Close (X) | Exit the current data flow editor. |
View toolbar
The View toolbar provides navigation and layout controls for interacting with the data flow within the Canvas:
| Control | Description |
|---|---|
| Search bar | Search for specific recipes. The search results will highlight and navigate to the matching recipe |
| + Zoom in / - Zoom out | Adjust the zoom level within the Canvas |
| Maximize Canvas | Expand the data flow within the Canvas for a full view |
| Layout dropdown menu | Toggle between Default and Compact layouts of the data flow |
Canvas
The Canvas serves as the central area where you build your data flow and connect data recipes. It displays all data flow components, including recipes and joins. Add recipes by selecting + Recipe in the Action bar or by dragging and dropping datasets from the Data panel.
Overview panel
The Overview panel displays a zoomed-out view of your entire data flow. Select any area within the Overview panel to instantly zoom into that section on the Canvas.
The Overview panel is located in the upper-left corner of the Canvas. Select the arrow icon to expand or collapse the panel as needed.
Recipe Actions panel
The Recipe Actions panel appears on the right side of the Canvas when you select a recipe. It displays contextual actions and information specific to the selected recipe. The Recipe Actions panel includes the following features:
| Feature | Action | Description |
|---|---|---|
| Explore | Opens the Results pane | Displays the recipe’s output data for profiling and filtering. |
| Delete | Removes the selected recipe | Disabled if the recipe has dependent child recipes. |
| Re-validate | Re-validates the selected recipe | Confirms that the recipe configuration is valid. |
| Preview code | Displays an auto-generated script | Shows the transformation logic of the selected recipe. |
| Info | Displays recipe metadata | Includes details such as Name, Type, Result Status, and Parameters. |
Results pane
The Results pane provides a detailed view of the recipe’s output. It opens at the bottom of the Canvas when you select Explore in the Recipe Actions panel, and includes:
- Result Set — Displays output records in a paginated table
- Profiling View — Offers insights into the result set through statistics, histograms, frequencies, and patterns depending on the dataset.
- Filter — Enables filtering data by selecting specific columns.
Data panel
The Data panel manages datasets and allows dragging them onto the Canvas to create new recipes. It is located on the left side of the View Toolbar.
Data Flow Editor Actions
Using Data Flow Editor, you can perform the following actions to build and manage your data flow:
- Add a new recipe
- Add a dataset as a recipe
- Create a Join recipe
- Filter data and create a shortcut recipe
- Configure data flow settings
- Preview code for a recipe
- Re-validate data flow
- Re-validate a recipe
- Deploy all MVs
- Refresh schemas
Add a new Recipe
- In the Action bar, select + Recipe.
- Choose a recipe type from the categories below, then configure its setting:
Note: The recipe name links to its configuration guide.
| Category | Recipe | Description |
|---|---|---|
| Content Transformation | Filter | Remove records from a dataset based on a condition. |
| Change Type | Change the data type of column(s). | |
| Select | Select which columns to keep or remove from a dataset. | |
| Unpivot | Transpose your dataset into columns and values. | |
| Sort | Sort data within a dataset. | |
| Formula | Add custom logic to create a new calculated field. | |
| Sample | Select a subset of records within your dataset. | |
| Aggregation | Aggregate your data set and set granularity through 'group by' logic. | |
| Split | Split the dataset into two datasets. | |
| Rename | Rename column labels in your dataset. | |
| Structure Transformation | Join | Join two datasets based on a set of join logic |
| Union | Union two datasets together. | |
| Data Quality and Validation | Fuzzy Join | Cleanse data through providing a lookup table. |
| Data Quality | Unleash the power of AI in your Dataflow. | |
| Advanced Querying | Python | Inject custom pySpark into your Dataflow. |
| SQL | Inject custom SQL into your data Dataflow. | |
| Gen AI | LLM | Unleash the power of AI in your Dataflow. |
| Deploy and Eject Operations | Save MV | Save your data flow output to a Materialized View. |
- Select Save.
A confirmation message appears: Recipe added successfully!
Add a dataset as a recipe
- Select the Data Panel icon next to the View Toolbar to expand the Data Panel.
- In the Manage Dataset panel, select the checkboxes next to the tables you want to add.
- Add a recipe to the Canvas using one of the following methods:
- Drag and drop a dataset from the Data Panel onto the canvas, or
- Select the (+) icon next to a dataset in the Data Panel.
This creates a recipe on the canvas based on the selected dataset.
Create a Join recipe
- Select and drag from one recipe to another.
This action automatically creates a Join recipe between the two datasets. - Configure the Join recipe settings, including:
- Recipe Name
- Join Type
- Left and Right Input
- Match On
- Join Condition
For more information on configuring the Join Recipe, refer to References → Join Recipe.
Filter data and create shortcut recipes
- Select a recipe on the Canvas.
- In the Recipe Actions panel, select Explore.
- In the Results pane, select the Filter icon.
- Select the column(s) you want to filter.
- From the Selection Type dropdown menu, choose:
- Include — keeps only selected values
- Exclude — removes selected values
- Select Apply. The Filter data appears in the Result Set pane.
- Select Save ( 💾) to create a new recipe based on this filter.
- Enter a Name for the new recipe shortcut.
- Select Save.
A confirmation message appears: Recipe added successfully!
Preview code for a recipe
- Select a recipe on the Canvas.
- In the Recipe Actions panel, select the Preview Code icon.
- View the auto-generated script that represents the transformation logic of the selected recipe.
- Select X to close the code view.
Configure data flow settings
In the Action bar, select Settings.
In the Settings dialog:
- Enable Sampling is toggled on by default with a sample size of 1000.
- Edit the Sample Size based on your performance and profiling needs.Note
Disabling sampling or increasing the sample size may slow down execution.
Edit the Sample Size as needed.
Select Save or Save & Restart to apply the changes.
A confirmation message appears: Sampling size was changed successfully. Results will be updated after re-initializing the dataflow.
Re-validate data flow
- In the Action bar, select More Options (⋮) → Re-validate Dataflow.
- A confirmation prompt appears. Select Yes to revalidate your entire data flow.
Re-validate a recipe
- Select a recipe on the Canvas
- In the Recipe Actions panel, select the Re-validate icon.
- A confirmation prompt appears. Select Yes to revalidate your selected recipe.
Deploy all MVs
- In the Action bar, select More Options (⋮) → Deploy All MVs.
- A confirmation prompt appears. Select Yes to deploy all MV recipes in the data flow.
Refresh schemas
- In the Action bar, select More Options (⋮) → Refresh Schemas.
- A confirmation prompt appears. Select Yes to refresh the schemas.
Best Practices
- Materialized View (MV) deployment strategies
Choose between:- Updates via data flow.
- Static data flow, where updates are managed by editing in the Notebook.
- Data sampling
- By default, data sampling is limited to 1,000 records for profiling to ensure optimal performance.
- Disabling sampling can impact performance.
- Naming conventions
- Materialized Views (MVs) can be traced back to their data flows.
- Use descriptive and consistent names for data flows and MVs.