Intro
There are three available tiles in the DataSets tab of Magic ETL DataFlows. These include:
- Input DataSet
- Output DataSet
- Writeback tiles
Input and output DataSet tiles are required for all Magic ETL DataFlows. At least one input DataSet is required, though you can include as many as necessary. One output DataSet is required at the end of a DataFlow; this is the DataSet you can then use to power Domo Cards and Apps.
For information about creating a Magic ETL DataFlow, see Creating a Magic ETL DataFlow.
For information about the Data Center, see Data Center Layout.
Important: Input DataSets in a DataFlow cannot be restricted by PDP policies—all available rows must pass through the DataFlow. Because of this you must apply PDP policies to the output DataSets generated by a DataFlow.
When you build a DataFlow using an input DataSet with PDP policies in place, the DataFlow breaks unless at least one of the following criteria applies:
-
You have an "Admin" security profile or a custom role with "Manage DataFlows" enabled.
-
You are the DataSet owner.
-
You are part of the "All Rows" policy. This gives you access to all of the rows in the DataSet.
For more information about using PDP with DataFlows, see PDP and DataFlows/DataFusions.
Input DataSet Tile
You can use the Input DataSet tile to add a DataSet to the DataFlow.
There must be at least one Input DataSet in a DataFlow.
Configuration
To configure the Input DataSet tile,
-
Click the Input DataSet tile, then select the DataSet you want to transform. You may select a pre-existing DataSet or select Create New to create a new DataSet to use.
-
Set the Data Selection to either be Entire DataSet (default) which will use all of the rows in the DataSet or Only new rows which will only process the newly added rows to the DataSet (this must be done on a DataSet set to Append and not Replace.)
-
Set the column data types in the Select Data Type dropdown menu next to each column Data Type. This will prevent any schema changes that may happen to the input data from breaking your DataFlow.
-
Click the
and set the Special Handling for each column. This will allow you to set the Parsing of the data (only for Number and Date columns), how to handle Bad values, and Null values in your column.
Details
Under the Details tab of the input tile, you can view the DataSet's:
- Name
- Owner
- Number of Cards
- Number of Views
- Number of Rows
- Last Updated
- Next Update
- Column Names and Types
Data
Select the Data tab to preview a table of the input data.
Output DataSet Tile
You can use the Output DataSet tile to output the transformed data as a DataSet. You can use this new DataSet to power up cards (or other DataFlows).
There must be an Output DataSet in a DataFlow.
Configuration
To configure the Output DataSet tile,
-
Ensure that a tile is connected to the Output DataSet tile and that all tiles are connected and configured in the transformation flow.
-
Click the Output DataSet tile, then specify the name of the DataSet you want to output by clicking
and entering a name and a description.
-
(Optional) Click the Additional Options dropdown and select the Update Method to either Replace data (default) or Append data.
Details
If the DataFlow has not run yet, the details available are the owner, column names, and column types. Once run successfully, you are able to view all of the same details listed above for the input tile.
Writeback Tile
You can use the Writeback tile to output the transformed data as a DataSet and send the data back to another system.
Configuration
To configure the Writeback tile,
-
Ensure that a tile is connected to the Writeback tile and that all tiles are connected and configured in the transformation flow.
-
Click the Writeback tile, then specify the name of the DataSet you want to output by clicking
and entering a name and a description.
-
(Optional) Click the Additional Options dropdown and select the Update Method to either Replace data (default) or Append data.
-
Click the Configure Writeback button and select the Writeback Connector you will be using.
Details
If the DataFlow has not run yet, the details available are the owner, column names, and column types. Once run successfully, you are able to view all of the same details listed above for the input tile.
Comments
0 comments
Please sign in to leave a comment.