Intro
There are two available tiles in the Filter tab of Magic ETL DataFlows. These include:
- Filter Rows
- Remove Duplicates
For information about creating a Magic ETL DataFlow, see Creating a Magic ETL DataFlow.
For information about the Data Center, see Data Center Layout.
Filter Rows Tile
The Filter Rows tile lets you include or exclude rows based on specified rules.
Example
For example, the Filter Rows tile transforms this data...
... to this...
... using this configuration:
To configure the Filter Rows tile,
-
Click the Filter Rows tile in the canvas.
-
(Optional) Rename the tile by clicking
, then entering the name you want.
-
Select either Add Filter Rule or Add Formula Rule.
-
(Conditional) For each Filter Rule, do the following:
-
Select a column to filter on.
-
Select the operation to use.
Operation items appear in the list, depending on the type of data in the column to filter on. -
Select whether to compare against values in a specific column or a specific value, then do one of the following:
-
(Conditional) If comparing against a column, select the column to use.
-
(Conditional) If comparing against a specific value, enter the value to use.
-
-
(Conditional) If you want to add another rule, select either Add Filter Rule or Add Formula Rule.
-
- (Conditional) For each Formula Rule, do the following:
-
(Optional) Open the Formula Editor.
-
Enter the formula you wish to use.
-
Validate the formula.
-
(Conditional) If you want to add another rule, select either Add Filter Rule or Add Formula Rule.
Note: For more information on creating a successful filter using the Formula Rule, see How to Write a Filter Formula in Magic ETL.
-
-
Select whether to include rows that meet all or any of the rules you define.
Remove Duplicates Tile
The Remove Duplicates tile lets you remove duplicate rows, based on specific columns.
Video - Removing Duplicates in Magic ETL
Example
For example, the Remove Duplicates tile transforms this data...
... to this...
... using this configuration:
To configure the Remove Duplicates tile,
-
Ensure that the column with values you want exists in the DataSet.
-
Click the Remove Duplicates tile in the canvas.
-
(Optional) Rename the tile by clicking
, then entering the name you want.
-
For each column with duplicate values you want to remove, do the following:
-
Select the column you want.
For example, "Product Base Margin". -
Mark whether the comparison is case sensitive.
-
(Conditional) If you want to add another column, then click Add Column Comparison.
-
Comments
0 comments
Please sign in to leave a comment.