Concat¶

In some cases, the join operation is not enough and you need to apply a union operation on two datasets to create a new one containing both of them. Therefore, we include the Concat operation that will merge two datasets into one output dataset by stacking them on top of each other (vertically). This operation is equivalent to a "union" operation mainly used in SQL.

To access to this option, you need to select the first dataset and on the left sidebar, select the Concat operation and afterwards the second dataset that you want to concat with.

Concat operation in the flow

A pop-up will appear with different settings to choose from :

Defining the output name, the operation name and description
Define the persistence setting
Delete the duplicated lines

Concat operation settings tab

Warning

In order for the operation to succeed, you need to have both datasets with the exact same number of columns with identical column names.

Tip

If you want to keep the duplicates in the output dataset, you can de-toggle the No duplicates box

No duplicates setting

Note

The operation will automatically detect and match the columns together to execute the operation but you can toggle the Matcher option to manually apply it according to your needs.

Matcher tab and settings for manual matching

After selecting the right settings for you, you just click on the green Submit and Run button and a green gear representing the operation will appear linked to both input datasets and the resulting output dataset on your project's flow.

Concat operation submitting button

Concat operation gear and output dataset on the flow