In Tranformation
page, convert extracted data into a suitable format for analysis. This stage includes data cleaning, filtering, aggregation, and normalization. Enrich data by integrating it with additional information.
The data will initially be transferred to the DuckDB
database within the designated {project_name}
schema before undergoing transformation for integration into the target databases. As an illustration, in the case of a project named “datasync_pipeline”, the data will be relocated to the datasync_pipeline
table schema.
By default the select query will be formed and added in the transformation script based on the first table extracted in the pipeline. The script can be further modified based on the requirement.
The Extracted Tables list shows the list of tables extracted with its column and its data type. Double clicking the table or column will allow to include them in the code editor for easy transformation.
Sample Transformation scripts:
select date_part('year', SalesDate) as "year", date_part('month',SalesDate) as "month", sum(Amount) as "TotalAmount" from datasync_pipeline.sales
group by date_part('year',SalesDate),date_part('month',SalesDate)
Note: The Transform Tables and Extracted Tables can be minimized to utilize the code editor area fully.
Note: The preview option only works after the source table data has been moved to destination database. Also, the preview can be minimized once utilized.
In the Transform section, users have the capability to incorporate multiple tables into their projects.
Drag and drop the transform tables to reorder the sequence in which the transform tables should be executed, as well as reuse these tables for future analyses.
When users desire to incorporate Primary keys into the transform tables, they should input the primary key values during the process of adding or editing the tables.
Save and Transform
button. The tables will be transferred to the destination database during this process. Now the data will be transformed and moved to the destinationSave as Draft
can be used to switch between transform and other tabs without losing the transformation query.IMPORTANT: After finishing the transformation, users can see the results in Destination tab.