Structured Query Language (SQL) is a programming language for storing and processing information in a relational database. A relational database stores information in tabular form, with rows and columns representing different data attributes and the various relationships between the data values.
The config section in a YAML file includes the following properties:
Connectorname: SQL
host: Hostname or IP address of the server
port: Server running port
username: Username
password: Password
database: Database
drivername: mssql+pyodbcData Hub icon on the Navigation Pane.
Add Pipeline and provide the new pipeline’s name.
MSSQL template.
| Parameters | Description |
|---|---|
| Host: | Specify the hostname of the MSSQL server. |
| Port: | Specify the port number of the MSSQL server (default is 1433). |
| Username: | Provide the username to authenticate with the MSSQL server. |
| Password: | Provide the password to authenticate with the MSSQL server. |
| Database: | Specify the name of the MSSQL database from which data will be extracted. |
| Driver Name: | Specify the driver name for connecting to MSSQL (e.g., mssql+pyodbc). |
| Driver: | Specify the ODBC driver to use for connecting to MSSQL (e.g., ODBC+Driver+17+for+SQL+Server). |
| Select | Tablename(s): Specify the name of the table(s) to load tables from the MSSQL server. |
| Metadata (Optional) | Replication Method: Specify the replication method for the table(s). Options include FULL or INCREMENTAL. |
| Replication Key: Specify the replication key for incremental replication. This key helps in identifying new or updated records. | |
| Replication Value: Specify the replication value to start the incremental replication from a particular point. | |
| Primary Key: Column(s) that uniquely identify each record in the source table. Used to ensure data consistency during replication. |



Run Now button.
.


Edit DataSource Option to view the created tables.In the metadata section, define the mode of data refresh. There are two modes: INCREMENTAL and FULL_TABLE.
In this configuration, data replication is handled using both the primary_key and the replication_key. The replication_key refers to a column that contains date or timestamp values, which is used to determine the point from which incremental data fetching should begin. The primary_key ensures each record is uniquely identified. During each sync, the system uses the date from the replication_key to fetch only the records that are newer than the last imported data, while the primary_key helps maintain data integrity and avoid duplication.
metadata:
TableName:
replication_method: INCREMENTAL
replication_key: <Date Column name>
primary_key: <Primary key column name>This mode fetches data from the date column mentioned in the replication key from the start date as mentioned in the replication value. Once it is scheduled, the replication value is updated automatically from the imported data.
metadata:
TableName:
replication_method: FULL_TABLE
replication_key: Column name
replication_value: column value that data starts from
interval_type: days/hours/minutes/year/month
interval_value: integer value to add in interval typeversion: 1
encrypt_credentials: false
plugins:
extractors:
- name: tap_postgres
connectorname: MSSQL
config:
host: Hostname or IP address of the server
port: Server running port
username: Username
password: Password
database: Database
drivername: mssql+pyodbc
#if ODBC Driver 18 for SQL Server is installed, use the following driver: ODBC+Driver+18+for+SQL+Server
driver: ODBC+Driver+17+for+SQL+Server
select:
- TABLE1
- TABLE2
metadata:
TABLE1:
replication_method: INCREMENTAL
replication_key: last_modified_on
primary_key: id
TABLE2:
replication_method: INCREMENTAL
replication_key: last_modified_on
primary_key: idversion: 1
encrypt_credentials: false
plugins:
extractors:
- name: tap_postgres
connectorname: MSSQL
config:
host: Hostname or IP address of the server
port: Server running port
username: Username
password: Password
database: Database
drivername: mssql+pyodbc
#if ODBC Driver 18 for SQL Server is installed, use the following driver: ODBC+Driver+18+for+SQL+Server
driver: ODBC+Driver+17+for+SQL+Server
select:
- TABLE1
- TABLE2
metadata:
TABLE1:
replication_method: FULL_TABLE
replication_key: last_modified_on
replication_value: 2023-07-19 00:00:00
interval_type: days
interval_value: 6
TABLE2:
replication_method: FULL_TABLE
replication_key: last_modified_on
replication_value: 2023-07-19 00:00:00
interval_type: days
interval_value: 6To execute a custom SQL query during extraction, add the query field under the properties section.
When a custom query is used, the select parameter does not extract data from the table name specified. Instead, the name provided in select is used only to create the destination table where the query result will be stored.
Add Query under Properties and include the query. Please note that when using Query in the template, the Select parameter is used to create the table for storing the query result instead of extracting the data from the name given in it.
Note: When using a custom query, data is not extracted from the table name specified. Instead, the query result is used to create and populate the table.
version: 1.0.1
encrypt_credentials: false
direct_target_import: false
union_all_tables: true
add_dbname_column: false
direct_load_to_destination: true
plugins:
extractors:
- name: MSSQL
connectorname: MSSQL
schemaname: dbo
config:
host:
port: 1433
username: sa
database: Retail
password:
drivername: mssql+pyodbc
driver: ODBC+Driver+17+for+SQL+Server
properties:
query: SELECT AVG(unit_cost) AS average_unit_cost FROM [Retail].[dbo].[Product_Details];
metadata:
select:
- aggregated_table When using the INCREMENTAL method with a custom query:
version: 1.0.1
encrypt_credentials: false
direct_target_import: false
union_all_tables: true
add_dbname_column: false
direct_load_to_destination: true
plugins:
extractors:
- name: MSSQL
connectorname: MSSQL
schemaname: dbo
config:
host:
port: 1433
username: sa
database: Retail
password:
drivername: mssql+pyodbc
driver: ODBC+Driver+17+for+SQL+Server
properties:
query: SELECT id, last_modified_on, product_name FROM [Retail].[dbo].[Product_Details];
metadata:
aggregated_table:
replication_method: INCREMENTAL
replication_key: last_modified_on
primary_key: id
select:
- aggregated_table