Implementing an Azure Data Solution: Integration Runtime for Copy Activity in Azure Data Factory V2

Integration Runtime for Copy Activity

Question

You have an Azure virtual machine that has Microsoft SQL Server installed. The database on the virtual machine contains a table named Table1.

You need to copy the data from Table1 to an Azure Data Lake Storage Gen2 account by using an Azure Data Factory V2 copy activity.

Which type of integration runtime should you use?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C.

B

Copying between a cloud data source and a data source in private network: if either source or sink linked service points to a self-hosted IR, the copy activity is executed on that self-hosted Integration Runtime.

https://docs.microsoft.com/en-us/azure/data-factory/concepts-integration-runtime#determining-which-ir-to-use

To copy data from Table1 on a virtual machine with SQL Server installed to an Azure Data Lake Storage Gen2 account using Azure Data Factory V2 copy activity, you need to configure an Integration Runtime (IR) to establish a secure communication channel between the source and destination.

There are three types of Integration Runtimes that can be used with Azure Data Factory V2:

  1. Azure Integration Runtime: This runtime is a fully managed service that can be used to copy data from Azure services like Azure Blob storage, Azure SQL Database, and Azure Data Lake Storage Gen1/Gen2 to other Azure services. It is recommended to use this runtime when the source and destination are both in Azure and there are no specific connectivity requirements.

  2. Self-hosted Integration Runtime: This runtime is a lightweight and agent-based runtime that can be installed on an on-premises machine or a virtual machine in Azure. It is recommended to use this runtime when the source or destination is on-premises or behind a firewall, and there is a need for a secure and encrypted communication channel.

  3. Azure-SSIS Integration Runtime: This runtime is used to run SQL Server Integration Services (SSIS) packages in Azure. It is recommended to use this runtime when there is a requirement to migrate or run existing SSIS packages in the cloud.

In this scenario, since the source database is on an Azure virtual machine, and the destination is an Azure Data Lake Storage Gen2 account, it is recommended to use a self-hosted integration runtime. By installing a self-hosted integration runtime on the virtual machine where SQL Server is installed, you can securely copy data from Table1 to the Azure Data Lake Storage Gen2 account.

Therefore, the correct answer is B. self-hosted integration runtime.