Gregory is a Data Engineer of Whizlabs Inc., working on ETL (extract, transform, load) operations of data pipelines on Azure.
He needs to access Azure Synapse from Azure Databricks using the data processing pipeline.
Which of the following tools can he use in this scenario?
Click on the arrows to vote for the correct answer
A. B. C. D.Correct Answer: B.
To access Azure Synapse from Azure Databricks using the data processing pipeline, Gregory can use the Azure Synapse connector.
Azure Synapse connector is a tool that allows you to read and write data to/from Azure Synapse Analytics (formerly SQL Data Warehouse) from Azure Databricks. It is an efficient way to load data into Databricks from Synapse, as well as to move data from Databricks to Synapse.
Azure Synapse connector uses JDBC (Java Database Connectivity) to connect to Synapse, and it supports reading and writing data using the DataFrame API or the SQL API.
Here are some benefits of using the Azure Synapse connector:
Azure Data Lake Storage Gen2 and Polybase are not directly related to connecting Azure Synapse and Azure Databricks. Azure Data Lake Storage Gen2 is a cloud-based data lake that provides scalable, secure, and cost-effective storage for big data analytics. Polybase is a technology that allows you to query data from external data sources such as Hadoop or Azure Blob Storage directly from SQL Server or Azure Synapse Analytics.
Spark driver is also not a tool that can be used to connect Azure Synapse and Azure Databricks. Spark driver is a component of Apache Spark that manages the execution of Spark applications and coordinates with the cluster manager to allocate resources for the application.