Azure Data Lake Storage: Ensuring Data Redundancy and Cost-effectiveness

Achieving Data Redundancy in Azure Data Lake Storage

Question

A famous CRM solution provider has its data stored in Azure Data Lake Storage.

The total data size is 25 TB.

You must ensure data redundancy in a way that if the region on which the data is stored goes through an outage, the data should be available immediately in another region.

Also, the solution should be cost-effective.

You plan to use Read Access Geo-Redundant Storage (RA-GRS)

Will this solve the issue?

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B.

Correct Answer: A.

When you use geo-redundant storage, the data is copied to a secondary region as well.

Similarly, Read Access Geo-redundant storage also does the same, but in a cost-effective manner.

The limitation with this is that the data in the secondary region will be available only for reading,

So, RA-GRS will be the right choice.

Reference:

For more details on column store indexes, please refer to the following document.

The use of Read Access Geo-Redundant Storage (RA-GRS) can provide a solution for ensuring data redundancy in case of an outage in the region where the data is stored.

RA-GRS provides geo-replication of data to a secondary region in the same geographic region but far enough away to avoid the same disasters, providing high availability and data durability. The data is replicated asynchronously to the secondary region, which means that there might be a delay between updates to the primary region and the secondary region.

In case of an outage in the primary region, the secondary region can immediately take over the workload, providing continuous availability of data to users. However, during the failover process, data might not be immediately available in the secondary region, and it could take some time for the system to catch up with the changes that occurred in the primary region.

RA-GRS is cost-effective because it does not require the use of a standby replica, which can be expensive. Instead, it relies on asynchronous replication to a secondary region, which can provide the same level of redundancy without the additional costs.

Therefore, the correct answer is A. Yes, RA-GRS can solve the issue of data redundancy in a cost-effective way, ensuring data availability in case of an outage in the primary region.