AWS Big Data Specialty: Analyzing Data in Amazon S3 with SQL

Enable Interactive Query Service on Amazon S3 Data | MSP Bank

Question

MSP Bank, Limited is a leading Japanese monetary institution that provides a full range of financial products and services to both institutional and individual customers.

It is headquartered in Tokyo.

MSP Bank is hosting their existing infrastructure on premise. MSP bank has lot of web applications, databases and data warehouse built on Teradata, NoSQL databases, and other types of data stores.

MSP bank has lot of data assets in terms of logs, documents; excel files, CSV files, PDF documents and others.

The files of different formats are being loaded into S3 buckets MSP bank wants to enable interactive query service on the files loaded in S3 that makes it easy to analyze data directly in Amazon Simple Storage Service (Amazon S3) using standard SQL.

select 1 option.

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer C.

Option A is incorrect -Amazon Aurora (Aurora) is a fully managed relational database engine that's compatible with MySQL and PostgreSQL.

You already know how MySQL and PostgreSQL combine the speed and reliability of high-end commercial databases with the simplicity and cost-effectiveness of open-sourcedatabases.

The code, tools, and applications you use today with your existing MySQL and PostgreSQL databases can be used with Aurora.

With some workloads, Aurora can deliver up to five times the throughput of MySQL and up to three times the throughput of PostgreSQL without requiring changes to most of your existing applications.

Amazon Aurora supports relational data models and does not support graph model.

https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/CHAP_AuroraOverview.html

Option B is incorrect -Amazon Relational Database Service (Amazon RDS) is a web service that makes it easier to set up, operate, and scale a relational database in the cloud.

It provides cost-efficient, resizable capacity for an industry-standard relational database and manages common database administration tasks.

Amazon RDS supports relational data models and does not support graph model.

https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/Welcome.html

Option C is correct -Amazon Athena is an interactive query service that makes it easy to analyze data directly in Amazon Simple Storage Service (Amazon S3) using standard SQL.

With a few actions in the AWS Management Console, you can point Athena at your data stored in Amazon S3 and begin using standard SQL to run ad-hoc queries and get results in seconds.

Athena is serverless, so there is no infrastructure to set up or manage, and you pay only for the queries you run.

Athena scales automatically-executing queries in parallel-so results are fast, even with large datasets and complex queries.

https://docs.aws.amazon.com/athena/latest/ug/what-is.html

Option D is incorrect -Amazon Neptune is a fast, reliable, fully managed graph database service that makes it easy to build and run applications that work with highly connected datasets.

The core of Neptune is a purpose-built, high-performance graph database engine that is optimized for storing billions of relationships and querying the graph with milliseconds latency

https://docs.aws.amazon.com/neptune/latest/userguide/intro.html

The best option for MSP Bank to enable interactive query service on the files loaded in S3 that makes it easy to analyze data directly in Amazon Simple Storage Service (Amazon S3) using standard SQL is Amazon Athena.

Amazon Athena is a serverless, interactive query service that makes it easy to analyze data directly in Amazon S3 using standard SQL. Athena is a perfect solution for MSP Bank, as it allows querying structured and unstructured data in S3 without the need to set up any infrastructure. It is a cost-effective solution as MSP Bank only pays for the queries they run.

Amazon Aurora is a fully managed relational database engine that is compatible with MySQL and PostgreSQL. It is a good option for MSP Bank to host its web applications and databases, but it is not designed for querying data stored in S3.

Amazon RDS (Relational Database Service) is another fully managed database service that is designed for running and scaling traditional relational databases such as MySQL, PostgreSQL, and Oracle. It is not the right solution for MSP Bank's requirement of analyzing data stored in S3.

Amazon Neptune is a fully managed graph database service that is optimized for storing and querying highly connected data. It is not the right solution for MSP Bank's requirement of analyzing data stored in S3.

Therefore, the best option for MSP Bank is Amazon Athena.