Marqueguard Analytics - AWS Data Pipeline and Key Artifacts

Marqueguard Analytics - AWS Data Pipeline and Key Artifacts

Question

Marqueguard is a social media monitoring company headquartered in Brighton, England.

Marqueguard sells three different products: Analytics, Audiences, and Insights.

Marqueguard Analytics is a "self-serveapplication" or software as a service, which archives social media data in order to provide companies with information and the means to track specific segments to analyze their brands' online presence. The tool's coverage includes blogs, news sites, forums, videos, reviews, images and social networks such as Twitter and Facebook.

Users can search data by using Text and Image Search, and use charting, categorization, sentiment analysis and other features to provide further information and analysis.

Marqueguard has access to over 80 million sources. Marqueguard hosted their entire infrastructure on AWS and uses Data Pipeline as data integration mechanism.

Marqueguard hosts different web services in terms of data includes DynamoDB, RDS, RedShift and S3

When AWS Data Pipeline runs a pipeline, it compiles the pipeline components to create a set of actionable Amazon EC2 instances.

Each instance contains all the information for performing a specific task.

The complete set of instances is the to-do list of the pipeline.

AWS Data Pipeline hands the instances out to task runners to process.

please identify the key artifacts.

select 2 options.

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Answer: A,D.

Option A is correct -uses only the following instance types of EC2 instances clusters besides default instance types to run pipeline work activities.

General Purpose.

Compute optimized.

Memory optimized.

Storage optimized.

https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-ec2-supported-instance-types.html

Option B is incorrect -uses only the following instance types of EC2 instances for EMR clusters besides default instance types to run pipeline work activities.

General Purpose.

Compute optimized.

Memory optimized.

Storage optimized.

Accelerated computing.

https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-emr-supported-instance-types.html

Option C is incorrect -uses only the following instance types of EC2 instances clusters besides default instance types to run pipeline work activities.

General Purpose.

Compute optimized.

Memory optimized.

Storage optimized.

https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-ec2-supported-instance-types.html

Option D is correct -uses only the following instance types of EC2 instances for EMR clusters besides default instance types to run pipeline work activities.

General Purpose.

Compute optimized.

Memory optimized.

Storage optimized.

Accelerated computing.

https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-emr-supported-instance-types.html

The question is asking about the key artifacts used by Marqueguard, which is a social media monitoring company that uses AWS Data Pipeline as their data integration mechanism. The question provides information about Marqueguard's products, infrastructure, and data services, and asks the candidate to identify two key artifacts related to EC2 instance types used for running pipeline work activities.

Option A states that Marqueguard uses only the following instance types of EC2 instances besides default instance types to run pipeline work activities: General Purpose, Compute optimized, Memory optimized, and Storage optimized.

Option B states that Marqueguard uses only the following instance types of EC2 instances for EMR (Elastic MapReduce) clusters besides default instance types to run pipeline work activities: General Purpose, Compute optimized, Memory optimized, and Storage optimized.

Option C states that Marqueguard uses only the following instance types of EC2 instances clusters besides default instance types to run pipeline work activities: General Purpose, Compute optimized, Memory optimized, Storage optimized, and Accelerated computing.

Option D states that Marqueguard uses only the following instance types of EC2 instances for EMR clusters besides default instance types to run pipeline work activities: General Purpose, Compute optimized, Memory optimized, Storage optimized, and Accelerated computing.

The correct options are A and B. These options refer to the EC2 instance types used by Marqueguard for running pipeline work activities and EMR clusters, respectively.

Option A specifies the general-purpose, compute-optimized, memory-optimized, and storage-optimized EC2 instance types that Marqueguard uses for running pipeline work activities, in addition to default instance types. This indicates that Marqueguard selects EC2 instance types based on their specific requirements for each pipeline work activity.

Option B specifies the same instance types as Option A but adds EMR clusters, which suggests that Marqueguard uses these EC2 instance types specifically for running EMR clusters. EMR is a service that allows users to process large amounts of data using Hadoop, Spark, or other big data frameworks.

Option C and D include "Accelerated computing" as an additional instance type, which is not mentioned in the question. While these instance types are available on AWS, there is no indication in the question that Marqueguard uses them for running pipeline work activities or EMR clusters. Therefore, options C and D are not correct in this context.

In summary, the key artifacts identified in this question are the EC2 instance types used by Marqueguard for running pipeline work activities and EMR clusters. The correct options are A and B, which specify general-purpose, compute-optimized, memory-optimized, and storage-optimized instance types.