You work for a telecommunications service and internet provider company that has been in business for decades.

Over the decades, the company has built various types of application systems and database technologies on the evolving platforms of the time.

Therefore, you have massive amounts of customer and company operational data on legacy mainframe systems and their associated data stores, such as aging relational databases. Your team is attempting to build a machine learning model to use streaming data from the company's in-home routers, functioning as IoT (Internet of Things) devices, and use that data to help the company sell additional services to its customer base.

The IoT data is unstructured, so you need to transform it to CSV format before ingesting it into the S3 buckets you use to house your datasets for your SageMaker model.

You also need to enrich the IoT data with real-time data from your legacy mainframe systems as the data streams into your AWS cloud environment. Which set of Amazon services would you use to set up this data transformation and ingestion pipeline?

Question

You work for a telecommunications service and internet provider company that has been in business for decades.

Over the decades, the company has built various types of application systems and database technologies on the evolving platforms of the time.

Therefore, you have massive amounts of customer and company operational data on legacy mainframe systems and their associated data stores, such as aging relational databases. Your team is attempting to build a machine learning model to use streaming data from the company's in-home routers, functioning as IoT (Internet of Things) devices, and use that data to help the company sell additional services to its customer base.

The IoT data is unstructured, so you need to transform it to CSV format before ingesting it into the S3 buckets you use to house your datasets for your SageMaker model.

You also need to enrich the IoT data with real-time data from your legacy mainframe systems as the data streams into your AWS cloud environment. Which set of Amazon services would you use to set up this data transformation and ingestion pipeline?

Exam-Answer · Accepted Answer

Have your legacy mainframe systems write to AWS Storage Gateway using the File Gateway configuration via an NFS (Network File System) connection. Use Kinesis Data Firehose to receive the streaming data from the IoT devices. Use the Kinesis Data Firehose lambda integration capability to enrich the IoT data with your legacy mainframe systems data and convert it to CSV before writing it to the S3 bucket used by your SageMaker model.

Transforming and ingesting IoT data with Amazon Services

Question

Answers

Explanations