HikeHills.com (HH) is an online specialty retailer that sells clothing and outdoor refreshment gear for trekking, go camping, boulevard biking, mountain biking, rock hiking, ice mountaineering, skiing, avalanche protection, snowboarding, fly fishing, kayaking, rafting, road and trace running, and many more. HH runs their entire online infrastructure on java based web applications running on AWS.
The HH is capturing click stream data and use custom-build recommendation engine to recommend products which eventually improve sales, understand customer preferences and already using AWS kinesis KPL to collect events and transaction logs and process the stream. HH is planning to consider Kinesis Streams to address integration and the architecture team is planning to define naming standards for each component in the streams.
Please advice.
Select 3 options.
Click on the arrows to vote for the correct answer
A. B. C. D. E. F. G.Answer: A, D, G.
Option A is correct -A Kinesis data stream is a set of shards.
Each data record has a sequence number that is assigned by Kinesis Data Streams.
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlOption B is incorrect -Each shard has a sequence of data records.
A shard is a uniquely identified sequence of data records in a stream.
A stream is composed of one or more shards, each of which provides a fixed unit of capacity.
Each shard can support up to 5 transactions per second for reads, up to a maximum total data read rate of 2 MB per second and up to 1,000 records per second for writes, up to a maximum total data write rate of 1 MB per second (including partition keys)
The data capacity of your stream is a function of the number of shards that you specify for the stream.
The total capacity of the stream is the sum of the capacities of its shards.
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlOption C is incorrect -A Kinesis data stream is a set of shards.
Each shard has a sequence of data records.
Each data record has a sequence number that is assigned by Kinesis Data Streams.
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlOption D is correct - A data record is the unit of data stored in a Kinesis data stream.
Data records are composed of a sequence number, a partition key, and a data blob, which is an immutable sequence of bytes.
Kinesis Data Streams does not inspect, interpret, or change the data in the blob in any way.
A data blob can be up to 1 MB.https://docs.aws.amazon.com/streams/latest/dev/key-concepts.html.
Option E is incorrect -A Kinesis data stream is a set of shards.
Each shard has a sequence of data records.
Each data record has a sequence number that is assigned by Kinesis Data Streams.
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlOption F is incorrect -Each shard has a sequence of data records.
A shard is a uniquely identified sequence of data records in a stream.
A stream is composed of one or more shards, each of which provides a fixed unit of capacity.
Each shard can support up to 5 transactions per second for reads, up to a maximum total data read rate of 2 MB per second and up to 1,000 records per second for writes, up to a maximum total data write rate of 1 MB per second (including partition keys)
The data capacity of your stream is a function of the number of shards that you specify for the stream.
The total capacity of the stream is the sum of the capacities of its shards.
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlOption G is correct -The retention period is the length of time that data records are accessiblH.
No.
the retention period is set at stream level.
The retention period is the length of time that data records are accessible after they are added to the stream.
A stream's retention period is set to a default of 24 hours after creation.
You can increase the retention period up to 168 hours (7 days)e after they are added to the stream.
A stream's retention period is set to a default of 24 hours after creation.
You can increase the retention period up to 168 hours (7 days)
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlOption H is incorrect - the retention period is set at stream level.
The retention period is the length of time that data records are accessible after they are added to the stream.
A stream's retention period is set to a default of 24 hours after creation.
You can increase the retention period up to 168 hours (7 days)
https://docs.aws.amazon.com/streams/latest/dev/key-concepts.htmlKinesis Streams is a managed service offered by Amazon Web Services (AWS) that allows for the real-time processing of streaming data at scale. When designing Kinesis Streams, it is important to understand its basic concepts and components.
A. Stream is a set of shards, A shard has a sequence of data records A stream is a logical grouping of shards, which are the underlying storage units for the data records. Each shard has a sequence of data records and a unique identifier called a shard ID.
B. Shard is a set of streams, A stream has a set of data records This answer is incorrect. Shards are not a set of streams. Instead, streams are a logical grouping of shards.
C. Stream is a set of records, A record has a sequence of shards This answer is incorrect. A stream is a logical grouping of shards, and a data record is stored within a shard.
D. Data records are composed of a sequence number, a partition key, and a data blob, which is an immutable sequence of bytes This answer is correct. A data record contains a sequence number, which is a unique identifier for the record within the shard. The partition key is used to determine which shard the record is stored in. The data blob is an immutable sequence of bytes containing the actual data.
E. Data streams are composed of a sequence number, a partition key, and a data blob, which is an immutable sequence of bytes This answer is incorrect. Streams are not composed of a sequence number, partition key, and data blob. Instead, streams are a logical grouping of shards, and shards contain the actual data records.
F. Data shards are composed of a sequence number, a partition key, and a data blob, which is an immutable sequence of bytes This answer is incorrect. Shards contain the actual data records, but they do not have a sequence number, partition key, and data blob. Instead, the data records within a shard have these attributes.
G. A data stream's retention period is set to a default of 24 hours after creation This answer is correct. By default, data records in a Kinesis stream are retained for 24 hours after they are added to the stream.
H. A data record's retention period is set to a default of 24 hours after creation. This answer is incorrect. The retention period applies to the entire stream and not to individual data records within the stream.
In summary, Kinesis Streams is a powerful tool for processing streaming data at scale. It is important to understand the basic concepts and components, including streams, shards, and data records. When designing Kinesis Streams, it is also important to consider the default retention period of 24 hours for data records in the stream.