You have been asked to select the storage system for the click-data of your company's large portfolio of websites.
This data is streamed in from a custom website analytics package at a typical rate of 6,000 clicks per minute.
With bursts of up to 8,500 clicks per second.
It must have been stored for future analysis by your data science and user experience teams.
Which storage infrastructure should you choose?
Click on the arrows to vote for the correct answer
A. B. C. D.B.
Google Cloud Bigtable is a scalable, fully-managed NoSQL wide-column database that is suitable for both real-time access and analytics workloads.
Good for: -> Low-latency read/write access -> High-throughput analytics -> Native time series support Common workloads: -> IoT, finance, adtech -> Personalization, recommendations -> Monitoring -> Geospatial datasets -> Graphs Incorrect Answers: C: Google Cloud Storage is a scalable, fully-managed, highly reliable, and cost-efficient object / blob store.
Is good for: -> Images, pictures, and videos -> Objects and blobs -> Unstructured data D: Google Cloud Datastore is a scalable, fully-managed NoSQL document database for your web and mobile applications.
Is good for: -> Semi-structured application data -> Hierarchical data -> Durable key-value data -> Common workloads: -> User profiles -> Product catalogs -> Game state Reference: https://cloud.google.com/storage-options/
Given the scenario, we need to select a storage system for click-data that can handle high traffic and large volumes of data. It should also be able to store the data for future analysis.
A. Google Cloud SQL: Google Cloud SQL is a fully managed relational database service that uses MySQL or PostgreSQL. It is suitable for storing structured data and handling moderate traffic. However, it may not be the best option for storing click-data that requires high write rates and can quickly grow to large volumes. Additionally, it is not designed to handle unstructured data, which could be a requirement for click-data. Therefore, Google Cloud SQL is not the best option for this scenario.
B. Google Cloud Bigtable: Google Cloud Bigtable is a scalable, fully managed NoSQL database service that can handle large volumes of structured and unstructured data. It is designed to handle high write rates and is ideal for storing click-data. With its ability to handle up to 8,500 clicks per second, it is the most suitable option for this scenario.
C. Google Cloud Storage: Google Cloud Storage is a scalable object storage service that is designed for unstructured data. It can handle high volumes of data and can be used to store click-data. However, it may not be the best option for real-time data ingestion and analysis. Google Cloud Storage is best suited for storing data that does not require immediate access.
D. Google Cloud Datastore: Google Cloud Datastore is a NoSQL document database service that is designed for storing structured data. It is suitable for handling moderate traffic and can handle data volumes of up to a few terabytes. However, it may not be the best option for storing click-data, which requires high write rates and can quickly grow to large volumes.
In summary, the most suitable storage infrastructure for storing click-data in this scenario is Google Cloud Bigtable.