Which solution will meet these requirements with the LEAST operational overhead?
Create an Amazon Kinesis data stream to store the data in Amazon S3. Create an Amazon Kinesis Data Analytics application to analyze the data. Invoke an AWS Lambda function to send the data to the Kinesis Data Analytics application.
Create an Amazon Kinesis data stream to store the data in Amazon S3. Create an Amazon EMR cluster to analyze the data. Invoke an AWS Lambda function to send the data to the EMR cluster.
Create an Amazon Kinesis Data Firehose delivery stream to store the data in Amazon S3. Create an Amazon EMR cluster to analyze the data.
Create an Amazon Kinesis Data Firehose delivery stream to store the data in Amazon S3. Create an Amazon Kinesis Data Analytics application to analyze the data.
Explanations:
This option suggests using Amazon Kinesis data stream and Kinesis Data Analytics with an AWS Lambda function. While it allows for real-time analysis, invoking a Lambda function adds complexity and operational overhead, and it does not directly support the near-real-time encryption and storage in Apache Parquet format without additional steps.
This option involves using an Amazon Kinesis data stream to store data in S3 and an Amazon EMR cluster for analysis. While EMR can process data, it requires significant operational management and setup, leading to higher overhead compared to alternatives. Additionally, it does not inherently support real-time data encryption before storage.
This option uses Kinesis Data Firehose to deliver data to S3 and an EMR cluster for analysis. Kinesis Data Firehose can automatically compress and encrypt data, but analyzing data with EMR introduces additional operational overhead and does not natively output data in Parquet format without extra configuration.
This option uses Amazon Kinesis Data Firehose, which can automatically handle data delivery to S3 in Parquet format while also supporting encryption. Additionally, integrating with Kinesis Data Analytics for near-real-time analysis ensures minimal operational overhead, as Firehose simplifies data processing and storage management.