Which AWS service or feature should the developer use to meet these requirements with the LEAST amount of operational overhead?

By: study aws cloud

Tagged: Cloud Practitioner

With: 1 Comment

A company has been storing monthly reports in an Amazon S3 bucket.The company exports the report data into comma-separated values (.csv) files.A developer wants to write a simple query that can read all of these files and generate a summary report.

Which AWS service or feature should the developer use to meet these requirements with the LEAST amount of operational overhead?

Amazon S3 Select

Amazon Athena

Amazon Redshift

Amazon EC2

Explanations:

Amazon S3 Select allows for querying data stored in S3 files but is limited to retrieving a subset of data from individual files. It does not provide a way to aggregate data across multiple CSV files efficiently.

Amazon Athena is a serverless query service that allows users to run SQL queries directly on data stored in S3. It can easily query multiple CSV files and generate summary reports with minimal operational overhead, making it the most suitable option for this scenario.

Amazon Redshift is a data warehousing service that requires setup and management of a cluster, which introduces more operational overhead compared to Athena. It is not ideal for ad-hoc querying of CSV files in S3 without significant data integration work.

Amazon EC2 involves setting up and managing virtual servers, which requires significant operational overhead for running queries on CSV files in S3. This is not the best choice for simply querying data stored in S3.

Previous Post: What is the likely cause of this problem?

Next Post: Which combination of CloudFront configuration settings should the developer use to meet these requirements?

1 Comment

Author

I have a feeling that the answer is:
Amazon Athena

Olivia

5 months ago

Permalink

Reply

Leave a Reply Cancel reply