Which solution will meet these requirements MOST cost-effectively?
Configure Amazon Athena to read the encrypted files. Run SQL queries on the data directly in Amazon S3.
Use Amazon S3 Select to run SQL queries on the data directly in Amazon S3.
Configure Amazon Redshift to read the encrypted files. Use Redshift Spectrum and Redshift query editor v2 to run SQL queries on the data directly in Amazon S3.
Configure Amazon EMR Serverless to read the encrypted files. Use Apache SparkSQL to run SQL queries on the data directly in Amazon S3.
Explanations:
Amazon Athena can read data directly from encrypted Parquet files in S3 and is a cost-effective solution for occasional SQL queries, as it only charges per query and doesn’t require additional infrastructure.
Amazon S3 Select is not designed for complex SQL queries on large datasets; it is primarily used for filtering and retrieving subsets of S3 objects rather than performing full SQL operations.
Amazon Redshift and Redshift Spectrum are more costly and suitable for frequent or complex analytics workloads. This is not the most cost-effective choice for occasional querying.
Amazon EMR Serverless is generally costlier and more complex than Athena for occasional queries, as it involves additional overhead in configuring and managing Spark jobs.