Which solution will meet these requirements MOST cost-effectively?

An analytics company has an Amazon SageMaker hosted endpoint for an image classification model.The model is a custom-built convolutional neural network (CNN) and uses the PyTorch deep learning framework.The company wants to increase throughput and decrease latency for customers that use the model.

Which solution will meet these requirements MOST cost-effectively?

Use Amazon Elastic Inference on the SageMaker hosted endpoint.

Retrain the CNN with more layers and a larger dataset.

Retrain the CNN with more layers and a smaller dataset.

Choose a SageMaker instance type that has multiple GPUs.

Explanations:

Amazon Elastic Inference allows for attaching low-cost inference accelerators to a SageMaker endpoint. This increases throughput and reduces latency without needing the more expensive GPU instances.

Retraining with more layers and a larger dataset may improve accuracy but will likely increase the computational requirements, thus increasing cost and potentially increasing latency.

Retraining with more layers and a smaller dataset is unlikely to achieve the desired improvements in throughput or latency and may degrade model performance.

Choosing a SageMaker instance with multiple GPUs is expensive and may not be the most cost-effective way to increase throughput and reduce latency. Elastic Inference is a more affordable alternative.

Free study guides, practices test, sample questions

Which solution will meet these requirements MOST cost-effectively?

Explanations:

1 Comment

Leave a Reply Cancel reply