Which combination of steps will meet these requirements?
(Choose three.)
Use an object detection algorithm to train a model to identify corrosion areas of a photo.
Use Amazon Rekognition with label detection on the photos.
Use a k-means clustering algorithm to train a model to classify the severity of corrosion in a photo.
Use an XGBoost algorithm to train a model to classify the severity of corrosion in a photo.
Perform image augmentation on photos that contain corrosion.
Perform image augmentation on photos that do not contain corrosion.
Explanations:
Using an object detection algorithm is appropriate as it can be trained to identify specific features, such as corrosion, in images. This is crucial for pinpointing the areas that require attention, especially in complex images where corrosion might not be immediately visible.
Amazon Rekognition is a general-purpose image analysis service that may not specifically be optimized for detecting corrosion in industrial settings. While it can label objects, it is not tailored for nuanced tasks like identifying varying degrees of corrosion or classifying maintenance needs.
K-means clustering is an unsupervised learning algorithm primarily used for grouping similar data points. It is not suitable for classification tasks like determining the severity of corrosion, which requires labeled data and specific categorization rather than clustering.
XGBoost is a powerful supervised learning algorithm suitable for classification tasks. It can handle complex patterns and relationships in the data, making it effective for classifying the severity of corrosion based on features extracted from the images.
Performing image augmentation on photos that contain corrosion helps to increase the dataset’s diversity and size, which is crucial for training an effective machine learning model. This can enhance the model’s ability to generalize and improve detection accuracy.
Augmenting images that do not contain corrosion may lead to an imbalance in the dataset and could confuse the model during training. Since corrosion is rare (0.1%), focusing on enhancing the positive class (corrosion images) is more beneficial than augmenting the negative class (non-corrosion images).