Which approach should the ML specialist use discover the relationship between the two new categorical features?
Create a scatterplot for day_of_week and binned_age.
Create crosstabs for day_of_week and binned_age.
Create word clouds for day_of_week and binned_age.
Create a boxplot for day_of_week and binned_age.
Explanations:
Scatterplots are used to visualize the relationship between two continuous variables, not categorical variables. In this case, bothdayofweekandbinned_ageare categorical, making a scatterplot unsuitable.
Creating crosstabs (or contingency tables) is an effective way to examine the relationship between two categorical variables. This approach allows the ML specialist to see the frequency distribution of product usage across different days of the week and age groups, enabling insights into usage patterns.
Word clouds are typically used to visualize text data, highlighting the frequency of words. They are not appropriate for analyzing categorical relationships between two categorical variables likedayofweekandbinned_age.
Boxplots are used to display the distribution of continuous data and are not suitable for comparing two categorical variables. Since both features are categorical, a boxplot would not provide meaningful insights in this context.