Description
Phase III: (10%) due Wednesday, Dec. 7, 11:59pm.
Use the dataset that you picked in Phase 2 or choose a new dataset – discuss your choice
with me in that case. (1%)
N.B. Your dataset should not be associated with any existing work related to the
required tasks – e.g., on kaggle, Github, …
Apply tree-based approaches including decision trees, random forest, bagging, and boosting.
(4%)
Apply unsupervised techniques including k-means and hierarchical clustering, as well as
principal component analysis. Analyze and comment on your results. (6%)
For each phase, make sure to highlight the following in your R markdown pdf file:
Dataset description including context and features
Data mining tasks
Model performance
Results
Comparison of results
Comments and interpretation
Name of your R markdown pdf file following this template: NameOfTeamMember1-
NameOfTeamMember2_Phase PhaseNumber.

