BIF524/CSC463 Data Mining Project – Phase 3

$30.00

Download Details:

  • Name: Phase3-wcetgc.zip
  • Type: zip
  • Size: 5.42 MB

Category:

Description

Rate this product

Phase III: (10%) due Wednesday, Dec. 7, 11:59pm.
 Use the dataset that you picked in Phase 2 or choose a new dataset – discuss your choice
with me in that case. (1%)
 N.B. Your dataset should not be associated with any existing work related to the
required tasks – e.g., on kaggle, Github, …
 Apply tree-based approaches including decision trees, random forest, bagging, and boosting.
(4%)
 Apply unsupervised techniques including k-means and hierarchical clustering, as well as
principal component analysis. Analyze and comment on your results. (6%)
For each phase, make sure to highlight the following in your R markdown pdf file:
 Dataset description including context and features
 Data mining tasks
 Model performance
 Results
 Comparison of results
 Comments and interpretation
Name of your R markdown pdf file following this template: NameOfTeamMember1-
NameOfTeamMember2_Phase PhaseNumber.