How to do it?:
Open the Rmarkdown file of this assignment (link) in Rstudio.
Right under each question, insert a code chunk (you can use the hotkey Ctrl + Alt + I
to add a code chunk) and code the solution for the question.
Knit
the rmarkdown file (hotkey: Ctrl + Alt + K
) to export an html.
Publish the html file to your Githiub Page.
Submission: Submit the link on Github of the assignment to Blackboard.
Use the PimaIndiansDiabetes
dataset. Use 15% data for testing. Use cross-validation with of 7 folds to tune random forest (method='ranger')
. What are the parameters that produce the greatest accuracy? What is the testing accuracy.
Use the PimaIndiansDiabetes
dataset. Go to https://topepo.github.io/caret/available-models.html and pick a classification model. Tune the classification model using cross-validation of 7 folds.
(Model Comparison) Use the PimaIndiansDiabetes
dataset. Pick two models at this link to compare using 7-fold cross validation method. Evaluate the accuracy of the final model on the test data.