The homeworks will consist of two parts: the first part will contain questions and answers drawn from the book or other resource, and the second part will contain programming assignments pertinent to the data mining algorithm under study. All homework must be submitted by creating an archive that must contain only the following files (if you submit any other file formats except the ones listed in (1) below, a penalty of 0.25 points will be levied; if you submit any other file formats except the ones listed in (2-i or 2-ii) below, a penalty of 0.25 points will be levied):
1. A PDF file (and only a PDF file, no other file formats will be accepted) corresponding to the first part, either processed through LaTeX (preferred), a word processor of your choice, or neatly written in hand;
2. The programming assignments can be handed in using one of the following formats:
i. A .Rmd file (R markdown file) that contains markdown with embedded R code chunks that can be loaded and executed. Corresponding to the .Rmd file should be a HTML notebook file that is produced by processing the .Rmd file. (If your homework file is saved as firstname-lastname.Rmd, processing it will produce firstname-lastname.nb.html) Both files should be submitted and each question in the assignment should be clearly marked.
ii. If you use Jupyter notebooks, you can install an R kernel in the notebook by following instructions at https://www.datacamp.com/community/blog/jupyter-notebook-r. You will need to save the notebook as a .html file (not a .ipynb file) and the corresponding code in an .R file. Then, submit both the files. As before, each question in the assignment should be clearly marked. (If you only submit one of the files, a penalty of 0.25 points will be levied.)
iii. Naming your files: The .Rmd or .R files must be named using the firstname-lastname.Rmd or firstnamelastname.R format, where firstname and lastname are the first and last names of the student as registered.