School of Computing and Information Systems The University of Melbourne
COMP90049 Introduction to Machine Learning (Semester 2, 2022)
1. How is holdout evaluation different to cross-validation evaluation? What are some reasons we would prefer one strategy over the other?
2. A confusion matrix is a summary of the performance of a (supervised) classifier over a set of development (¡°test¡±) data, by counting the various instances:
Copyright By PowCoder代写 加微信 powcoder
2 3 1 531 371 035
Classified
(i). Calculate the classification accuracy of the system. Find the error rate for the system.
(ii). Calculate the precision, recall and F-score (where ¦Â = 1) for class d.
(iii). Why can¡¯t we do this for the whole system? How can we consider the whole system?
3. Forthefollowingdataset:
ID Outl Temp Humi Wind PLAY TRAINING INSTANCES
AshhFN BshhTN CohhFY DrmhFY ErcnFY FrcnTN
TEST INSTANCES GocnT?
(i). Classify the test instances using the method of 0-R.
(ii). Classify the test instances using the method of 1-R.
4. Given the above dataset, we wished to perform feature selection on this dataset, where the class is PLAY:
(i). Which of Humi and Wind has the greatest Pointwise Mutual Information for the class Y? What about N?
(ii). Which of the attributes has the greatest Mutual Information for the class, as a whole?
程序代写 CS代考 加微信: powcoder QQ: 1823890830 Email: powcoder@163.com