CS代考 1 Introduction

1 Introduction
1. Give a definition of ¡°Pattern Recognition¡±.
Pattern Recognition is concerned with creating algorithms that can assign names to observations. Or,
Pattern Recognition is concerned with making decisions based on data.

Copyright By PowCoder代写 加微信 powcoder

2. Which of the following problems is a suitable application for pattern recognition?
a. Classifying numbers into primes and non-primes.
No. Can be more easily and more accurately perfomed analytically.
b. Detecting potential fraud in credit card charges.
c. Determining where a cannon ball is likely to land given information about elevation and direction of the barrel, wind direction, etc.
No. Better to do it analytically (e.g. using Newton¡¯s equations).
d. Identifying the species a newly discovered ¡°bug¡± belongs to.
Depends! Some species are defined by simple rules (female mammals produce milk, etc.), so given the right data might easily simply write down the solution. Given other data, like images of different bugs, we would need to perform pattern recognition.
3. Give brief definitions of the following terms: a. Exemplar
a particular datapoint which is represented by a feature vector (also called an item, sample, instance,…).
b. Dataset
the collection of feature vectors for all exemplars.
c. Generalization
how well a model performs on new data.
d. Overfitting
making the model so specific to the training data that it fails to generalise to new data.
e. Decision Theory
methods for making decisions that reduce cost rather than misclassification rate.
f. Feature Space
the (multidimensional) space defined by the feature vectors in the dataset.

g. Linearly Separable
exemplars from two classes can be separated by a hyperplane in feature space.
h. Dichotomizer
a classifier that places exemplars in one of two classes (also called a binary classifier).
i. Hyper-Parameter
a value used by the learning algorithm in its search for the optimal parameters of the classifier.
j. Grid search
a method of trying to find suitable hyper-parameters that searches all possible combinations of values within defined ranges.
k. Training data
the collection of feature vectors used by the learning algorithm to tune the parameters of the classifier.
l. Test data
the collection of feature vectors used by to evaluate the performance of the trained classifier (this dataset should be distinct from the training data to ensure generalisation).
4. What types of learning best describe the following three scenarios, in which a coin classification system is being created for a vending machine.
a. Measurements of a large number of coins are taken. The algorithm finds that these measurements fall in several ¡°bins¡±. It finds decision boundaries that separate these bins and uses these to classify new coins.
unsupervised learning (clustering)
b. The measurements of each coin is presented to the classifier which makes a decision. The classifier changes its decision boundaries based on whether this decision was correct of incorrect.
reinforcement learning
c. Measurements of a large number of coins are taken. The algorithm uses this data, together with known class labels, to infer decision boundaries which it then uses to classify new coins.
supervised learning (classification)
5. Briefly explain the following types of learning method: a. Classification
A method that learns to predict a class label associated with each exemplar.
b. Regression
A method that learns to predict a continuous value for each exemplar.
c. Semi-supervised

A method that learns using both labelled and unlabelled training exemplars.
d. Transfer
A method that pre-trains a classifier on another task before training it on the main task in the hope that the pre-training will help improve performance on the main task.
Class Features 1(5.3,4)
2 (2.3,9.1) 1 (4,3)
2 (1.8,5.5)
Class Features
15 15.5 1(5,4)
Dataset 2 Dataset 3
Class Features Class Features
2 2.3 2 (2,9) 14 1 4 1 (4,3) 17 1 7 1 (7,4) 21 2 1.8 2 (1,5)
Identify which of the above datasets are: a. univariate-continuous
b. multivariate-discrete
c. multivariate-continuous
d. univariate-discrete
7. A classifier is designed to determine if a feature vector is or is not in a certain class. The output produced by the classifier is 1 if the sample is predicted to be in the class, and 0 otherwise. The following table shows the feature vectors for the samples in the test set, along with the class labels predicted by the classifier and the true class labels of each sample.
Features Predicted Class (5.3,4) 1
(2.3,9.1) 0
(7,4.1) 1 (1.8,5.5) 0 (6,3.1) 1 (4.5,3.5) 0
True Class 1
Draw a confusion matrix for this data.
true label
predicted label
true false true 3 2 false 1 1

8. For the results given in the previous question calculate the following performance metrics: a. the error-rate
=FP+FN =1+2=3 TP+TN+FP+FN 3+1+1+2 7
b. the accuracy
=TP+TN =3+1=4 TP+TN+FP+FN 3+1+1+2 7
c. the recall
=TP=3=3 TP+FN 3+2 5
d. the precision
=TP=3=3 TP+FP 3+1 4
e. the f1-score
=2¡ÁTP =2¡Á3=6=2
2¡ÁTP+FP+FN 2¡Á3+1+2 9 3
alternatively:f -score=
2¡Árecall¡Áprecision 1 recall+precision
3 + 3 12 + 15 27
2¡Á 3 ¡Á 3
= 5 4 = 20 = =
9. The following table shows exemplars from two classes. Sketch the feature space, and suggest a suitable location for a decision boundary that will minimise the number of mis-classifications. If the cost of erroneously choosing class 1 is higher than the cost of erroneously choosing class two, how will this effect the location of the decision boundary?
Class Features 1 (3,4)
2468 2468 2468

程序代写 CS代考 加微信: powcoder QQ: 1823890830 Email: powcoder@163.com