Part A Task 1.1
1.1 Aggregation
(2 marks)
• 2 marks: an accurate data frame.
• -1 mark: summarise outside of 2020 boundary.
• -0.5 mark per incorrect attribute summary.
• 0 mark: 3 or more variables are incorrectly summarised or not attempted.
Part A Task 1.2
1.2
Add a new variable
(1 mark)
•1 mark: correct output as specified
•-0.5: order of the output are incorrect.
•0 mark: not attempted, incorrect values for the new variable
Part A Task 1 – Overall
• Mark awarded as the break down above. Output valid csv , and to standard output.
• 0 mark for Task 1: Missing either csv or standard output. Or the command ‘python parta1.py owid-covid-data-2020- monthly.csv’ generates errors or no output.
Task 1
Overall
Part A Task 2.1
Task 2.1
Confirmed cases in linear scale
(1 mark)
• 1 mark: an accurate plot. with appropriate title, axis labels, ticks, and tick labels.
• -0.5 mark: correct variables used but not plotted along the specified axis.
• 0 mark: incorrect variable pairs used or incorrect plot produced.
Part A Task 2.2
Task 2.2
Confirmed cases in log scale
(1 mark)
• 1 mark: an accurate plot, with appropriate title, axis labels, ticks, and tick labels.
• -0.5 mark: correct variables used but not plotted along the specified axis.
• 0 mark: incorrect variable pairs used, incorrect scales used, or incorrect plot produced.
Part A Task 2 – Overall
Task 2
Overall
• Mark awarded as the break down above.
• -0.5 mark: some issues with either of the plots, e.g. title, labels, etc.
• 0 mark for Task 2: Missing scatter-a.png or scatter-a.png files from the output. Or Or the command ‘python parta2.py scatter-a.png scatter-b.png’ generates errors or no output.
Part A Task 3.1
Task 3.1
preprocessing reporting
(1.5 marks)
• 1.5 marks: excellent, clear and succinct explanations to the pre-processing. Good insights to limitations of the data.
• 1 mark: good explanations to the pre-processing steps and reasonable limitations included and or expressions are at times ambiguous.
• 0.5 mark: some obvious omissions of some pre- processing steps, data source, and limitations of the data.
• 0 mark: no explanations or explanations are incoherent, and difficult to understand.
Part A Task 3.2
Task 3.2
visual analysis
(1.5 marks)
• 1.5 marks: plots included and excellent, clear, and succinct explanations of the plots and patterns.
• 1 mark: good explanations of the plots and reasonable patterns described but may omit some clear patterns or explanations less clear and at times ambiguous.
• 0.5 mark: reasonable descriptions of the plots and the patterns but may contain some incorrect interpretations or missing key components.
• 0 mark: no explanations or explanations are incoherent, and difficult to understand
Part A Task 3.3
Task 3.3
Contrasting discussion (1 mark)
• 1 mark: clear, concise, and meaningful discussions.
• 0.5 mark: the discussion make sense but may be vague, contain irrelevant information, or miss key differences between the two scatter plots.
• 0 mark: no discussions or discussions are incoherent, and difficult to understand
Part A Task 3 – Overall
Task 3
Overall
• Mark awarded as the break down above.
• -1 mark: two plots are not included in the report.
Part B Task 1
Regular Expressions
(1 mark)
• 1 mark: All of the document IDs are correctly identified and the file formatted correctly
• 0.5 marks: More than 75% of document IDs are correctly identified or there are issues with the file format
• 0 marks: The output is missing or completely incorrect
Part B Task 2
Preprocessing (1 mark)
• 1 mark: Both case folding and non-alphabetic character removal are performed correctly
• 0.5 marks: One of case folding or non-alphabetic character removal is performed correctly
• 0 marks: The output is missing or neither case folding nor character removal is performed correctly
Part B Task 3
Basic Search
(2 marks)
• 2 marks: The output passes all of the provided test cases
• 1 mark: The output passes at least half of the provided test cases
• 0 marks: The output is missing or fails all of the provided test cases
Part B Task 4
Advanced Search
(2 marks)
• 2 marks: The output passes all of the provided test cases
• 1 mark: The output passes at least half of the provided test cases
• 0 marks: The output is missing or fails all of the provided test cases
Part B Task 5
Search Rankings
(3 marks)
• 3 marks: The output passes all of the provided test cases
• 2 mark: The output passes two of the provided test cases
• 1 mark: The output passes one of the provided test cases
• 0 marks: The output is missing or fails all of the provided test cases
Part C
Git
(2 marks)
• 2 marks: All necessary files have been pushed to the remote git repository, with a sensible readme file and commit comments
• 1 mark: All necessary files have been pushed to the remote git repository
• 0 marks: The git repository does not contain the correct version of all necessary files