AIM OF THE ASSIGNMENT
To provide deeper understanding of appropriate methodological approaches to processing and analysing noisy data; and to encourage appreciation of the challenges involved in data analysis.
LEARNING OUTCOMES
Understanding of the fundamentals of Python to enable the use of various big data technologies; Understand how classical statistical techniques are applied in modern data analysis; Understanding of the potential application of data analysis tools for various problems and appreciate their limitations; Understanding of the challenges and complexity of data analysis.
THE BRIEF
Provide a brief report on analysis of an open data set. Example data sets are available the UCI Machine Learning Repository (https://archive.ics.uci.edu/ml/datasets.html) or Kaggle (https://www.kaggle.com/datasets) for example. There are some restrictions on the dataset that can be selected (see below). You can focus your report on one aspect of the dataset or multiple aspects, the main objective is to find some interesting questions or problems to answer.