You will find attached a database that you will need to analyze. Please review and provide us with answers for the four point below:
Be as specific as you can. Do not hesitate to contact us three for any questions you might have during your work.
• Using Python analyze the data and give the Exploratory data Analysis.
• Preprocess the item description and categorize the data into similar groups (for example under label “tube”) Items with similar measurements should be grouped together.
• Complete the data with missing values or replace the wrong quantity and price values with information found in the item description. Split data into training (70%) and test (30%) and apply a Machine Learning model that predicts labels.
• Make a short report of the steps used as well as your findings including model performance. Provide the findings in a Jupyter notebook.