Homework 2
Use the PubMed dataset and download at least five years of scientific articles.
1. For each of the following terms: influenza, Obesity, Cancer, Covid-19 create an area chart and compare their frequencies in each article on year basis (2016, 2017, 2018, 2019, 2020).
2. Please use a dumbel-chart and report the changes in following keywords for 2020 and 2019: influenza, Covid-19, depression, mental health, physical activity, wearable
3. Get Covid-19 statistics from the city of Boston:
https://www.bphc.org/whatwedo/infectious-diseases/Infectious-Diseases-A-to-Z/covid- 19/Pages/Boston-COVID-19-Data.aspx
or Massachussete: https://www.mass.gov/doc/covid-19-cases-in-massachusetts-as-of-april-13- 2020/download
It is ok, if you manually enter the data into your code.
Next, design a Choropleth map and visualize (number of confirmed cases or percentage of positive tests) on the map of Boston or MA.
– Good link to learn using map: http://bcb.dfci.harvard.edu/~aedin/courses/R/CDC/maps.html
You need to prepare a report on your tasks and findings. You can copy paste your codes, its results and your description into a Word document, Python Notebook or you can use R notebook.
Your deadline for delivering this home work is in 6 days. Please feel free to ask your question and prepare it for presentation for the next session.