Project 1 and 2

These projects are from the Module 1 and Module 2 on data exploration and statistical inference. The projects are about analysis of COVID-19 infection data. The jupyter notebook script of this project can be found here and the full report on the methodology can be found here.

What we did in this project?

  • Extract, clean and combine valuable information of time series on up-to-date Covid cases reports, together with government stringency indicators and socioeconomic indicators from most countries around the world.

  • Visualize the current situation related to the pandemic dynamics worldwide.

  • Find correlation in the stringency indicators, socioeconomic indicator and the pandemic dynamics.

  • Identify patterns in countries with the best response against the pandemic outbreak.

Data Sources

  1. Covid epidemiology time series: this data was collected directed from the git repository of COVID-19 Dashboard application by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University [1].

  2. Covid government response indicator: this dataset was collected from the git repository of the Oxford COVID-19 Government Response Tracker, Blavatnik School of Government [2].

  3. Socioeconomic indicator: this data was collected via API protocol communication with the World Bank Open database [3].

How to reproduce this work?

You can run this project remotely via google colab or locally in your personal computer. If you chose to run it locally, just download and run the jupyter notebook in your machine. You may need to install some external dependencies (libraries) as requested.