How Data Analytics and Big Data Can Help Scientists in Managing COVID-19 Diffusion: Modeling Study to Predict the COVID-19 Diffusion in Italy and the Lombardy Region

J Med Internet Res. 2020 Oct 14;22(10):e21081. doi: 10.2196/21081.

Abstract

Background: COVID-19 is the most widely discussed topic worldwide in 2020, and at the beginning of the Italian epidemic, scientists tried to understand the virus diffusion and the epidemic curve of positive cases with controversial findings and numbers.

Objective: In this paper, a data analytics study on the diffusion of COVID-19 in Italy and the Lombardy Region is developed to define a predictive model tailored to forecast the evolution of the diffusion over time.

Methods: Starting with all available official data collected worldwide about the diffusion of COVID-19, we defined a predictive model at the beginning of March 2020 for the Italian country.

Results: This paper aims at showing how this predictive model was able to forecast the behavior of the COVID-19 diffusion and how it predicted the total number of positive cases in Italy over time. The predictive model forecasted, for the Italian country, the end of the COVID-19 first wave by the beginning of June.

Conclusions: This paper shows that big data and data analytics can help medical experts and epidemiologists in promptly designing accurate and generalized models to predict the different COVID-19 evolutionary phases in other countries and regions, and for second and third possible epidemic waves.

Keywords: COVID-19; Italy; SARS-CoV-2; big data; data analytics; diffusion; modeling; prediction; predictive models.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Betacoronavirus*
  • Big Data*
  • COVID-19
  • Computer Simulation
  • Coronavirus Infections / epidemiology*
  • Coronavirus Infections / transmission
  • Data Science
  • Humans
  • Italy / epidemiology
  • Pandemics
  • Pneumonia, Viral / epidemiology*
  • Pneumonia, Viral / transmission
  • SARS-CoV-2