基于 XGBoost 模型的新型冠状病毒(COVID-19) 疫情分析与预测
(中国人民武装警察部队士官学校,浙江 杭州 311400)

摘  要:为了对新型冠状病毒(COVID-19)传播趋势实现更加精确的预测,提出了一种 COVID-19 的智能估算方法。首先利用 Matplotlib 对 COVID-19 数据进行可视化分析、提取特征,利用 XGBoost 建立智能估算方法模型,结合全国、湖北以及其他四个省份的 COVID-19 数据进行智能估算。实验结果表明,与线性回归、随机森林、SVM、KNN 相比,该方法在平均绝对误差、均方根百分比误差和最大估算误差 3 个技术指标上均优于其他四种回归算法,具有较高的估算精度和泛化能力。

关键词:新型冠状病毒;疫情;特征提取;模型构建;XGBoost 算法


Analysis and Prediction of Novel Coronavirus (COVID-19) Epidemic Situation Based on XGBoost Model

SUN Xuke

(Basic Department of Armed Police Officer School, Hangzhou 311400, China)

Abstract: In order to achieve more accurate prediction of the spread trend of novel coronavirus (COVID-19), an intelligent estimation method of COVID-19 is proposed. Firstly, this paper uses matplotlib to visualize and analyze COVID-19 data, extracts features, and uses XGBoost to build a model of the intelligent estimation method, and combines COVID-19 data from the whole country, Hubei and four other provinces for intelligent estimation. The experimental results show that compared with linear regression, random forest, SVM and KNN, the method outperforms the other four regression algorithms in three technical indexes: mean absolute error, root mean square percentage error and maximum estimation error, and has higher estimation accuracy and generalization ability.

Keywords: novel coronavirus; epidemic situation; feature extraction; model construction; XGBoost algorithm


