PreprintPDF Available

Modeling and Forecast of COVID-19 Vaccinations Population Size in China: A Non-Clinical Study

Authors:
Preprints and early-stage research may not have been peer reviewed yet.

Abstract and Figures

BACKGROUND Since its outbreak in December 2019, severe acute respiratory syndrome coronavirus-2, the virus responsible for the COVID-19 pandemic, has considerably affected the worldwide population. Health authorities and the medical community identify vaccines as an effective tool for managing public health. METHODS In this study, the autoregressive integrated moving average (ARIMA) model built-in Python was adopted to establish the COVID-19 vaccination forecast model. In this study, the sample data were selected from the Our World in Data website. COVID-19 vaccinations administered daily in China from December 16, 2020 to March 21, 2021 were analyzed to establish an autoregressive integrated moving average (ARIMA) model. RESULTS The built-in ARIMA module function of Python was used, and the optimum model was ARIMA (3, 2, 3) according to the established time series analysis. The analysis showed that the predicted COVID-19 vaccination uptake supplemented well with the actual values with a small relative error. CONCLUSIONS This indicated that the ARIMA(3, 2, 3) model could be used to forecast the number of COVID-19 vaccinations in China.
Content may be subject to copyright.
Modeling and Forecast of COVID-19 Vaccinations
Population Size in China: A Non-Clinical Study
Wenbin Du ( duwb@swufe.edu.cn )
Southwestern University of Finance and Economics https://orcid.org/0000-0001-7604-3902
Fengrui Hua
Southwestern University of Finance and Economics
Shengyuan Xu
Tohoku University: Tohoku Daigaku
You Wu
Southwestern University of Finance and Economics
Research
Keywords: COVID-19 vaccine, time series analysis, Python language, autoregressive integrated moving
average (ARIMA) model, forecast
DOI: https://doi.org/10.21203/rs.3.rs-729132/v1
License: This work is licensed under a Creative Commons Attribution 4.0 International License. 
Read Full License
1
Modeling and Forecast of COVID-19 Vaccinations population size in China: a
non-clinical study
Wenbin Du1*, Fengrui Hua2, Shengyuan Xu3, You Wu4
1 Research Institute of Social Development, Southwestern University of Finance and
Economics, No. 555, Liutai Road, Wenjiang District, Chengdu 611130, China. Tel:
+86-13669357615; Fax: +86-28-87092257; Email: duwb@swufe.edu.cn
2 Research Institute of Social Development, Southwestern University of Finance and
Economics, No. 555, Liutai Road, Wenjiang District, Chengdu 611130, China. Tel:
+86-15034084982; Fax: +86-28-87092257; Email: huafengrui@smail.swufe.edu.cn
3 Graduate School of Information Sciences, Tohoku University, 6-3-09 Aoba,
Aramakiaza Aobaku, Sendai, 980-8579, Japan. Tel: +81-08033363888; Fax:
+81-22-795-5815; Email: kyos163@163.com
4 School of Accounting, Southwestern University of Finance and Economics, No. 555,
Liutai Road, Wenjiang District, Chengdu 611130, China. Tel: +86-13717647930; Fax:
+86-28-87092257; Email: waiola1999@sina.com
Corresponding author:
Wenbin Du, Research Institute of Social Development, Southwestern University of
Finance and Economics, No. 555, Liutai Road, Wenjiang District, Chengdu 611130,
China.
Tel: +86-13669357615;
Fax: +86-28-87092257;
Email: duwb@swufe.edu.cn
2
Abstract
BACKGROUND
Since its outbreak in December 2019, severe acute respiratory syndrome
coronavirus-2, the virus responsible for the COVID-19 pandemic, has considerably
affected the worldwide population. Health authorities and the medical community
identify vaccines as an effective tool for managing public health.
METHODS
In this study, the autoregressive integrated moving average (ARIMA) model built-in
Python was adopted to establish the COVID-19 vaccination forecast model. In this
study, the sample data were selected from the Our World in Data website. COVID-19
vaccinations administered daily in China from December 16, 2020 to March 21, 2021
were analyzed to establish an autoregressive integrated moving average (ARIMA)
model.
RESULTS
The built-in ARIMA module function of Python was used, and the optimum model
was ARIMA (3, 2, 3) according to the established time series analysis. The analysis
showed that the predicted COVID-19 vaccination uptake supplemented well with the
actual values with a small relative error.
CONCLUSIONS
This indicated that the ARIMA(3, 2, 3) model could be used to forecast the number of
COVID-19 vaccinations in China.
CONTRIBUTION
3
In this study, the ARIMA model has demonstrated high goodness of fit for forecasting
the COVID-19 vaccinations, which can be used in the short-term prediction of the
monitored COVID-19 vaccination data sequence, providing a reference for the
establishment of immune barrier and the supply of vaccines in China.
Keywords:
COVID-19 vaccine, time series analysis, Python language, autoregressive integrated
moving average (ARIMA) model, forecast
4
Background
The COVID-19 pandemic is causing an unprecedented effect on global health and
economics. When safe and efficient vaccines and treatments were unavailable,
nonpharmaceutical interventions were used to reduce transmission and the burden of
the pandemic. However, most of these measures involved huge economic costs.
Therefore, effective COVID-19 vaccines have been urgently needed to lower the
morbidity and mortality of COVID-19 (Li et al., 2020). The COVID-19 pneumonia
outbreak at the beginning of 2020 involved the first COVID-19 cases caused by the
novel β-coronavirus severe acute respiratory syndrome CoV-2 (SARS-CoV-2)
(Tregoning et al., 2020). The genetic information was disclosed on January 10, 2020,
54 days after the first case was reported. On March 13, 2020, 63 days after the
SARS-CoV-2 virus was sequenced, the news was released that the first dose of the
human vaccine was under testing. The Strategic Advisory Group of Experts on
Immunization of the World Health Organization has outlined the value framework for
COVID-19 vaccine allocation and priority, revealing the core principles of vaccine
distribution (World Health Organization, 2020). These guidelines require further
specifications and should be targeted to each country and their local conditions,
including the intensity of the pandemic, target of pandemic control, supply of
vaccines, and population eligible for vaccination. As the first epicenter country of
COVID-19 pneumonia, China has invested heavily in its vaccination, being the main
participator in the COVID-19 vaccine development for controlling the pandemic,
wherein its vaccines were provided by the government, vaccine makers, and
5
nongovernmental organizations (World Health Organization, 2020). On January 24,
2020, the Chinese Center for Disease Control and Prevention successfully separated
the first COVID-19 virus strain. On June 19, 2020, the first mRNA vaccine was
approved for a clinical test. As of February 25, 2021, there had been four types of
COVID-19 vaccines, three inactivated vaccines, and one adenovirus vector vaccine.
At 24 o’clock of March 20, 2021, the reported number of administered COVID-19
vaccinations reached 74.956 million in China (CCTV News, 2021). In this study, the
autoregressive integrated moving average (ARIMA) model built-in Python was
adopted to establish the COVID-19 vaccination forecast model. Time series analysis
and short-term forecast were performed for daily data of COVID-19 vaccinations
administered in China from December 16, 2020, to March 21, 2021. By forecasting
the number of COVID-19 vaccinations and capturing the variation rule of the time
series of vaccinations using the ARIMA model, the evolution rule of the time series
could be extended, thereby accurately predicting the short-term COVID-19 vaccine
demand and providing a reference for vaccine supply in China.
Data source and modeling principle
Data source
In this study, the sample data were selected from the Our World in Data website
https://ourworldindata.org/coronavirus-testing#china, established by the Oxford
University, on the basis of the decades-long data involving the human living standards
of various countries. The website provides the daily updated number of COVID-19
vaccinations and the complete COVID-19 dataset on its COVID-19 Explorer. The
6
data were collected by browsing the official public information, and specifically, the
data for China were sourced from the information published by the National Health
Commission of the People’s Republic of China. In this study, the daily vaccination
uptake in China from December 16, 2020, to March 21, 2021, updated by the Our
World in Data website, was used for modeling and the short-term forecast.
Method
ARIMA model
The ARIMA model is mostly used to analyze the nonstationary, nonseasonal time
series (Box, Jenkins & Reinsel, 2010), which is the most commonly used model for
fitting nonstationary series. It can be further divided into the autoregressive (AR),
moving average (MA), and autoregressive moving average (ARMA) models
(Kantelhardt, et al, 2002). They are explained below in detail:
AR model
An AR model with the following structure is called the -order AR model and
denoted as AR(). The mathematical formula is as follows:
= + +  +  + +  + (1)
Here, the value of the random variable at moment t is the multiple linear
regression of the first terms ,  , and  . The error term is the
current random interference , which is a zero-mean white noise sequence. When
= 0, the model is called the centralized AR(ρ) model.
MA model
7
A MA model with the following structure is called the qth-order AR model and
denoted as MA(q). The mathematical formula is as follows:
= +    (2)
Here, the value of the random variable at moment t is the multiple linear
function of the first q random interferences ,  ,  . The error term is the
current random interference , which is a zero-mean white noise sequence.
Specifically, when μ = 0, the model is called as the centralized MA(q) model.
ARMA model
An ARMA model with the following structure is denoted as ARMA(p, q) and its
mathematical formula is as follows:
= +  +  + +  +  
 (3)
The ARMA model is used for stationary time series, which is a combination of the
AR and MA models. It is supposed that is the observed time series, is the
white noise sequence, and is the mixed process of autoregression and moving
average. When the orders of AR and MA are p and q, respectively, the model is
denoted as ARMA(p, q).
ARIMA model
If the time series is nonstationary, its difference should be considered to transform
it into a stationary one, which is then followed by the forecast using the ARMA(p, q)
model. These processes constitute the ARIMA(p, d, q) model wherein p represents the
number of autoregressive terms, d the times of difference of the time series, and q the
8
number of moving average terms. The structure of the ARIMA(p, d, q) model is as
follows:
Φ(Β)= Θ(Β) (4)
Ε()= 0, ()=
, Ε()= 0, (5)
Ε= 0, < (6)
= (1 Β) (7)
Φ(Β)= 1 Β Β Β (8)
Θ(Β)= 1 Β Β (9)
ARIMA modeling steps
For time series analysis, the ARIMA model forecasts through the following four
steps: time series stationarity test, model recognition and order determination, model
validation, and model forecast. These specific processes are explained below:
Series establishment and stationarity test
The time series ARIMA model must be established on the basis of a stationary time
series as it cannot capture nonstationary ones. Therefore, the stationarity test is needed
at first. The typical methods of stationarity tests include the sequence diagram,
autocorrelation, and unit root tests. In the sequence diagram test, the mean and
variance curves of the time sequence data are plotted to determine if the time series is
stationary by observing its trend. In the autocorrelation test, the autocorrelation
function (ACF) figure is used. For nonstationary data, the ACF figure will approach 0
slowly or not at all. For the unit root test, the solved augmented Dickey-Fuller (ADF)
value will be used to determine whether the sequence data are significant in the given
9
confidence interval. If so, the sequence data are stationary. The ADF test assumes that
the time series has a unit root, i.e., the time series is nonstationary with periodic
fluctuations. If the test result indicates that the absolute value of T is lower than the
observation level of 1%, 5%, or 10% and the significance probability ρ value is
larger than 0.05, the sequence is deemed as nonstationary, and the original hypothesis
cannot be rejected. Similar to Eq. (10), the mean of the random series is expressed as
follows:
= =()
 (10)
where  represents the expectation of the random series at moment t.
If the time series is determined to be nonstationary, i.e., periodic or with a marked
trend, the difference of the original sequence data will be determined. The following
Eq. (11) gives the d difference equation:
=(1 Β)= (−1)

 (11)
Here, indicates the stationary sequence after difference; Β is the delay operator,
indicating that the times of time index difference is d for the series.
Model recognition and order determination
After d times of difference, the ACF and partial autocorrelation function (PACF)
tests on the stationary time series are commonly performed to obtain the number of
layer p and order q for ARIMA. By observing the truncation and trailing of the ACF
and PACF tests, the model is fitted. The fitting results are compared and adjusted
accordingly for the preliminary construction of one or more suitable ARIMA models.
If both the ACF and PACF tests of the time series are trailing, it can be determined
10
that the forecast model is ARMA(p, q). The values of p and q are investigated
gradually from the lower orders, and the smallest p and q are selected that can
minimize the values of the Akaike information criterion (AIC) and Bayesian
information criterion (BIC). Eq. (12) shows the mathematical expression of the ACF:
ρk = ( )/
 ( )
 (12)
where represents the value of the random series at moment t, and represents
the mean of the random series.
Eq. (13) displays the mathematical expression of the PACF:
 = 

 ,
1  ,
 (13)
where  = ,   , whereas indicates the ACF of the random
series.
Model validation
Before establishing the ARIMA model to forecast the number of administered
COVID-19 vaccinations, the reasonability of the model should be evaluated. The
purpose of model evaluation is to avoid p and q deviations determined by figure
reading and ensure the precision of the forthcoming vaccination forecast. The white
noise and normality tests are typically used to examine the residual sequence. If p
values for the fitting test statistics are all considerably larger than the significance
level of 0.05, the residual sequence can be observed as a white noise sequence. If the
fitted model cannot pass the test, the model will be reselected for fitting. In the
normality test, the obtained data are, however, assessed to see whether they satisfy the
normal distribution. The standardized residual Q-Q plot of the ARIMA model is
11
drawn to determine the reasonability of the established forecast model.
Model forecast
The main function of the time series analysis is to explain the autocorrelation of a
time series with mathematical models, thereby predicting the future variation rule of
the series. In this study, the daily data of the COVID-19 vaccinations administered in
China from December 16, 2020 to March 21, 2021 will be selected to find the
optimum forecast model and make five-day projections. By comparing the predicted
value with the actual data, the forecast performance will be assessed.
Result
Time series establishment and stationarity test
The data of the daily COVID-19 vaccinations administered in China from
December 16, 2020 to March 21, 2021 were first used to create the time series plot.
Figure 1(a) shows the date on the horizontal axis and the number of new COVID-19
vaccinations each day on the vertical axis. The vaccination data clearly showed
sustainable growth, which is typical of a nonstationary series.
[Figure 1 about here] Time series plots of the COVID-19 vaccination data in
China. (a) Time series plot of the original COVID-19 vaccination data, (b) Time
series plot of the COVID-19 vaccination data after first-order difference, (c) Time
series plot of the COVID-19 vaccination data after second-order difference
12
The original time series was nonstationary, which required the operation of the
difference to be stationary. It was found that after trial, the first- and second-order
differences could remove the trend of the original series, making it stationary. The
time series plots after the difference are shown in Figures 2(b) and 2(c). In addition,
after k=3, the autocorrelation and partial autocorrelation coefficients gradually
decreased and fell into the confidence interval, indicating that the current time series
turned stationary (Figure 2(c)).
[Figure 2 about here] Plots of autocorrelation and partial autocorrelation before
and after difference. (a) Plots of autocorrelation and partial autocorrelation for the
original series, (b) Plots of autocorrelation and partial autocorrelation after the
first-order difference, (c) Plots of autocorrelation and partial autocorrelation after the
second-order difference.
The unit root (ADF) method was then adopted for the stationarity test. As shown in
Table 1, the t statistic of the ADF test targeting the time series data of administered
COVID-19 vaccinations in China was 0.519; p was 0.985; and 1%, 5%, and 10%
thresholds were −3.503, −2.893, and −2.584, respectively. Moreover, p = 0.985 > 0.1,
so the original hypothesis could not be rejected, and the series was not stationery. The
first-order difference of the series was then performed, followed by the ADF test,
resulting in p = 0.845 > 0.1. Therefore, the original hypothesis still could not be
rejected, indicating a nonstationary series. The second-order difference was then
13
performed, followed by ADF test, resulting in p = 0.000 < 0.01, indicating 99% or
higher chance of rejecting the original hypothesis. At this moment, the series was
stationary. Therefore, the parameter d of the ARIMA model was 2.
[Table 1 about here] Table 1. ADF test results
Model recognition and order determination
According to the ACF and PACF plots of given in Figure 2(c), the
autocorrelation plot after the second-order difference showed a significantly nonzero
autocorrelation number of 2 or 3 with the trailing property, so q = 2 or 3. In addition,
the partial autocorrelation plot showed a PACF of 2 or 3; therefore, p = 2 or 3. From
the ADF test results, when the times of difference was 2, the series was stationary, so
d value was 2. The preliminary models were thus ARIMA(2, 2, 2), ARIMA(2, 2, 3),
ARIMA(3, 2, 2), and ARIMA(3, 2, 3). Parameter estimation and model validation
were then performed to further determine the optimum model. The AIC, BIC, and
Hannan-Quinn information criterion (HQIC) were used to select the best p and q
values. The equation of AIC and BIC is as follows:
(, )= ln
+ 2( + + 1)/ (14)
If (, )= min
, (, ), the order of the ARMA model is decided as (p,
q), and
is the maximum likelihood estimate of the corresponding sequence.
Therefore, when selecting the parameters for ARMA(p, q), AIC and BIC were
combined, and the relatively optimum model was searched according to the
minimization principle of AIC, BIC, and HQIC (Burnham & Anderson, 2004). From
the lowest order, the AIC, BIC, and HQIC values of the preliminary models were
14
calculated, and the AIC thermodynamic diagram was created so as to determine the
orders.
[Table 2 about here] Table 2. AIC, BIC, and HQIC values of the ARMA model
As indicated in Table 2, the parameters of p and q were selected, which
corresponded to the minimum AIC, BIC, and HQIC values. The research results
showed that ARMA(3, 3) model had the lowest AIC, BIC, and HQIC values, which
were 2316.015, 2336.361, and 2324.233, respectively. Similarly, from the
order-determining AIC thermodynamic diagram (Figure 3), we noted that the
optimum model occurred when (p, q) values were (3, 3). Therefore, when p = 3 and q
= 3, the AIC, BIC, and HQIC values reached the minima. Considering that the times
of difference was d = 2, ARIMA(3, 2, 3) was determined to be a better model for
forecasting the number of COVID-19 vaccinations, which can be expressed as
follows:
= 2829.663 + 0.374  0.217  + 0.388  0.504  +
0.585  0.944  (15)
[Figure 3 about here] AIC thermodynamic diagram.
Model validation
After model establishment, the residual sequence needs to be checked to determine
if it only contains white noise. If not, it indicates that there still exists useful
information in the residual, demanding further revision of the model until the residual
15
sequence is a white noise sequence. The validation process was as follows. First,
when the size of the fitting model was determined, the significance of the model was
assessed (α = 0.05). It was p = 0.14, much larger than 0.05. Therefore, the original
hypothesis was acknowledged, assuming that the time series was a white noise
sequence, i.e., a randomly generated series with no temporal correlation. Second, the
autocorrelation plot of the residual was created, and the pure randomness of the
residual was tested. The results of the residual autocorrelation test of ARIMA(3, 2, 3)
are shown in Figure 4(b), where the ACF is basically in the 95% confidence interval,
and the residual is white noise. Finally, the Q-Q plot of the ARIMA model was drawn
(Figure 4(c)). The points in the middle are almost overlapping with the straight line,
proving that the residual of the forecast model ARIMA(3, 2, 3) followed the normal
distribution, and thus, the established ARIMA(3, 2, 3) was reasonable.
[Figure 4 about here] Results of the ARIMA model validation. (a) Plot of residual
sequence, (b) Autocorrelation and partial autocorrelation plots of the residual
sequence, (c) Q-Q plot of the standardized residual for the ARIMA model.
Model forecast
The forecast ( ) function from the StatsModels library of Python was used for
constructing the model combining the values of p, d, and q. Specifically, the
ARIMA(3, 2, 3) model for forecasting COVID-19 vaccinations was established and
five-day projections were made. The modeling code is as follows:
16
data=data.astype('float64')
model=ARIMA(data,order=(p,d,q)).fit()
predict_data=model.predict()
forecast=pd.Series([int(each) for each in model.forecast(5)[0]])
dates=pd.date_range(final_data['date'].values[-1],periods=6)[1:]
forecast.index=dates
The predicted values, actual values, absolute errors, relative errors, and variation
coefficients are given in Table 3. The predicted trend was observed to match with the
actual trend with a good fitting. The average absolute error was −148175.2, the
average relative error was 10.06%, and the average variation coefficient was 5.34%.
The forecast of the model was within an acceptable range. The formula for calculating
the average relative error is as follows:
 =
|
|
 (16)
The formula of the variation coefficient is as follows:
=
(17)
In Eq. (17), represents the standard deviation and represents the mean.
[Table 3 about here] Table 3. Predicted results by the ARIMA(3, 2, 3) model
Figure 5 shows that the five-day projections (March 22, 2021 to March 26, 2021)
match well with the actual data of COVID-19 vaccination administration, showing a
high accuracy of the forecast. In addition, the number of administered vaccinations
shows a steady upward trend.
[Figure 5 about here] Performance of COVID-19 vaccination forecast.
17
Discussion
The data of COVID-19 vaccinations daily administered in China from December
16, 2020 to March 21, 2021 were selected for a time series analysis and fitting of the
optimal model of ARIMA(3, 2, 3). The model was then used to predict the number of
five-day vaccinations from March 22 to March 26, 2021, the results of which were
compared with the actual data. The average relative error was shown to be 10.06%,
whereas the average variation coefficient was 5.34%. The analysis showed that the
predicted COVID-19 vaccination uptake supplemented well with the actual values
with a small relative error. This indicated that the ARIMA(3, 2, 3) model could be
used to forecast the number of COVID-19 vaccinations in China. The ARIMA model
performs statistical analysis based on historic data. However, the vaccination speed
and number were influenced by other factors such as the relevant policies and vaccine
production scales. Historically, vaccine effectiveness often drops when the vaccines
are ultimately rolled out into the general population because clinical trials occur under
controlled and ideal conditions and may not capture all heterogeneity within the
population (Fedson, 1998). This would raise the question of whether to vaccinate with
currently available vaccines or wait for one with a higher effectiveness against that
particular variant Therefore, the ARIMA model was only applicable to short-term
vaccination forecasts (Baden et al., 2021; Polack et al., 2021). For more accurate
prediction, the inclusion of new data and adjustment of the model parameters are
needed to adapt to the actual vaccination scenario.
18
Conclusion
In summary, the ARIMA(3, 2, 3) model has demonstrated high goodness of fit for
forecasting the COVID-19 vaccinations in China, which can be used in the short-term
prediction of the monitored COVID-19 vaccination data sequence, providing a
reference for the establishment of immune barrier and the supply of vaccines in
China.
List of abbreviations
autocorrelation function (ACF)
augmented Dickey-Fuller (ADF)
Akaike information criterion (AIC)
autoregressive (AR)
autoregressive integrated moving average (ARIMA)
Bayesian information criterion (BIC)
Hannan-Quinn information criterion (HQIC)
moving average (MA)
partial autocorrelation function (PACF)
19
Declarations
Ethics approval and consent to participate
Not applicable
Consent for publication
Not applicable
Availability of data and materials
The data that support the findings of this study are available from Our World in Data
but restrictions apply to the availability of these data, which were used under license
for the current study, and so are not publicly available. Data are however available
from the authors upon reasonable request and with permission of Our World in Data.
Competing interests
The authors declare that they have no competing interest.
Funding
This study was supported by the Introducing Talents Initiated Project of Southwestern
University of Finance and Economics (Grant No. 230600001002020015).
Authors' contributions
WD, FH, SX conceived and designed the experiments. WD analyzed the data. WD
wrote the paper. FH contributed to the revised manuscript. SX and YW collected the
data. All authors read and approved the final manuscript.
Acknowledgements
Not applicable
20
References
Baden LR, Sahly H, Essink B, Kotloff K, Zaks T. Efficacy and safety of the
mrna-1273 sars-cov-2 vaccine NEW ENGL J MED. 2021;384:403-416
Box G, Jenkins GM, Reinsel GC. Time series analysis. Forecasting and control. 3rd
ed. Journal of Time. 2010.
Burnham KP, Anderson DR. Multimodel Inference Understanding AIC and BIC in
Model Selection. SOCIOL METHOD RES. 2004;33:261-304.
China's State Council joint prevention and control mechanism held a press conference
material. 2021. https://news.cctv.com/. Accessed 2 Jun 2021.
Fedson DS. Measuring protection: Efficacy versus effectiveness. Dev Biol Stand.
1998;95:195-201.
Kantelhardt JW, Zschiegner SA, Koscielny-Bunde E, Bunde A, Stanley HE.
Multifractal detrended fluctuation analysis of nonstationary time series. PHYSICA A.
2002;316:87-114.
Li Z, Chen Q, Feng L, Rodewald L, Li S. Active case finding with case management:
the key to tackling the COVID-19 pandemic. Lancet. 2020;396:63-70.
Polack FP, Thomas SJ, Kitchin N, Absalon J, Gruber WC, et al. Safety and efficacy of
the bnt162b2 mrna covid-19 vaccine. NEW ENGL J MED. 2020;383:2603-2615.
Tregoning JS, Brown ES, Cheeseman HM, et al. Vaccines for COVID‐19. CLIN EXP
IMMUNOL. 2020;202:162-192.
WHO SAGE values framework for the allocation and prioritization of COVID-19
vaccination document. In: World Health Organization. 2020.
21
https://www.who.int/publications/i/item/who-sage-values-framework-for-the-allocatio
n-and-prioritization-of-covid-19-vaccination. Accessed 14 Sep 2020.
Draft landscape and tracker of COVID-19 candidate vaccines document. In: World
Health Organization. 2020.
https://www.who.int/docs/default-source/a-future-for-children/novel-coronavirus_land
scape_covid-19.pdf?sfvrsn=4d8bd201_1. Accessed 10 Dec 2020.
Our World in Data website. https://ourworldindata.org/coronavirus-testing#china.
Accessed 2 Jun 2021.
22
Tables:
Table 1. ADF test results
Difference
order
t p
Threshold
1% 5% 10%
0 0.519 0.985 -3.503 -2.893 -2.584
1 -0.706 0.845 -3.503 -2.893 -2.584
2 -5.54 0 -3.503 -2.893 -2.584
Table 2. AIC, BIC, and HQIC values of the ARMA model
Model AIC BIC HQIC
ARMA(3,3) 2316.015 2336.361 2324.233
ARMA(2,2) 2319.739 2334.999 2325.903
ARMA(2,3) 2320.167 2337.97 2327.358
ARMA(3,2) 2325.573 2343.377 2332.765
Table 3. Predicted results by the ARIMA(3, 2, 3) model
Date Predict
value
Actual
value
Absolute
error
Relative
error%
Variation
coefficient%
20210322 2010823 1973905 36918 1.87 0.93
23
20210323 2248285 2077238 171047 8.23 3.95
20210324 2418990 2270243 148747 6.55 3.17
20210325 2527634 2816476 -288842 10.25 5.40
20210326 2645064 3453810 -808746 23.41 13.26
Note: Absolute error = Actual valuePredict value, whereas Relative error=Absolute
error/Actual value
Figure 1
Time series plots of the COVID-19 vaccination data in China. (a) Time series plot
of the original COVID-19 vaccination data, (b) Time series plot of the COVID-19
vaccination data after first-order difference, (c) Time series plot of the COVID-19
vaccination data after second-order difference
(a)
24
(b)
(c)
Figure 2
Plots of autocorrelation and partial autocorrelation before and after difference.
(a) Plots of autocorrelation and partial autocorrelation for the original series, (b) Plots
of autocorrelation and partial autocorrelation after the first-order difference, (c) Plots
of autocorrelation and partial autocorrelation after the second-order difference.
25
(a) (b)
(c)
Figure 3
AIC thermodynamic diagram.
26
Figure 4
Results of the ARIMA model validation. (a) Plot of residual sequence, (b)
Autocorrelation and partial autocorrelation plots of the residual sequence, (c) Q-Q
plot of the standardized residual for the ARIMA model.
(a)
(b)
27
(c)
Figure 5
Performance of COVID-19 vaccination forecast.
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Background Vaccines are needed to prevent coronavirus disease 2019 (Covid-19) and to protect persons who are at high risk for complications. The mRNA-1273 vaccine is a lipid nanoparticle–encapsulated mRNA-based vaccine that encodes the prefusion stabilized full-length spike protein of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes Covid-19. Methods This phase 3 randomized, observer-blinded, placebo-controlled trial was conducted at 99 centers across the United States. Persons at high risk for SARS-CoV-2 infection or its complications were randomly assigned in a 1:1 ratio to receive two intramuscular injections of mRNA-1273 (100 μg) or placebo 28 days apart. The primary end point was prevention of Covid-19 illness with onset at least 14 days after the second injection in participants who had not previously been infected with SARS-CoV-2. Results Download a PDF of the Research Summary. The trial enrolled 30,420 volunteers who were randomly assigned in a 1:1 ratio to receive either vaccine or placebo (15,210 participants in each group). More than 96% of participants received both injections, and 2.2% had evidence (serologic, virologic, or both) of SARS-CoV-2 infection at baseline. Symptomatic Covid-19 illness was confirmed in 185 participants in the placebo group (56.5 per 1000 person-years; 95% confidence interval [CI], 48.7 to 65.3) and in 11 participants in the mRNA-1273 group (3.3 per 1000 person-years; 95% CI, 1.7 to 6.0); vaccine efficacy was 94.1% (95% CI, 89.3 to 96.8%; P<0.001). Efficacy was similar across key secondary analyses, including assessment 14 days after the first dose, analyses that included participants who had evidence of SARS-CoV-2 infection at baseline, and analyses in participants 65 years of age or older. Severe Covid-19 occurred in 30 participants, with one fatality; all 30 were in the placebo group. Moderate, transient reactogenicity after vaccination occurred more frequently in the mRNA-1273 group. Serious adverse events were rare, and the incidence was similar in the two groups. Conclusions The mRNA-1273 vaccine showed 94.1% efficacy at preventing Covid-19 illness, including severe disease. Aside from transient local and systemic reactions, no safety concerns were identified. (Funded by the Biomedical Advanced Research and Development Authority and the National Institute of Allergy and Infectious Diseases; COVE ClinicalTrials.gov number, NCT04470427.)
Article
Full-text available
Background Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection and the resulting coronavirus disease 2019 (Covid-19) have afflicted tens of millions of people in a worldwide pandemic. Safe and effective vaccines are needed urgently. Methods Download a PDF of the Research Summary. In an ongoing multinational, placebo-controlled, observer-blinded, pivotal efficacy trial, we randomly assigned persons 16 years of age or older in a 1:1 ratio to receive two doses, 21 days apart, of either placebo or the BNT162b2 vaccine candidate (30 μg per dose). BNT162b2 is a lipid nanoparticle–formulated, nucleoside-modified RNA vaccine that encodes a prefusion stabilized, membrane-anchored SARS-CoV-2 full-length spike protein. The primary end points were efficacy of the vaccine against laboratory-confirmed Covid-19 and safety. Results A total of 43,548 participants underwent randomization, of whom 43,448 received injections: 21,720 with BNT162b2 and 21,728 with placebo. There were 8 cases of Covid-19 with onset at least 7 days after the second dose among participants assigned to receive BNT162b2 and 162 cases among those assigned to placebo; BNT162b2 was 95% effective in preventing Covid-19 (95% credible interval, 90.3 to 97.6). Similar vaccine efficacy (generally 90 to 100%) was observed across subgroups defined by age, sex, race, ethnicity, baseline body-mass index, and the presence of coexisting conditions. Among 10 cases of severe Covid-19 with onset after the first dose, 9 occurred in placebo recipients and 1 in a BNT162b2 recipient. The safety profile of BNT162b2 was characterized by short-term, mild-to-moderate pain at the injection site, fatigue, and headache. The incidence of serious adverse events was low and was similar in the vaccine and placebo groups. Conclusions A two-dose regimen of BNT162b2 conferred 95% protection against Covid-19 in persons 16 years of age or older. Safety over a median of 2 months was similar to that of other viral vaccines. (Funded by BioNTech and Pfizer; ClinicalTrials.gov number, NCT04368728.)
Article
Full-text available
In the clinical development of a vaccine, an efficacy study asks the question, "Does the vaccine work?" In contrast, an effectiveness study asks the question "Does vaccination help people?". In general, vaccine development proceeds from a study of immunogenicity to a randomized controlled trial that determines vaccine efficacy under ideal conditions. Efficacy studies, however, have several limitations. In an immunogenicity study, when a vaccine is given according to different schedules, the object of the study is not the vaccine itself but the schedules; i.e., what is important is not the "relative immunogenicity" of the vaccine, but which schedule is more protective given the occurrence of the disease that is to be prevented. Furthermore, a clinical trial of vaccine efficacy is unable to predict accurately the level of protection that will be achieved in public health practice. Vaccination effectiveness can be evaluated in a prospective clinical trial, although few such studies have been undertaken. Effectiveness is usually assessed retrospectively, sometimes using a screening test, but more often in a case-control or cohort study. In these studies, rigorous risk adjustment is necessary to ensure the comparability of study populations. Retrospective studies also provide a means for assessing serious but rare vaccine-associated adverse events, an undertaking often needed to maintain public confidence in vaccination programmes. Many vaccines currently under development will be replacement rather than new vaccines, and they are unlikely to be evaluated in traditional efficacy trials. In future years, effectiveness studies, some of them using large administrative databases, will become increasingly important features of vaccine development and the formulation of public policy for immunization.
Article
COVID-19 was declared a pandemic by WHO on March 11, 2020, the first non-influenza pandemic, affecting more than 200 countries and areas, with more than 5·9 million cases by May 31, 2020. Countries have developed strategies to deal with the COVID-19 pandemic that fit their epidemiological situations, capacities, and values. We describe China's strategies for prevention and control of COVID-19 (containment and suppression) and their application, from the perspective of the COVID-19 experience to date in China. Although China has contained severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and nearly stopped indigenous transmission, a strong suppression effort must continue to prevent re-establishment of community transmission from importation-related cases. We believe that case finding and management, with identification and quarantine of close contacts, are vitally important containment measures and are essential in China's pathway forward. We describe the next steps planned in China that follow the containment effort. We believe that sharing countries' experiences will help the global community manage the COVID-19 pandemic by identifying what works in the struggle against SARS-CoV-2.
Multifractal detrended fluctuation analysis of nonstationary time series
Multifractal detrended fluctuation analysis of nonstationary time series. PHYSICA A. 2002;316:87-114.
Vaccines for COVID-19
  • J S Tregoning
  • E S Brown
  • H M Cheeseman
Tregoning JS, Brown ES, Cheeseman HM, et al. Vaccines for COVID-19. CLIN EXP IMMUNOL. 2020;202:162-192.