INTRODUCTION
The Fudan-CCDC model [
1–
3] was proposed by Cheng’s group at Fudan University to study the evolution of COVID-19. The model took advantages of the time delay process introduced by the TDD-NCP model [
4–
9] proposed previously also by Cheng’s group, and developed new convolution kernels for the time delay terms by applying several time distributions acquired from an important paper [
10] by CCDC (China Center for Disease Control and Prevention). Both the TDD-NCP model and the Fudan-CCDC model are single-chain models and have been performed well in analyzing the evolution of COVID-19 in China, and its early stage of global transmission [
11,
12].
The multi-chain model was put forward and developed in the context of the second outbreak in some regions. We first had this idea when analyzing the epidemic situation of South Korea. In the Fig. 1, there was a sudden turn in growth rate, which inferred that a stronger transmission chain might have emerged.
With the further spread of the global pandemic, such a sudden change in growth rate has been observed in the cases of other countries as well. And the curves fitted by the Fudan-CCDC model sometimes deviate from the data. Singapore is one of the examples. We had studied Singapore’s case in [
12], and based on the data till Feb 25, we concluded that Singapore had been successful in disease prevention and control. Since then, our group has been continually tracking the data. Unexpectedly, in late February, a sudden rise occurred, see Fig. 2B.
We show in Fig. 2 the curve fitting for Singapore’s data by the Fudan-CCDC model, on Feb 17 and Mar 1, respectively. We see in Fig. 2A that on Feb 17, the Fudan-CCDC model had predicted that the increment of confirmed cases would be zero on Feb 27, and remained stable for the next ten days. However, on the crucial day Feb 27, an unexpected rise occurred in Singapore’s data (see Fig. 2B), which caused our vigilance. Till Mar 1, this new upward trend was so obvious that it could not be explained by the single chain Fudan-CCDC model any longer. Therefore, we began to consider the application of the multi-chain Fudan-CCDC model, and revisit Singapore’s case.
RESULTS
There are two important parameters in our model, one is the infection rate , which depicts the speed of virus transmission, and the other is the isolation rate , which is related to the strength of government measures and the public ’s awareness of prevention.
The two-chain and three-chain models until Apr 4
Figures 3 and 4 show the evolutions of COVID-19 in Singapore and its possible future trends, based on the two-chain model. The scattered red circles are the data: the number of cumulative confirmed cases (Fig. 3) and its increment (Fig. 4) from Jan 23 to Apr 4. In Fig.3, we illustrate the four “most optimized” fitting curves (in solid lines) for the data, and their predictions (in dotted lines) by the model, in the order of red, green, blue and purple, respectively. For the convenience to recognize, the ‘very most optimized’ fitting curve is drawn in a full solid red line, including its prediction. Details of the optimization methods are described in the section Materials and Methods. We can see from Fig. 3A that based on the two-chain model, Singapore is expected to have zero increment of confirmed cases on Apr 26, and the total number of infections will be around 1500, if no other transmission chains arise in the future. Figure 3A is the semi-log form of Fig. 3B, and it clearly demonstrates the good curve fitting of the sudden rise in growth rate around Mar 3.
Figure 4A shows the fitting for the daily increment based on the two-chain Fudan CCDC model. There are two peaks in the curves, suggesting possible new sources of transmission. Then we single out the “most optimized” curve (the red line) in Fig. 4B. We see that most of the data fall in this area, indicating the effectiveness of the model. Besides, the two chains are shown in green and blue dotted lines, respectively.
Now we consider the three-chain Fudan-CCDC model. Figures 5 and 6 show the epidemic evolution in Singapore based on the three-chain model. The legends are the same as in the previous context. We can see from Fig. 5A that under the three-chain model, Singapore is expected to have zero increment of confirmed cases on May 4, and the total number of infections will be around 1,900, if no other chains of transmission arise in the future.
In addition, we find that the end date of COVID-19 based on the three-chain model is later than that based on the two-chain model, and the number of total infected is also significantly higher. This is because the time of zero increment will now arrive until all the transmission chains come to end.
Warning a possible new outbreak in Singapore on Apr 12
Figures 7 and 8 show predictions of the cumulative and incremental confirmed cases in Singapore based on the multi-chain model, with data observed from Jan 23 to Apr 12. Table 1 presents Parameters for two-chain model and three-chain model, with data observed from Jan 23 to Apr 12.
The predictions of the two-chain model and three-chain both show an uncontrollable trend of the epidemic. These two models both pass on the information that Singapore might be faced with a very risky situation of rapidly increasing cases. Therefore, strong measures are urgently needed to contain the epidemic.
On April 12, we observed that in our prediction both the two-chain model and the three-chain model did not converge, and we predicted a wave of outbreaks in Singapore. The later evolution indeed corroborated that.
Two-stage assumption for the fourth chain
Figure 9A is plotted based on data from Jan 23 to Apr 12, which shows an unstopping trend. The model performs a precise prediction on evolution of 10 days later. Due to quarantine measurements of the government, the increment of confirmed cases drops gradually from Apr 23. We hereby introduced the assumption of two-stage for the fourth chain and applied the modified parameter in analysis on Apr 30, shown in Fig. 9B.
Parameter values are obtained by data fitting with optimization programs, which are collected in Table 2.
Comparison of models with different chain numbers
With the data of Jan 23–Mar 19, we conducted experiments on models with different chain numbers to see the difference among the models with different chain number, and the results are as follows (Figs. 10 and 11).
Here we list the parameters in Table 2 or the multi-chain model based on the data from Jan 23 to May 31.
New two-chain model with two-stages parameters
Though the evolution of the epidemic so far can be well fitted by the multi-chain model, the drawback is obvious that more chains would be needed if more cases appear. Here we introduce new two-chain model, the difference is that the parameters and are both two-stages, that is to say, and will be changed at some time in every chain (see Table 3 ).
Figures12A and B (the semi-log plot) show that the number of cumulative confirmed cases can be fitted well by the new chain model. Figures 13A and B (the semi-log plot) show the fitting for the daily increment based on the new two-chain Fudan CCDC model, here only the new cases which are great than 10 are plotted. One advantage of the two-stages model is that the trend of the evolution is clear.
DISCUSSION
Advantages of the multi-chain Fudan-CCDC model
The multi-chain Fudan-CCDC model has given a better explanation of the epidemic evolution in Singapore, and perhaps other nations as well. Compared to single chain models, the multi-chain Fudan-CCDC model shows the following advantages: (i) It better fits the data in history; (ii) By identifying different sets of parameters for different chains, it is able to simulate the multi-peaks in the daily increment data, which the single-chain models can hardly explain; (iii) It illustrates the importance of controlling the imported cases. Since zero increment depends on when the last chain vanishes, it is difficult to completely end the epidemic unless all sources of transmission are detected and blocked.
With more chains, the model could better interpret the epidemic and gain more accurate predictions.
Detection of new chains
Now we revisit Fig. 2 to discuss when to introduce new chains. In Fig. 2B, there is an obvious shallow pit around Feb 27 along with the data trend, illustrating that the number of confirmed cases was about to flatten, but rose up again immediately. This shallow pit acts as a signal to consider new chains in the model, warning new sources of transmission. In fact, in Fig. 2A, a shallow pit has already occurred around Feb 12. This pit was not so obvious as the next one around Feb 27, and was likely to be treated as fluctuation of the data. Besides, more data are needed to form a new transmission chain. Therefore, carefully detecting and analyzing these shallow pits plays an important role in finding new chains.
Additivity
One may find that the multi-chain model is just an addition of multiple single-chain models. In fact, the single-chain and multi-chain Fudan-CCDC models are both linear ones, so they enjoy the convenience of additivity. For the traditional nonlinear epidemic models such as the SIR and SEIR models, the model also can be linearized, and SEIJR model is developed [
13]. The property of additivity is friendly, as it allows us to construct new models of not only multiple chains, but also multiple districts, which might be applicable to other countries.
In conclusion, the multi-chain Fudan-CCDC model is suitable for Singapore. It has made possible the early detection of imported infectors and super spreaders, and is able to suggest timely adjustment for epidemic control. Based on the experiences in Singapore, it is very difficult to control the transmission since the infected people will increase exponentially even if very small infected ones are not be isolated or treated, and it is important to trace the curve of the cases.
MATERIALS AND METHODS
In this section, we introduce two models, the single-chain Fudan-CCDC model and the multi-chain Fudan-CCDC model respectively. The single-chain Fudan-CCDC model describes the evolution of COVID-19 based on the assumption that all the new cases originate from the initial source, i.e. there is only one chain of transmission. And the multi-chain Fudan-CCDC model assumes that due to new imported cases, new super spreaders, or the different transmission characteristics of different regions, there may be two or more single chains of transmission in the country.
The single-chain Fudan-CCDC model
As is mentioned in [
1–
3,
11,
12], our single-chain Fudan-CCDC model is as follows:
where
and
represent the cumulative infected people and the cumulative confirmed cases at day
, respectively, and
is the instant (not cumulative) number of infected isolated not yet confirmed by the hospital. The infected ones are put into isolation once they show illness symptoms, and the newly confirmed should be removed from the isolated group.
is the number of people who are potentially infectious to healthy ones–they are infected actually but not in quarantine or hospitalization.
and
represent the infection rate and the isolation rate respectively, which may be changed in different time periods. Some transition probabilities are used in our model:
and
are the transition probabilities from infection to illness onset, and from infection to hospitalization, respectively. Here we reconstruct them from one important paper [
10] by CCDC:
•
: the transition probability from infection to illness onset is one log-normal distribution of
•
: the transition probability from illness onset to hospitalization is one Weibull distribution of
•
: the transition probability from infection to hospitalization, which can be calculated via the convolution of
and
, and may be approximated by
In the implementation, the supports of
and
is set by 21 days and 42 days respectively [
10]. This time delay dynamic system is applicable to simulations of COVID-19 in the countries where community transmission exists, while the kernels like
and
might vary from countries to countries.
The model can be used to fit the reported numbers of the cumulative confirmed cases and predict the evolution of epidemic, and the details can be found in [
1–
3,
11,
12].
The multi-chain Fudan-CCDC model
In the multi-chain Fudan-CCDC model, the final epidemic transmission chain is the superposition of several single chains:
and we obtain the sum forms:
where
is the start time of the
-th source.
Specifically, we have applied the two-chain and the three-chain models to analysis the situations in Singapore. We suppose that there is a new chain when a sudden turn appears in the curve of reported confirmed cases. For both the two-chain model and the three-chain model, infection rate and isolation rate of the fisrt chain are obtained by fitting data before a specific time node. The differences lie in assumptions and parameters to identified.
Optimization method for parameter identification
Parameter identification is an optimization process. There are two kinds of decision variables in this optimization, time nodes
and the model parameters. We suppose that more recent data have more importance and efficiency for us to predict the trend. So we established the objective function as follows:
where
Note that the first term is to minimize the difference betweem values of data and simulations, i.e. the empirical risk and the second term is to minimize the structural risk. is the weight of the penalty term, and contains only in the two-chain model, and in the three-chain model.
Time nodes. Since public data are discrete along time and time nodes are dates, the grid searching method could be used to obtain the minimum value of the objective function. As characteristics of the first chain is known, so we only need to do grid searching of other chains.
Model parameters. The parameter optimization is solved by a constrained optimization problem solver.
Therefore, the whole process of optimization can be summarized as the following three steps: determine all possible time nodes; calculate the minimum of objective function for cases of different time nodes; obtain the optimal time nodes and model parameters.
Higher Education Press and Springer-Verlag GmbH Germany, part of Springer Nature