1. Introduction
The coronavirus disease 2019 (COVID-19) pandemic, caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), began in Wuhan and soon spread globally. Prior studies have investigated SARS-CoV-2 transmission dynamics in mainland China [1]. For instance, population mobility data was found to be highly predictive of COVID-19 importation risk from Wuhan to other Chinese cities in early 2020 [2]; population flow from Wuhan to 296 prefectures was found to drive the spatiotemporal distribution of COVID-19 cases in China in the spring of 2020 [3]. However, detailed descriptions of mainland China’s sub-provincial administrative units that reported COVID-19 data (i.e., prefecture or equivalents) remained a gap in the epidemiological literature.
The doubling time of cumulative case count describes how fast an epidemic is growing [4]. For example, in 2003, the doubling times of the severe acute respiratory syndrome (SARS) epidemic in different countries across the world was investigated, and it was suggested that variation in doubling times of SARS epidemics across countries might arise from variation in both transmission rates and control efforts [5]. In 2020, doubling times of the COVID-19 pandemic across different European countries, different regions in Iran, and different states of the United States (U.S.) have been studied, respectively [6,7,8,9]. We previously analyzed the COVID-19 epidemic doubling time by province in mainland China, from 20 January through 9 February 2020 [10]. Further analysis by prefecture would provide a more detailed account of the early phase of the pandemic within mainland China. We hypothesized that population size or population density might be correlated with the arithmetic mean of the doubling times and ran regression models accordingly. We also investigated the power-law relationship between cumulative case count and population size. Furthermore, we investigated potential prefecture-level spatial clustering of the cumulative case count, the total times the cumulative case count doubled and the mean doubling times. Spatial autocorrelation Moran’s I statistics was performed to identify potential clusters.
The objectives of this paper are: (1) to describe the sub-provincial administrative units that reported COVID-19 case count data as reflected in an oft-cited dataset, and whether their cumulative case count in the early phase of the epidemic follows a power-law relationship with population size; (2) to compute the COVID-19 epidemic doubling times by prefecture in mainland China in the early phase of the epidemic and its relation with population size and density; and (3) to identify spatial clusters of the cumulative number of cases and the total times the cumulative case count doubled by prefecture in mainland China in the early phase of the epidemic.
2. Materials and Methods
2.1. Geographic Scope
The geographic area of this study was mainland China comprising 22 provinces, 5 ethnic minority “autonomous regions” and 4 centrally administered municipalities (Appendix A). In the provinces and “autonomous regions,” there were three tiers of sub-provincial administrative units: prefecture-level units, county-level units, and township-level units [11]. Among the prefecture-level units, the majority were prefectural-level cities that encompass cities and their surrounding counties [11]. In the centrally administered municipalities of Beijing, Chongqing, Shanghai, and Tianjin, there were two sub-municipal levels: urban districts and street communities [11].
2.2. COVID-19 Cumulative Incidence Data Sources
Sub-provincial cumulative numbers of confirmed COVID-19 cases were reported daily by provincial health commissions since 20 January 2020 [10]. Such data were collated daily by DingXiangYuan (abbreviated as DXY), an online community of mainland Chinese healthcare professionals [12]. DXY maintained a publicly available and oft-cited website that published the aggregated data that was updated daily. DXY was the source of mainland Chinese COVID-19 case count data available on the oft-cited Johns Hopkins University dashboard [13]. We downloaded DXY data from an openly available Github source that crawled data from DXY (a crawler developed by Isaac Lin, aka “BlankerL”) [14]. The dataset analyzed here covered one month immediately after the implementation of Wuhan’s cordon sanitaire on 23 January 2020: from 24 January 2020 (the first date of this dataset) through 24 February 2020 (the day this dataset was retrieved) by the date the data was retrieved from governmental press releases by DXY. We cleaned the dataset for errors and inconsistencies in data entry, as per the official press releases that our team collected from provincial government websites [10].
2.3. Epidemic Doubling Time
If the cumulative case count C(t) doubles between time point t1 and time point t2, i.e., C(t2)/C(t1) = 2, the time difference t2 − t1 is known as the epidemic doubling time, td. A long epidemic doubling time indicates a slow epidemic growth. The shorter the doubling time, the faster the epidemic grows. We computed how long it took for the cumulative case count to double each time. The arithmetic means of the successive epidemic doubling times provide summaries of the epidemic growth of a location over the study period. Given that the doubling times are inversely proportional to the growth rate, and given a fixed study period, the number of times the cumulative case count has doubled provides a crude indicator of the epidemic growth; however, this is sensitive to the starting value of the cumulative incidence. The numbers of doubling times are presented alongside their arithmetic mean by prefecture.
2.4. Regression
We ran linear regression models with the dependent variable the arithmetic mean doubling time, and independent variable (a) population size or (b) population density, with the date of the first reported COVID-19 case as a covariate. The unit of analysis was a prefecture. For these prefectures, we obtained their population data from the 2010 China Census [15], and the geographical area data from the respective English language Wikipedia page for each province, autonomous region, or centrally administered municipality. We excluded prefectures where the cumulative case count doubled two or fewer times by 24 February 2020, from our regression models, as they introduced a lot of noise therein.
We characterized the functional relationship between population size and cumulative case count in prefectures excluding those in Hubei by 24 February 2020. Prefectures in Hubei were excluded because their cumulative case count was disproportionately high compared to those in other provinces. If the relationship between population size and cumulative case count follows the power-law, then log(cumulative case count) = g * log(population size), or, log(cumulative case count/population size) = m * log(population size) where m = g − 1. Per capita cumulative case count would be exactly proportional to population size and there would be no heterogeneity when m = 0. Prefectures with lower population size would have a higher per capita cumulative case count when m < 0; and a lower per capita cumulative case count when m > 0. Linear regression was used to obtain an estimate of m [16]. Please refer to the Appendix A for further details.
2.5. Spatial Clustering
In this study, the local Moran’s I index was used to identify the spatial clusters of COVID-19 cases and doubling times in mainland China. Using case count as an example, the local Moran’s I index can be expressed as:
where Ii is the local Moran’s I index for location i; zi is the cumulative number of reported cases at location i; is the mean value of reported cases; σ2 is the variance of z, and wij is the spatial weight matrix which is represented based on a distance of weighting between locations i and j. The local Moran’s I index can reflect the clusters of homogeneous values (e.g., high values surrounded by neighbors with high values) [17].2.6. Programming
We used Python to process the cumulative incidence data and compute the COVID-19 epidemic doubling time by prefecture. R 3.5.1 to 3.6.2 (R Core Team, R Foundation for Statistical Computing, Vienna, Austria) was used in statistical analysis. ArcGIS Pro (Version 2.4.0) was used in spatial analysis and map creation.
2.7. Ethics
The Institution Review Board of Georgia Southern University determined that this was not defined as human subjects research under human subjects regulations (H20364).
3. Results and Discussion
3.1. Sub-Provincial COVID-19 Reporting Units
A total of 462 sub-provincial administrative entities (reporting units) reported COVID-19 data, including zero case counts (Table 1). They comprised 448 sub-provincial geographical reporting units and 14 divisions of the Xinjiang Production and Construction Corps (XPCC) [18]. The XPCC divisions were excluded from further analysis. The 448 sub-provincial geographical reporting units are referred to as “prefectures” in this paper, even though prefectures are just one type of many different reporting units (Appendix A).
Of the 448 prefectures, 39 had zero cumulative case count by 24 February 2020. Of the remaining 409 prefectures with cases, the case in Ganjiang New District was merged with the cases in the City of Nanchang for our maps and statistical analysis. Thus, a total of 408 prefectures were used in creating maps and conducting spatial analysis. Of these 408 prefectures, 8 were excluded for our statistical analysis by population and population density due to their geographic irregularities. Thus, a total of 439 prefectures (400 with cases and 39 without cases by 24 February 2020) were included in the statistical analysis (Table 1 and Table 2). In Table 3 and Table 4, we provide the descriptive statistics of the population and population density of 439 prefectures by their cumulative number of reported COVID-19 cases.
Figure 1 describes the change in cumulative case count by prefecture by week from 26 January through 16 February 2020; cumulative case count on 23 February 2020, is presented in Figure 2. Our results highlight the geographic extent of the epidemic that affected many Chinese prefectures. As presented in Table 3, a total of 98% (1300 million; 2010 Census) of the mainland Chinese population lived in the 400 prefectures with at least one COVID-19 case by 24 February 2020. Nevertheless, some remote prefectures were spared. The 39 prefectures without any cases by 24 February 2020 had a population of 32 million (2%; 2010 Census). The city of Wuhan was the only reporting unit with a cumulative number of 10,000+ confirmed cases. Another seven prefectures, all in Hubei, with a total population of 27.8 million (2010 Census) had a case count in the order of thousands (1000–9999) each. Outside Hubei, all the prefectures reported a cumulative case count <1000 as of February 24. A total of 36 prefectures (8 in Hubei and 28 outside Hubei) with a total of 220 million inhabitants (2010 Census) reported case counts in the hundreds (100–999) (Table 3).
Table 4 shows that the 39 prefectures with zero cases by 24 February 2020, had a median population density of 28.92 (2.5 to 97.5 percentiles, 0.84–514.02); the 140 prefectures with 1–9 cases had a median population density of 175.98 (9.61–25,006.65) per sq. km. The 216 prefectures with 10–99 cases had a median population density of 466.80 (59.38–22,887.07) per sq. km. The one prefecture in Hubei that fell into that group (10–99 cases) had a population density of 23.41 per sq. km and was an outlier in this group. For the 36 prefectures with 100–999 cases, the median population density was 475.62 (135.55–2143.41) per sq. km. Of these prefectures, the eight prefectures in Hubei had a median of 244.13 (137.51–529.09) and the 28 outside Hubei had a median of 550.51 (175.46–2839.21). The seven Hubei prefectures with 1000–9999 cases had a median population density of 404.57 (232.98–640.30) per sq. km and Wuhan (with 10,000+ cases) had a population density of 1144.61 per sq. km. We found that the regression line between log-transformed population size and log-transformed cumulative case count per 100,000 population (by 24 February 2020) of prefectures (excluding those in Hubei) has a slope of m = −0.1440 (95% CI, −0.2524, −0.0357). Given m < 0, low-population prefectures have a higher per capita cumulative case count than high-population prefectures (Figure 3).
Added unto our maps is the Hu Line, a separator of population density in China first proposed by the Chinese geographer Hu Huanyong in 1935 [19]. This conceptual line is a straight line connecting Heihe, Heilongjiang Province, to Tengchong, Yunnan Province. The Hu Line divides China into a densely populated southeast and a sparsely populated northwest. The Hu Line succinctly articulates the demographic and geographic disparities in China [20]. To the northwest of the Hu Line, 6% of China’s population spans across more than half of China’s territory. That means for every square-kilometer there are 11 people, approximately one-fourth of the average global population density [21]. In contrast, approximately 94% of the Chinese population lives to the southeast of the Hu Line. It translates into a population density of 260 people per square-kilometer, approximately six times the global average [21]. By applying the Hu Line to our maps of the COVID-19 epidemic in China (Figure 1, Figure 2, Figure 4 and Figure 5), we present graphically how the vast majority of the Chinese population lived in prefectures affected by COVID-19 in February 2020. Those prefectures that reported zero cases were sparsely populated even though their areas were large.
3.2. Doubling Time
Figure 4 shows that the cumulative case count in prefectures in Hubei and its neighboring provinces doubled eight or more times in the study period. The cumulative case count in Wuhan doubled 15 times in the study period; another 19 prefectures doubled 8 to 11 times (Table 5). Severe epidemics happened in major metropolitan areas, such as Guangzhou and Shenzhen in Guangdong Province in the South, Wenzhou, Taizhou, Ningbo, and Hangzhou in Zhejiang Province in the East, as well as in the northeastern cities of Qiqihar and Harbin in Heilongjiang Province.
Another measure of the growth rate of the epidemic is the arithmetic mean of the doubling times (Figure 5). This metric was low in Hubei province and several coastal cities, indicating fast epidemic growth. Excluding Wuhan and excluding those prefectures where there was only one case or none, a total of 76 (17.3% of 439) prefectures had an arithmetic mean of the epidemic doubling time of <2 days (Table 5).
Among prefectures outside Hubei province and where the epidemic had doubled ≥3 times, for every increase in 100,000 residents, the arithmetic means of the doubling times changed by −0.012 (95% CI, −0.017, −0.006) after controlling for the date of the first reported case (Table 6). While the association between population size and the arithmetic mean of the doubling times was statistically significant (p < 0.001), the model only explained a small part of the variance (adjusted R2 = 0.057). We further tested if there is any association between population density and the arithmetic mean of the doubling times; the statistical association was found to be insignificant (Table 6).
3.3. Spatial Clustering
Based on the local Moran’s I clusters for the cumulative number of confirmed cases on 16 February 2020, most prefectures in Hubei province reported significantly more cases than prefectures in other provinces (Figure 6). Prefectures in Hubei had similarly high case numbers as their neighbors, exhibiting a high-high cluster. The prefectures adjacent to Hubei province, represented by dark blue on the map, exhibit a low-high pattern. These prefectures had significantly lower numbers than cities in Hubei province. Harbin in the Heilongjiang Province showed a high-low pattern, meaning that a significantly higher number is observed in Harbin than its neighbors.
Based on the local Moran’s I clusters for the total number of times the cumulative case count had doubled by 16 February 2020, many prefectures in the central and southeast part of China showed a high-high pattern, suggesting a rapid growth of the epidemic (Figure 7). A few prefectures such as Kunming, Chengdu, Baoding, and Dalian, showed significantly more times the cumulative case count doubled than their immediate neighbors are represented in a high-low pattern on the map.
Based on the local Moran’s I clusters for the arithmetic mean of doubling time, prefectures in the southeastern part of mainland China experienced fast epidemic growth with a short average doubling time were either in low-low clusters or high-low clusters (Figure 8). Some prefectures in the northern or northwestern part of mainland China experienced slow epidemic growth with a long average doubling time.
Our spatial clustering analysis captured a snapshot of the epidemic in mid-February 2020 when prefectures in Hubei province had very high case counts compared to the rest of mainland China. Meanwhile, prefectures across central, eastern, and southern China experienced rapid case growth. In some prefectures in northern, northeastern, and western China, epidemic hotspots were found. From the local clustering analysis, the epicenters of the epidemics can be clearly identified through the high-high clusters. Likewise, vulnerable areas located near the epicenters can be identified through the low-high clusters. Identifying the clusters of high and low case numbers can help us detect the sharpest boundaries between areas with a high and low level of transmission, which may help guide intervention measures.
If performed in real-time, spatial analysis can help epidemiologists identify epidemic hotspots and jurisdictions with rapid epidemic growth. Public health interventions can be applied in tiers that are proportionate to the risk of infection so that the epidemic can be controlled while damage to the economy and limitations to personal liberty can be minimized.
3.4. Limitations
This study has its limitations. First, underreporting due to underdiagnosis was a possibility, especially in Hubei province where the supply of diagnostic equipment was low and the capacity of local hospitals was overwhelmed during the study period [22]. Differential reporting rate across prefectures was a possibility too. Underlying this study was a strong assumption that the reporting rate remained the same over time within a prefecture. Given that the time frame of our dataset began on 24 January 2020, four days after the Chinese central government initiated nationwide reporting of COVID-19, we opined that the reporting rate was fairly stable during our study period. However, changes in case definitions might introduce further uncertainty in the case count reported in China as described in Tsang et al. [1]. Second, our dataset was crawled by a third-party crawler [14], from the DXY website that aggregated China’s official data for this COVID-19 pandemic. We have manually identified and corrected errors therein as per provincial governmental sources [10]. Nonetheless, some minor errors might still remain. Third, the dates in the DXY dataset were the dates that DXY retrieved the data from the health authorities. They were the same dates as the data was released as the website maintained near-real-time updates for its visitors. Since late January 2020, data every 24 h ending at midnight was released by China’s National Health Commission at around 8 am the next day (Beijing Time). Thus, the reporting date was one day behind the actual date when the cases were confirmed. Nevertheless, this limitation would not affect our calculation of the doubling times as long as the daily reporting periods remained consistent. Fourth, we acknowledged that the explanatory power of our regression models was limited because very few predictors were included in the regression models.
4. Conclusions
Our analysis documented spatial heterogeneity in the epidemic growth of the COVID-19 epidemic across prefectures in mainland China from 24 January to 24 February 2020. First, we found that the epidemic heavily affected prefecture-level cities in Hubei and neighboring provinces and a number of metropolitan areas in southern, eastern, and northeastern China. Nevertheless, our analysis showed that by 24 February 2020, the epidemic had spread to prefectures that comprise 98% of the Chinese population. We found that the power-law relationship between population size and cumulative case count (by 24 February 2020) indicated low-population prefectures had a higher per capita cumulative case count than high-population prefectures. Second, an increase in population size was associated with a very small reduction in the mean doubling time. Third, spatial analysis indicated that by mid-February 2020, prefecture-level cumulative case count clustered around Hubei while many prefectures across central, coastal, and northeastern China experienced rapid growth with cumulative case count doubling multiple times with a small mean doubling time. This study demonstrates that, if performed in real-time, spatial analysis of prefecture-level COVID-19 data can enable epidemiologists to stratify local jurisdictions by their epidemic growth. Tiers of public health interventions can be implemented by local jurisdictions in a manner that is proportionate to their epidemic risk. Spatial analysis can offer additional insights into the epidemic that enables effective responses to control its spread and yet minimizes unnecessary draconian measures that harm the economy and limit personal liberty.
Author Contributions
Conceptualization, I.C.-H.F., X.Z. and G.C.; methodology, I.C.-H.F., X.Z. and G.C.; software, X.Z.; validation, I.C.-H.F., X.Z., C.-N.C. and G.C.; formal analysis, I.C.-H.F., X.Z. and C.-N.C.; investigation, I.C.-H.F., X.Z., C.-N.C., S.K.O. and K.M.-R.; resources, I.C.-H.F., X.Z. and G.C.; data curation, S.K.O., K.M.-R., C.-H.C., P.-Y.L. and M.L.; Writing—Original draft preparation, I.C.-H.F.; Writing—Review and editing, I.C.-H.F., X.Z., C.-N.C. and G.C.; visualization, X.Z. and C.-N.C.; supervision, I.C.-H.F., X.Z. and G.C.; project administration, I.C.-H.F.; funding acquisition, G.C. and I.C.-H.F. All authors have read and agreed to the published version of the manuscript.
Funding
G.C. receives support from the National Science Foundation grant no. 1414374 as part of the joint NSF-NIH-USDA Ecology and Evolution of Infectious Diseases program. I.C.-H.F. received salary support in the academic year 2019–2020 from the National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention (no. 19IPA1908208). This article is not part of I.C.-H.F.’s CDC-sponsored projects.
Institutional Review Board Statement
The Institutional Review Board of Georgia Southern University determined that this was not defined as human subjects research under human subjects regulations (H20364).
Informed Consent Statement
Not applicable, as this study was not defined as human subjects research under human subjects regulations.
Data Availability Statement
The data used in this study was retrieved from BlankerL: DXY-COVID-19-Crawler (COVID-19/2019-nCoV Realtime Infection Crawler and API). Available online:
Acknowledgments
The team thanks Dongyu Jia who participated in our data curation team at the very beginning of our COVID-19 projects but has discontinued his participation because of other research commitments.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Disclaimers
The opinions expressed in this article do not represent the official positions of the Centers for Disease Control and Prevention or the United States Government.
Appendix A
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures and Tables
Figure 1. Cumulative case count by prefecture on 26 January, 2 February, 9 February, and 16 February 2020. The shade of red represents the cumulative number of cases on a log10 scale. The line in the plot is the Hu Line that connects the city of Heihe in Heilongjiang Province and the city of Tengchong in Yunnan Province.
Figure 2. Cumulative case count by prefecture on 23 February 2020. The shade of red represents the cumulative number of cases on a log10 scale. The line in the plot is the Hu Line that connects the city of Heihe in Heilongjiang Province and the city of Tengchong in Yunnan Province.
Figure 3. Power-law relationship between population size and cumulative case count of prefectures in mainland China (except Hubei) by 24 February 2020. The coefficient m in ln(cumulative case count per 100,000 population) = m*ln(population) is −0.1440 (95% CI, −0.2524, −0.0357).
Figure 4. The number of times the cumulative case count doubled from 24 January through 24 February 2020. The line in the plot is the Hu Line that connects the city of Heihe in Heilongjiang Province and the city of Tengchong in Yunnan Province.
Figure 5. The arithmetic mean of the doubling times of the cumulative case count from 24 January through 24 February 2020. The line in the plot is the Hu Line that connects the city of Heihe in Heilongjiang Province and the city of Tengchong in Yunnan Province.
Figure 6. Results of local Moran’s I clusters for the cumulative number of reported cases on 16 February 2020, indicating high numbers surrounded by high numbers (high-high clusters), high numbers surrounded by low numbers (high-low clusters), low numbers surrounded by high numbers (low-high clusters) and low numbers surrounded by low numbers (low-low clusters). Purple color represents that jurisdictions without cases by 16 February 2020, or jurisdictions outside mainland China (Hong Kong, Macau, and Taiwan).
Figure 7. Results of local Moran’s I clusters for the total number of times the cumulative case count had doubled by 16 February 2020, indicating high numbers surrounded by high numbers (high-high clusters), high numbers surrounded by low numbers (high-low clusters), low numbers surrounded by high numbers (low-high clusters), and low numbers surrounded by low numbers (low-low clusters). Purple color represents that jurisdictions without cases by 16 February 2020, or jurisdictions outside mainland China (Hong Kong, Macau, and Taiwan).
Figure 8. Results of local Moran’s I clusters for the arithmetic mean of the doubling times, indicating high numbers surrounded by high numbers (high-high clusters), high numbers surrounded by low numbers (high-low clusters), low numbers surrounded by high numbers (low-high clusters), and low numbers surrounded by low numbers (low-low clusters). Purple color represents that jurisdictions without cases by 16 February 2020, or jurisdictions outside mainland China (Hong Kong, Macau, and Taiwan).
Number and types of reporting units, DingXiangYuan (DXY) data entries by location, non-resident data entries, entries used for mapping after data merger and statistical analysis, of 31 provinces (autonomous regions, centrally administered municipalities) in mainland China.
Number and Types of Reporting Units 1 | DXY Data Entries (Excl. Duplicate Row) | Separate Data Entry for Non-Residents 2 | DXY Data Entries Used for Mapping after Data Merger | DXY Data Entries Included in Statistical Analysis | Geo-Graphical Reporting Units Without Cases 3 | |
---|---|---|---|---|---|---|
Mainland China TOTAL | 462 | 421 | Not applied | 408 | 400 | 39 |
Anhui 4 | 16 PLC | 17 | No | 16 | 16 | 0 |
Beijing 5 | 16 districts | 15 | Yes (+1) 2 | 15 | 15 | 1 |
Chongqing 6 | 41 = 26 districts + 12 counties + 3 “new areas” | 39 | No | 39 | 36 | 2 |
Fujian | 9 PLC | 9 | No | 9 | 9 | 0 |
Gansu 7 | 14 = 12 PLC + 2 AP | 11 | No | 11 | 11 | 3 |
Guangdong 8 | 21 PLC | 20 | No | 20 | 20 | 1 |
Guangxi 9 | 14 PLC | 13 | No | 13 | 13 | 1 |
Guizhou | 9 PLC | 9 | No | 9 | 9 | 0 |
Hainan 10 | 19 = 4 PLC + 5 CLC + 4 counties + 6 autonomous counties | 15 | No | 15 | 15 | 4 |
Hebei | 11 PLC | 11 | No | 11 | 11 | 0 |
Heilongjiang 11 | 13 = 12 PLC + 1 P | 13 | No | 13 | 13 | 0 |
Henan 12 | 18 = 17 PLC + 1 DA CLC | 23 | No | 18 | 18 | 0 |
Hubei | 17 = 12 PLC + 1 AP + 3 DA CLC and 1 DA county-level forestry area | 17 | No | 17 | 17 | 0 |
Hunan | 14 = 13 PLC + 1 AP | 14 | No | 14 | 14 | 0 |
Inner Mongolia 13 | 12 = 9 PLC + 3 leagues | 11 | No | 11 | 11 | 1 |
Jiangsu | 13 PLC | 13 | No | 13 | 13 | 0 |
Jiangxi 14 | 12 = 11 PLC and Ganjiang New District | 12 | No | 11 | 11 | 0 |
Jilin 15 | 11 = 9 PLC + 2 CLC | 10 | No | 10 | 8 | 1 |
Liaoning 16 | 14 PLC | 13 | No | 13 | 13 | 1 |
Ningxia 17 | 6 = 5 PLC + NECIBAC | 6 | No | 6 | 5 | 0 |
Qinghai 18 | 8 = 2 PLC + 6 AP | 2 | No | 2 | 2 | 6 |
Shaanxi 19 | 12 = 10 PLC + 1 CLC + YAHITZ | 12 | No | 12 | 10 | 0 |
Shandong 20 | 16 PLC | 15 | No | 15 | 15 | 1 |
Shanghai | 16 districts | 16 | Yes (+1) 2 | 16 | 16 | 0 |
Shanxi | 11 PLC | 11 | No | 11 | 11 | 0 |
Sichuan | 21 = 18 PLC + 3 AP | 21 | No | 21 | 21 | 0 |
Tianjin 21 | 16 PLC | 14 | Yes (+1) 2 | 14 | 14 | 2 |
Tibet 22 | 7 = 6 PLC + 1 P | 1 | No | 1 | 1 | 6 |
Xinjiang 23 | 14 = 4 PLC + 5 P + 5 AP | 7 | No | 7 | 7 | 7 |
XPCC 23 | 14 divisions | 6 | No | 0 | 0 | 0 |
Yunnan 24 | 16 = 8 PLC + 8 AP | 14 | No | 14 | 14 | 2 |
Zhejiang | 11 PLC | 11 | No | 11 | 11 | 0 |
AP—autonomous prefectures; CLC—county-level city (cities); DA—directly administered; NECIBAC—Ningdong Energy Chemical Industry Base Administration Committee; P—prefectures; PLC—prefecture-level city (cities). XPCC—Xinjiang Production and Construction Corps; YAHTIZ—Yangling Agriculture Hi-Tech Industrial Zone. Please refer to the Appendix A for footnotes of Table 1.
Table 2Reporting units excluded from statistical analysis.
Provincial-Level Units and Their Data Entries That Were Excluded (Listed in Footnotes) | Number of Entities or Data Entries Excluded 1 | Counted towards the 462 Reporting Units | Included in the 408 Reporting Units Contributed Cases to Maps | Included in the 439 Reporting Units in the Statistical Analysis |
---|---|---|---|---|
Geographical units excluded 1 | ||||
Chongqing 2 | 3 | Yes | Yes | No |
Jiangxi 3 | 1 | Yes | No | No |
Jilin 4 | 2 | Yes | Yes | No |
Ningxia 5 | 1 | Yes | Yes | No |
Shaanxi 6 | 2 | Yes | Yes | No |
Xinjiang 7 | 14 | Yes | No | No |
Non-geographical units excluded | ||||
Beijing 8 | 1 | No | No | No |
Shanghai 9 | 1 | No | No | No |
Tianjin 10 | 1 | No | No | No |
Please refer to the Appendix A for footnotes of Table 2.
Table 3The number of reporting units, their total, median, 2.5 and 97.5 percentile population by the cumulative number of reported confirmed cases as of 24 February 2020, with population size data as per the 2010 China Census. All numbers were rounded to a whole number.
Cumulative Number of Reported Confirmed Cases as of 24 February 2020 | Number of Reporting Units | Total Population | Median Population | Population at 2.5 Percentile | Population at 97.5 Percentile |
---|---|---|---|---|---|
Mainland China | 439 | 1,332,039,983 | 2,462,583 | 230,959 | 9,124,731 |
0 | 39 | 31,848,609 | 525,570 | 90,714 | 2,447,762 |
1–9 | 140 | 238,294,009 | 1,363,741 | 247,335 | 4,341,021 |
10–99 | 216 | 804,313,575 | 3,400,676 | 606,085 | 9,036,573 |
100–999 | 36 | 219,989,569 | 6,038,972 | 1,114,799 | 12,870,158 |
1000–9999 | 7 | 27,808,833 | 4,814,542 | 1,215,701 | 6,091,515 |
10,000+ | 1 | 9,785,388 | 9,785,388 | 9,785,388 | 9,785,388 |
Hubei | 17 | 57,237,727 | 2,873,687 | 424,195 | 8,336,060 |
10–99 | 1 | 76,140 | 76,140 | 76,140 | 76,140 |
100–999 | 8 | 19,567,366 | 2,668,135 | 986,318 | 3,933,888 |
1000–9999 | 7 | 27,808,833 | 4,814,542 | 1,215,701 | 6,091,515 |
10,000+ | 1 | 9,785,388 | 9,785,388 | 9,785,388 | 9,785,388 |
Mainland China except Hubei | 422 | 1,274,802,256 | 2,437,097 | 231,607 | 9,103,275 |
0 | 39 | 31,848,609 | 525,570 | 90,714 | 2,447,762 |
1–9 | 140 | 238,294,009 | 1,363,741 | 247,335 | 4,341,021 |
10–99 | 215 | 804,237,435 | 3,416,196 | 620,985 | 9,039,884 |
100–999 | 28 | 200,422,203 | 7,151,485 | 1,425,193 | 13,139,293 |
Number of reporting units, their total area (square kilometers), and the median, 2.5 and 97.5 percentile population density (number of residents per sq. km) by the cumulative number of reported confirmed cases as of 24 February 2020, with population size data as per the 2010 China Census.
Cumulative Number of Reported Confirmed Cases as of 24 February 2020 | Number of Reporting Units | Total Area (Square Kilometers) | Median Population Density (Number of Residents Per sq. km) | Population Density at 2.5 Percentile (Number of Residents Per sq. km) | Population Density at 97.5 Percentile (Number of Residents Per sq. km) |
---|---|---|---|---|---|
Mainland China | 439 | 9,562,140.57 | 301.31 | 5.03 | 22,057.47 |
0 | 39 | 3,183,409.11 | 28.92 | 0.84 | 514.02 |
1–9 | 140 | 3,154,046.56 | 175.98 | 9.61 | 25,006.65 |
10–99 | 216 | 2,628,555.06 | 466.80 | 59.38 | 22,887.07 |
100–999 | 36 | 511,626.98 | 475.62 | 135.55 | 2143.41 |
1000–9999 | 7 | 75,953.77 | 404.57 | 232.98 | 640.30 |
10,000+ | 1 | 8549.09 | 1144.61 | - | - |
Hubei | 17 | 185,824.93 | 353.20 | 68.74 | 950.00 |
10–99 | 1 | 3253 | 23.41 | - | - |
100–999 | 8 | 98,069.07 | 244.13 | 137.51 | 529.09 |
1000–9999 | 7 | 75,953.77 | 404.57 | 232.98 | 640.30 |
10,000+ | 1 | 8549.09 | 1144.61 | - | - |
Mainland China except Hubei | 422 | 9,376,315.64 | 300.38 | 4.64 | 22,705.88 |
0 | 39 | 3,183,409.11 | 28.92 | 0.84 | 514.02 |
1–9 | 140 | 3,154,046.56 | 175.98 | 9.61 | 25,006.65 |
10–99 | 215 | 2,625,302.06 | 468.23 | 61.88 | 22,972.87 |
100–999 | 28 | 413,557.91 | 550.51 | 175.46 | 2839.21 |
Epidemic doubling time among 384 reporting units, excluding Wuhan (n = 1) and units that reported a cumulative case count of zero (n = 39) or one case (n = 15) as of 24 February 2020.
Arithmetic Mean of the Epidemic Doubling Time (Days) | Reporting Units (n) | % of all Geographical Reporting Units (N = 439) 1 |
---|---|---|
0 < x < 1 | 13 | 3.0 |
1 ≤ x < 2 | 63 | 14.4 |
2 ≤ x < 3 | 116 | 26.4 |
3 ≤ x < 4 | 95 | 21.6 |
4 ≤ x < 5 | 44 | 10.0 |
5 ≤ x < 6 | 22 | 5.0 |
6 ≤ x < 7 | 13 | 3.0 |
7 ≤ x < 20 | 18 | 4.1 |
Number of times the cumulative case count doubled | ||
1 | 33 | 7.5 |
2 | 64 | 14.6 |
3 | 84 | 19.1 |
4 | 75 | 17.1 |
5 | 67 | 15.3 |
6 | 25 | 5.7 |
7 | 17 | 3.9 |
8 | 7 | 1.6 |
9 | 6 | 1.4 |
10 | 4 | 0.9 |
11 | 2 | 0.5 |
1 55 (12.5%) of 439 reporting units were excluded.
Table 6Coefficient estimates (95% confidence intervals) of two regression models: Model A: between the arithmetic mean of the doubling times and population size, and Model B: between the arithmetic mean of the doubling times and population density. These models were applied to data from 271 prefectures after excluding prefectures from Hubei and prefectures where the epidemic had doubled ≤2 times by 24 February 2020. Data were fitted with ordinary least squares linear regression.
Dependent Variable: Arithmetic Mean of the Doubling Times | Coefficient (95% CI) |
---|---|
Independent Variables | |
Model A (Adjusted R2 = 0.057) | |
Population size | −0.012 (−0.017, −0.006) * |
The date of the first reported case | −0.057 (−0.124, 0.010) |
Model B (Adjusted R2 <0.001) | |
Population density | −1.65 × 10−5 (−4.32 × 10−5, 1.01 × 10−5) |
The date of the first reported case | −1.42 × 10−2 (−8.13 × 10−2, 5.29 × 10−2) |
* p < 0.001.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
To describe the geographical heterogeneity of COVID-19 across prefectures in mainland China, we estimated doubling times from daily time series of the cumulative case count between 24 January and 24 February 2020. We analyzed the prefecture-level COVID-19 case burden using linear regression models and used the local Moran’s I to test for spatial autocorrelation and clustering. Four hundred prefectures (~98% population) had at least one COVID-19 case and 39 prefectures had zero cases by 24 February 2020. Excluding Wuhan and those prefectures where there was only one case or none, 76 (17.3% of 439) prefectures had an arithmetic mean of the epidemic doubling time <2 d. Low-population prefectures had a higher per capita cumulative incidence than high-population prefectures during the study period. An increase in population size was associated with a very small reduction in the mean doubling time (−0.012, 95% CI, −0.017, −0.006) where the cumulative case count doubled ≥3 times. Spatial analysis revealed high case count clusters in Hubei and Heilongjiang and fast epidemic growth in several metropolitan areas by mid-February 2020. Prefectures in Hubei and neighboring provinces and several metropolitan areas in coastal and northeastern China experienced rapid growth with cumulative case count doubling multiple times with a small mean doubling time.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Department of Biostatistics, Epidemiology and Environmental Health Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, GA 30460, USA;
2 Department of Geography, Texas Christian University, Fort Worth, TX 76109, USA;
3 Department of Psychology and Criminal Justice, School of Education & Behavioral Sciences, Middle Georgia State University, Macon, GA 31206, USA;
4 Independent Researcher, Hong Kong Special Administrative Region, China;
5 Department of Biostatistics, Boston University, Boston, MA 02215, USA;
6 Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA 30302, USA