1. Introduction
The globe has entered the digital era as a result of the advancement of Information and Communication Technologies (ICT) technologies. A resilient world is defined by the digital economy’s constant performance, and active digital activities strengthen our society’s virtualization (Alzhanova et al., 2020; Sepehrdoust & Khodaee, 2013). As a result, a more constructive understanding of the existing state of the digital economy aids us in establishing a global information world (Zhang et al., 2021). Typically, the evolution of the digital sector has three characteristics: variety, diversity, and dynamism, all of which were thought to contribute to the study of the global economic and political system’s complexity (Alibekova et al., 2020; Dwivedi et al., 2021; Mensah, 2019). Simultaneously, widespread employment and accessible access boost subversive and all-around innovation (Alibekova et al., 2020; Jiao & Sun, 2021).
However, there is still a large disparity in China’s digital economic levels across the country. As a result, the goal of this research is to examine China’s digital economy’s spatial distribution and determine the root cause of the country’s digital divide. According to studies, digital economic development in China’s east and west regions has imbalanced characteristics. The digital economy has a direct impact on regional disparities. The digital economy is imbalanced, and China has launched a number of initiatives to close this regional gap and encourage regional coordinated development (Tang & Lu, 2021).
Some scholars have studied the spatial distribution of digital economic development. For instance, Zhong and Mao(2020) found that the level of the digital economy in the Yangtze River is low, and there is a hunch-like shape in spatial distribution. Binyan et al. (2018) showed that the level of digital economic development in China’s Northeast regionlags behindthe national average, and the internal polarization is obvious. The differences in digital economic development in the Beijing-Tianjin-Hebei urban group are significantly higher than that of other urban groups, and urban group differences are the main source of their development. Moreover, China’s digital economic development is distinguishedonaprovincial scale, presenting the characteristics of the east to west gradient, but the Sichuan and Yu become a new pole in innovation.
Existing studies have offered a good reference for this article, however, it is worth investigating further in the following three areas: First, there is literature on the development of China’s digital economic level, which is based on provincial level statistics, although digital economic processes are more integrated into cities. As a result, the digital economic level’s spatial distribution features will be more logical when compared to the urban level.
Second, most of the literature is studied in terms of the characteristics of digital economic level development in the research material, but there is very little research on the formation mechanism or influencing factors of digital economic level distribution. Third, many simple OLS regression methods have been used to analyze the influencing factors of human capital geographical distribution in the research method, but this method ignores the spatial heterogeneity and spatial dependency of the study object.
Based on this, the current study intends to reveal its spatial characteristics utilizing space-related analysis, GIS space expression, and a geographically weighted regression model to clarify China’s urban human capital spatial distribution characteristics (GWR). The driving element serves as a benchmark for China’s regional economic balance growth.
2. Methodology and Data
2.1. Research Method
2.1.1. Spatial Self-Correlation Analysis
Space self-correlation describes the overall spatial relationship of all units in the study space, usually using the Moran’s I index to analyze some phenomena of adjacent areas (spatial positive correlation) or different (space negative), or Independence (random distribution), its calculation formula is:
\(I=n \sum_{i=1}^{n} \sum_{i=1}^{n} w_{i j}\left(x_{i}-\bar{x}\right)\left(x_{j}-\bar{x}\right) /\left(\sum_{i=1}^{n} \sum_{i=1}^{n} w_{i j}\right) \sum_{i=1}^{n}\left(x_{i}-\bar{x}\right)^{2}\) (1)
Where: Xi represents the observation value of the region i; N is the number of regions; Wij is a spatial weight matrix to define the mutual adjacency of spatial objects. There are two main types of spatial weighting, one is based on the principle of neighboring relationships, and the other is based on the principle of distance. The unit studied in this paper is the prefecture-level city. Because of the prefecture level city’s adjacency, there is no adjacency unit in more cities due to a lack of data, making it unsuitable for selection based on neighboring relationships. Therefore, we should choose the principle based on distance principle Spatial weight settings and ensure that each unit has at least one neighbour 1. Generally, adjacent standards are adjacent to 1 in two regions, otherwise 0.
2.1.2. Getis-Ord General G
Getis-Ord The degree of local spatial aggregation of all units within the study space is described by general G analysis. Typically, the general G value and Z score value are used to analyze the spatial feature distribution, which is prone to high polymerization or cluster distribution, and the calculation formulas are as follows:
\(G=\frac{\sum_{i=1}^{n} \sum_{j=1}^{n} w_{i, j} x_{i} x_{j}}{\sum_{i=1}^{n} \sum_{j=1}^{n} x_{i} x_{j}}, \forall j \neq 1\) (2)
Where: Xi represents the observation value of the region i; N is the number of regions; Wij is the spatial weight matrix, used to define the mutual adjacency of the spatial object.
\(\mathrm{ZG}=\frac{G-E[G]}{\overline{)V[G]}}\) (3)
Among them, the calculation formula of A and B is as follows:
\(\begin{aligned} &E[G]=\frac{\sum_{i=1}^{n} \sum_{j=1}^{n} w_{i, j}}{n(n-1)}, \forall j \neq 1 \\ &V[G]=E\left[G^{2}\right]-E[G]^{2} \end{aligned}\) (4)
2.1.3. Geographical Weighted Regression (GWR) Analysis
\(y_{i}=\beta_{0}\left(\mu_{i}, v_{i}\right)+\sum_{j=1}^{k} \beta_{j}\left(\mu_{i}, v_{i}\right) x_{i j}+\varepsilon_{i}\) (5)
Where: yi is an interpreted variable; β0 is a constant term, which performs a local regression estimate by using sub-sample data information of adjacent observations; (μi, vi) is a geographic location coordinate of observed sample i; εi is a random error of region i; The subscript j is a K + 1 function of the geographic location (μi, vi); βj(ui, vi) is the elastic coefficient of the region i. Estimation of any regional feature elasticity in the study adopts the weighted least squares method, and the estimated value is as follows:
\(\hat{\beta}\left(u_{i}, v_{i}\right)=\left[X^{T} W\left(u_{i}, v_{i}\right) X^{-1} X^{T} W\left(u_{i}, v_{i}\right) Y\right]\) (6)
Where: X is explained the variable matrix; W (μi, vi) is the spatial weight matrix, which is composed of the monotonous delivery function of the geographic distance between the regression zone and its neighboring region. And the GUAs function usually represented W (μi, vi):
\(W\left(\mu_{i}, v_{i}\right)=\exp \left[-\left(-d_{i j} / b\right)^{2}\right]\) (7)
Where: dij is the distance between the zone i and the area j; b is the optimal bandwidth, determined by the Minimum Information Guidelines (AIC). The advantage of the GWR model is that the advantage of the GWR model is: (1) GWR can solve the space self-related problem while the OLS model cannot solve; (2) Each sample spatial unit in GWR models provides a coefficient estimate, allowing the model outcome to better represent local conditions. (3) GWR can reveal the spatial differences and how these work.
2.2. Variable Settings and Data Sources
Because the digital economy is essentially a fusion of the old economy and Internet technology, it is influenced by a wide range of elements. Economic development, industrial structure, human capital level, innovation capacity, information infrastructure, and information utilization, according to (Zhang et al., 2020; Zhong & Mao, 2020), may have a major impact on the regional difference in digital economic development. The following explanations will clarify how these parameters work and how we measure them in our study:
(1) Economic development condition: The digital economy is inextricably linked to its economic foundation as a result of the traditional economy and data resources. Strong economic growth can have a direct impact on the development of digital infrastructure. The share of area GDP in national GDP is used in this study to indicate local economic conditions (Gaziz et al., 2020).
(2) Industrial structure: The digital economy is intertwined with the first, second, and third industries, with the third being particularly important due to retail e-commerce. To measure the industrial structure, we use the gravity of the third industry accounts for local GDP (Zimmermann & Koerner, 1999).
(3) Human capital level: The competitiveness of individuals is at the core of the city’s competitiveness. Regions with high human capital can promote information communication and attract greater labor inflows due to positive externalities. As a result, this study follows Arauzo-practice Carod’s of using the proportion of students at a school accounting for the total number of ordinary colleges as a proxy variable for human capital. (Arauzo-Carod, 2021).
(4) Technological innovation capabilities: Because regional digital economic development is unaffected by science and technology support, this study uses technical patent authorization as a proxy variable (Hanna, 2018).
(5) Information infrastructure level: The digital economy benefits from a high level of information infrastructure. The intensity of regional information infrastructure is reflected in the level of information infrastructure, which demonstrates its future development potential (Williams, 2021). As a result, this study uses information industry employees and telecom income to represent this variable.
(6) Information utilisation level. Digital information has the potential to significantly increase the popularity of digital technology, which, in turn, will fuel the growth of the digital economy. As a result, we have used mobile phone access and Internet access as metric instruments. So, how do we track the progress of the digital economy? The use of a standard to construct this indicator is a common method (Nenkina, 2020), but it is not objective enough. Data from the Tencent Institute’s China Internet Digital Economic Index from 2016 to 2019 and the China City Statistical Yearbook 2019 are used in this analysis. Table 1 shows descriptive statistics for model variables.
Table 1: Descriptive Statistical Analysis of Variables
2.3. Analysis of Spatial Characteristics of Digital Economy Level in the Prefecture-Level City
2.3.1. Digital Economic Agglomeration is Further Enhanced
Furthermore, 13 cities from the eastern region, three from the middle, four from the western section, and none from the northeast make up the top 20 cities with the highest digital economy levels. While the east region only accounts for one of the twenty cities with the lowest digital economy level, In 2019, the level of digital economic development in China’s locations is depicted in Figure 1. The digital economy is exhibiting a clear spatial evolution. First, the degree of digital economic aggregation in the east has increased, but considerable differences exist across the three major regions, namely the east, the middle, and the western regions. The Bohai area, the Yangtze River Delta, and the Pearl River Delta are the most visible agglomerations.
Figure 1: Digital Economy Index of China’s Prefecture-Level Cities
2.3.2. Digital Economic Development Levels in The Region Are Significant
Taking z digital economy in 2019 as an example, Beijing has the highest level of digital economic development (35.7336), 127 times the lowest level of digital economic development (0.2816) in Jinchang. Beijing, Shanghai, Guangzhou, Shenzhen, Dongguan, Chengdu, Chongqing), and 15 cities with digital economic development below 0.2 (Ali Zhi, Changdu, GuLuzhou, Haibei, Hainan) State, Kizilu Kirgiz, Linzhi, Naqu Region, Qianjiang City, Shigan District, Shannan, Shennongjia District, Tianmen City, Yushuzhou).
Furthermore, 13 cities from the eastern region, three from the middle, four from the western section, and none from the northeast make up the top 20 cities with the highest digital economy levels. While the east area only accounts for 1, 5 cities from the central region, six cities from the Northeast, and eight cities from the western region among the 20 cities with the lowest digital economy level. In this regard, China’s digital economy has increased significantly in the eastern area, with key cities such as Beijing, Shanghai, Guangzhou, and Shenzhen leading the way. In each city, there are still significant differences in digital economic levels.
2.3.3. Digital Economic Development Presents a Significant Space Positive
According to the neighbor relationship among cities, the binary adjacency matrix is established. We calculate Moran’s I and general values based on the matrix using ArcGIS10.5 software. The 2019 digital economy’s Moran index and Z score are 0.1231 and 4.5950, respectively. And the p-value is 0.0000, indicating that the test was passed at a 1% significance level. At the same time, the digital economy’s G value and Z score for 2019 are 0.000001 and 4.8430, respectively. The P-value is 0.0000, which means the test passes at a 1% significance level.
2.4. Factors Affecting the Level of Digital Economic Development
2.4.1. OLS Model and Drive Factor Selection
The variables in the ArcGIS platform were treated with the step-by-step regression model, and those with a VIF value larger than 5 were excluded. Human capital, information staff, telecommunications income, and Internet access were the last four interpretation variables. (See Table 2) The regression model R2 = 0.9269 now has a better fitting performance.
Table 2: OLS Regression with All Explanatory Variables
The VIF values of telephone access and innovation in Table 2 are the highest; we removed these two indicators first, then ran OLS regression again, based on the concept of caution. This time, all explanatory variable VIF values are less than 5 (Table 3).
Table 3: OLS Regression After Deleting the Two Explanatory Variables with the Largest Expansion Factor
The OLS regression result of industrial structure and per capita GDP is not significant at a 5% significance level (Table 3). As a result, we removed these two markers and ran the OLS regression again. Table 4 displays the results. Table 4 shows that these four variables, such as human capital, information staff, telecom income, and Internet access, meet the GWR conditions well, irrespective of their value or robust p-value. As a result, these four factors are applied to the GWR return.
Table 4: OLS Regression after Deleting Explanatory Variables with Insignificant Coefficients
2.4.2. Drive Factor GWR Model Regression
In ArcGIS10.5 software, the adaptive core function is used to achieve a minimum AICC bandwidth. Table 5 shows the GWR regression’s estimation results. The GWR model outperforms the OLS model by a large margin, and the GWR model’s AICC (−1201.9702) is lower than the OLS model’s AICC (−1147.5271). The modified R2 value also changed from 0.9269 to 0.9420. As a result, we feel the GWR model’s interpretation capability has been improved. Each spatial unit in the GWR model has its own coefficient. Table 6 presentsa descriptive statistical analysis of the regression coefficient of the GWR model. The results reveal that partial variable regression coefficients are unstable. Because the maximum and minimum levels of information staff and Internet access point in the same direction, we assume that having more information staff or having higher Internet access in China indicates having a greater level of the digital economy. In comparison, the maximum and minimum levels of human capital and telecom income do not have the same direction, indicating that higher levels of the digital economy in China are not always associated with higher levels of human capital and telecom income. However, positive relationships exist in most cities.
Table 5: Parameter Estimation and Test Results of GWR Model
Table 6: Descriptive Statistical Analysis of Regression Coefficient of GWR Model
2.5. Analysis of Space Mode of Drive Factor
(1) The role pattern of human capital: As shown in Figure 2a, the regression coefficient of human capital decreases from west to east. Southwest has a high value, while most of the Northeast has a low-value regression coefficient. This conclusion shows that whereas human capital has a high marginal utility in the west, and a low marginal utility in the eastern region, particularly in the northeast.
(2) The role pattern of information staff: Figure 2b shows that in the northern regions, such as Beijing, Tianjin, Hebei, Shanxi, and others, the information staff regression coefficient has a high value. While for southwest information staff regression coefficient has a low value. The findings suggest that robust information infrastructure can help the north improve its digital economy development.
(3) The role of telecom income: Figure 2c shows that the regression coefficient of telecom income is the lowest in the western area, indicating that telecom income hasn’t been an essential engine for the development of the local digital economy, whereas it has become an impetus to encourage local development in the other regions.
(4) The role of Internet access: Figure 2d demonstrates that, despite being in the southwest, the regression coefficient is high. This conclusion shows that throughout most of China, the marginal utility of internet connectivity is very high. As a result, by popularizing the Internet, the government may help to develop a digital economy.
Figure 2: Spatial Distribution of the Regression Coefficients
3. Conclusion and Policy Recommendations
This study explores the spatial characteristics and the driving factors of digital economy development, which provide a reference for perfecting China’s digital industrial distribution and narrowing digital gaps. We used Moran’s I analysis to see if China’s digital economy had spatial self-correlation. The Getis-Ord General G test was used to determine the clustering type of China’s digital economy. We used OLS and GWR approaches sequentially to determine the driving variables of China’s digital economy level.
On the one hand, the data show that the digital economy is rapidly expanding throughout China; on the other hand, there is a significant regional gap in the digital economy level in China, and digital economy agglomeration is increasing over time. Furthermore, the findings reveal that human capital, information staff, telegram income, and Internet connection are the most critical factors in accelerating the development of the digital economy. They fulfill different roles in different locations at the same time. As a result, governments in the West must continue to improve the level of human capital and technology innovation to close the digital economy deficit. In the northern part of the country, the government may boost the digital economy by encouraging more individuals to work in the information sector. At the same time, in the east, increasing telecom income and popularizing the Internet would be more helpful.
References
- Alibekova, G., Medeni, T., Panzabekova, A., & Mussayeva, D. (2020). Digital transformation enablers and barriers in the economy of Kazakhstan. Journal of Asian Finance, Economics, and Business, 7(7), 565-575. https://doi.org/10.13106/jafeb.2020.vol7.no7.565
- Alzhanova, F. G., Kireyeva, A. A., Satpayeva, Z. T., Tsoy, A. A., & Nurbatsin, A. (2020). Analysis of the level of technological development and digital readiness of scientific-research institutes. Journal of Asian Finance, Economics, and Business, 7(12), 1133-1147. https://doi.org/10.13106/jafeb.2020.vol7.no12.1133
- Arauzo-Carod, J. M. (2021). Location determinants of high-tech firms: An intra-urban approach. Industry and Innovation, 54(11), 1-24. https://doi.org/10.1080/13662716.2021.1929868
- Binyan, W., Junfeng, T., Lisha, C., Feilong, H., Han, H., & Shijun, W. (2018). Spatial differentiation of digital economy and its influencing factors in China. Scientia Geographica Sinica, 38(6), 859-868. https://doi.org/10.13249/j.cnki.sgs.2018.06.004
- Dwivedi, Y. K., Ismagilova, E., Hughes, D. L., Carlson, J., Filieri, R., Jacobson, J., Jain, V., Karjaluoto, H., Kefi, H., Krishen, A. S., Kumar, V., Rahman, M. M., Raman, R., Rauschnabel, P. A., Rowley, J., Salo, J., Tran, G. A., & Wang, Y. (2021). Setting the future of digital and social media marketing research: Perspectives and research propositions. International Journal of Information Management, 59(6), 102168. https://doi.org/10.1016/j.ijinfomgt.2020.102168
- Gaziz, S., Oteshova, A., Prodanova, N., Savina, N., & Bokov, D. O. (2020). Digital economy and its role in the process of economic development.Journal of Security and Sustainability Issues, 9(4), 1225-1235. https://doi.org/10.9770/jssi.2020.9.4(9)
- Hanna, N. (2018). A role for the state in the digital age. Journal of Innovation and Entrepreneurship, 7(1), 86-93. https://doi.org/10.1186/s13731-018-0086-3
- Jiao, S., & Sun, Q. (2021). Digital economic development and its impact on economic growth in china: Research based on the perspective of sustainability. Sustainability (Switzerland), 13(18), 245-256. https://doi.org/10.3390/su131810245
- Mensah, J. (2019). Sustainable development: Meaning, history, principles, pillars, and implications for human action: A literature review. Cogent Social Sciences, 5(1). https://doi.org/10.1080/23311886.2019.1653531
- Nenkina, O. (2020). Assessment of readiness of accounting and statistical systems for digital economy transactions. Advances in Social Science, Education and Humanities Research, 392(2019), 466-469. https://doi.org/10.2991/assehr.k.200113.097
- Sepehrdoust, H., & Khodaee, H. (2013). The impact of information and communication technology on employment of selected OIC countries. African Journal of Business Management, 7(39), 4149-4154. https://doi.org/10.5897/AJBM11.3058
- Tang, L., & Lu, B. (2021). Spatial correlation network and regional differences for the development of digital economy in China. Entropy, 23(1575), 1-16. https://doi.org/10.20944/entre202106.0359.v1
- Williams, L. D. (2021). Concepts of the digital economy and industry 4.0 in intelligent and information systems.International Journal of Intelligent Networks, 2(9), 122-129. https://doi.org/10.1016/j.ijin.2021.09.002
- Zhang, J., He, X., & Yuan, X. D. (2020). Research on the relationship between Urban economic development level and urban spatial structure-A case study of two Chinese cities. PLoS ONE, 15(7), 1-14. https://doi.org/10.1371/journal.pone.0235858
- Zhang, W., Zhao, S., Wan, X., & Yao, Y. (2021). Study on the effect of the digital economy on high-quality economic development in China. PLoS ONE, 16(9), 1-27. https://doi.org/10.1371/journal.pone.0257365
- Zhong, Y., & Mao, W. (2020). Spatial differentiation of digital economy and its influencing factors in the Yangtze River Economic Belt. Journal of Chongqing University, 26(1), 19-30. https://doi.org/10.11835/j.issn.1008-5831.jg.2019.05.002
- Zimmermann, H. D., & Koerner, E. I. S. (1999). Emerging industrial structures in the digital economy: The case of the financial industry.Aisel Aisnet Organization, 1(11), 39. http://aisel.aisnet.org/amcis1999/39