The research on propagation modeling and governance strategies of online rumors based on behavior–attitude

Hailiang Chen (National University of Defense Technology, Changsha, China)
Chuan Ai (National University of Defense Technology, Changsha, China)
Bin Chen (National University of Defense Technology, Changsha, China)
Yong Zhao (National University of Defense Technology, Changsha, China)
Kaisheng Lai (Jinan University, Guangzhou, China)
Lingnan He (Sun Yat-Sen University, Guangzhou, China)
Zhihan Liu (Central South University, Changsha, China)

Internet Research

ISSN: 1066-2243

Article publication date: 13 July 2021

Issue publication date: 15 March 2022

1354

Abstract

Purpose

The purpose of this paper is to achieve effective governance of online rumors through the proposed rumor propagation model and immunization strategy.

Design/methodology/approach

The paper leverages the agent-based modeling (ABM) method to model individuals from two aspects, behavior and attitude. Based on the analysis and research of online data, we propose a rumor propagation model, namely the Untouched view transmit removed-Susceptible hesitate agree disagree (Unite-Shad), and devise an immunization strategy, namely the Gravity Immunization Strategy (GIS). A graph-based framework, namely Pregel, is used to carry out the rumor propagation simulation experiments. Through the experiments, the rationality of the Unite-Shad and the effectiveness of the GIS are verified.

Findings

The study discovers that the inconsistency between human behaviors and attitudes in rumor propagation can be explained by the Unite-shad model. Besides, the GIS, which shows better performance in small-world networks than in scale-free networks, can effectively suppress rumor propagation in the early stage.

Research limitations/implications

This paper provides an effective immunization strategy for rumor governance. Specifically, the Unite-Shad model reveals the mechanism of rumor propagation, and the GIS provides an effective governance method for selecting immune nodes.

Originality/value

The inconsistency of human behaviors and attitudes in real scenes is modeled in the Unite-Shad model. Combined with the model, the definition of diffusion domain is proposed and a novel immunization strategy, namely GIS, is designed, which is significant for the social governance of rumor propagation.

Keywords

Citation

Chen, H., Ai, C., Chen, B., Zhao, Y., Lai, K., He, L. and Liu, Z. (2022), "The research on propagation modeling and governance strategies of online rumors based on behavior–attitude", Internet Research, Vol. 32 No. 2, pp. 620-639. https://doi.org/10.1108/INTR-08-2020-0480

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Hailiang Chen, Chuan Ai, Bin Chen, Yong Zhao, Kaisheng Lai, Lingnan He and Zhihan Liu

License

Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode


1. Introduction

With the development of information technology and social software, information propagation on the Internet presents new characteristics, such as anonymity, fragmentation and timeliness. These characteristics bring about unprecedented changes in the propagation and influence of information (Budak et al., 2011). Consequently, advanced information technology significantly enhances the information exchange in the social network, but it also promotes the propagation of rumors.

With the rapid propagation through online social media, rumors pose a great risk to social stability, and the influences of rumors are hard to eliminate. Compared with offline rumors, online rumors typically exert more persistent and latent influence. For example, during the outbreak of Corona Virus Disease 2019 (COVID-19), various rumors about the epidemic propagated quickly and widely on online social platforms such as WeChat and Twitter. These rumors reinforced public panic and brought serious consequences to society. More importantly, they also brought great hidden dangers and resistance to epidemic prevention. For example, misleading by rumors, some people drank methanol to treat COVID-19, which resulted in a considerable number of deaths (Cooney, 2020). Therefore, effective rumor governance is indispensable in social development.

Modeling rumor propagation is one of the vital issues to realize online rumor governance. Rumor is a special kind of information, which shares many similarities with infectious diseases in terms of propagation. Therefore, in the early research studies on rumor propagation, various infectious disease models such as the Susceptible Infected Removed (SIR) model and Susceptible Infected Susceptible (SIS) model were employed to model rumor propagation (Zanette, 2001; Leskovec et al., 2007). Later, the unique features of rumor propagation were gradually realized. As a result, many extended rumor propagation models were proposed, such as the model with the forgetting mechanism (Nekovee et al., 2007). Up to now, people's attitudes toward rumors have been modeled in the rumor propagation model. For instance, Ma et al. (2018) classified people according to their attitudes, such as susceptible, infected and immune. In other research studies, the behaviors of people were modeled, such as forwarding, viewing (Liu et al., 2017a). Furthermore, some researchers mixed the attitude and behavior. Moreno et al. (2004) split the population into Ignorant, Spreader and Stifler (ISS). Sometimes, the division may fail to express the actual situation accurately. For example, people who oppose the rumors may also forward the rumors, and those who support the rumors may not forward them. As for one rumor, some people may never forward it, but they can receive it many times. Their attitudes toward this rumor may change because of the multiple receptions. Hence, it is with important practical significance to model the behavior and attitude separately. The above propagation mechanisms are incapable of modeling the mechanism of rumor propagation perfectly.

Therefore, we propose a rumor propagation model, namely the Untouched view transmit removed-Susceptible hesitate agree disagree (Unite-Shad) model, which models the process of rumor propagation based on the agent-based modeling (ABM) approach (Bonabeau, 2002). Specifically, the behavior model of information propagation, called Susceptible View Forward Removed (SVFR) model (Liu et al., 2017a), and the classical attitude model, called Susceptible Exposed Infectious Recovered (SEIR) model (Liu et al., 2017b), are used to model both the behaviors and attitudes of people respectively. In the proposed Unite-Shad model, the interactions between behaviors and attitudes are considered in the process of rumor propagation, which shows important implications in studying the rumor propagation model.

ABM is a powerful simulation modeling technique, which can be used to model a system as a collection of autonomous decision-making entities called agents. In the process of rumor propagation, individuals are modeled as heterogeneous, autonomous and interactive agents. Therefore, agents often exhibit different attitudes toward the rumors and they can decide to forward the rumors or not (Qiu et al., 2017). Additionally, the system can be used to predict the propagation of rumors, which is practically significant in rumor governance. Therefore, this paper uses the ABM approach to model individuals from two aspects, behavior and attitude.

Combined with the rumor propagation model, the effects of immunization strategies are elaborately studied through numerical experiments. It can further guide the design and implementation of rumor immunization strategies. Current research studies on rumor immunization strategy are based on the static network topology (Wang et al., 2014, 2016; Cohen et al., 2000; Liu et al., 2016). These strategies ignore the structural features of the sub-networks in which rumors have not yet propagated. Moreover, due to the retardance of surveillance, governance is usually launched after the spread of rumors. Therefore, in order to study the control of rumor propagation in real scenes, the distance centrality is defined as the shortest distance from one node to the diffusion domain. The diffusion domain is a subgraph composed of all the nodes that have received the rumors and the edges between these nodes. Based on the rumor propagation model proposed in this paper, considering the degree centrality and distance centrality of nodes, an immunization strategy to suppress the propagation of rumors, namely the Gravity Immunization Strategy (GIS), is proposed. Through the comparisons with the classic immunization strategies, the advantages of GIS are given and discussed.

The contributions of our study are threefold:

  1. We establish a new rumor propagation model, called Unite-Shad, which involves the inconsistency between human behaviors and attitudes.

  2. We propose an immunization strategy, named GIS, which considers the propagation feature of rumor propagation.

  3. We conduct a preliminary study on the implementation time of the immunization strategies and the number of immune nodes. Such studies can be used to improve the effectiveness of rumor governance in the real world.

This paper is organized as follows: In Section 2, we introduce the related works of the rumor propagation models and immunization strategies. In Section 3, the questionnaire data and WeChat data are introduced. In Section 4, we introduce the Unite-Shad model and give the rationality verification of the model. In Section 5, we introduce GIS and conduct comparative experiments. In Section 6, the paper is concluded.

2. Related research

2.1 Rumor propagation model

Rumors can be viewed as an infection of the mind (Musa and Fori, 2019) and a lot of research studies on rumor propagation are based on infectious disease models. Generally, rumor propagation models can be roughly divided into three categories, mathematical models, complex network-based models and features-based models.

The early research studies on rumor propagation were developed based on mathematical models. Daley and Kendall (1965) studied the similarity between the propagation of rumors and viruses from the perspective of mathematical models, and they proposed the first classic mathematical model, called the DK model, to formulate the propagation of rumors. After that, the MT model was proposed based on the DK model (Maki and Thompson, 1973). Musa and Fori (2019) explored and analyzed the equilibrium point of rumor propagation by establishing a certain mathematical model. Bhih et al. (2020) established a mathematical model based on the cholera model and analyzed the control of the propagation of rumors.

The research studies of the rumor propagation model based on complex networks started from the proposal of the small-world network (Watts and Strogatz, 1998) and the scale-free network (Barabási and Albert, 1999). Zanette (2001) leveraged the complex network theory to study the interactions of different types of people in rumor propagation and employed the SIR model in the research studies of rumor propagation. Moreno et al. (2004) built the ISS model for rumor propagation on the scale-free network, which divides individuals in the network into three categories, Ignorant, Spreader and Stifler. The population classifications differentiate rumor propagation models from the infectious disease model. The ISS model has become the baseline model for the research studies on rumor propagation and these three classifications are recognized as the standard of the rumor propagation model. Based on the ISS model, Nekovee et al. (2007) adopted the forgetting mechanism to enhance the credibility of the model. Piqueira (2010) studied the ISS model and used the conceptions in the dynamic system theory to establish the balance point of the rumor propagation process.

There also exist many pieces of research studies based on the features of rumor propagation. Zhao et al. (2012) introduced the constant forgetting rate to construct the Susceptible Infected Hibernator Removed (SIHR) model, and it was found that forgetting could reduce the influence of rumors. Wan et al. (2016) considered the counter mechanism on complex social networks and they proposed a new model, called the Susceptible Infected Counter Susceptible (SICS) model. The spreading dynamics of the rumor were studied elaborately by using the mean-field theory. Yang et al. (2017) considered that rumor believers would enter the “isolated state” due to group pressure and proposed the Uncertain Rumor Quarantine Truth (URQT) model correspondingly. To analyze the probability of independent spreaders in the rumor propagation network, Ma et al. (2018) considered the independent rumor spreader in the SIR rumor propagation model.

The above models classify the population into susceptible, infected, recovered and other types according to one aspect. While, in the actual situation, people's behaviors and attitudes play significant roles in rumor propagation. Specifically, people's attitudes may affect their behaviors (Li et al., 2018a). In the meanwhile, their attitudes can be affected by the behaviors of their friends. It means that the behaviors and attitudes of individuals are not always consistent. For example, according to the conformity phenomenon (Liu, 2016), an individual tends to maintain accordance with the majority under direction or pressure, indicating that he or she may forward the rumor whether he or she agrees with it or not. Therefore, it is necessary to model the propagation of rumors with the consideration of the inconsistency between human behaviors and attitudes. As a result, the Unite-Shad model is proposed in this paper. The comparisons of the different rumor propagation models are concluded in Table 1.

2.2 Immunization strategy

Immune nodes refer to the nodes that do not forward the rumors. By selecting some critical nodes as immune nodes, immunization strategies can suppress the propagation of rumors and realize the rumor governance (Wang et al., 2014). Selecting immune nodes is usually implemented through the prior information of the nodes. However, as for Random Strategy (RS) (Cohen et al., 2000), the immune node is selected without considering any information. As a result, it usually obtains bad performance. According to the given information of the nodes, the immunization strategy can be classified as local information immunization strategy and global information immunization strategy.

Acquaintance Strategy (AS) (Cohen et al., 2003; Holme, 2004; Wang et al., 2016) is one of the local information immunization strategies, in which the immune nodes are selected randomly in the initial stage and then the neighboring nodes of existent immune nodes with the highest degree become the new immune nodes iteratively. With the local information, this strategy can obtain better performance than the RS. Yuan and Tang (2015) proposed the Community-based Immunization Strategy (CIS) by tracking the evolution of community structure and they demonstrated the feasibility of the CIS. This strategy evaluates the local importance of nodes in communities. The immunization strategy in reference (Liu et al., 2016) initializes the scores of nodes with the degrees, and then the scores are recalculated based on local information of the nodes. According to the different ways of scores change, several strategies were proposed, among which the Known Local Score (KLS) was proved to be the best one.

Target immunization strategy (Pastorsatorras and Vespignani, 2002) is one of the global information immunization strategies, which immunizes the top k nodes with the highest priority according to importance scores. Generally, the importance of nodes is evaluated by degree centrality or betweenness centrality. The strategy of immune nodes with the Highest Degree (HD) can effectively reduce the network density, which is an important factor in the growth of rumor propagation. The strategy of immune nodes with the Highest Betweenness (HB) achieves the purpose of delaying the propagation of the rumor by cutting off the critical propagation path. HB and HD could be improved by adaptive strategies, such as Highest Degree Adaptive (HDA) and Highest Betweenness Adaptive (HBA) (Wang et al., 2016; Gallos et al., 2007). These two adaptive strategies recalculate the importance of the nodes after each node is immunized. Restricted by the high computational complexity of HB and HBA, they are not qualified for large-scale social networks. A dynamic way was designed (Schneider et al., 2012) to immunize the nodes, which evaluates the contribution of each node to the largest connected cluster and then selects the top N nodes with the highest contribution as immune nodes. This strategy shows good performance in maintaining the robustness of the network. Based on the heterogeneous characteristic, Li et al. (2018b) proposed a new network immunization strategy in which the importance of a node is determined by its location in the network and the dynamic activities of the node.

Some of the immunization strategies are directly based on the features of the static network structure and the others select immune nodes from the static network in a dynamic way. However, these methods ignore the propagation features of the rumors. As far as we know, currently, there is no research on immunization strategies that combines with the diffusion domain. Therefore, GIS is proposed in this paper. A comparison with different strategies is shown in Table 2.

3. Data analysis

The dataset of our work is composed of two parts, online questionnaire data and WeChat data. The parameters of the rumor propagation model are obtained through the data analysis of online questionnaire data, including population features, probability of forwarding a rumor, etc. Since the viewing probability of rumors cannot be obtained by online questionnaire data, the WeChat data is used as a supplement.

3.1 Questionnaire data

The questionnaire survey aims to obtain information on the following two aspects. For one thing, the information of individuals in the crowd is obtained, including age, gender, household registration and education level. For another thing, their attitudes toward the rumors are obtained, including whether they think the news is credible or not and whether they would like to forward or not. The content of the rumor is: “Attention! when buying a house after October this year, the down payment ratio for the first home will rise by at least 50%, and in some cities, it rises 60%”. In the end, 5,210 valid questionnaires are collected. According to the results of questionnaire data, people's attitudes are divided into four states.

  1. S (Susceptible): People do not receive rumors and they are very sensitive to the rumor.

  2. H (Hesitate): People are uncertain about the credibility of the rumor and they are easily influenced by the attitudes of others.

  3. A (Agree): People agree with the content of the rumor and they do not question the correctness of the rumor at all.

  4. D (Disagree): People disagree with the content of the rumor and they feel that the content is not credible.

According to the questionnaire data, the forwarding probabilities of people in H, A, D states are 10.77%, 39.66% and 1.86%, respectively. The statistical results of the proportion of attitudes at the beginning and the end of rumor propagation are shown in Table 3.

With the help of the questionnaire data, the four different features (i.e. gender, ages, household registration, and education level) are analyzed. With the method of the Chi-square test, it is found that the proportion of people with different attitudes is not related to gender, household registration and education level.

However, as for ages, the results are different. According to different ages, people are divided into three categories: the young (10–39 years old), the adult (40–49 years old) and the old (50 years old and above) (Ahmad et al., 2000). In the questionnaire data, the proportions of the population by age are 58.9%, 30.3% and 10.8%, respectively.

The correlation test between age and attitude is shown in Table 4. The statistics are the number of people of different ages and attitudes. The adjusted residuals represent the correlation between the corresponding age and attitude.

If the adjusted residual is greater than 0, the age groups are positively correlated to the corresponding attitudes. If the adjusted residual is less than 0, the age groups are negatively correlated to the corresponding attitudes. From the statistical analysis results in Table 4, some conclusions can be found. Compared with the A state, young people tend to be in the H state. However, compared to the H state, the old people tend to be in the A state. The result of the Chi-square test is χ2=10.297, P<0.05, so it is believed that age and attitude are correlated with 95% confidence.

3.2 WeChat data

Since the viewing probability cannot be obtained from the questionnaire data, the WeChat data is collected as a supplement. The WeChat data is the collection of web pages spread in WeChat Moments from a third-party service company. This data records the corresponding timestamp and all user activities from January 14 to February 27, 2016, such as viewing and forwarding. Users must view the web pages before they could forward them. If a user views a web page shared by a friend, the actions of the user and the friend are recorded in the data. Web pages with the spreading period within 45 days are chosen, in total 277014. There are more than 7 million users who are contained in the propagation process of these pages.

The propagation of a single piece of information forms a cascade. In the WeChat data, the scale of the cascade follows a power-law distribution, and the power exponent is approximately λ=2.17. The average forwarding probability in the WeChat data is 0.091. In order to obtain the average viewing probability accurately, we use the Random Recursive Tree (RRT) (Liu et al., 2017a) to model the growth process of the cascade. The viewing probability of each person in the model is related to the number of their neighbors. The calculation formula for the viewing probability μi of an individual is μi=cdiω, where di is the degree of node i, the index ω is a positive number and c is a constant, which is determined by the given average viewing probability. c can be calculated by Equation (1).

(1)μ=cd=dmindmaxdωPr[D=d]

The μ is the average viewing probability and ω is the parameters in the experiments. d represents the degree of the node. dmin and dmax represent the minimum and the maximum of d respectively. Pr[D=d] represents the probability that the nodes' degree is d. Liu et al. (2017a) carried out experiments to find the best parameters. When ω=1.2 and μ=0.4, the best fitting result is obtained. At this time, the λ is very close to 2.17. Therefore, in the follow-up experiments, μ=0.4 is used as the viewing probability.

4. The Unite-Shad model

In this section, we aim to propose a rumor propagation model based on the behaviors and attitudes of individuals. Considering that people's behaviors and attitudes toward rumors would not be consistent entirely, the Unite-Shad is proposed to model the behaviors and attitudes respectively. In the end, simulation experiments of the model are carried out based on the WeChat data and questionnaire data to validate the rationality of the model.

4.1 Model description

The inconsistency between human behaviors and attitudes means that the users who agree with the rumor may not forward the rumor, and the users who disagree with the rumor may forward it. The inconsistency exists in reality. Therefore, the Unite-Shad model is proposed to describe the process of rumor propagation with the inconsistency between human behaviors and attitudes. This model is based on the information propagation model, SVFR and SEIR. The combination of the two models makes the Unite-Shad model perform better in the description of the features of rumor propagation.

Before the description of the model, a few hypotheses are introduced, which makes the model more convincing.

H1.

Individuals never view the same rumor more than once, but they may receive the same rumor again and again.

H2.

Individuals do not forward the same rumor more than once, and if they forward the rumor, they may continue to participate in the discussion of the rumor, which can change the attitudes of the neighbors.

H3.

If someone does not view the rumor for the first time, he or she may not view the rumor in the later period.

H4.

The attitudes of individuals next time are related to the current attitude instead of the historical attitudes.

The inconsistency between human behaviors and attitudes is considered by modeling individuals in the rumor propagation with ABM. As shown in Figure 1, agents have different behaviors and attitudes. Behaviors and attitudes are modeled separately in the Unite-Shad model, so the inconsistency between behaviors and attitudes is involved. In the Unite-Shad model, the behaviors of agents are determined by probability. If an agent views a rumor and his or her attitude is the A state (Agree), his or her behavior will be the R state (Removed) or the T state (Transmit) with the probability of σ and 1-σ respectively.

The behaviors of people are grouped into four states.

  1. U (Untouched): People do not touch the rumor.

  2. V (View): People receive the rumor and view the rumor.

  3. T (Transmit): People forward the rumor to their friends.

  4. R (Removed): People ignore the rumor.

For a certain rumor, each agent is in the U state at first. When he or she receives the rumor, he or she may view the rumor with a probability of μ. If he or she views the rumor, his or her behavior will turn to the V state. Otherwise, his or her behavior will turn to the R state. A person, who views the rumor, may forward it with a probability of σ and then his or her behavior will turn to the T state. Otherwise, his or her behavior will turn to the R state directly. Finally, for the person who forwards the rumor, in the next step, his or her behavior will turn to the R state for sure.

Similarly, we divide attitudes into four states as well. The detailed classifications are introduced in section 3. Now the transitions between these four states are introduced. In the beginning, everyone in the social networks is in the S state. When someone receives a rumor for the first time, his or her attitude will turn to H, A or D, according to the probability of θ1, θ2, or θ3 respectively. When he or she receives the rumor from neighbors again, his or her attitude may turn to other states according to a specific transition probability. The transitions of attitudes are shown in Figure 1.

Behaviors and attitudes of people are interrelated sometimes. For example, if someone does not view the rumor, his or her behavior will transform from the U state to the R state. Meanwhile, his or her attitude will transform from the S state to the D state synchronously. If he or she views the rumor, forwarding or non-forwarding will not change his or her attitude. However, if he or she forwards the rumor, the neighbors will receive the rumor and the neighbors' attitudes will change. If he or she receives the rumor more than once, his or her attitude will change randomly among H, A and D, according to the probabilities analyzed from the questionnaire data.

4.2 Validation

In the experiment, a social network is built for the propagation of the rumors, in which people are regarded as nodes and the relationships between people are regarded as edges. The configuration network is similar to the observed real social network, which is a scale-free network with a power-law degree distribution Pr[D=d]=τdϕ (Ai et al., 2018, 2020). τ is a coefficient and ϕ is the power exponent. A configuration model is used to construct a random scale-free network with power exponent ϕ=2.5. The maximum degree is dmax=N1/(ϕ1), where N is the network size. When the network size is N=10,000, the average degree is E[D]=26.15.

The experiment is based on the Pregel, which is a graph calculation framework, to improve efficiency through parallel computing.

4.2.1 Model parameters

According to the fitting results of the forwarding probability of WeChat data in the paper (Liu et al., 2017a), the average viewing probability μ is used in this paper. However, rumors are different from common information and the forwarding probability of rumors is different from that of common information. Therefore, we use the forwarding probability obtained from the questionnaire data as the forwarding probability of rumors. The forwarding probabilities of people in H, A, D states are 10.77%, 39.66% and 1.86%, respectively. Moreover, the data shows that age is an important factor affecting the attitude transformation between H, A and D. Therefore, in the experiment, age is set as a parameter. According to the questionnaire data, the age ratio in the experiment is set as 58.9% for the young, 30.3% for the adult and 10.8% for the old.

As the user's attitude changes in the three states (H, A, D), the next attitude is only related to the current attitude. Therefore, the transformation process of the three attitude states in the system can be regarded as a Markov chain (Brooks, 2010). At each step of the Markov chain, the system can change from one state to another according to the probability. The changes of the states are called transitions and the related probability is called transition probability. P is the state transition matrix of all people. If the age is not considered, the transition matrix of all people is equal to the individual transition matrix p (P=p).

Assuming that Xi=[x1x2x3] is the proportion of people in H, A, D at time i, the proportion of users at the time i+1 can be expressed as Xi+1 and then there is a relationship, Xi+1=XiP. When the attitude tends to be stable, there exists limiXi+1=Xi, namely Equation (2).

(2)X=XP
Among which P=[α1α2α3β1β2β3γ1γ2γ3] and X represents the proportion of users in different attitudes at time i= . There are three constraints between conversion probabilities, as shown in Equation (3).
(3)α1+α2+α3=1β1+β2+β3=1γ1+γ2+γ3=1

Since people of different ages have different attitudes toward rumors, the transition probability matrix is considered separately for people with different age groups. From the data analysis of the questionnaire data, the proportion of H, A, D populations in different age groups are obtained and the results are listed in Table 5.

After solving Equations (2) and (3), the following transition probability matrix can be obtained as shown in Equations (4)–(6).

(4)Pyoung=[13067ay367bycy3067ay+367bycyay16730cy110dy6730cy+110dyaybydy1bydy]
(5)Padult=[13758aa558baca3758aa+558bacaaa15837ca537da5837ca+537daaabada1bada]
(6)Pold=[12370ao110boco2370ao+110bocoao1  7023co723do7023co+723doaobodo1bodo]

The above transition probability matrices are for the young, the adult and the old people respectively. However, the transition probability matrices are not unique and some of them may be vastly different from the actual situation. Therefore, through the analysis of questionnaire data, social scientists give out some hypotheses. The parameters in Equation (4)–(6), such as ay, by, cy, are constrained by these hypotheses. They are shown as follows.

H5.

For the young, the probability from the H state to the D state is greater than that from the H state to the A state.

H6.

For the old, the probability from the H state to the A state is greater than that from the H state to the D state.

H7.

The probability from the D state to the D state is greater than that from the D state to the H state. The probability from the D state to the H state is greater than that from the D state to the A state.

H8.

The probability from the A state to the A state is greater than that from the A state to the H state. The probability from the A state to the H state is greater than that from the A state to the D state.

Through these hypotheses, the transition probability matrix is obtained, as listed in Table 6.

4.2.2 Experiments and results

A total of 100 experiments are carried out and the effective experimental data are selected for analysis. The results obtained are shown in Table 7. The data in the first row is the results of the questionnaires (QR), and the data in the second row is the statistical results of the experiments (ER).

It can be found from Table 7 that the experimental results are very close to the questionnaire results with the largest gap of 1.5%, indicating that the Unite-Shad model can nearly reproduce the actual results with the parameters obtained from the questionnaire data.

In the experiment, active individuals are regarded as people who are viewing or forwarding the rumor. The number of active individuals can reflect the popularity of the rumor. Through real-time statistics on the number of active individuals, it can be concluded that the popularity of rumors increases rapidly in the initial stage of propagation and slowly decreases in the end, which is consistent with the features that rumors erupt quickly and fade slowly in the social networks (Hallatschek and Fisher, 2014). It indicates the Unite-Shad model is rational from another aspect. There are two reasons for the rapid propagation of rumors in the early stage, and they are the high forwarding probability of rumors and the small-world features in social networks. After the number of active individuals reaches the peak, their interest in the rumors may decline, and then the number of people who view or forward the rumors may gradually decrease.

5. Evaluation of immunization strategies

The governance of rumors plays an important role in the management of society. When the rumors start to appear, they usually do not attract the attention of the government. But when the rumors spread around, the government realizes that the strategies should be executed to control the rumors. Therefore, how to govern rumors with the best effect through immunization strategies is studied in this section. Firstly, the GIS is introduced. Next, the experiments are carried out to determine the optimal time to execute the strategy and the optimal proportion of immune nodes. To demonstrate the advantages of GIS, comparative experiments are carried out with different strategies. The strategies used in the following experiments are NS, RS, HD, HDA, KLS and GIS, where the NS means that no strategies are executed.

5.1 Model description

Yuan and Tang (2015) argued that the implementation of immunization strategies is not “the more central, the better”. Therefore, this paper proposes a strategy called GIS. We define the diffusion domain as all the nodes that have attached the rumor. Two factors of the nodes are considered in the GIS, i.e. the degree of the node and the shortest distance from the node to the diffusion domain.

In the GIS, each node is evaluated through Equation (7). If K nodes need to be immunized, the first K nodes with the highest evaluation value are selected as immune nodes, and these immune nodes will not forward rumor information.

(7)Gi=diu/piv

In Equation (7), di is the degree of node i, and pi is the shortest distance from node  i to the diffusion domain. If a node is in the diffusion domain, it cannot be selected as an immune node. In the Gravity Model proposed by Newton, the universal gravitation between two objects follows the Equation F=GMm/r2 (Ahmed and Mohamed, 2018). The universal gravitation is inversely proportional to the square of the distance between two objects. Therefore, v=2 is used in the following experiments. With the condition of v=2, we conduct experiments about u, the range of which is set from 1 to 5. If the u is too large, the GIS is equivalent to the HD. When u=3, the rumors are best controlled, so in the following experiments, u=3 is used.

5.2 The experiments on the performance of strategy execution time

The experiments in this section aim to determine the optimal time for executing strategies. Based on the Unite-Shad model, different execution times for strategy are set. The execution time of strategy represents by t. At the same time, t means the depth of the rumor propagation cascade. The rumor propagation cascade is the trace formed with the spreading of the rumors. From the preliminary experiment, it is known that when the depth of the rumor propagation cascade is up to 10, the propagation of the rumor reaches a stable state. Therefore, the value of t in the experiment is set from 1 to 10. At the same time, when the proportion of immune nodes is greater than 0.1, the immune effect barely increases. Therefore, in the experiments, the proportion of immune nodes is selected randomly, ranging from 0 to 0.1. By comparing the final scale of infection of the six strategies, the optimal execution time of strategies and the effects of different strategies are analyzed. In the experiments, in order to explore the stable effect of the immunization strategy and reduce the influence of random factors, a node with the average degree is set as the initial node to ensure that each rumor will spread out. The experiments are based on the scale-free network described above, and the result is shown in Figure 2.

In Figure 2, the horizontal axis represents the time t to execute strategies. The vertical axis represents the number of nodes in the diffusion domain at the end of rumor propagation. Figure 2 shows that, except for NS and RS, the effects of other immunization strategies vary greatly with the changes of steps and the trends of the four strategies are consistent. To further accurately distinguish the effects of different steps, the average values Mt of step t of the HD, HDA, KLS and GIS experimental results are calculated. The number of newly increased nodes in the diffusion domain is obtained by Nt=Mt+1Mt. Then the index of Rt is designed to reflect the number of newly increased nodes that receive the rumor, which is calculated by Rt=Nt/Mt.

As shown in Table 8, the scale of the diffusion domain is reduced in the ninth step, that is Nt=4. The downsize is caused by random factors, which indicates that if strategies are executed after nine steps, the governance effect is not obvious. Therefore, the time advantage of rumor governance can be obtained by placing an immunization strategy in the first eight steps. Moreover, the sooner we put an immunization strategy, the better governance effect can be obtained. In the first step, the propagation of rumors is not as rapid as that in the second step, because the rumors forwarding population is small. In the second and third steps, there is a suitable size of the rumors forwarding population, and the number of people who have not accepted the rumors is relatively large, so it is very conducive for the propagation of rumors. As a result, the first four steps are the key period to control the propagation of rumors. After four steps, the rumors have been fully propagated among the people.

5.3 The experiments on the performance of the immune proportion

The Effective Value of the new added Unit (EVU) is defined as the change in the proportion of the diffusion domain caused by a single immune node. Better governance effects can be realized with larger EVU. The EVU can be used to evaluate the cost performance of newly added immune nodes. In order to explore the optimal proportion of immune nodes, experiments with different proportions of immune nodes are carried out. Since if the proportion of immune nodes is greater than 10%, the immune effect is almost unchanged, in the experiment, the proportion is controlled within 10% (0.01, 0.02…0.10). Because of the impact of the time of taking strategies, the four different times (the first step to the fourth step) are selected in experiments. A node with the average degree is chosen as the initial node. At the same time, the experiment is based on the scale-free network above and each set of parameters is repeated 100 times.

Under different strategies, different proportions of immune nodes are set, and the proportion of the diffusion domain size to the network size is used as the evaluation index. The result is shown in Figure 3. The results of the RS do not change significantly with the varied proportion of immune nodes, therefore, in the following, the result of the other four strategies is used. Furthermore, we use the quadratic polynomial to fit the relationship between the proportion of the diffusion domain and the proportion of immune nodes. The fitting results are shown in the red curve in Figure 3. EVU is shown in Figure 3 as the absolute value of the derivative function of the fitted curve, as shown in Equations (8) and (9).

(8)f(x)=ax2+bx+c
(9)EVU=|f(x)|=|2ax+b|
where a, b, and c are the coefficients of the polynomial. When the EVU is less than 1, it means if n new immune nodes are added, the size of the diffusion domain will decrease by nδ nodes (δ>0). On the contrary, when the EVU is greater than 1, it means if n new immune nodes are added, the size of the diffusion domain will decrease by n+δ nodes (δ>0). Therefore, when the EVU value is greater than 1, the immune nodes can exert the immune effect. We argue that 1 is the critical value of EVU. With the increase of immune nodes, the EVU is continuously reduced. When EVU = 1, the immune nodes are unable to exert better immune effects if the immune nodes continue to increase. Therefore, when EVU = 1, the value of the horizontal axis is the critical value of proportion (CVP), which represents the optimal proportion of immune nodes.

In different steps, the CVP is different. However, there is a trend that the earlier strategies may cause a greater CVP. It shows that under the same proportion of immune nodes, the earlier strategies can make each node a greater EVU, which is significant for governance. If the strategy execution time is late, in order to achieve the same control effect, more immune nodes are needed. Also, since the range of the CVP is 0.049 ∼ 0.081 for the first four steps, the proportion of immune nodes should be controlled in the range of 0.049 ∼ 0.081.

5.4 The experiments on the performance of different strategies

From the experiments above, the optimal time to execute the strategy and the proportion of immune nodes are obtained. In order to compare the effects of different strategies, the execution time of the strategy is set in the second step, and the proportion of immune nodes is set to 0.081. In order to test the effectiveness of the immunization strategy and avoid the influence of random factors, a node with the average degree is taken as the initial node. The experiment for each strategy is repeated 500 times. Since the social networks in the real world are small-world and scale-free (Lee et al., 2016), three different kinds of networks are used in the experiments. They are a small-world network, a scale-free network and the Twitter network. The results are shown in Figure 4.

Figure 4a shows the results in a small-world network with 10,000 nodes and an average degree of 26. The network is generated by the WS model (Watts and Strogatz, 1998) with 10,000 nodes. It is obvious that GIS is much better than other strategies. The turning point at t = 2 represents the obvious effect of GIS after the strategy is executed. The GIS almost suppresses the propagation of rumors immediately.

Figure 4b shows the results in a scale-free network, which is described above. It can be seen that the immunization strategies HD, HDA, KLS and GIS are significantly better than NS and RS. Moreover, it can be concluded from the partially enlarged view that the GIS can delay the outbreak of rumors and it demonstrates a good suppression of rumor propagation in the early stage.

Figure 4c shows the results of different strategies in experiments with the Twitter network from the Stanford Network Analysis Project (Leskovec and Krel, 2014). The Twitter network is the friend relationships of users from Twitter, which consists of 81,306 users and 1,768,149 relationships among users. The average degree of the network is 45.49. In the Twitter network, the effectiveness of HD, HDA, KLS and GIS are pretty good, especially for GIS. The results are consistent with those of the scale-free network, which indicates that the experimental results are relatively stable.

6. Discussion and conclusions

If the rumors are not properly controlled, the large-scale propagation of the rumors would disrupt the development of society. Therefore, it is of great practical significance to discover the mechanism of rumor propagation and to design effective governance strategies. Specifically, this paper devises an effective rumor governance plan through the analysis of online data. Firstly, we propose the Unite-Shad model by analyzing features of questionnaire data and the WeChat data. Then, based on the Unite-Shad model, the GIS is proposed.

In the Unite-Shad model, each person in the social network is modeled as an agent. The behaviors and attitudes of agents are modeled separately, but they also influence each other. Through the simulation experiments, the rationality of the model is verified. Based on the Unite-Shad model, an immunization strategy, namely GIS, is established, which combines the advantages of degree centrality and distance centrality. Distance centrality refers to the shortest distance from the node to the diffusion domain. With the support of GIS, rumors can be suppressed earlier since the immune nodes are close to the diffusion domain. Based on the experiments on two configuration networks and one real-world network, the conclusions can be summarized that GIS can effectively delay the propagation of rumors in the early stage in the social networks. Besides, compared with scale-free networks, GIS is more effective to suppress rumor propagation in small-world networks.

This study makes important implications in real-world rumor governance. For example, to suppress the propagation of rumors, Tencent's rumor debunkers use the fact-checking platform (a WeChat program) to push rumor refutation responses (https://fact.qq.com/). This study can be used to determine how to push rumor refutation responses to the public through the following steps. Firstly, the diffusion domain is extracted and the distances from nodes to the diffusion domain are obtained. Then, the topology of the network and the degrees of nodes are obtained through the friend relationships. Subsequently, each node is evaluated by GIS, and simulation experiments are supposed to be carried out based on the network topology to decide the critical time to execute GIS and the number of immune nodes. At last, according to the results, some critical nodes are selected as immune nodes and they are pushed with rumor refutation responses, leading to the disagreement with the rumor. The existence of immune nodes is capable of reducing the average forwarding probability of rumors in public. Consequently, the propagation of rumors is suppressed. Furthermore, based on the collaboration with Tencent, this study will be used in the rumor refutation response to improve governance effectiveness.

However, there remain some drawbacks in the present work. For instance, the experiments are only carried out on the configuration networks and the Twitter network, which inevitably lack convincing evidence to illustrate the generalization on other networks. Therefore, in future work, collecting data from other platforms and testing the generalization of our proposed models are supposed to be carried out. Besides, as for GIS, the indexes of degree and distance are the factors that influence the effect of the immunization strategy. Therefore, efforts should be focused on the research of how to obtain the best indexes of degree and distance.

Figures

The behaviors and attitudes of an agent

Figure 1

The behaviors and attitudes of an agent

Effect of time in executing strategy

Figure 2

Effect of time in executing strategy

Experiments in the proportion of immunized nodes

Figure 3

Experiments in the proportion of immunized nodes

Network immunity

Figure 4

Network immunity

Comparisons among rumor propagation models

ModelCategoryFeature
DKMathematical modelThe first mathematical model
The model proposed by Bhih et al. (2020)Mathematical modelBased on the cholera model
ISSComplex network-based modelBased on scale-free networks
The model proposed by Piqueira (2010)Complex network-based modelBased on ISS and dynamic system theory
SIHRFeature-based modelIntroduce constant forgetting rate
SICSFeature-based modelIntroduce counter mechanism and mean-field theory
URQTFeature-based modelIntroduce “isolated state” in the believer
Unite-ShadFeature-based modelIntroduce inconsistency between behaviors and attitudes

Comparisons among immunization strategies

ModelInformation usedDynamic immunizationUse of propagation feature
RSNoNoNo
ASLocalNoNo
CISLocalYesNo
KLSLocalNoNo
HDGlobalNoNo
HDAGlobalYesNo
HBGlobalNoNo
HBAGlobalYesNo
GISGlobalYesYes

The attitudes distribution in people

Hesitate (%)Agree (%)Disagree (%)
The beginning of the rumor propagation40357
The end of the rumor propagation65314

Correlation test between age and attitude

The youngThe adultThe old
StatisticsAdjusted residualsStatisticsAdjusted residualsStatisticsAdjusted residuals
H1,2822.2633−0.4197−2.8
A94044−0.8221.2
D1,694−2.19000.73442.4

Note(s): If the absolute value of the adjusted residual is greater than 2, the age groups are statistically different in relevant attitudes

The proportion of H, A, D populations in different age groups

AttitudesThe beginning of the rumor propagation (%)The end of the rumor propagation (%)
The youngH4167
A430
D553
The adultH4058
A337
D575
The oldH3570
A423
D617

Transition probability matrix for experiments

The youngThe adultThe old
HADHADHAD
H0.860.120.020.820.160.020.890.050.06
A0.260.730.010.250.740.020.240.730.03
D0.470.060.470.260.060.680.350.300.35

Statistical result of attitudes

The youngThe adultThe old
HADHADHAD
QR67.0%30.0%3.0%58.0%37.0%5.0%70.0%23.0%7.0%
ER65.5%30.8%3.7%57.4%36.7%5.9%70.4%22.6%7.0%

Statistical indicators of the effects of the four strategies

Step12345678910
Mt3,6463,8394,3114,6534,9465,0805,1885,1995,2385,234
Nt1934723422931341081139−4
Rt0.0530.1230.0790.0630.0270.0210.0020.008−0.001

References

Ahmad, O.B., Boschi-Pinto, C., Lopez, A., Murray, C.J.L., Lozano, R. and Inoue, M. (2000), “Age standardization of rates: a new who standard”, available at: http://www.who.int/healthinfo/paper31.pdf (accessed 10 May 2021).

Ahmed, I. and Mohamed, E.H. (2018), “Density centrality: identifying influential nodes based on area density formula”, Chaos, Solitons and Fractals, Vol. 2018 No. 114, pp. 69-80.

Ai, C., Chen, B., He, L.N., Lai, K.S. and Qiu, X.G. (2018), “The national geographic characteristics of online public opinion propagation in China based on WeChat network”, GeoInformatica, Vol. 22 No. 2, pp. 311-334.

Ai, C., Chen, B., Chen, H.L., Dai, W.H. and Qiu, X.G. (2020), “Geographical structural features of the WeChat social networks”, ISPRS International Journal of Geo-Information, Vol. 9 No. 5, p. 290.

Barabási, A.L. and Albert, R. (1999), “Emergence of scaling in random networks”, Science, Vol. 286 No. 5439, pp. 509-512.

Bhih, A.E., Ghazzali, R., Rhila, S.B., Rachik, M. and Laaroussi, A.E.A. (2020), “A discrete mathematical modeling and optimal control of the rumor propagation in online social network”, Discrete Dynamics in Nature and Society, Vol. 2020 No. 3, p. 4386476.

Bonabeau, E. (2002), “Agent-based modeling: methods and techniques for simulating human systems”, Proceedings of the National Academy of Sciences, Vol. 99 No. 3, pp. 7280-7287.

Brooks, S. (2010), “Markov chain Monte Carlo method and its application”, Journal of the Royal Statistical Society, Vol. 47 No. 1, pp. 69-100.

Budak, C., Agrawal, D. and Abbadi, A.E. (2011), “Limiting the spread of misinformation in social networks”, Proceedings of the 20th International Conference on World Wide Web, ACM, New York, NY, pp. 665-674.

Cohen, R., Erez, K., Benavraham, D. and Havlin, S. (2000), “Resilience of the internet to random breakdowns”, Physical Review Letters, Vol. 85 No. 21, pp. 4626-4628.

Cohen, R., Havlin, S. and Ben-Avraham, D. (2003), “Efficient immunization strategies for computer networks and populations”, Physical Review E, Vol. 91 No. 24, p. 247901.

Cooney, C. (2020), “FATAL REMEDY at least 27 die from drinking industrial alcohol as 'cure' for coronavirus in Iran”, The Sun, 10 Mar, available at: https://www.thesun.co.uk/news/11135724/27-die-industrial-alcohol-cure-coronavirus-iran/ (accessed 10 May 2021).

Daley, D.J. and Kendall, D.G. (1965), “Stochastic rumours”, IMA Journal of Applied Mathematics, Vol. 1 No. 1, pp. 42-55.

Gallos, L.K., Liljeros, F., Argyrakis, P., Bunde, A. and Havlin, S. (2007), “Improving immunization strategies”, Physical Review E, Vol. 75 No. 4, p. 045104.

Hallatschek, O. and Fisher, D.S. (2014), “Acceleration of evolutionary spread by long-range dispersal”, Proceedings of the National Academy of Sciences of the United States of America, Vol. 111 No. 46, p. 201404663.

Holme, P. (2004), “Efficient local strategies for vaccination and network attack”, EPL, Vol. 68 No. 6, pp. 908-914.

Lee, K.W., Lee, J.H. and Choe, H.Z. (2016), “Small-world scale-free”, The Journal of Korean Institute of Communications and Information Sciences, Vol. 41 No. 7, pp. 754-764.

Leskovec, J. and Krevl, A. (2014), “Large network dataset collection”, SNAP Datasets, available at: http://snap.stanford.edu/data (accessed 22 May 2021).

Leskovec, J., Mcglohon, M., Faloutsos, C., Glance, N. and Hurst, M. (2007), “Patterns of cascading behavior in large blog graphs”, in Apte, C., Skillicorn, D., Liu, B. and Parthasarathy, S. (Eds), Proceedings of the Seventh SIAM International Conference on Data Mining, Minneapolis, MN, pp. 551-556.

Li, C.C., Liu, F.M. and Li, P. (2018a), “Ising model of user behavior decision in network rumor propagation”, Discrete Dynamics in Nature and Society, Vol. 2018 No. 2018, pp. 1-10.

Li, X., Guo, J., Gao, C., Zhang, L. and Zhang, Z. (2018b), “A hybrid strategy for network immunization”, Chaos, Solitons and Fractals, Vol. 106, pp. 214-219.

Liu, P. (2016), “Research on college students' conformity in sports”, Creative Education, Vol. 7 No. 3, pp. 449-452.

Liu, Y., Deng, Y. and Wei, B. (2016), “Local immunization strategy based on the scores of nodes”, Chaos, Vol. 26 No. 1, p. 013106.

Liu, L., Qu, B., Chen, B., Hanjalic, A. and Wang, H.J. (2017a), “Modelling of information diffusion on social networks with applications to WeChat”, Physica A: Statistical Mechanics and its Applications, Vol. 496, pp. 318-329.

Liu, Q., Li, T. and Sun, M. (2017b), “The analysis of an SEIR rumor propagation model on heterogeneous network”, Physica A: Statistical Mechanics and its Applications, Vol. 469, pp. 372-380.

Ma, K., Li, W., Guo, Q., Zheng, X., Zheng, Z., Gao, C. and Tang, S. (2018), “Information spreading in complex networks with participation of independent spreaders”, Physica A: Statistical Mechanics and its Applications, Vol. 492, pp. 21-27.

Maki, D.P. and Thompson, M. (1973), Mathematical Models and Applications: With Emphasis on the Social Life, and Management Sciences, Prentice Hall, Upper Saddle River, NJ.

Moreno, Y., Nekovee, M. and Pacheco, A.F. (2004), “Dynamics of rumor spreading in complex networks”, Physical Review E, Vol. 69 No. 6, p. 066130.

Musa, S. and Fori, M. (2019), “Mathematical model of the dynamics of rumor propagation”, Journal of Applied Mathematics and Physics, Vol. 7 No. 6, pp. 1289-1303.

Nekovee, M., Moreno, Y., Bianconi, G. and Marsili, M. (2007), “Theory of rumour spreading in complex social networks”, Physica A: Statistical Mechanics and its Applications, Vol. 374 No. 1, pp. 457-470.

Pastorsatorras, R. and Vespignani, A. (2002), “Immunization of complex networks”, Physical Review E, Vol. 65 No. 3, p. 036104.

Piqueira, J.R.C. (2010), “Rumor propagation model: an equilibrium study”, Mathematical Problems in Engineering, Vol. 2010, pp. 242-256.

Qiu, X.G., Chen, B. and Zhang, P. (2017), Emergency Management Oriented Artificial Society Construction and Computational Experiments, Science Press, Beijing, BJ.

Schneider, C.M., Mihaljev, T. and Herrmann, H.J. (2012), “Inverse targeting—an effective immunization strategy”, EPL, Vol. 98 No. 4, p. 46002.

Wan, C., Li, T., Wang, Y. and Liu, X. (2016), “Rumor spreading of a SICS model on complex social networks with counter mechanism”, Open Access Library Journal, Vol. 03 No. 7, pp. 1-11.

Wang, J., Zhao, L. and Huang, R. (2014), “SIRaRu rumor spreading model in complex networks”, Physica A: Statistical Mechanics and its Applications, Vol. 398, pp. 43-45.

Wang, Z., Bauch, C.T., Bhattacharyya, S., Donofrio, A., Manfredi, P., Perc, M., Perra, N., Salathe, M. and Zhao, D. (2016), “Statistical physics of vaccination”, Physics Reports, Vol. 664 No. 664, pp. 1-113.

Watts, D.J. and Strogatz, S.H. (1998), “Collective dynamics of 'small-world' networks”, Nature, Vol. 393 No. 6684, pp. 440-442.

Yang, L.X., Zhang, T.R., Yang, X.F., Wu, Y.B. and Tang, Y.Y. (2017), “On the effectiveness of the truth-spreading/rumor-blocking strategy for restraining rumors”, available at: https://arxiv.org/abs/1705.10618v1 (accessed 10 May 2021).

Yuan, P. and Tang, S. (2015), “Community-based immunization in opportunistic social networks”, Physica A: Statistical Mechanics and its Applications, Vol. 420, pp. 85-97.

Zanette, D.H. (2001), “Critical behavior of propagation on small-world networks”, Physical Review E, Vol. 64 No. 5, p. 050901.

Zhao, L., Wang, J., Chen, Y., Wang, Q., Cheng, J. and Cui, H. (2012), “SIHR rumor spreading model in social networks”, Physica A: Statistical Mechanics and its Applications, Vol. 391 No. 7, pp. 2444-2453.

Acknowledgements

Hailiang Chen and Chuan Ai are both contributed equally.

This study was supported by the National Key Research & Development (R&D) Plan under Grant No. 2018YFC0806900, the National Natural Science Foundation of China under Grant Nos. 21808181 and 72071207, National Social Science Foundation of China under Grant No.17CGL047, Humanity and Social Science Youth Foundation of the Ministry of Education of China under Grant No. 19YJCXH073 and the Guangdong Key Laboratory for Big Data Analysis and Simulation of Public Opinion.

Corresponding author

Bin Chen can be contacted at: nudtcb9372@gmail.com

Related articles