How to identify influential content: Predicting retweets in online financial community
Abstract
Purpose
Retail investors are prone to be affected by information dissemination in social media with the rapid development of Web 2.0. The purpose of this study is to recognize the factors that may impact users' retweet behavior, namely information dissemination in the online financial community, through machine learning techniques.
Design/methodology/approach
This paper crawled data from the Chinese online financial community (Xueqiu.com) and extracted author-related, content-related, situation-related, stock-related and stock market-related features from the dataset. The best information dissemination prediction model based on these features was determined by evaluating five classifiers with various performance metrics, and the predictability of different feature groups was tested.
Findings
Five prevalent classifiers were evaluated with various performance metrics and the random forest classifier was proven to be the best retweet prediction model in the authors’ experiments. Moreover, the predictability of author-related, content-related and market-related features was illustrated to be relatively better than that of the other two feature groups. Several particularly important features, such as the author's followers and the rise and fall of the stock index, were recognized in this paper at last.
Originality/value
This study contributes to in-depth research on information dissemination in the financial domain. The findings of this study have important practical implications for government regulators to supervise public opinion in the financial market.
Keywords
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Grant numbers 71332003, 71671011 and 72201020) and China Scholarship Council (Grant number 202106020120).
Citation
He, D., Yao, Z., Zhao, F. and Wang, Y. (2023), "How to identify influential content: Predicting retweets in online financial community", Aslib Journal of Information Management, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/AJIM-05-2022-0254
Publisher
:Emerald Publishing Limited
Copyright © 2023, Emerald Publishing Limited