To read this content please select one of the options below:

Statistical inference from power law distributed web‐based social interactions

Daphne R. Raban (School of Management and the Center for the Study of the Information Society, University of Haifa, Haifa, Israel)
Eyal Rabin (The Center for the Study of the Information Society, University of Haifa, Haifa, Israel)

Internet Research

ISSN: 1066-2243

Article publication date: 5 June 2009

876

Abstract

Purpose

The purpose of this paper is to propose a method for statistical inference on data from power law distributions in order to explain behavior and social phenomena associated with web‐based social spaces such as discussion forums, question‐and‐answer sites, web 2.0 applications and the like.

Design/methodology/approach

The paper starts by highlighting the importance of explaining behavior in social networks. Next, the power law nature of social interactions is described and a hypothetical example is used to explain why analyzing sub‐sets of data might misrepresent the relationship between variables having power law distributions. Analysis requires the use of the complete distribution. The paper proposes logarithmic transformation prior to correlation and regression analysis and shows why it works using the hypothetical example and field data retrieved from Microsoft's Netscan project.

Findings

The hypothetical example emphasizes the importance of analyzing complete datasets harvested from social spaces. The Netscan example shows the importance of the logarithmic transformation for enabling the development of a predictive regression model based on the power law distributed data. Specifically, it shows that the number of new and returning participants are the main predictors of discussion forum activity.

Originality/value

This paper offers a useful analysis tool for anyone interested in social aspects of the Internet as well as corporate intra‐net systems, knowledge management systems or other systems that support social interaction such as cellular phones and mobile devices. It also explains how to avoid errors by paying attention to assumptions and range restriction issues.

Keywords

Citation

Raban, D.R. and Rabin, E. (2009), "Statistical inference from power law distributed web‐based social interactions", Internet Research, Vol. 19 No. 3, pp. 266-278. https://doi.org/10.1108/10662240910965342

Publisher

:

Emerald Group Publishing Limited

Copyright © 2009, Emerald Group Publishing Limited

Related articles