English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Contribution to Collected Edition

The Impact of Measurement Time on Subgroup Detection in Online Communities

MPS-Authors
/persons/resource/persons41230

Krempel,  Lothar
Wissenschaft, Technik und Innovationssysteme, MPI for the Study of Societies, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Zeini, S., Göhnert, T., Hecking, T., Krempel, L., & Hoppe, H. U. (2014). The Impact of Measurement Time on Subgroup Detection in Online Communities. In F. Can, T. Özyer, & F. Polat (Eds.), State of the Art Applications of Social Network Analysis (pp. 249-268). Cham: Springer International Publishing. doi:10.1007/978-3-319-05912-9_12.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0023-F74C-0
Abstract
More and more communities use internet based services and infrastructure for communication and collaboration. All these activities leave digital traces that are of interest for research as real world data sources that can be processed automatically or semi-automatically. Since productive online communities (such as open source developer teams) tend to support the establishment of ties between actors who work on or communicate about the same or similar objects, social network analysis is a frequently used research methodology in this field. A typical application of Social Network Analysis (SNA) techniques is the detection of cohesive subgroups of actors (also called “community detection”. We were particularly interested in such methods that allow for the detection of overlapping clusters, which is the case with the Clique Percolation Method (CPM) and Link Community detection (LC). We have used these two methods to analyze data from some open source developer communities (mailing lists and log files) and have compared the results for varied time windows of measurement. The influence of the time span of data capturing/aggregation can be compared to photography: A certain minimal window size is needed to get a clear image with enough “light” (i.e. dense enough interaction data), whereas for very long time spans the image will be blurred because subgroup membership will indeed change during the time span (corresponding to a moving target). In this sense, our target parameter is “resolution” of subgroup structures. We have identified several indicators for good resolution. In general, this value will vary for different types of communities with different communication frequency and behavior. Following our findings, an explicit analysis and comparison of the influence of time window for different communities may be used to better adjust analysis techniques for the communities at hand.