Skip to content

Research at St Andrews

Towards reproducibility in online social network research

Research output: Contribution to journalArticle


Luke Hutton, Tristan Henderson

School/Research organisations


The challenge of conducting reproducible computational research is acknowledged across myriad disciplines from biology to computer science. In the latter, research leveraging online social networks (OSNs) must deal with a set of complex issues, such as ensuring data can be collected in an appropriate and reproducible manner. Making research reproducible is difficult, and researchers may need suitable incentives, and tools and systems, to do so. In this paper, we explore the state-of-the-art in OSN research reproducibility, and present an architecture to aid reproducibility. We characterize the reproducible OSN research using three main themes: 1) reporting of methods; 2) availability of code; and 3) sharing of research data. We survey 505 papers and assess the extent to which they achieve these reproducibility objectives. While systems-oriented papers are more likely to explain data-handling aspects of their methodology, social science papers are better at describing their participant-handling procedures. We then examine incentives to make research reproducible, by conducting a citation analysis of these papers. We find that sharing data are associated with increased citation count, while sharing method and code does not appear to be. Finally, we introduce our architecture which supports the conduct of reproducible OSN research, which we evaluate by replicating an existing research study.


Original languageEnglish
Pages (from-to)156-167
Number of pages12
JournalIEEE Transactions on Emerging Topics in Computing
Issue number1
Early online date20 Jul 2015
Publication statusPublished - Mar 2018

    Research areas

  • Data sharing, Online social networks, Reproducibility, Survey

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. Integrating the Data Protection Impact Assessment into the Software Development Lifecycle

    Irvine, C., Balasubramaniam, D. & Henderson, T., 17 Sep 2020, Proceedings of the 15th DPM International Workshop on Data Privacy Management.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Data Protection for the Common Good: Developing a framework for a data protection-focused data commons

    Wong, J., Henderson, T. & Ball, K., 15 Sep 2020, Data for Policy Conference 2020.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Data Protection, certification and the fourth industrial revolution

    Henderson, T. & Schafer, B., Sep 2019, Regulating Industrial Internet through IPR, Data Protection and Competition Law. Ballardini, R. M., Pitkänen, O. & Kuommamäki, P. (eds.). Alphen aan den Rijn, The Netherlands: Kluwer Law International

    Research output: Chapter in Book/Report/Conference proceedingChapter

ID: 163403775