Skip to content

Research at St Andrews

Towards reproducibility in online social network research

Research output: Contribution to journalArticlepeer-review


Luke Hutton, Tristan Henderson

School/Research organisations


The challenge of conducting reproducible computational research is acknowledged across myriad disciplines from biology to computer science. In the latter, research leveraging online social networks (OSNs) must deal with a set of complex issues, such as ensuring data can be collected in an appropriate and reproducible manner. Making research reproducible is difficult, and researchers may need suitable incentives, and tools and systems, to do so. In this paper, we explore the state-of-the-art in OSN research reproducibility, and present an architecture to aid reproducibility. We characterize the reproducible OSN research using three main themes: 1) reporting of methods; 2) availability of code; and 3) sharing of research data. We survey 505 papers and assess the extent to which they achieve these reproducibility objectives. While systems-oriented papers are more likely to explain data-handling aspects of their methodology, social science papers are better at describing their participant-handling procedures. We then examine incentives to make research reproducible, by conducting a citation analysis of these papers. We find that sharing data are associated with increased citation count, while sharing method and code does not appear to be. Finally, we introduce our architecture which supports the conduct of reproducible OSN research, which we evaluate by replicating an existing research study.


Original languageEnglish
Pages (from-to)156-167
Number of pages12
JournalIEEE Transactions on Emerging Topics in Computing
Issue number1
Early online date20 Jul 2015
Publication statusPublished - Mar 2018

    Research areas

  • Data sharing, Online social networks, Reproducibility, Survey

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. Data portability as a tool for audit

    Zwiebelmann, Z. & Henderson, T., 21 Sep 2021, UbiComp '21: Adjunct Proceedings of the 2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2021 ACM International Symposium on Wearable Computers. ACM, p. 276–280 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Data protection for the common good: developing a framework for a data protection-focused data commons

    Wong, J., Henderson, T. & Ball, K., 15 Sep 2020, Data for Policy Conference 2020. Data for Policy, 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Automating dynamic consent decisions for the processing of social media data in health research

    Norval, C. & Henderson, T., Jul 2020, In: Journal of Empirical Research on Human Research Ethics. 15, 3, p. 187-201

    Research output: Contribution to journalArticlepeer-review

  4. Short paper: Integrating the data protection impact assessment into the software development lifecycle

    Irvine, C., Balasubramaniam, D. & Henderson, T., 2020, Data Privacy Management, Cryptocurrencies and Blockchain Technology: ESORICS 2020 International Workshops, DPM 2020 and CBT 2020, Guildford, UK, September 17–18, 2020, Revised Selected Papers. Garcia-Alfaro, J., Navarro-Arribas, G. & Herrera-Joancomarti, J. (eds.). Cham: Springer, p. 219-228 (Lecture Notes in Computer Science (including subseries Security and Cryptology); vol. 12484 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

ID: 163403775