Skip to content

Research at St Andrews

Parallel and streaming truth discovery in large-scale quantitative crowdsourcing

Research output: Contribution to journalArticle

Author(s)

Robin Wentao Ouyang, Lance M. Kaplan, Alice Toniolo, Mani Srivastava, Timothy J. Norman

School/Research organisations

Abstract

To enable reliable crowdsourcing applications, it is of great importance to develop algorithms that can automatically discover the truths from possibly noisy and conflicting claims provided by various information sources. In order to handle crowdsourcing applications involving big or streaming data, a desirable truth discovery algorithm should not only be effective, but also be scalable. However, with respect to quantitative crowdsourcing applications such as object counting and percentage annotation, existing truth discovery algorithms are not simultaneously effective and scalable. They either address truth discovery in categorical crowdsourcing or perform batch processing that does not scale. In this paper, we propose new parallel and streaming truth discovery algorithms for quantitative crowdsourcing applications. Through extensive experiments on real-world and synthetic datasets, we demonstrate that 1) both of them are quite effective, 2) the parallel algorithm can efficiently perform truth discovery on large datasets, and 3) the streaming algorithm processes data incrementally, and it can efficiently perform truth discovery both on large datasets and in data streams.

Close

Details

Original languageEnglish
Pages (from-to)2984-2997
Number of pages14
JournalIEEE Transactions on Parallel and Distributed Systems
Volume27
Issue number10
Early online date6 Jan 2016
DOIs
Publication statusPublished - 1 Oct 2016

    Research areas

  • Crowdsourcing, Truth discovery, Quantitative task, Big data, Parallel algorithm, Streaming algorithm

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. On natural language generation of formal argumentation

    Cerutti, F., Toniolo, A. & Norman, T. J., 27 Dec 2019, Proceedings of the 3rd Workshop on Advances In Argumentation In Artificial Intelligence co-located with the 18th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2019): Rende, Italy, November 19-22, 2019. Santini, F. & Toniolo, A. (eds.). Sun SITE Central Europe, p. 15-29 15 p. (CEUR Workshop Proceedings; vol. 2528).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Dialectical models of deliberation, problem solving and decision making

    Walton, D., Toniolo, A. & Norman, T. J., 13 Sep 2019, In : Argumentation. First Online

    Research output: Contribution to journalArticle

  3. Deb8: a tool for collaborative analysis of video

    Carneiro, G., Nacenta, M., Toniolo, A., Mendez, G. & Quigley, A. J., 4 Jun 2019, Proceedings of the 2019 ACM International Conference on Interactive Experiences for TV and Online Video (TVX '19). ACM, p. 47-58

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Deb8: collaborative fact checking

    Carneiro, G., Nacenta, M., Toniolo, A., Mendez, G. G. & Quigley, A. J., 5 May 2019.

    Research output: Contribution to conferencePaper

  5. A tool to highlight weaknesses and strengthen cases: CISpaces.org

    Cerutti, F., Norman, T. J. & Toniolo, A., 12 Dec 2018, Legal Knowledge and Information: JURIX 2018: The Thirty-first Annual Conference. Palmirani, M. (ed.). IOS Press, p. 186-189 4 p. (Frontiers in Artificial Intelligence and Applications; vol. 313).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related by journal

  1. Large-scale automatic k-means clustering for heterogeneous many-core supercomputer

    Yu, T., Zhao, W., Liu, P., Janjic, V., Yan, X., Wang, S., Fu, H., Yang, G. & Thomson, J. D., May 2020, In : IEEE Transactions on Parallel and Distributed Systems. 31, 5, p. 997-1008 12 p.

    Research output: Contribution to journalArticle

ID: 247864392

Top