Skip to content

Research at St Andrews

Lattice-based scheduling for multi-FPGA systems

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Teng Yu, Bo Feng, Mark Stillwell, Liucheng Guo, Yuchun Ma, John Donald Thomson

School/Research organisations


Accelerators are becoming increasingly prevalent in distributed computation. FPGAs have been shown to be fast and power efficient for particular tasks, yet scheduling on FPGA-based multi-accelerator systems is challenging when workloads vary significantly in granularity in terms of task size and/or number of computational units required. We present a novel approach for dynamically scheduling tasks on networked multi-FPGA systems which maintains high performance, even in the presence of irregular tasks. Our topological ranking-based scheduling allows realistic irregular workloads to be processed while maintaining a significantly higher level of performance than existing schedulers.


Original languageEnglish
Title of host publicationProceedings of the International Conference on Field-Programmable Technology 2018, Naha, Okinawa, Japan
PublisherIEEE Press
Publication statusPublished - 10 Dec 2018
EventInternational Conference on Field-Programmable Technology (FPT'18) - Naha, Okinawa, Japan
Duration: 10 Dec 201814 Dec 2018


ConferenceInternational Conference on Field-Programmable Technology (FPT'18)
Abbreviated titleFPT'18
CityNaha, Okinawa
Internet address

    Research areas

  • Runtime scheduling, Lattice, Representation, Multi-FPGA

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. Large-scale hierarchical k-means for heterogeneous many-core supercomputers

    Li, L., Yu, T., Zhao, W., Fu, H., Wang, C., Tan, L., Yang, G. & Thomson, J., 11 Nov 2018, Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '18). Piscataway: IEEE Press, 11 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Predicting and optimizing image compression

    Murashko, O., Thomson, J. D. & Leather, H., 1 Oct 2016, Proceedings of the 24th ACM International Conference on Multimedia. ACM, p. 665-669

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Milepost GCC: Machine Learning Enabled Self-tuning Compiler

    Fursin, G., Kashnikov, Y., Memon, A., Chamski, Z., Temam, O., Namolaru, M., Yom-Tov, E., Mendelson, B., Zaks, A., Courtois, E., Bodin, F., Barnard, P., Ashton, E., Bonilla, E., Thomson, J. D., Williams, C. & O'Boyle, M., 2011, In : International Journal of Parallel Programming. 39, 3, p. 296-327 32 p.

    Research output: Contribution to journalArticle

  4. Automatic OpenCL device characterization: guiding optimized kernel design

    Thoman, P., Kofler, K., Studt, H., Thomson, J. D. & Fahringer, T., 2011, Euro-Par 2011 Parallel Processing: 17th International Conference, Euro-Par 2011, Bordeaux, France, August 29 - September 2, 2011, Proceedings, Part II. Berlin, Heidelberg: Springer-Verlag, p. 438-452 15 p. (Lecture Notes in Computer Science; vol. 6853/2011).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Workload characterization supporting the development of domain-specific compiler optimizations using decision trees for data mining

    Fenacci, D., Franke, B. & Thomson, J., 2010, Proceedings of the 13th International Workshop on Software 38; Compilers for Embedded Systems. New York, NY, USA: ACM, p. 5:1-5:10 (SCOPES '10).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

ID: 256719143