Research output: Contribution to journal › Article › peer-review
Collaborative heterogeneity-aware OS scheduler for asymmetric multicore processors. / Yu, Teng; Zhong, Runxin; Janjic, Vladimir; Petoumenos, Pavlos; Zhai, Jidong; Leather, Hugh; Thomson, John Donald.
In: IEEE Transactions on Parallel and Distributed Systems, Vol. 32, No. 5, 01.05.2021, p. 1224-1237.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - Collaborative heterogeneity-aware OS scheduler for asymmetric multicore processors
AU - Yu, Teng
AU - Zhong, Runxin
AU - Janjic, Vladimir
AU - Petoumenos, Pavlos
AU - Zhai, Jidong
AU - Leather, Hugh
AU - Thomson, John Donald
N1 - Funding: This work is supported in part by the China Postdoctoral Science Foundation (Grant No. 2020TQ0169), the ShuiMu Tsinghua Scholar fellowship (2019SM131), National Key R&D Program of China (2020AAA0105200), National Natural Science Foundation of China (U20A20226), Beijing Natural Science Foundation (4202031), Beijing Academy of Artificial Intelligence BAAI), the UK EPSRC grants Discovery: Pattern Discovery and Program Shaping for Manycore Systems (EP/P020631/1). This work is also supported by the Royal Academy of Engineering under the Research Fellowship scheme.
PY - 2020/12/16
Y1 - 2020/12/16
N2 - Asymmetric multicore processors (AMP) offer multiple types of cores under the same programming interface. Extracting the full potential of AMPs requires intelligent scheduling decisions, matching each thread with the right kind of core, the core that will maximize performance or minimize wasted energy for this thread. Existing OS schedulers are not up to this task. While they may handle certain aspects of asymmetry in the system, none can handle all runtime factors affecting AMPs for the general case of multi-threaded multi-programmed workloads. We address this problem by introducing COLAB, a general purpose asymmetry-aware scheduler targeting multi-threaded multi-programmed workloads. It estimates the performance and power of each thread on each type of core and identifies communication patterns and bottleneck threads. With this information, the scheduler makes coordinated core assignment and thread selection decisions that still provide each application its fair share of the processor’s time.We evaluate our approach using both the GEM5 simulator on four distinct big.LITTLE configurations and a development board with ARM Cortex-A73/A53 processors and mixed workloads composed of PARSEC and SPLASH2 benchmarks. Compared to the state-of-the art Linux CFS and AMP-aware schedulers, we demonstrate performance gains of up to 25% and 5% to 15% on average,together with an average 5% energy saving depending on the hardware setup.
AB - Asymmetric multicore processors (AMP) offer multiple types of cores under the same programming interface. Extracting the full potential of AMPs requires intelligent scheduling decisions, matching each thread with the right kind of core, the core that will maximize performance or minimize wasted energy for this thread. Existing OS schedulers are not up to this task. While they may handle certain aspects of asymmetry in the system, none can handle all runtime factors affecting AMPs for the general case of multi-threaded multi-programmed workloads. We address this problem by introducing COLAB, a general purpose asymmetry-aware scheduler targeting multi-threaded multi-programmed workloads. It estimates the performance and power of each thread on each type of core and identifies communication patterns and bottleneck threads. With this information, the scheduler makes coordinated core assignment and thread selection decisions that still provide each application its fair share of the processor’s time.We evaluate our approach using both the GEM5 simulator on four distinct big.LITTLE configurations and a development board with ARM Cortex-A73/A53 processors and mixed workloads composed of PARSEC and SPLASH2 benchmarks. Compared to the state-of-the art Linux CFS and AMP-aware schedulers, we demonstrate performance gains of up to 25% and 5% to 15% on average,together with an average 5% energy saving depending on the hardware setup.
KW - Assymetric multicore processors
KW - Operating system
KW - Scheduling
KW - Performance model
KW - Energy efficiency
U2 - 10.1109/TPDS.2020.3045279
DO - 10.1109/TPDS.2020.3045279
M3 - Article
VL - 32
SP - 1224
EP - 1237
JO - IEEE Transactions on Parallel and Distributed Systems
JF - IEEE Transactions on Parallel and Distributed Systems
SN - 1045-9219
IS - 5
ER -
Research output: Contribution to journal › Article › peer-review
Research output: Contribution to journal › Article › peer-review
ID: 271737406