A platform for research: civil engineering, architecture and urbanism
To distribute or not to distribute: The question of load balancing for performance or energy
Heterogeneous systems are nowadays a common choice in the path to Exascale. Through the use of accelerators they offer outstanding energy efficiency. The programming of these devices employs the host-device model, which is suboptimal as CPU remains idle during kernel executions, but still consumes energy. Making the CPU contribute computin effort might improve the performance and energy consumption of the system. This paper analyses the advantages of this approach and sets the limits of when its beneficial. The claims are supported by a set of models that determine how to share a single data-parallel task between the CPU and the accelerator for optimum performance, energy consumption or efficiency. Interestingly, the models show that optimising performance does not always mean optimum energy or efficiency as well. The paper experimentally validates the models, which represent an invaluable tool for programmers when faced with the dilemma of whether to distribute their workload in these systems. ; This work has been supported by the University of Cantabria (CVE-2014-18166), the Spanish Science and Technology Commission (TIN2016-76635-C2-2-R), the European Research Council (G.A. No 321253) and the European HiPEAC Network of Excellence. The Mont-Blanc project has received funding from the European Unions Horizon 2020 research and innovation programme under grant agreement No 671697. ; Peer Reviewed ; Postprint (author's final draft)
To distribute or not to distribute: The question of load balancing for performance or energy
Heterogeneous systems are nowadays a common choice in the path to Exascale. Through the use of accelerators they offer outstanding energy efficiency. The programming of these devices employs the host-device model, which is suboptimal as CPU remains idle during kernel executions, but still consumes energy. Making the CPU contribute computin effort might improve the performance and energy consumption of the system. This paper analyses the advantages of this approach and sets the limits of when its beneficial. The claims are supported by a set of models that determine how to share a single data-parallel task between the CPU and the accelerator for optimum performance, energy consumption or efficiency. Interestingly, the models show that optimising performance does not always mean optimum energy or efficiency as well. The paper experimentally validates the models, which represent an invaluable tool for programmers when faced with the dilemma of whether to distribute their workload in these systems. ; This work has been supported by the University of Cantabria (CVE-2014-18166), the Spanish Science and Technology Commission (TIN2016-76635-C2-2-R), the European Research Council (G.A. No 321253) and the European HiPEAC Network of Excellence. The Mont-Blanc project has received funding from the European Unions Horizon 2020 research and innovation programme under grant agreement No 671697. ; Peer Reviewed ; Postprint (author's final draft)
To distribute or not to distribute: The question of load balancing for performance or energy
Stafford, Esteban (author) / Pérez, Borja (author) / Bosque, Jose L. (author) / Beivide Palacio, Ramon (author) / Valero Cortés, Mateo (author) / Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors / Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
2017-01-01
Miscellaneous
Electronic Resource
English
DDC:
690
Katcon to distribute automotive composites
British Library Online Contents | 2016
ECC to distribute Swancor resins
British Library Online Contents | 2009
VELOX to distribute heat stabilizers
British Library Online Contents | 2016
Velox to distribute glass fiber reinforcements
British Library Online Contents | 2018
Matrix Composites to distribute core materials
British Library Online Contents | 2016