Rate-based thermal, power, and co-location aware resource management for heterogeneous data centers

Academic Article

Abstract

  • © 2017 Elsevier Inc. Today's data centers contain large numbers of compute nodes that require substantial power, and therefore require a large amount of cooling resources to operate at a reliable temperature. The high power consumption of the computing and cooling systems produces extraordinary electricity costs, requiring some data center operators to be constrained by a specified electricity budget. In addition, the processors within these systems contain a large number of cores with shared resources (e.g., last-level cache), heavily affecting the performance of tasks that are co-located on cores and contend for these resources. This problem is only exacerbated as processors move to the many-core realm. These issues lead to interesting performance-power tradeoffs; by considering resource management in a holistic fashion, the performance of the computing system can be maximized while satisfying power and temperature constraints. In this work, the performance of the system is quantified as the total reward earned from completing tasks by their individual deadlines. By designing three resource allocation techniques, we perform a rigorous analysis on thermal, power, and co-location aware resource management using two different facility configurations, three different workload environments, and a sensitivity analysis of the power and thermal constraints.
  • Authors

    Digital Object Identifier (doi)

    Author List

  • Oxley MA; Jonardi E; Pasricha S; Maciejewski AA; Siegel HJ; Burns PJ; Koenig GA
  • Start Page

  • 126
  • End Page

  • 139
  • Volume

  • 112