The
Metric of Scale: Real-World Experiences
via Infrastructural
Software
Everyone involved in Grand Challenge HPC is concerned with the
metric of scale. Application developers covet linear speedup as a function of
processor count. System vendors are driven by scale in the development of
next-generation architectures. Interconnect vendors aim to maximize bandwidth
while minimizing latency irrespective of scale. Programming models, plus the
compilers and tools that support them, are also designed with scale in mind.
The case is no different for infrastructural software: Scale is a gating
factor. Drawing on real-world experience in managing shared and
distributed-memory applications, illustrations of the incremental and
monumental impact of scale is demonstrated. Although these capability-oriented
workloads are very effective in illustrating the challenge of scale for
infrastructural software, there is another class of workloads that manifest
scale quite differently. With the focus on parallel processing, application
throughput exposes the metric of scale for capacity-oriented workloads. Again
drawing on real-world experience, incremental and monumental ramifications of
capacity-driven scale are presented. With respect to infrastructural software,
scale has been a shrewd motivator in the past, and will continue to create
challenges and opportunities in the future.