Improving Collective I/O Performance Using Non-volatile Memory Devices

TitleImproving Collective I/O Performance Using Non-volatile Memory Devices
Publication TypeJournal Article
Year of Publication2016
Authors
Journal2016 IEEE International Conference on Cluster Computing (CLUSTER), Cluster Computing (CLUSTER), 2016 IEEE International Conference on
Pagination120
ISSN978-1-5090-3653-0
Abstract

Collective I/O is a parallel I/O technique designed to deliver high performance data access to scientific applications running on high-end computing clusters. In collective I/O, write performance is highly dependent upon the storage system response time and limited by the slowest writer. The storage system response time in conjunction with the need for global synchronisation, required during every round of data exchange and write, severely impacts collective I/O performance. Future Exascale systems will have an increasing number of processor cores, while the number of storage servers will remain relatively small. Therefore, the storage system concurrency level will further increase, worsening the global synchronisation problem. Nowadays high performance computing nodes also have access to locally attached solid state drives, effectively providing an additional tier in the storage hierarchy. Unfortunately, this tier is not always fully integrated. In this paper we propose a set of MPI-IO hints extensions that enable users to take advantage of fast, locally attached storage devices to boost collective I/O performance by increasing parallelism and reducing global synchronisation impact in the ROMIO implementation. We demonstrate that by using local storage resources, collective write performance can be greatly improved compared to the case in which only the global parallel file system is used, but can also decrease if the ratio between aggregators and compute nodes is too small.

DOI10.1109/CLUSTER.2016.37