An implementation of a plasma physics application for distributed-memory supercomputers using a directive-based programming framework
Resumen
To extract performance from supercomputers, programmers in the High Performance Computing (HPC) community are often required to use a combination of frameworks to take advantage of the multiple levels of parallelism. However, over the years, efforts have been made to simplify this situation by creating frameworks that can take advantage of multiple levels. This often means that the programmer has to learn a new library. On the other hand, there are frameworks that were created by extending the capabilities of established paradigms. In this paper, we explore one of this libraries, OpenMP Cluster. As its name implies, it extends the OpenMP API, which allows seasoned programmers to take advantage of their experience to use just one API to program in sharedmemory and distributed-memory parallelism. In this paper, we took an existing plasma physics code that was programmed with MPI+OpenMP and ported it over to OpenMP Cluster. We also show that under certain conditions, the performance of OpenMP Cluster is similar to that of the MPI+OpenMP code.
Referencias bibliográficas
Allmann-Rahn, F., Lautenbach, S., Deisenhofer, M., & Grauer, R. (2024, March). The muphyII Code: Multiphysics Plasma Simulation on Large HPC Systems. Computer Physics Communications, 296, 109064. doi:https://doi.org/10.1016/j.cpc.2023.109064
Choi, J. Y., Chang, C.-S., Dominski, J., Klasky, S., Merlo, G., Suchyta, E., . . . Wood, C. (2018). Coupling Exascale Multiphysics Applications: Methods and Lessons Learned. 2018 IEEE International Conference on e-Science and Grid Computing (pp. 442-452). Amsterdam, Netherlands: IEEE. doi:10.1109/eScience.2018.00133
Coto-Vílchez, F., Vargas, V. I., Solano-Piedra, R., Rojas-Quesada, M. A., Araya-Solano, L. A., Ramírez, A. A., . . . Arias, S. (2020, July 8). Progress on the small modular stellarator SCR-1: new diagnostics and heating scenarios. Journal of Plasma Physics, 86(4), 815860401. doi:10.1017/S0022377820000677
Di Francia Rosso, P. H., & Francesquini, E. (2022). OCFTL: An MPI Implementation-Independent Fault Tolerance Library for Task-Based Applications. In I. Gitler, C. J. Barrios Hernández, & M. Esteban (Ed.), High Performance Computing. 8th Latin American Conference, CARLA 2021, Guadalajara, Mexico, October 6–8, 2021, Revised Selected Papers. 1540, pp. 131-147. Springer, Cham. doi:10.1007/978-3-031-04209-6_10
Jiménez, D., Campos-Duarte, L., Solano-Piedra, R., Araya-Solano, L. A., Meneses, E., & Vargas, I. (2020). BS-SOLCTRA: Towards a Parallel Magnetic Plasma Confinement Simulation Framework for Modular Stellarator Devices. In J. L. Crespo-Mariño, & E. Meneses-Rojas (Ed.), High Performance Computing. 6th Latin American Conference, CARLA 2019, Turrialba, Costa Rica, September 25–27, 2019, Revised Selected Papers. 1087, pp. 33-48. Springer, Cham. doi:10.1007/978-3-030-41005-6_3
Jiménez, D., Herrera-Mora, J., Rampp, M., Laure, E., & Meneses, E. (2022). Implementing a GPU-Portable Field Line Tracing Application with OpenMP Offload. In P. Navaux, C. J. Barrios H, C. Osthoff, & G. Guerrero (Ed.), High Performance Computing. 9th Latin American Conference, CARLA 2022, Porto Alegre, Brazil, September 26–30, 2022, Revised Selected Papers (pp. 31-46). Springer International Publishing. doi:10.1007/978-3-031-23821-5_3
Jiménez, D., Meneses, E., & Vargas, V. I. (2021, July 17). Adaptive Plasma Physics Simulations: Dealing with Load Imbalance using Charm++. PEARC '21: Practice and Experience in Advanced Research Computing. Article No. 3, pp. 1-8. New York, NY, USA: Association for Computing Machinery. doi:10.1145/3437359.3465566
Topcuoglu, H., Hariri, S., & Wu, M.-Y. (2002, March). Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Transactions on Parallel and Distributed Systems, 13(3), 260-274. doi:10.1109/71.993206
Yviquel, H., Pereira, M., Francesquini, E., Valarini, G., Leite, G., Rosso, P., . . . Araujo, G. (2023, January). The OpenMP Cluster Programming Model. ICPP Workshops '22: Workshop Proceedings of the 51st International Conference on Parallel Processing. Article No. 17, pp. 1-11. Bordeaux, France: Association for Computing Machinery. doi:10.1145/3547276.3548444
Descargas
Derechos de autor 2024 Revista Colombiana de Computación
Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial-CompartirIgual 4.0.