Simplifying the parallelization of scientific codes by a function-centric approach in Python

Published in Computational Science & Discovery, 2010

Recommended citation: Jon K. Nilsen, Xiang Cai, Bjørn Høyland, and Hans Petter Langtangen (2010)."Simplifying the parallelization of scientific codes by a function-centric approach in Python."Computational Science & Discovery 3 (1) 1 - 25. https://doi.org/10.1088/1749-4699/3/1/015003

The purpose of this paper is to show how existing scientific software can be parallelized using a separate thin layer of Python code where all parallelization-specific tasks are implemented. We provide specific examples of such a Python code layer, which can act as templates for parallelizing a wide set of serial scientific codes. The use of Python for parallelization is motivated by the fact that the language is well suited for reusing existing serial codes programmed in other languages. The extreme flexibility of Python with regard to handling functions makes it very easy to wrap up decomposed computational tasks of a serial scientific application as Python functions. Many parallelization-specific components can be implemented as generic Python functions, which may take as input those wrapped functions that perform concrete computational tasks. The overall programming effort needed by this parallelization approach is limited, and the resulting parallel Python scripts have a compact and clean structure. The usefulness of the parallelization approach is exemplified by three different classes of application in natural and social sciences.

Download paper here