PyCOMPSs: Parallel computational workflows in Python

E Tejedor, Y Becerra, G Alomar… - … Journal of High …, 2017 - journals.sagepub.com
The International Journal of High Performance Computing …, 2017journals.sagepub.com
The use of the Python programming language for scientific computing has been gaining
momentum in the last years. The fact that it is compact and readable and its complete set of
scientific libraries are two important characteristics that favour its adoption. Nevertheless,
Python still lacks a solution for easily parallelizing generic scripts on distributed
infrastructures, since the current alternatives mostly require the use of APIs for message
passing or are restricted to embarrassingly parallel computations. In that sense, this paper …
The use of the Python programming language for scientific computing has been gaining momentum in the last years. The fact that it is compact and readable and its complete set of scientific libraries are two important characteristics that favour its adoption. Nevertheless, Python still lacks a solution for easily parallelizing generic scripts on distributed infrastructures, since the current alternatives mostly require the use of APIs for message passing or are restricted to embarrassingly parallel computations. In that sense, this paper presents PyCOMPSs, a framework that facilitates the development of parallel computational workflows in Python. In this approach, the user programs her script in a sequential fashion and decorates the functions to be run as asynchronous parallel tasks. A runtime system is in charge of exploiting the inherent concurrency of the script, detecting the data dependencies between tasks and spawning them to the available resources. Furthermore, we show how this programming model can be built on top of a Big Data storage architecture, where the data stored in the backend is abstracted and accessed from the application in the form of persistent objects.
Sage Journals
Showing the best result for this search. See all results