Speed-up and efficiency of parallelized Monte Carlo integration on homogeneous and heterogeneous clusters

Abstract

We evaluate the performance of parallelized Monte Carlo integration algorithms on homogeneous and heterogeneous clusters of the Structure and Dynamics Laboratory. In this study, intrinsic pseudorandom number generators for Fortran and Python were used and parallelization achieved by MPI libraries. On an example using 10^9 samples, the parallelized Python code proved to be scalable on a cluster of up to 44 processors. The Fortran parallelized code performed less well on scalability but had a much shorter execution time. It was also observed that the overhead cost of parallelization saturates as the number of processors used increased.