 |
We obtained a maximum performance of 200 MIOPS in MPI
and 400 MIOPS id OpenMP. This result shows the nature of shared memory
of the machine. We can see that the performance in OpenMP is the double
respect to MPI. |
 |
The performance of the
system using MPI scales up to 16 processes and then decreases, but using OpenMP it
scales up to 32 and remains constant. |
 |
Using MPI
we obtained a maximum speedup of 9, and using OpenMP we obtained a maximum
speedup of 16.
|
 |
The maximum SWaP in MPI was 0.33 and the maximum SWaP
in OpenMP was 0.66. |
 |
We can see the impact in performance of
communication time overhead.
Because of the operations involved in the sparse matrix
multiplication the processing units spend time in the
synchronization of the computation. This communication overhead
is the responsible of the decease of the performance compared to
dense matrix multiplication. |