Run Time Actions
__mp_slave_wait_for_work()
__mp_simple_sched(par_loop)
__mp_wait_for_completion()
When the parallel threads are spawn:
- they spin in __mp_slave_wait_for_work()
Master enters parallel region:
- passes the address of the _mpdo subroutine to the slaves
- at the end of parallel region all threads synchronize
- slaves go back to spin in __mp_slave_wait_for_work()
- master continues with serial execution