How do we know which model is better to use? It seems like the data-parallel model is better if the forall loop can be parallelized, since there's no overhead from barriers or mutexes?
Please log in to leave a comment.
How do we know which model is better to use? It seems like the data-parallel model is better if the forall loop can be parallelized, since there's no overhead from barriers or mutexes?