Collective Communication
Some deficiencies with our parallel trapezoidal rule program
Idle processors during the broadcast of input data and summation of the local results (global reduction)
O(P) communication cycles
Can we improve upon this?
Previous slide
Next slide
Back to first slide
View graphic version