Abstract
Efficient generation of communication sets and local index sets is important for evaluation of array expressions in scientific languages such as Fortran-90 and High Performance Fortran implemented on distributed-memory machines. We show that for arrays affinely aligned with templates that are distributed on multiple processors with a block-cyclic distribution, the local memory access sequence and communication sets can be efficiently enumerated using closed forms. First, closed form solutions are presented for arrays that are aligned with identity templates that are distributed using block or cyclic distributions. These closed forms are then used with a uirtual processor approach to give an efficient solution for arrays with block-cyclic distributions. These results are extended to arrays affinely aligned to arbitrary templates that have regular distributions. We present performance results on an iPSC/860 processor, that demonstrate the low runtime overhead of this scheme.
Original language | English (US) |
---|---|
Article number | 4134228 |
Pages (from-to) | 301-305 |
Number of pages | 5 |
Journal | Proceedings of the International Conference on Parallel Processing |
Volume | 2 |
DOIs | |
State | Published - 1993 |
Externally published | Yes |
Event | 1993 International Conference on Parallel Processing, ICPP 1993 - Syracuse, United States Duration: Aug 16 1993 → Aug 20 1993 |
ASJC Scopus subject areas
- Software
- General Mathematics
- Hardware and Architecture