Abstract
Software managed scratchpad memories (SPMs) provide improved performance and power in embedded processors by reducing required hardware resources. Performance depends strongly on the scheme used to map code and data onto the SPM, but generating optimal mappings can be extremely difficult. Here we address instruction mapping on SPMs and present a performance model and algorithm, "Code Overlay Generator" (COG), for producing high performance dynamic SPM code mappings. Our heuristic does not require profiling information, and is suitable for generating mapping solutions for large programs which are otherwise infeasible using previously proposed Integer Linear Programming (ILP) techniques. We compare our algorithm with a published heuristic and the code overlay mapping algorithm provided with the Cell Broadband Engine (CBE) Synergistic Processing Unit (SPU) compiler from IBM, spu-gcc. We find an average performance advantage of 34% compared to the previous algorithm, and 87% with respect to spu-gcc. We additionally show that our performance model enables improved tools for offline evaluation of code overlay performance and mapping selection.
Original language | English (US) |
---|---|
Title of host publication | 2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, CODES+ISSS 2010 |
Pages | 287-296 |
Number of pages | 10 |
State | Published - 2010 |
Event | 8th IEEE/ACM International Conference on Hardware/Software-Co-Design and System Synthesis, CODES+ISSS 2010 - Scottsdale, AZ, United States Duration: Oct 24 2010 → Oct 29 2010 |
Other
Other | 8th IEEE/ACM International Conference on Hardware/Software-Co-Design and System Synthesis, CODES+ISSS 2010 |
---|---|
Country/Territory | United States |
City | Scottsdale, AZ |
Period | 10/24/10 → 10/29/10 |
Keywords
- Cell Broadband Engine
- Code Mapping
- Code Overlay
- Compiler
- Embedded Systems
- Scratchpad Memory
ASJC Scopus subject areas
- Hardware and Architecture
- Software
- Electrical and Electronic Engineering