loading...
Effective Instruction Prefetching in Chip Multiprocessors for Modern Commercial Applications
San Francisco, California February 12-February 16
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/HPCA.2005.1311th International Symposium on High- ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Lawrence Spracklen, Sun Microsystems Inc., Sunnyvale, CA
Yuan Chou, Sun Microsystems Inc., Sunnyvale, CA
Santosh G. Abraham, Sun Microsystems Inc., Sunnyvale, CA
In this paper, we study the instruction cache miss behavior of four modern commercial applications (a database workload, TPC-W, SPECjAppServer2002 and SPECweb99). These applications exhibit high instruction cache miss rates for both the L1 and L2 caches, and a sizable performance improvement can be achieved by eliminating these misses.
We show that it is important, not only to address sequential misses, but also misses due to branches and function calls. As a result, we propose an efficient discontinuity prefetching scheme that can be effectively combined with traditional sequential prefetching to address all forms of instruction cache misses.
Additionally, with the emergence of chip multiprocessors (CMPs), instruction prefetching schemes must take into account their effect on the shared L2 cache. Specifically, aggressive instruction cache prefetching can result in an increase in the number of L2 cache data misses. As a solution, we propose a scheme that does not install prefetches into the L2 cache unless they are proven to be useful.
Overall, we demonstrate that the combination of our proposed schemes is successful in reducing the instruction miss rate to only 10%-16% of the original miss rate and results in a 1.08X-1.37X performance improvement for the applications studied.
Citation:
Lawrence Spracklen, Yuan Chou, Santosh G. Abraham, "Effective Instruction Prefetching in Chip Multiprocessors for Modern Commercial Applications," hpca, pp.225-236, 11th International Symposium on High-Performance Computer Architecture (HPCA'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.