片上多核处理器共享末级缓存动静结合地址映射机制

计算机科学 ›› 2012, Vol. 39 ›› Issue (8): 304-310.

片上多核处理器共享末级缓存动静结合地址映射机制

曹非，刘志勇

(西北工业大学计算机学院西安710072)；(中国科学院计算技术研究所前澹研究中心北京100190)

出版日期:2018-11-16 发布日期:2018-11-16

Combined Method of Dynamic and Static Address Mapping for Shared Last Level Cache of CMP

Online:2018-11-16 Published:2018-11-16

摘要/Abstract

摘要： 片上多核处理器(CMP)通常采用私有或者共享的末级高速缓存(cache)结构，而共享末级cache一般使用静态地址映射机制。该机制将各处理器临时私有访问的数据映射于分布在其他处理器的末级cache中，使得各处理器对临时私有数据的访问延时增加。针对该问题，提出了一种动静结合的共享末级cache地址映射方法。该方法可将原来静态映射于其他处理器末级cache中的临时私有数据动态映射于访问者处理器的本地末级cache中，减少了大量静态映射所造成的长延时非本地末级cache访问，从而有效降低了整个共享末级cache的访问延时，在提高性能的同时降低了功耗和带宽使用。实验结果表明，动静结合的地址映射方式应用于采用环连接互连结构和侦听顺序环协议的CMP结构时，可获得的平均性能提升为9%，最大性能提升为38%。

关键词: 片上多核处理器，共享末级高速缓存，地址映射机制，环，侦听顺序环协议

Abstract: The shared last level cache(LLC) of CMP often uses a static address mapping method. This method may map some processor's temporary private data to other processor's last level cache. The processor needs longer access latency on these data than on the data mapped to local. This paper proposed a combined method of dynamic and static address mapping. The method can map most temporary data to their accessing processor's cache, so that these data's access latency can be reduced to loca lLLC access latency and the power and bandwidth of interconnection wasted for these data are saved. The experiment results show that the combined method of static and dynamic address mapping used in a CMP with a ring interconnection and SOR cache coherence protocol can obtain average performance increase of 9%，and the maximum is 38%.

Key words: CMP, Shared last level cache, Address mapping method, Ring, SOR cache coherence protocol

曹非，刘志勇. 片上多核处理器共享末级缓存动静结合地址映射机制[J]. 计算机科学, 2012, 39(8): 304-310. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed