学校首页| 设为主页|ENGLISH|旧版地址


光电论坛第74期 :构建STT-RAM的cache结构以提升片上多处理器的性能

来源:武汉光电国家研究中心     作者:    发布时间:2016年12月12日    浏览:

光电论坛第七十四期 构建STT-RAM的cache结构以提升片上多处理器的性能
主讲:Prof. Chita R. Das, The Pennsylvania State University, USA

时间:2013年7月22日 10:00-11:30



Chita Das是宾夕法尼亚州立大学计算机科学和工程系统卓越教授。他主要的研究兴趣包括大规模计算、多核体系结构、性能评价、容错计算和云计算。他在片上互连网络和高速网络互连分析及设计方向做出突出成绩。他在上述领域发表了200多篇论文,很多获得最佳论文奖。他是很多学术会议和组织的主席或者成员,也是很多著名学术期刊的主编。他也是IEEE会士。


Chita Das is a Distinguished Professor of Computer Science and Engineering at the Pennsylvania State University. His main areas of interest include CMPs and manycore architectures, performance evaluation, fault-tolerant computing, and Clouds/datacenters. In particular, he has worked extensively in the area of design and analysis of interconnection networks/on-chip interconnects. He has published more than 200 papers in the above areas, has received several best paper awards, and has served on many program committees, and editorial boards. He is a Fellow of the IEEE.





Spin-Transfer Torque RAM (STT-RAM) is an emerging non-volatile memory technology that possesses many attractive characteristics such as high density, low leakage and low read access latency. However, one of the major drawbacks of STT-RAM technology is its long write latency, which impedes its progress for wide spread adoption for on-chip caches compared to the traditional SRAM based caches. By adopting suitable mechanisms that can minimize the latency overhead of STT-RAM writes, it is possible to design energy-efficient and high density caches for CMPs.

In this talk, I will discuss two complementary techniques to mitigate the write overhead of STT-RAM. The first approach centers on designing an elegant network level solution. This approach is based on the observation that instead of staggering requests to a write-busy STT-RAM bank, the network should schedule requests to other idle cache banks for effectively hiding the latency.
While the first approach attempts to hide the STT-RAM write latency, our second approach focuses on reducing this write latency by tuning its data-retention time. We argue that by relaxing the non-volatility feature of STT-RAMs to have data-retention time in the range of milliseconds, we can optimize the on-chip cache architecture for CMPs. The advantages of both these techniques compared to the SRAM based cache architecture will be discussed.