Reducing Register Ports Using Delayed Write-Back Queues And Operand Pre-Fetch

Nam Sung Kim, Trevor Mudge

(Paper #174)


Abstract

In high-performance wide-issue microprocessors the access time, energy and area of the register file are often critical to overall performance. This is because these pararmeters grow superlinearly as read and write ports are added to support wide-issue. This paper presents techniques to reduce the number of ports of a register file intended for a wide-issue microprocessor without noticeably impacting its IPC. Our results show that it is possible to replace a 16 read/8 write port file intended for an eight-issue processor with an 8 read/8 write port file so that the impact on IPC is around 1%. This is accomplished with the addition of some small auxiliary memory structures. Furthermore, the access time of the replacement (including the auxiliary structures) is such that if it were the critical path a 45-50% increase in clock speed would be possible. Finally, there is an energy per access savings of about 20% and an area savings of 40%, which has the potential for further savings by shortening global interconnect in the layout. An extension to the scheme that reduces the number of write ports from 8 to 6 is also presented. It suffers a modest penalty in terms of IPC, but shows further reduction in energy and area. Depending on implementation characteristics it could yield a further increase in performance.

Keywords:

Architecture