TY - GEN
T1 - Thermoelectric-based sustainable self-cooling for fine-grained processor hot spots
AU - Lee, Soochan
AU - Pandiyan, Dhinakaran
AU - Seo, Jae-sun
AU - Phelan, Patrick
AU - Wu, Carole-Jean
N1 - Publisher Copyright: © 2016 IEEE.
PY - 2016/7/20
Y1 - 2016/7/20
N2 - This article proposes to tackle the high chip temperature problem caused by a few thermal hot spots in a processor using a different approach. In addition to using cooling mechanisms, such as metal heat sinks or fans, and dynamic thermal/power management solutions, this work investigates a new, fine-grained spot cooling technique for processors using thermoelectric generators (TEGs) and coolers (TECs) at the processor microarchitecture level. The proposed approach reduces the temperature of hot spots with TECs that are powered by TEG-based processor waste heat harvesting. The approach is sustainable in that it does not require additional power input for spot cooling while being able to lower the total chip temperature. The design takes advantage of two critical observations of processor temperature distribution and reliability. First, the temperature difference between the hot spots and cooler areas is typically tens of degrees on modern processors. Second, the reliability of the entire chip is limited by the highest of the hot spot temperatures. This presents an opportunity such that if we can maintain the same chip reliability by keeping the hot spot temperature constant or lower while allowing the temperature of the cooler chip area to increase to, but not exceed, the hot spot temperature, we would be able to harvest the processor waste heat without degrading reliability. We also show that the harvested electrical energy can be used to lower the temperature of processor hot spots, therefore reducing the overall chip temperature and improving reliability. The temperature evaluation results based on an open source processor temperature simulation tool-HotSpot 5.02-show that the proposed technique can effectively lower the chip temperature by an average of 7.7°C across the benchmarks studied in this paper. This temperature reduction corresponds to 29% meantime-to-failure MTTFEM and 5.7% MTTFTDDB reliability improvements. The proposed approach is a novel, self-sustaining spot cooling technique that does not require any power input and can effectively reduce the overall chip temperature for improved chip reliability.
AB - This article proposes to tackle the high chip temperature problem caused by a few thermal hot spots in a processor using a different approach. In addition to using cooling mechanisms, such as metal heat sinks or fans, and dynamic thermal/power management solutions, this work investigates a new, fine-grained spot cooling technique for processors using thermoelectric generators (TEGs) and coolers (TECs) at the processor microarchitecture level. The proposed approach reduces the temperature of hot spots with TECs that are powered by TEG-based processor waste heat harvesting. The approach is sustainable in that it does not require additional power input for spot cooling while being able to lower the total chip temperature. The design takes advantage of two critical observations of processor temperature distribution and reliability. First, the temperature difference between the hot spots and cooler areas is typically tens of degrees on modern processors. Second, the reliability of the entire chip is limited by the highest of the hot spot temperatures. This presents an opportunity such that if we can maintain the same chip reliability by keeping the hot spot temperature constant or lower while allowing the temperature of the cooler chip area to increase to, but not exceed, the hot spot temperature, we would be able to harvest the processor waste heat without degrading reliability. We also show that the harvested electrical energy can be used to lower the temperature of processor hot spots, therefore reducing the overall chip temperature and improving reliability. The temperature evaluation results based on an open source processor temperature simulation tool-HotSpot 5.02-show that the proposed technique can effectively lower the chip temperature by an average of 7.7°C across the benchmarks studied in this paper. This temperature reduction corresponds to 29% meantime-to-failure MTTFEM and 5.7% MTTFTDDB reliability improvements. The proposed approach is a novel, self-sustaining spot cooling technique that does not require any power input and can effectively reduce the overall chip temperature for improved chip reliability.
KW - Processor hot spot mitigation
KW - chip reliability
KW - thermoelectric-based cooling
KW - waste heat harvesting
UR - http://www.scopus.com/inward/record.url?scp=84983233968&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84983233968&partnerID=8YFLogxK
U2 - 10.1109/ITHERM.2016.7517635
DO - 10.1109/ITHERM.2016.7517635
M3 - Conference contribution
T3 - Proceedings of the 15th InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems, ITherm 2016
SP - 847
EP - 856
BT - Proceedings of the 15th InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems, ITherm 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems, ITherm 2016
Y2 - 31 May 2016 through 3 June 2016
ER -