Power Modeling and Optimization for GPGPUs

Li, Zhi

dc.contributor.advisor	Fu, Xin
dc.contributor.author	Li, Zhi
dc.date.accessioned	2013-07-14T15:44:36Z
dc.date.available	2013-07-14T15:44:36Z
dc.date.issued	2013-05-31
dc.date.submitted	2013
dc.identifier.other	http://dissertations.umi.com/ku:12738
dc.identifier.uri	http://hdl.handle.net/1808/11465
dc.description.abstract	Modern graphics processing units (GPUs) supports tens of thousands of parallel threads and delivers remarkably high computing throughput. General-Purpose computing on GPUs (GPGPUs) is becoming the attractive platform for general-purpose applications that request high computational performance such as scientific computing, financial applications, medical data processing, and so on. However, GPGPUs is facing severe power challenge due to the increasing number of cores placed on a single chip with decreasing feature size. In order to explore the power optimization techniques in GPGPUs, I first build a power model for GPGPUs, which is able to estimate both dynamic and leakage power of major microarchitecture structures in GPGPUs. I then target on the power-hungry structures (e.g. register file) to explore the energy-efficient GPGPUs. In order to hide the long latency operations, GPGPUs employs the fine-grained multi-threading among numerous active threads, leading to the sizeable register files with massive power consumption. The conventional method to reduce dynamic power consumption is the supply voltage scaling. And the inter-bank tunneling FETs (TFETs) is the promising candidate compared to CMOS for low voltage operations regarding to both leakage and performance. However, always executing at the low voltage will result in significant performance degradation. In this study, I propose the hybrid CMOS-TFET based register file and allocate TFET-based registers to threads whose execution progress can be delayed to some degree to avoid the memory contentions with other threads to reduce both dynamic and leakage power, and the CMOS-based registers are still used for threads requiring normal execution speed. My experimental results show that the proposed technique achieves 30% energy (including both dynamic and leakage) reduction in register files with negligible performance degradation compared to the baseline case equipped with naive power optimization technique.
dc.format.extent	56 pages
dc.language.iso	en
dc.publisher	University of Kansas
dc.rights	This item is protected by copyright and unless otherwise specified the copyright of this thesis/dissertation is held by the author.
dc.subject	Computer engineering
dc.subject	General-purpose computing on graphics processing units
dc.subject	Memory contention
dc.subject	Register file
dc.subject	Tunneling field effect transistors
dc.title	Power Modeling and Optimization for GPGPUs
dc.type	Thesis
dc.contributor.cmtemember	Fu, Xin
dc.contributor.cmtemember	Minden, Gary J.
dc.contributor.cmtemember	Kulkarni, Prasad
dc.thesis.degreeDiscipline	Electrical Engineering & Computer Science
dc.thesis.degreeLevel	M.E.
kusw.oastatus	na
kusw.oapolicy	This item does not meet KU Open Access policy criteria.
kusw.bibid	8085705
dc.rights.accessrights	openAccess

Files in this item

Name:: Li_ku_0099M_12738_DATA_1.pdf
Size:: 415.9Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

The University of Kansas prohibits discrimination on the basis of race, color, ethnicity, religion, sex, national origin, age, ancestry, disability, status as a veteran, sexual orientation, marital status, parental status, gender identity, gender expression and genetic information in the University’s programs and activities. The following person has been designated to handle inquiries regarding the non-discrimination policies: Director of the Office of Institutional Opportunity and Access, IOA@ku.edu, 1246 W. Campus Road, Room 153A, Lawrence, KS, 66045, (785)864-6414, 711 TTY.