OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li,...
-
Upload
elisa-rackley -
Category
Documents
-
view
212 -
download
0
Transcript of OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li,...
![Page 1: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/1.jpg)
OS-aware TuningImproving Instruction Cache
Energy Efficiency on System Workloads
Authors : Tao Li, John, L.K.Published in : Performance, Computing, and Communications
Conference, 2006. IPCCC 2006. 25th IEEE InternationalDate of Conference : 10-12 April 2006
第 31 組 9962111 邱鎮浩 , 9962306 蘇育毅 , 9962145 吳修億
![Page 2: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/2.jpg)
1.Introduction 2.Experimental Methodology 3.User/OS I-Cache Accesses Characterization 4.OS-aware I-Cache Tuning 5.Power and Performance Evaluation 6.Conclusions
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 302
Outline
![Page 3: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/3.jpg)
Dynamic power dissipation Due to charging and discharging highly
capacitive bit lines and sense amps Static power dissipation
電晶體越做越小 -> 自行導通
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 303
1.Introduction
![Page 4: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/4.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 304
半導體規格影響耗能
![Page 5: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/5.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 305
15 個 app 耗能表
![Page 6: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/6.jpg)
Dynamic 省電 : OS-aware cache way lookup
Static 省電 : OS-aware cache set drowsy mode
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 306
![Page 7: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/7.jpg)
Modified vision of SoftWatt SimOS OS: SGI-IRI 5.3 商業版 硬體 : 8-issue superscalar processor
split L1 Instruct. ,data cache unified L2 cache memory
實測 15 個 application
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 307
2.Experimental Methodology
![Page 8: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/8.jpg)
Vortex : database manipulation code Gcc : compiler code Sendmail : SMTP, 1KB~1.5MB Fileman : cp, rm, chmod, tar –cvf… …… OS activity ranges from 6% in compress,92%
in fileman
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 308
15 個 application
![Page 9: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/9.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 309
3.User/OS I-Cache Accesses Characterization
Dual mode
OS instruc.User instruc.
![Page 10: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/10.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3010
![Page 11: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/11.jpg)
A conventional 4-way set associative cache requires four tag comparisons and four data array read-outs for a cache access.
Nevertheless, during user execution, performing tag comparisons and data array read-outs for OS cache lines are unnecessary and waste extra dynamic power.
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 11 / 30
OS-aware Cache Way Lookup
![Page 12: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/12.jpg)
Cache way mode bit
Processor status register(PSR)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 12 / 30
OS-aware Cache Way Lookup
![Page 13: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/13.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 13 / 30
OS-aware Cache Way Lookup
![Page 14: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/14.jpg)
Once generated, they remain unchanged unless a cache line replacement.
The processor switches mode.
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 14 / 30
OS-aware Cache Way Lookup
![Page 15: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/15.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 15 / 30
OS-aware Cache Way Lookup
![Page 16: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/16.jpg)
Due to CMOS technology scaling, static power due to leakage current is gaining in importance in I-cache power dissipation.
These make efforts at leakage control essential to maintain control of I-cache power on current and next generations of processors.
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 16 / 30
OS-aware Cache Set Drowsy Mode
![Page 17: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/17.jpg)
State-preserving drowsy cache techniques.(L1)
Gated-Vdd technique.(L2)
Cache way mode bit
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 17 / 30
OS-aware Cache Set Drowsy Mode
![Page 18: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/18.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 18 / 30
OS-aware Cache Set Drowsy Mode
![Page 19: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/19.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 19 / 30
OS-aware Cache Set Drowsy Mode
![Page 20: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/20.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 20 / 30
OS-aware Cache Set Drowsy Mode
![Page 21: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/21.jpg)
Access-biased classification.
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 21 / 30
OS-aware Cache Set Drowsy Mode
![Page 22: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/22.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 22 / 30
OS-aware Cache Set Drowsy Mode
![Page 23: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/23.jpg)
使用 OS-aware cache way lookup 之後, dynamic power 省電的比例 user execution 能省下 29% OS execution 能省下 30%
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3023
5.Power and Performance Evaluation
合計共能省下 30% 的 dynamic power
![Page 24: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/24.jpg)
dynamic power 的省電與否與 cache way access 的減少有關聯
OS-aware cache way lookup 能省下約 30% 的dynamic power 而不影響其效能 power 而不影響其效能
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3024
5.Power and Performance Evaluation
![Page 25: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/25.jpg)
經測試之後 Residency-based drowsy mode scheme 能省下
約 5% - 50% 的 leakage power
Access-based drowsy mode scheme 藉由將部分的 cache 設成 drowsy state 以減少更多的 leakage power ,平均約能省下 37% 的 leakage power
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3025
5.Power and Performance Evaluation
![Page 26: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/26.jpg)
OS-aware cache set drowsy mode method 執行時間會略高於正常值( <1% )因: 誤將要用到的 cache set 設成 drowsy mode 的花費
相對較低 套用 cache set drowsy policies 能讓存取 drowsy
cache 的機率大幅降低
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3026
5.Power and Performance Evaluation
![Page 27: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/27.jpg)
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3027
![Page 28: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/28.jpg)
為了提升 I-cache 的用電效率,我們將從 user/OS I-cache 在系統執行時的存取方式著手,在 dual-mode operation 中觀察其省電能力
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3028
6.Conclusions
![Page 29: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/29.jpg)
OS-aware cache way lookup 藉由減少 parallel tag comparison 以及 data array readout 來降低 dynamic power 消耗
OS-aware tuning 能有效地減少因 CMOS 製程技術進步產生的額外 static power 消耗
OS-aware tuning 能降低 dynamic power 和static power 的消耗的目的,且只需做少許的硬體元件更動就能達成。
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3029
6.Conclusions
![Page 30: OS-aware Tuning Improving Instruction Cache Energy Efficiency on System Workloads Authors : Tao Li, John, L.K. Published in : Performance, Computing, and.](https://reader035.fdocuments.us/reader035/viewer/2022070306/5518a254550346c31f8b491a/html5/thumbnails/30.jpg)
Thanks for your listening
9962111 邱鎮浩 9962306 蘇育毅 9962145 吳修億 / 3030