Ta 01 Sharma Crimmins Pres User
Ta 01 Sharma Crimmins Pres User
Ta 01 Sharma Crimmins Pres User
SNUG 2018 1
Agenda
Introduction
Design Information
Merged Switching Activity Power Reduction
Physically Aware Timing Closure
Final Leakage Recovery
Conclusions
SNUG 2018 2
Introduction
SNUG 2018 3
Introduction
VPU SOC
NCE
SNUG 2018 4
Introduction
– Applications
• Drones (Navigation / Tracking)
• Security (Cat or Cat-Burglar!)
• Mobile (Facial Recognition)
• AR/VR Headsets
SNUG 2018 5
Introduction
SNUG 2018 6
Introduction
– Neural Compute Engine (NCE) executes the compute intensive neural algorithms
• Speed of response is critical (Real-Time)
• Dynamic and Leakage power consumption critical (Always-On)
SNUG 2018 7
Introduction
DC DC DC DC
ECO LOOPS
PT
SNUG 2018 8
Introduction
ICC2 routed
Extracted database x 12
Parasitics x12
StdCell LEF
STAR-RC
STAR-RC
STAR-RC
STAR-RC
Macro LEF
ICC2 Partitions
MACRO and ICC2 Partitions
Netlist and ICC2 Partitions
Std-Cell DEF x 12 ICC2
LEFs
PRIMETIME Primetime
ECO
Scripts x 12
SNUG 2018 9
Introduction
• Goals
SNUG 2018 10
Glossary
SNUG 2018 11
Design Information
SNUG 2018 12
Design Information
SNUG 2018 13
Design Information
• Myriad X SOC
SNUG 2018 14
Design Information
• Myriad X SOC
– 12 physical partitions
– Replicated SHAVE block x16 (MIM)
– Multi-million instances
– 18 Signoff PVT corners with Functional / BIST / Scan stuck@ / Scan @speed modes
– Power optimization was targeted on the standalone NCE partition
– Timing Signoff and Timing ECO’s were performed on a flat top-level design
SNUG 2018 15
Merged Switching Activity Power Reduction
SNUG 2018 16
Merged Switching Activity Power Reduction
• Flow Overview
SNUG 2018 17
Merged Switching Activity Power Reduction
SNUG 2018 18
Merged Switching Activity Power Reduction
SNUG 2018 19
Merged Switching Activity Power Reduction
SNUG 2018 20
Merged Switching Activity Power Reduction
read_saif merged_nce.saif
Moves clock pin power onto the data network
update_power (clock network was don’t-touched for power
} recovery)
SNUG 2018 21
Merged Switching Activity Power Reduction
SNUG 2018 22
Merged Switching Activity Power Reduction
% Combinational Power Improvement Vs Base Design for most used Deep Neural
Network Test-cases in the Neural Compute Engine
70
63%
60
51%
50
38% 41%
40
31%
28%
30
20
11% 13%
10
0
Internal Power Switching Power Leakage Power Total Gain
SNUG 2018 23
Physically Aware Timing Closure
SNUG 2018 24
Physically Aware Timing Closure
• Advanced Features
SNUG 2018 25
Physically Aware Timing Closure
• Advanced Features
SNUG 2018 26
Physically Aware Timing Closure
• Advanced Features
Ideal for our replicated
– Enable MIM ECO support “SHAVE” physical design
SNUG 2018 27
Physically Aware Timing Closure
• DRC Fixing
set_app_var eco_insert_buffer_search_distance_in_site_rows 8
Controls buffer
fix_eco_drc -type max_transition -verbose \ insertion location
-physical_mode open_site \
-methods {insert_buffer} –buffer_list {LVT_BUF_X4 LVT_BUF_X5...} \
-cell_type combinational
SNUG 2018 28
Physically Aware Timing Closure
• Setup Fixing
SNUG 2018 29
Physically Aware Timing Closure
• Hold Fixing
Targeted path-group
fixing
fix_eco_timing –type hold -physical_mode open_site \
–methods {size_cell} \
-group {shave_2_shave ...} –pba_mode exhaustive \
-cell_type combinational
Low Leakage High Vt cells
used for buffer insertion
fix_eco_timing –type hold -physical_mode open_site \
–methods {insert_buffer} \
–buffer_list {HVT_BUF_X1 HVT_BUF_X2 HVT_DEL05_X1 HVT_DEL10_X2 ...} \
-group {shave_2_shave ...} –pba_mode exhaustive \
-cell_type combinational
SNUG 2018 30
Physically Aware Timing Closure
• Advanced Features
Import physical clock
information
– Clock network ECO (Useful skew)
SNUG 2018 31
Physically Aware Timing Closure
SNUG 2018 32
Physically Aware Timing Closure
• Advanced Features
SNUG 2018 33
Physically Aware Timing Closure
SNUG 2018 34
Physically Aware Timing Closure
• Additional Steps
SNUG 2018 35
Physically Aware Timing Closure
SNUG 2018 37
Final Leakage Recovery
• Leakage Recovery
SNUG 2018 38
Final Leakage Recovery
remote_execute {
set_user_attribute [get_lib_cells */*HVT20*] threshold_voltage_group HVT20
set_user_attribute [get_lib_cells */*HVT16*] threshold_voltage_group HVT16
set_user_attribute [get_lib_cells */*SVT20*] threshold_voltage_group SVT20
set_user_attribute [get_lib_cells */*SVT16*] threshold_voltage_group SVT16
set_user_attribute [get_lib_cells */*LVT20*] threshold_voltage_group LVT20
set_user_attribute [get_lib_cells */*LVT16*] threshold_voltage_group LVT16
set_user_attribute [get_lib_cells */*ULT20*] threshold_voltage_group ULT20
set_user_attribute [get_lib_cells */*ULT16*] threshold_voltage_group ULT16
}
SNUG 2018 39
Final Leakage Recovery
remote_execute {
set_app_var power_enable_analysis true
set_app_var power_analysis_mode averaged
set_app_var eco_alternative_area_ratio_threshold 1
define_user_attribute cell_footprint -class lib_cell -import -type string
set_app_var eco_alternative_cell_attribute_restrictions {cell_footprint}
}
SNUG 2018 40
Final Leakage Recovery
• Leakage Recovery
SNUG 2018 41
Final Leakage Recovery
• Leakage Recovery
SNUG 2018 42
Final Leakage Recovery
21.0% 21.9%
14.9% 16.2%
7.0%
10.7%
8.7% 9.3% 11.5%
10.7%
7.6% 7.6%
5.1% 5.9%
SNUG 2018 43
Conclusions
SNUG 2018 44
Conclusions
• >30% combinational power reduction in the NCE achieved using the merged
switching activity flow
• Faster ECO Turn-Around-Time (TAT) during Signoff phase with PT ECO loops
completed in under 6 hours and 90% of DRC / Setup and 98% of Hold
violations fixed with 7 ECO loops.
SNUG 2018 45
Thank You
SNUG 2018 46