Building Better IP With RTL Architect NoC IP Physical Exploration by Arteris
Building Better IP With RTL Architect NoC IP Physical Exploration by Arteris
Building Better IP With RTL Architect NoC IP Physical Exploration by Arteris
¹Management estimates
CPU Subsystem Domain-specific Subsystems Machine Learning Subsystem Safety Island / Safety-
Accelerator Subsystem(s) Critical Subsystem
Application-specific IP Subsystem
CPU CPU
Accel DSP IP SRAM
DSU (L3 Cache) DSU (L3 Cache) Arteris Ncore Arteris FlexNoC Arteris FlexNoC AI Package Arteris FlexNoC
Resilience (Safety)
Memory Subsystem High Speed Wired Peripherals Wireless Subsystem Security Subsystem I/O Peripherals
Arteris Ncore® cache coherent interconnect IP Arteris FlexNoC® non-coherent interconnect IP Arteris CodaCache® last level cache
SNUG SILICON VALLEY 2023 3
Arteris Technology
– Power, Performance, and Area (PPA) factors impacting NoC and SoC iterations
– Impacts exacerbated by more advanced nodes: 7nm, 5nm, 3nm, etc.
EDA flow diagram source: Andrew B. Kahng, et al., “VLSI Physical Design: From Graph
Partitioning to Timing Closure,” Springer (2011)
SNUG SILICON VALLEY 2023 8
The EDA Flow & NoC Design
SNPS Platform Architect
# Targets
Chip (NoC)
Arteris NoC
Automation
RTL
Digital
Implementation
of Networks on
Chip
EDA flow diagram source: Andrew B. Kahng, et al., “VLSI Physical Design: From Graph
Partitioning to Timing Closure,” Springer (2011)
SNUG SILICON VALLEY 2023 9
The EDA Flow & NoC Design
Failure to close
timing results in
weeks long delays
EDA flow diagram source: Andrew B. Kahng, et al., “VLSI Physical Design: From Graph
Partitioning to Timing Closure,” Springer (2011)
SNUG SILICON VALLEY 2023 10
Interconnect Timing Design Challenges
Interconnect RC Trend
• No standard methodology for timing 0,3
8,0E+05
Capcitance
Resistance
6,0E+05
0,2
0,0E+00 0,1
32 nm 22 nm 15 nm 11 nm 7 nm
Technology Node
Endpoint (NIU)
Pipeline
4
3
1
2
0
#ToughToDoManually
Clock Cycles #AutomationNeeded
Transport delay = 𝑭𝑭 (foundry, routing stack, type of driving cell, process voltage, temperature, …)
SNUG SILICON VALLEY 2023 12
The Customer PPA Struggle
NoC RTL is different for every refinement
Change NoC
requirements Architecture
for RTL Development
Customer uses Arteris tools
to design the Network on
Arteris AE
Chip (NoC)
Team supports
customer’s
RTL implementation
RTL
Timing is not efforts generation &
met
Customer uses Synopsys export
EDA flow diagram source: Andrew B. Kahng, et al., “VLSI Physical Design: From Graph
Partitioning to Timing Closure,” Springer (2011)
SNUG SILICON VALLEY 2023 13
A Customer Example
Manual Flow With Some Layout Awareness & Guidance
NoC Topology Architecture
Software
Guidance to P&R
given manually
Specification
# Initiators
NoC Topology Manual update of constraints for P&R Architecture
NoC
# Clock Domains, Critical Paths
Manually Co-optimized
# Targets
RTL
Synthesis RTL
+ Constraints Pipeline Pipeline Verilog
Insertion Insertion
P&R Gate-Level
Timing Layout
Manual 14 – 35 days 70 Days (Customer Example)
RTL
Synthesis RTL
+ Constraints Pipeline Pipeline Verilog
Insertion Insertion
P&R Gate-Level
Abstract to RTL based estimations
Timing Layout
Manual 14 – 35 days 70 Days (Customer Example)
P&R Gate-Level
Timing
Layout
Earlier
Estimation 14 – 35 days Savings
from RTL
SNUG SILICON VALLEY 2023 16
Physical Exploration with RTL Architect
Limitations of the Existing Flow
Customer
Gate-Level
PPA Estimates
• Customers provide gate-centric reports that
don’t pinpoint the problem with the RTL.
• It takes multiple iterations to converge P&R PPA Report Layout
• RTL Architect
– Provides reliable timing
– Allows assessments early
– … to fix high logic levels
High Metrics
Correlation 3X+
Faster
Runtime
P&R Gate-Level
Timing Layout
Manual 14 – 35 days 70 Days (Customer Example)
Automated
Specification
# Initiators
Constraints for P&R
NoC Topology
NoC Architecture
# Clock Domains, Critical Paths
Co-optimized
# Targets
Generated Automated
Layout RTL with RTL
.def import Constraints
Pipeline Insertion Co-Optimize NoC IP Verilog
from P&R Tools with
also: Vizio/Photo
Digital
P&R Implementation Gate-Level
Timing Layout
FlexNoc 5 10 – 25 days 23 days Up to 5x Faster Physical Closure