netlist → (area, power, leakage) regression on 26 OpenCores sky130 modules
| channel | pearson r | interpretation |
|---|---|---|
| log10(area_um2) | +0.78 | strong — cell count dominates |
| log10(pwr_total_w) | +0.66 | real signal; vector-power records lift this |
| log10(pwr_leakage_w) | +0.19 | weak — leakage is ~0 at sky130 typical w/o per-cell Vt features |
| log10(n_cells) | +0.81 | trivial — model recovers cell-count from structural features |
This surrogate is good enough to demonstrate the end-to-end RL plumbing (matmul Verilog → yosys → graph → GNN → reward → GRPO update), but the power correlation (+0.66) and broken leakage (+0.19) are too noisy to actually drive Qwen toward higher-throughput matmul designs better than the real-synth reward did in our prior session. A pre-Phase-3 build of (a) per-module dependency tracker → ~150 module dataset, (b) leakage augmentation via Vt-mix synth runs, would lift correlations into the 0.85+ range where RL would actually benefit.