-
fix(vibe): warm-up runs a real text forward (absorb torch lazy-init) · cec9dec3
Loading weights wasn't enough — the first forward still cost ~30s on torch's one-time graph/thread init. Warm now runs a throwaway _embed_texts() so the first USER query is ~1.5s, not 30s.
PLN (Algolia) authoredcec9dec3
×