VIA-SD: Verification via Intra-Model Routing for Speculative Decoding

Yuchen Xian, Yang He, Yunqiu Xu, Yi Yang

Thursday at 04:00

2 Views

0 Comments

arXiv:2606.12243v1 Announce Type: cross Abstract: Speculative decoding (SD) addresses the high inference costs of LLMs by having lightweight drafters generate candidates for large verifiers to validate in parallel. Existing draft-verify methods use binary decisions: accept or fully recompute. Yet we find that many rejected tokens can be verified...

Read the full article at the source.

Read Original Article

Was this helpful?