Offline
day
week
DSpark: Speculative decoding accelerates LLM inference [pdf]
775 points
|
github.com
|
aurenvale
|
33hrs