Document Preview Unavailable

The Synergy of Speculative Decoding and Batching in Serving Large Language Models

Su, Qidong; Giannoula, Christina; Pekhimenko, Gennady.  arXiv.org, Oct 28, 2023.

You might have access to this document