Academic Publication Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Commercial Realization
Startups and Open Source tools heavily associated with the concepts explored in this paper.
Academic Publication Startups and Open Source tools heavily associated with the concepts explored in this paper.