sglang
SGLang is a fast serving framework for large language models and vision language models.
5k stars
53 watching
374 forks
Language: Python
last commit: 5 days ago
Linked from 1 awesome list
cudainferencellamallama2llama3llama3-1llavallmllm-servingmoepytorchtransformervlm