sglang

SGLang is a fast serving framework for large language models and vision language models.

GitHub

5k stars
53 watching
374 forks
Language: Python
last commit: 5 days ago
Linked from 1 awesome list

cudainferencellamallama2llama3llama3-1llavallmllm-servingmoepytorchtransformervlm

Backlinks from these awesome lists: