lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
4k stars
35 watching
383 forks
Language: Python
last commit: 5 days ago
Linked from 1 awesome list
codellamacuda-kernelsdeepspeedfastertransformerinternlmllamallama2llama3llmllm-inferenceturbomind