lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

GitHub

4k stars
35 watching
383 forks
Language: Python
last commit: 5 days ago
Linked from 1 awesome list

codellamacuda-kernelsdeepspeedfastertransformerinternlmllamallama2llama3llmllm-inferenceturbomind

Backlinks from these awesome lists: