ms-swift

LLM framework

A framework for efficient fine-tuning and deployment of large language models

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

GitHub

5k stars

24 watching

409 forks

Language: Python

last commit: 8 months ago

Linked from 1 awesome list

agentdeploydpoemu3-geninternvlligerllamallama3llmloralora-gaminicpm-vmodelscopemultimodalpeftpre-trainingqwen2qwen2-vlsftvllm

Screenshot of modelscope/ms-swift website

swift.readthedocs.io/zh-cn/latest/

Backlinks from these awesome lists:

ethicalml/awesome-production-machine-learning

Related projects:

Repository	Description	Stars
young-geng/easylm	A framework for training and serving large language models using JAX/Flax	2,428
optimalscale/lmflow	A toolkit for fine-tuning and inferring large machine learning models	8,312
alpha-vllm/llama2-accessory	An open-source toolkit for pretraining and fine-tuning large language models	2,732
huggingface/peft	An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters	16,699
mlabonne/llm-course	A comprehensive course and resource package on building and deploying Large Language Models (LLMs)	40,053
modeltc/lightllm	A Python-based framework for serving large language models with low latency and high scalability.	2,691
qwenlm/qwen2.5	A large language model series with various sizes and variants for text generation and understanding.	10,959
zjh-819/llmdatahub	A curated collection of high-quality datasets for training large language models.	2,708
opengvlab/llama-adapter	An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy	5,775
meta-llama/llama-recipes	Provides tools and examples for fine-tuning the Meta Llama model and building applications with it	15,578
haotian-liu/llava	A system that uses large language and vision models to generate and process visual instructions	20,683
sgl-project/sglang	A fast serving framework for large language models and vision language models.	6,551
swift-ai/swift-ai	A high-performance deep learning library written in Swift for Apple platforms.	6,032
thunlp/plmpapers	Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers.	3,331
eleutherai/lm-evaluation-harness	Provides a unified framework to test generative language models on various evaluation tasks.	7,200