ms-swift

LLM framework

A framework for efficient fine-tuning and deployment of large language models

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

GitHub

5k stars
24 watching
409 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list

agentdeploydpoemu3-geninternvlligerllamallama3llmloralora-gaminicpm-vmodelscopemultimodalpeftpre-trainingqwen2qwen2-vlsftvllm

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
young-geng/easylm A framework for training and serving large language models using JAX/Flax 2,428
optimalscale/lmflow A toolkit for fine-tuning and inferring large machine learning models 8,312
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,732
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,699
mlabonne/llm-course A comprehensive course and resource package on building and deploying Large Language Models (LLMs) 40,053
modeltc/lightllm A Python-based framework for serving large language models with low latency and high scalability. 2,691
qwenlm/qwen2.5 A large language model series with various sizes and variants for text generation and understanding. 10,959
zjh-819/llmdatahub A curated collection of high-quality datasets for training large language models. 2,708
opengvlab/llama-adapter An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy 5,775
meta-llama/llama-recipes Provides tools and examples for fine-tuning the Meta Llama model and building applications with it 15,578
haotian-liu/llava A system that uses large language and vision models to generate and process visual instructions 20,683
sgl-project/sglang A fast serving framework for large language models and vision language models. 6,551
swift-ai/swift-ai A high-performance deep learning library written in Swift for Apple platforms. 6,032
thunlp/plmpapers Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. 3,331
eleutherai/lm-evaluation-harness Provides a unified framework to test generative language models on various evaluation tasks. 7,200