ms-swift
LLM framework
A framework for fine-tuning and deploying large language models
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
4k stars
23 watching
377 forks
Language: Python
last commit: 3 days ago
Linked from 1 awesome list
agentdeploydpointernvlligerllamallama3llavallmloramegatronminicpm-vmodelscopemultimodalpeftpre-trainingqwen2qwen2-vlreflectionsft
Related projects:
Repository | Description | Stars |
---|---|---|
young-geng/easylm | A framework for training and serving large language models using JAX/Flax | 2,409 |
optimalscale/lmflow | A toolkit for finetuning large language models and providing efficient inference capabilities | 8,273 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,720 |
huggingface/peft | An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters | 16,437 |
mlabonne/llm-course | A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 39,120 |
modeltc/lightllm | An LLM inference and serving framework providing a lightweight design, scalability, and high-speed performance for large language models. | 2,609 |
qwenlm/qwen2.5 | A large language model series with various sizes and variants for text generation and understanding. | 9,710 |
zjh-819/llmdatahub | A curated collection of high-quality datasets for training large language models. | 2,635 |
opengvlab/llama-adapter | An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,754 |
meta-llama/llama-recipes | Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,126 |
haotian-liu/llava | A system that uses large language and vision models to generate and process visual instructions | 20,232 |
sgl-project/sglang | A framework for serving large language models and vision models with efficient runtime and flexible interface. | 6,082 |
swift-ai/swift-ai | A high-performance deep learning library written in Swift for Apple platforms. | 6,029 |
thunlp/plmpapers | Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,328 |
eleutherai/lm-evaluation-harness | Provides a unified framework to test generative language models on various evaluation tasks. | 6,970 |