ms-swift
LLM framework
A framework for efficient fine-tuning and deployment of large language models
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
5k stars
24 watching
409 forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list
agentdeploydpoemu3-geninternvlligerllamallama3llmloralora-gaminicpm-vmodelscopemultimodalpeftpre-trainingqwen2qwen2-vlsftvllm
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for training and serving large language models using JAX/Flax | 2,428 |
| A toolkit for fine-tuning and inferring large machine learning models | 8,312 |
| An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |
| An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters | 16,699 |
| A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 40,053 |
| A Python-based framework for serving large language models with low latency and high scalability. | 2,691 |
| A large language model series with various sizes and variants for text generation and understanding. | 10,959 |
| A curated collection of high-quality datasets for training large language models. | 2,708 |
| An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,775 |
| Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,578 |
| A system that uses large language and vision models to generate and process visual instructions | 20,683 |
| A fast serving framework for large language models and vision language models. | 6,551 |
| A high-performance deep learning library written in Swift for Apple platforms. | 6,032 |
| Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,331 |
| Provides a unified framework to test generative language models on various evaluation tasks. | 7,200 |