MiniCPM
Language Model
A language model designed to surpass the capabilities of GPT-3.5-Turbo on various tasks such as text generation, tool calling, and long-text processing
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
7k stars
76 watching
454 forks
Language: Jupyter Notebook
last commit: 18 days ago Related projects:
Repository | Description | Stars |
---|---|---|
paddlepaddle/paddlenlp | A comprehensive NLP and LLM development suite supporting wide-range of tasks and models | 12,130 |
openbmb/minicpm-v | A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs. | 12,619 |
ggerganov/llama.cpp | Enables efficient inference of large language models using optimized C/C++ implementations and various backend frameworks | 68,190 |
liguodongiot/llm-action | A comprehensive resource sharing project focused on large language model (LLM) engineering and applications, covering various aspects from training to inference and deployment. | 10,677 |
opengvlab/llama-adapter | An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,754 |
cvi-szu/linly | A collection of pre-trained language models for Chinese text processing and dialogue generation. | 3,029 |
facico/chinese-vicuna | An instruction-following Chinese LLaMA-based model project aimed at training and fine-tuning models on specific hardware configurations for efficient deployment. | 4,142 |
hiyouga/llama-factory | A unified platform for fine-tuning multiple large language models with various training approaches and methods | 34,436 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,722 |
lyogavin/airllm | A Python library that optimizes inference memory usage for large language models on limited GPU resources. | 5,259 |
haotian-liu/llava | A system that uses large language and vision models to generate and process visual instructions | 20,359 |
dvlab-research/mgm | An open-source framework for training large language models with vision capabilities. | 3,211 |
meta-llama/llama-recipes | Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,288 |
jittor/jittorllms | A high-performance deep learning framework designed to efficiently deploy large models on low-end hardware. | 2,374 |
ymcui/chinese-llama-alpaca | Develops and deploys large language models for natural language processing tasks in Chinese, particularly for text encoding and decoding. | 18,395 |