MiniCPM
Language Model
A language model designed to surpass the capabilities of GPT-3.5-Turbo on various tasks such as text generation, tool calling, and long-text processing
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
7k stars
77 watching
461 forks
Language: Jupyter Notebook
last commit: 3 months ago Related projects:
Repository | Description | Stars |
---|---|---|
paddlepaddle/paddlenlp | A comprehensive NLP and LLM library that provides an easy-to-use interface for a wide range of tasks, including text classification, neural search, question answering, information extraction, and more. | 12,224 |
openbmb/minicpm-v | A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs. | 12,870 |
ggerganov/llama.cpp | Enables LLM inference with minimal setup and high performance on various hardware platforms | 69,185 |
liguodongiot/llm-action | Sharing technical knowledge and practical experience on large language models | 11,871 |
opengvlab/llama-adapter | An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,775 |
cvi-szu/linly | A collection of pre-trained language models for Chinese text processing and dialogue generation. | 3,034 |
facico/chinese-vicuna | An instruction-following Chinese LLaMA-based model project aimed at training and fine-tuning models on specific hardware configurations for efficient deployment. | 4,152 |
hiyouga/llama-factory | A tool for efficiently fine-tuning large language models across multiple architectures and methods. | 36,219 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |
lyogavin/airllm | Optimizes large language model inference on limited GPU resources | 5,446 |
haotian-liu/llava | A system that uses large language and vision models to generate and process visual instructions | 20,683 |
dvlab-research/mgm | An open-source framework for training large language models with vision capabilities. | 3,229 |
meta-llama/llama-recipes | Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,578 |
jittor/jittorllms | A high-performance deep learning framework designed to efficiently deploy large models on low-end hardware. | 2,389 |
ymcui/chinese-llama-alpaca | Develops and deploys large language models for natural language processing tasks in Chinese, particularly for text encoding and decoding. | 18,513 |