MiniCPM

Language Model

A language model designed to surpass the capabilities of GPT-3.5-Turbo on various tasks such as text generation, tool calling, and long-text processing

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

GitHub

7k stars
76 watching
454 forks
Language: Jupyter Notebook
last commit: 18 days ago

Related projects:

Repository Description Stars
paddlepaddle/paddlenlp A comprehensive NLP and LLM development suite supporting wide-range of tasks and models 12,130
openbmb/minicpm-v A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs. 12,619
ggerganov/llama.cpp Enables efficient inference of large language models using optimized C/C++ implementations and various backend frameworks 68,190
liguodongiot/llm-action A comprehensive resource sharing project focused on large language model (LLM) engineering and applications, covering various aspects from training to inference and deployment. 10,677
opengvlab/llama-adapter An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy 5,754
cvi-szu/linly A collection of pre-trained language models for Chinese text processing and dialogue generation. 3,029
facico/chinese-vicuna An instruction-following Chinese LLaMA-based model project aimed at training and fine-tuning models on specific hardware configurations for efficient deployment. 4,142
hiyouga/llama-factory A unified platform for fine-tuning multiple large language models with various training approaches and methods 34,436
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,722
lyogavin/airllm A Python library that optimizes inference memory usage for large language models on limited GPU resources. 5,259
haotian-liu/llava A system that uses large language and vision models to generate and process visual instructions 20,359
dvlab-research/mgm An open-source framework for training large language models with vision capabilities. 3,211
meta-llama/llama-recipes Provides tools and examples for fine-tuning the Meta Llama model and building applications with it 15,288
jittor/jittorllms A high-performance deep learning framework designed to efficiently deploy large models on low-end hardware. 2,374
ymcui/chinese-llama-alpaca Develops and deploys large language models for natural language processing tasks in Chinese, particularly for text encoding and decoding. 18,395