MiniCPM

Language Model

A language model designed to surpass the capabilities of GPT-3.5-Turbo on various tasks such as text generation, tool calling, and long-text processing

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

GitHub

7k stars
77 watching
461 forks
Language: Jupyter Notebook
last commit: 3 months ago

Related projects:

Repository Description Stars
paddlepaddle/paddlenlp A comprehensive NLP and LLM library that provides an easy-to-use interface for a wide range of tasks, including text classification, neural search, question answering, information extraction, and more. 12,224
openbmb/minicpm-v A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs. 12,870
ggerganov/llama.cpp Enables LLM inference with minimal setup and high performance on various hardware platforms 69,185
liguodongiot/llm-action Sharing technical knowledge and practical experience on large language models 11,871
opengvlab/llama-adapter An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy 5,775
cvi-szu/linly A collection of pre-trained language models for Chinese text processing and dialogue generation. 3,034
facico/chinese-vicuna An instruction-following Chinese LLaMA-based model project aimed at training and fine-tuning models on specific hardware configurations for efficient deployment. 4,152
hiyouga/llama-factory A tool for efficiently fine-tuning large language models across multiple architectures and methods. 36,219
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,732
lyogavin/airllm Optimizes large language model inference on limited GPU resources 5,446
haotian-liu/llava A system that uses large language and vision models to generate and process visual instructions 20,683
dvlab-research/mgm An open-source framework for training large language models with vision capabilities. 3,229
meta-llama/llama-recipes Provides tools and examples for fine-tuning the Meta Llama model and building applications with it 15,578
jittor/jittorllms A high-performance deep learning framework designed to efficiently deploy large models on low-end hardware. 2,389
ymcui/chinese-llama-alpaca Develops and deploys large language models for natural language processing tasks in Chinese, particularly for text encoding and decoding. 18,513