MiniCPM

Language Model

A language model designed to surpass the capabilities of GPT-3.5-Turbo on various tasks such as text generation, tool calling, and long-text processing

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

GitHub

7k stars

77 watching

461 forks

Language: Jupyter Notebook

last commit: about 1 year ago

Related projects:

Repository	Description	Stars
paddlepaddle/paddlenlp	A comprehensive NLP and LLM library that provides an easy-to-use interface for a wide range of tasks, including text classification, neural search, question answering, information extraction, and more.	12,224
openbmb/minicpm-v	A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs.	12,870
ggerganov/llama.cpp	Enables LLM inference with minimal setup and high performance on various hardware platforms	69,185
liguodongiot/llm-action	Sharing technical knowledge and practical experience on large language models	11,871
opengvlab/llama-adapter	An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy	5,775
cvi-szu/linly	A collection of pre-trained language models for Chinese text processing and dialogue generation.	3,034
facico/chinese-vicuna	An instruction-following Chinese LLaMA-based model project aimed at training and fine-tuning models on specific hardware configurations for efficient deployment.	4,152
hiyouga/llama-factory	A tool for efficiently fine-tuning large language models across multiple architectures and methods.	36,219
alpha-vllm/llama2-accessory	An open-source toolkit for pretraining and fine-tuning large language models	2,732
lyogavin/airllm	Optimizes large language model inference on limited GPU resources	5,446
haotian-liu/llava	A system that uses large language and vision models to generate and process visual instructions	20,683
dvlab-research/mgm	An open-source framework for training large language models with vision capabilities.	3,229
meta-llama/llama-recipes	Provides tools and examples for fine-tuning the Meta Llama model and building applications with it	15,578
jittor/jittorllms	A high-performance deep learning framework designed to efficiently deploy large models on low-end hardware.	2,389
ymcui/chinese-llama-alpaca	Develops and deploys large language models for natural language processing tasks in Chinese, particularly for text encoding and decoding.	18,513