MiniCPM
Language Model
A language model designed to surpass the capabilities of GPT-3.5-Turbo on various tasks such as text generation, tool calling, and long-text processing
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
7k stars
77 watching
461 forks
Language: Jupyter Notebook
last commit: 4 months ago Related projects:
Repository | Description | Stars |
---|---|---|
| A comprehensive NLP and LLM library that provides an easy-to-use interface for a wide range of tasks, including text classification, neural search, question answering, information extraction, and more. | 12,224 |
| A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs. | 12,870 |
| Enables LLM inference with minimal setup and high performance on various hardware platforms | 69,185 |
| Sharing technical knowledge and practical experience on large language models | 11,871 |
| An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,775 |
| A collection of pre-trained language models for Chinese text processing and dialogue generation. | 3,034 |
| An instruction-following Chinese LLaMA-based model project aimed at training and fine-tuning models on specific hardware configurations for efficient deployment. | 4,152 |
| A tool for efficiently fine-tuning large language models across multiple architectures and methods. | 36,219 |
| An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |
| Optimizes large language model inference on limited GPU resources | 5,446 |
| A system that uses large language and vision models to generate and process visual instructions | 20,683 |
| An open-source framework for training large language models with vision capabilities. | 3,229 |
| Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,578 |
| A high-performance deep learning framework designed to efficiently deploy large models on low-end hardware. | 2,389 |
| Develops and deploys large language models for natural language processing tasks in Chinese, particularly for text encoding and decoding. | 18,513 |