chinese-llm-benchmark

LLM benchmark

A comprehensive benchmarking platform for evaluating the capabilities of large language models.

中文大模型能力评测榜单:目前已囊括128个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、百度文心一言、阿里通义千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及qwen2.5、llama3.1、glm4、书生internLM2.5、openbuddy、AquilaChat等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

GitHub

3k stars
33 watching
129 forks
last commit: 6 days ago

Related projects:

Repository Description Stars
tigerresearch/tigerbot Develops and deploys large language models for natural language processing tasks, including text generation, question answering, and more. 2,242
zhayujie/chatgpt-on-wechat An implementation of a conversational AI model integrated with WeChat's messaging platform to provide automated customer support and chat services. 31,233
thudm/glm-4 Develops and releases pre-trained models for conversational AI tasks with enhanced capabilities on long text generation, multimodal interaction, and domain adaptation. 5,277
memochou1993/gpt-ai-assistant An AI-powered chat application using OpenAI and LINE APIs 7,428
father-bot/chatgpt_telegram_bot A Telegram bot using ChatGPT's API to provide a chat interface with low latency and various features 5,184
openbmb/minicpm-v A multimodal language model designed to understand images, videos, and text inputs and generate high-quality text outputs. 12,619
openchatai/openchat A tool for building and managing custom chatbots using large language models 5,195
opengvlab/internvl A pioneering open-source alternative to commercial multimodal models with a family of large-scale language and vision models. 6,014
n3d1117/chatgpt-telegram-bot A Telegram bot integrating with OpenAI APIs to provide answers 3,078
cledev-limited/cledev.openai An unofficial .NET SDK providing a Blazor Server Playground and API access to the OpenAI chat-gpt service. 115
bhaskatripathi/pdfgpt A solution to chat with the contents of a PDF file using GPT capabilities 6,971
qwenlm/qwen This repository provides large language models and chat capabilities based on pre-trained Chinese models. 14,164
orionstarai/orionstar-yi-34b-chat A high-quality chat model trained on 15W+ high-quality data to provide excellent conversational experiences. 259
xx-net/xx-net A proxy tool designed to bypass internet censorship and restrictions by disguising traffic as ordinary network activity. 33,063