LLMeBench
LLM benchmarker
A benchmarking framework for large language models
Benchmarking Large Language Models
81 stars
13 watching
18 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
benchmarkinglarge-language-modelsllmmultilingual
Related projects:
Repository | Description | Stars |
---|---|---|
| A tool for evaluating the performance of large language model APIs | 678 |
| A benchmark for evaluating large language models in multiple languages and formats | 93 |
| A lightweight, multilingual language model with a long context length | 920 |
| Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 15 |
| An open-source benchmarking framework for evaluating cross-style visual capability of large multimodal models | 84 |
| Measures the understanding of massive multitask Chinese datasets using large language models | 87 |
| Develops large language models for text understanding and generation tasks. | 85 |
| A benchmark for evaluating large language models' ability to process multimodal input | 322 |
| A multilingual large language model developed by XVERSE Technology Inc. | 50 |
| A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models | 231 |
| An image-context reasoning benchmark designed to challenge large vision-language models and help improve their accuracy | 259 |
| A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks | 73 |
| A large-scale language model for scientific domain training on redpajama arXiv split | 125 |
| A curated list of large machine learning models tracked over time | 341 |
| An open-source toolkit for building and evaluating large language models | 267 |