ARES
RAG model evaluator
A tool for automatically evaluating RAG models by generating synthetic data and fine-tuning classifiers
Automated Evaluation of RAG Systems
499 stars
11 watching
54 forks
Language: Python
last commit: 4 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| An evaluation tool for Retrieval-augmented Generation methods | 141 |
| A framework for evaluating and diagnosing retrieval-augmented generation systems | 630 |
| Evaluates language models using standardized benchmarks and prompting techniques. | 2,059 |
| A framework for evaluating language models on NLP tasks | 326 |
| An evaluation framework for large language models trained with instruction tuning methods | 535 |
| An evaluation framework for machine learning models and datasets, providing standardized metrics and tools for comparing model performance. | 2,063 |
| An evaluation of packer type estimation and detection tools to improve malware analysis capabilities | 11 |
| A build-time code evaluation tool for JavaScript | 127 |
| Tools and evaluation framework for accelerating the development of large multimodal models by providing an efficient way to assess their performance | 2,164 |
| Evaluating and improving large multimodal models through in-context learning | 21 |
| Evaluates foundation models on human-centric tasks with diverse exams and question types | 714 |
| An interactive toolkit for working with weather radar data using Python and atmospheric radar algorithms | 520 |
| A Python package that enables the creation and management of Retrieval Augmented Generation applications with filtering capabilities. | 229 |
| A Python package providing metrics and tools for evaluating image segmentation models | 282 |
| A package providing tools and metrics for evaluating Earth system models | 104 |