ARES

RAG model evaluator

A tool for automatically evaluating RAG models by generating synthetic data and fine-tuning classifiers

Automated Evaluation of RAG Systems

GitHub

499 stars
11 watching
54 forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
gomate-community/rageval An evaluation tool for Retrieval-augmented Generation methods 141
amazon-science/ragchecker A framework for evaluating and diagnosing retrieval-augmented generation systems 630
openai/simple-evals Evaluates language models using standardized benchmarks and prompting techniques. 2,059
allenai/olmo-eval A framework for evaluating language models on NLP tasks 326
declare-lab/instruct-eval An evaluation framework for large language models trained with instruction tuning methods 535
huggingface/evaluate An evaluation framework for machine learning models and datasets, providing standardized metrics and tools for comparing model performance. 2,063
ffri/packerdetectiontoolevaluation An evaluation of packer type estimation and detection tools to improve malware analysis capabilities 11
kentcdodds/preval.macro A build-time code evaluation tool for JavaScript 127
evolvinglmms-lab/lmms-eval Tools and evaluation framework for accelerating the development of large multimodal models by providing an efficient way to assess their performance 2,164
mshukor/evalign-icl Evaluating and improving large multimodal models through in-context learning 21
ruixiangcui/agieval Evaluates foundation models on human-centric tasks with diverse exams and question types 714
arm-doe/pyart An interactive toolkit for working with weather radar data using Python and atmospheric radar algorithms 520
whyhow-ai/rule-based-retrieval A Python package that enables the creation and management of Retrieval Augmented Generation applications with filtering capabilities. 229
martinkersner/py-img-seg-eval A Python package providing metrics and tools for evaluating image segmentation models 282
pcmdi/pcmdi_metrics A package providing tools and metrics for evaluating Earth system models 104