ARES

RAG model evaluator

A tool for automatically evaluating RAG models by generating synthetic data and fine-tuning classifiers

Automated Evaluation of RAG Systems

499 stars

11 watching

54 forks

Language: Python

last commit: over 1 year ago

Linked from 1 awesome list

Screenshot of stanford-futuredata/ARES website

ares-ai.vercel.app/

Backlinks from these awesome lists:

ethicalml/awesome-production-machine-learning

Related projects:

Repository	Description	Stars
gomate-community/rageval	An evaluation tool for Retrieval-augmented Generation methods	141
amazon-science/ragchecker	A framework for evaluating and diagnosing retrieval-augmented generation systems	630
openai/simple-evals	Evaluates language models using standardized benchmarks and prompting techniques.	2,059
allenai/olmo-eval	A framework for evaluating language models on NLP tasks	326
declare-lab/instruct-eval	An evaluation framework for large language models trained with instruction tuning methods	535
huggingface/evaluate	An evaluation framework for machine learning models and datasets, providing standardized metrics and tools for comparing model performance.	2,063
ffri/packerdetectiontoolevaluation	An evaluation of packer type estimation and detection tools to improve malware analysis capabilities	11
kentcdodds/preval.macro	A build-time code evaluation tool for JavaScript	127
evolvinglmms-lab/lmms-eval	Tools and evaluation framework for accelerating the development of large multimodal models by providing an efficient way to assess their performance	2,164
mshukor/evalign-icl	Evaluating and improving large multimodal models through in-context learning	21
ruixiangcui/agieval	Evaluates foundation models on human-centric tasks with diverse exams and question types	714
arm-doe/pyart	An interactive toolkit for working with weather radar data using Python and atmospheric radar algorithms	520
whyhow-ai/rule-based-retrieval	A Python package that enables the creation and management of Retrieval Augmented Generation applications with filtering capabilities.	229
martinkersner/py-img-seg-eval	A Python package providing metrics and tools for evaluating image segmentation models	282
pcmdi/pcmdi_metrics	A package providing tools and metrics for evaluating Earth system models	104