ESMValTool

Model evaluator

A community-developed tool for evaluating climate models and providing diagnostic metrics.

ESMValTool: A community diagnostic and performance metrics tool for routine evaluation of Earth system models in CMIP

GitHub

230 stars

32 watching

128 forks

Language: NCL

last commit: 10 months ago

Linked from 1 awesome list

Screenshot of ESMValGroup/ESMValTool website

www.esmvaltool.org

Backlinks from these awesome lists:

protontypes/open-sustainable-technology

Related projects:

Repository	Description	Stars
chenllliang/mmevalpro	A benchmarking framework for evaluating Large Multimodal Models by providing rigorous metrics and an efficient evaluation pipeline.	22
evolvinglmms-lab/lmms-eval	Tools and evaluation framework for accelerating the development of large multimodal models by providing an efficient way to assess their performance	2,164
mshukor/evalign-icl	Evaluating and improving large multimodal models through in-context learning	21
escomp/cesm	Provides tools and infrastructure for managing and running the Community Earth System Model	348
jpmml/jpmml-evaluator-spark	A library that enables evaluation of predictive models stored in PMML format within Apache Spark	94
dtcenter/metplus	Provides a Python scripting infrastructure for evaluating and visualizing meteorological model performance.	99
open-compass/vlmevalkit	An evaluation toolkit for large vision-language models	1,514
allenai/olmo-eval	A framework for evaluating language models on NLP tasks	326
huggingface/evaluate	An evaluation framework for machine learning models and datasets, providing standardized metrics and tools for comparing model performance.	2,063
edublancas/sklearn-evaluation	A tool for evaluating and visualizing machine learning model performance	3
declare-lab/instruct-eval	An evaluation framework for large language models trained with instruction tuning methods	535
pcmdi/pcmdi_metrics	A package providing tools and metrics for evaluating Earth system models	104
modelscope/evalscope	A framework for efficiently evaluating and benchmarking large models	308
openai/simple-evals	Evaluates language models using standardized benchmarks and prompting techniques.	2,059
maluuba/nlg-eval	A toolset for evaluating and comparing natural language generation models	1,350