polish-sentence-evaluation
Sentence evaluator
An evaluation framework for sentence representations in Polish language
Evaluation of Sentence Representations in Polish
22 stars
8 watching
3 forks
Language: Python
last commit: about 2 years ago
Linked from 1 awesome list
natural-language-processingpolish-languagesentence-embeddingsword-embeddings
Related projects:
Repository | Description | Stars |
---|---|---|
sdadas/polish-nlp-resources | Pre-trained models and resources for Natural Language Processing in Polish | 329 |
ermlab/polish-word-embeddings-review | An evaluation framework for Polish word embeddings prepared by various research groups using analogy tasks. | 4 |
obss/jury | A comprehensive toolkit for evaluating NLP experiments offering automated metrics and efficient computation. | 187 |
openai/simple-evals | Evaluates language models using standardized benchmarks and prompting techniques. | 2,059 |
allenai/olmo-eval | A framework for evaluating language models on NLP tasks | 326 |
nicklockwood/expression | A Swift framework for evaluating mathematical expressions at runtime on multiple platforms | 832 |
huggingface/evaluate | An evaluation framework for machine learning models and datasets, providing standardized metrics and tools for comparing model performance. | 2,063 |
rlancemartin/auto-evaluator | An evaluation tool for question-answering systems using large language models and natural language processing techniques | 1,065 |
binwang28/sbert-wk-sentence-embedding | A method to generate sentence embeddings from pre-trained language models | 178 |
krrishdholakia/betterprompt | An API for evaluating the quality of text prompts used in Large Language Models (LLMs) based on perplexity estimation | 43 |
facebookresearch/senteval | Tool for evaluating the quality of sentence embeddings as features in various downstream tasks. | 2,086 |
dfki-nlp/gevalm | Evaluates German transformer language models with syntactic agreement tests | 7 |
stanford-futuredata/ares | A tool for automatically evaluating RAG models by generating synthetic data and fine-tuning classifiers | 499 |
lartpang/pysodevaltoolkit | A comprehensive Python toolbox for evaluating salient object detection and camouflaged object detection tasks | 168 |
pkunlp-icler/pca-eval | An open-source benchmark and evaluation tool for assessing multimodal large language models' performance in embodied decision-making tasks | 99 |