polish-sentence-evaluation
Sentence evaluator
An evaluation framework for sentence representations in Polish language
Evaluation of Sentence Representations in Polish
22 stars
8 watching
3 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list
natural-language-processingpolish-languagesentence-embeddingsword-embeddings
Related projects:
Repository | Description | Stars |
---|---|---|
sdadas/polish-nlp-resources | Pre-trained models and resources for Natural Language Processing in Polish | 323 |
ermlab/polish-word-embeddings-review | An evaluation framework for Polish word embeddings prepared by various research groups using analogy tasks. | 4 |
obss/jury | A comprehensive toolkit for evaluating NLP experiments offering automated metrics and efficient computation. | 188 |
openai/simple-evals | A library for evaluating language models using standardized prompts and benchmarking tests. | 1,939 |
allenai/olmo-eval | An evaluation framework for large language models. | 310 |
nicklockwood/expression | A Swift framework for evaluating mathematical expressions at runtime on multiple platforms | 830 |
huggingface/evaluate | An evaluation framework for machine learning models and datasets, providing standardized metrics and tools for comparing model performance. | 2,034 |
rlancemartin/auto-evaluator | An evaluation tool for question-answering systems using large language models and natural language processing techniques | 1,063 |
binwang28/sbert-wk-sentence-embedding | A method to generate sentence embeddings from pre-trained language models | 177 |
krrishdholakia/betterprompt | An API for evaluating the quality of text prompts used in Large Language Models (LLMs) based on perplexity estimation | 38 |
facebookresearch/senteval | Tool for evaluating the quality of sentence embeddings as features in various downstream tasks. | 2,087 |
dfki-nlp/gevalm | Evaluates German transformer language models with syntactic agreement tests | 7 |
stanford-futuredata/ares | A tool for automatically evaluating RAG models by generating synthetic data and fine-tuning classifiers | 483 |
lartpang/pysodevaltoolkit | A comprehensive Python toolbox for evaluating salient object detection and camouflaged object detection tasks | 167 |
pkunlp-icler/pca-eval | An open-source benchmark and evaluation tool for assessing multimodal large language models' performance in embodied decision-making tasks | 100 |