bleurt
NLG evaluation metric
An evaluation metric for Natural Language Generation based on transfer learning.
BLEURT is a metric for Natural Language Generation based on transfer learning.
698 stars
13 watching
85 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
maluuba/nlg-eval | A toolset for evaluating and comparing natural language generation models | 1,349 |
benhamner/metrics | Provides implementations of various supervised machine learning evaluation metrics in multiple programming languages. | 1,627 |
ssymmetry/bbt-fincuge-applications | Creating a comprehensive platform for natural language processing in the financial industry by developing and publishing large-scale datasets, pre-trained models, and evaluation benchmarks. | 241 |
mlgroupjlu/llm-eval-survey | A repository of papers and resources for evaluating large language models. | 1,433 |
bllip/bllip-parser | A statistical natural language parser used to generate grammatically correct sentences from unstructured text input. | 227 |
microsoft/prophetnet | A collection of research implementations and models for natural language generation | 691 |
i-gallegos/fair-llm-benchmark | Compiles bias evaluation datasets and provides access to original data sources for large language models | 110 |
thiagocf05/webnlg | Provides intermediate representations of data for NLG tasks like Discourse Ordering and Lexicalization | 69 |
nlgranger/seqtools | A Python library to manipulate and transform indexable data | 48 |
dluebke/bpelstats | A tool for calculating and analyzing BPEL metrics | 0 |
simplenlg/simplenlg | Generates Natural Language from syntactic forms using morphological and grammatical rules | 811 |
intellabs/fastrag | A framework for efficient and optimized retrieval augmented generative pipelines using state-of-the-art LLMs and Information Retrieval. | 1,336 |
lartpang/pysodmetrics | A library providing an implementation of various metrics for object segmentation and saliency detection in computer vision. | 144 |
opennlg/openba | A pre-trained language model designed for various NLP tasks, including dialogue generation, code completion, and retrieval. | 94 |
google-research/deep_ope | A set of pre-trained reinforcement learning policies and benchmarking data for offline model selection in reinforcement learning. | 85 |