massive
Multilingual NLU dataset toolkit
A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset
Tools and Modeling Code for the MASSIVE dataset
541 stars
17 watching
57 forks
Language: Python
last commit: about 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 476 |
| A lightweight, multilingual language model with a long context length | 920 |
| This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 89 |
| Provides a collection of datasets for natural language processing in Ukrainian. | 57 |
| A multilingual NLP toolkit providing various natural language processing tasks | 65 |
| A collection of linguistic datasets and benchmarks for natural language understanding tasks | 8 |
| A series of large language models trained from scratch to excel in multiple NLP tasks | 7,743 |
| A guide to using pre-trained large language models in source code analysis and generation | 1,789 |
| A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. | 254 |
| A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. | 825 |
| A linguistic framework for natural language processing tasks. | 216 |
| A tool that simplifies the process of preparing and manipulating natural language processing datasets | 243 |
| Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 37 |
| A collection of language resources extracted from publicly available sources. | 7 |
| Measures the understanding of massive multitask Chinese datasets using large language models | 87 |