massive
Multilingual NLU dataset toolkit
A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset
Tools and Modeling Code for the MASSIVE dataset
541 stars
17 watching
57 forks
Language: Python
last commit: about 3 years ago
Linked from 1 awesome list
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 476 |
| | A lightweight, multilingual language model with a long context length | 920 |
| | This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 89 |
| | Provides a collection of datasets for natural language processing in Ukrainian. | 57 |
| | A multilingual NLP toolkit providing various natural language processing tasks | 65 |
| | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 8 |
| | A series of large language models trained from scratch to excel in multiple NLP tasks | 7,743 |
| | A guide to using pre-trained large language models in source code analysis and generation | 1,789 |
| | A collection of natural language processing models and tools for collaboration on a joint project between BAAI and JDAI. | 254 |
| | A comprehensive toolkit for Natural Language Processing tasks in Indic languages, providing pre-trained models and datasets. | 825 |
| | A linguistic framework for natural language processing tasks. | 216 |
| | A tool that simplifies the process of preparing and manipulating natural language processing datasets | 243 |
| | Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 37 |
| | A collection of language resources extracted from publicly available sources. | 7 |
| | Measures the understanding of massive multitask Chinese datasets using large language models | 87 |