Unicoder
Multilingual language models
This repository provides pre-trained models and code for understanding and generation tasks in multiple languages.
Unicoder model for understanding and generation.
88 stars
11 watching
14 forks
Language: Python
last commit: 12 months ago Related projects:
Repository | Description | Stars |
---|---|---|
bilibili/index-1.9b | A lightweight, multilingual language model with a long context length | 904 |
xverse-ai/xverse-moe-a36b | Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 36 |
nttcslab-nlp/doc_lm | This repository contains source files and training scripts for language models. | 12 |
eleutherai/polyglot | Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 475 |
microsoft/wavecoder | A system for generating and improving code through large language models | 51 |
microsoft/mpnet | Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. | 288 |
tiger-ai-lab/uniir | Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks. | 110 |
xverse-ai/xverse-13b | A large language model developed to support multiple languages and applications | 649 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 180 |
alexa/massive | A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset | 538 |
ibm-granite/granite-3.0-language-models | A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 214 |
juliastrings/utf8proc | A C library for processing UTF-8 Unicode data | 1,058 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
shawn-ieitsystems/yuan-1.0 | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |
lang/unicode_utils | Utilities for working with Unicode strings in Ruby | 113 |