Unicoder

Multilingual language models

This repository provides pre-trained models and code for understanding and generation tasks in multiple languages.

Unicoder model for understanding and generation.

GitHub

89 stars
11 watching
13 forks
Language: Python
last commit: about 1 year ago

Related projects:

Repository Description Stars
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 920
xverse-ai/xverse-moe-a36b Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. 37
nttcslab-nlp/doc_lm This repository contains source files and training scripts for language models. 12
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 476
microsoft/wavecoder A system for generating and improving code through large language models 53
microsoft/mpnet Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. 288
tiger-ai-lab/uniir Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks. 114
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 648
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
alexa/massive A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset 541
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 232
juliastrings/utf8proc A C library for processing UTF-8 Unicode data 1,069
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,789
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
lang/unicode_utils Utilities for working with Unicode strings in Ruby 113