Unicoder

Multilingual language models

This repository provides pre-trained models and code for understanding and generation tasks in multiple languages.

Unicoder model for understanding and generation.

GitHub

88 stars
11 watching
14 forks
Language: Python
last commit: 12 months ago

Related projects:

Repository Description Stars
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 904
xverse-ai/xverse-moe-a36b Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. 36
nttcslab-nlp/doc_lm This repository contains source files and training scripts for language models. 12
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 475
microsoft/wavecoder A system for generating and improving code through large language models 51
microsoft/mpnet Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. 288
tiger-ai-lab/uniir Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks. 110
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 649
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
alexa/massive A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset 538
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 214
juliastrings/utf8proc A C library for processing UTF-8 Unicode data 1,058
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591
lang/unicode_utils Utilities for working with Unicode strings in Ruby 113