Unicoder

Multilingual language models

This repository provides pre-trained models and code for understanding and generation tasks in multiple languages.

Unicoder model for understanding and generation.

GitHub

89 stars

11 watching

13 forks

Language: Python

last commit: over 2 years ago

Related projects:

Repository	Description	Stars
bilibili/index-1.9b	A lightweight, multilingual language model with a long context length	920
xverse-ai/xverse-moe-a36b	Develops and publishes large multilingual language models with advanced mixing-of-experts architecture.	37
nttcslab-nlp/doc_lm	This repository contains source files and training scripts for language models.	12
eleutherai/polyglot	Large language models designed to perform well in multiple languages and address performance issues with current multilingual models.	476
microsoft/wavecoder	A system for generating and improving code through large language models	53
microsoft/mpnet	Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning.	288
tiger-ai-lab/uniir	Trains and evaluates a universal multimodal retrieval model to perform various information retrieval tasks.	114
xverse-ai/xverse-13b	A large language model developed to support multiple languages and applications	648
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
alexa/massive	A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset	541
ibm-granite/granite-3.0-language-models	A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources.	232
juliastrings/utf8proc	A C library for processing UTF-8 Unicode data	1,069
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
shawn-ieitsystems/yuan-1.0	Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing	591
lang/unicode_utils	Utilities for working with Unicode strings in Ruby	113