Pangea

Multimodal LLM

An open-source multilingual large language model designed to understand and generate content across diverse languages and cultural contexts

This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"

GitHub

92 stars

3 watching

3 forks

Language: Python

last commit: over 1 year ago

Related projects:

Repository	Description	Stars
lyuchenyang/macaw-llm	A multi-modal language model that integrates image, video, audio, and text data to improve language understanding and generation	1,568
damo-nlp-mt/polylm	A polyglot large language model designed to address limitations in current LLM research and provide better multilingual instruction-following capability.	77
pleisto/yuren-baichuan-7b	A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks	73
alpha-vllm/wemix-llm	An LLaMA-based multimodal language model with various instruction-following and multimodal variants.	17
phellonchen/x-llm	A framework that enables large language models to process and understand multimodal inputs from various sources such as images and speech.	308
ailab-cvc/seed	An implementation of a multimodal language model with capabilities for comprehension and generation	585
deeplangai/lingowhale-8b	An open bilingual LLM developed using the LingoWhale model, trained on a large dataset of high-quality middle English text, and fine-tuned for specific tasks such as conversation generation.	134
vita-mllm/vita	A large multimodal language model designed to process and analyze video, image, text, and audio inputs in real-time.	1,005
luogen1996/lavin	An open-source implementation of a vision-language instructed large language model	513
damo-nlp-sg/m3exam	A benchmark for evaluating large language models in multiple languages and formats	93
mbzuai-oryx/groundinglmm	An end-to-end trained model capable of generating natural language responses integrated with object segmentation masks for interactive visual conversations	797
bilibili/index-1.9b	A lightweight, multilingual language model with a long context length	920
bytedance/lynx-llm	A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models	231
yfzhang114/slime	Develops large multimodal models for high-resolution understanding and analysis of text, images, and other data types.	143
orionstarai/orion	A family of large language models designed to handle multilingual text and provide strong performance in various tasks such as chat, long context, and retrieval augmented generation.	789