Visual-Chinese-LLaMA-Alpaca

Visual Model

Develops a multimodal Chinese language model with visual capabilities

多模态中文LLaMA&Alpaca大语言模型（VisualCLA）

GitHub

429 stars

9 watching

36 forks

Language: Python

last commit: about 3 years ago

alpacachinesellamallmloramultimodalnlpvision-language

Screenshot of airaria/Visual-Chinese-LLaMA-Alpaca website

github.com/airaria/Visual-Chinese-LLaMA-Alpaca

Related projects:

Repository	Description	Stars
lc1332/chinese-alpaca-lora	Develops and maintains a Chinese language model finetuned on LLaMA, used for text generation and summarization tasks.	711
linksoul-ai/chinese-llama-2-7b	A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data	2,235
fittentech/openllama-chinese	A Chinese language large language model built from OpenLLaMA and fine-tuned on various datasets for multilingual text generation.	65
wisconsinaivision/vip-llava	A system designed to enable large multimodal models to understand arbitrary visual prompts	302
lyuchenyang/macaw-llm	A multi-modal language model that integrates image, video, audio, and text data to improve language understanding and generation	1,568
andrewzhe/lawyer-llama	An AI model trained on legal data to provide answers and explanations in Chinese law	871
michael-wzhu/chinese-llama2	A custom Chinese version of the Meta Llama 2 model for improved Chinese language support and application	748
jerry1993-tech/cornucopia-llama-fin-chinese	A Chinese finance-focused large language model fine-tuning framework	596
ailab-cvc/seed	An implementation of a multimodal language model with capabilities for comprehension and generation	585
pleisto/yuren-baichuan-7b	A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks	73
openlmlab/openchinesellama	An incremental pre-trained Chinese large language model based on the LLaMA-7B model	234
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336
dvlab-research/llama-vid	An image-based language model that uses large language models to generate visual and text features from videos	748
llava-vl/llava-interactive-demo	An all-in-one demo for interactive image processing and generation	353
llava-vl/llava-plus-codebase	A platform for training and deploying large language and vision models that can use tools to perform tasks	717