Visual-Chinese-LLaMA-Alpaca

Visual Model

Develops a multimodal Chinese language model with visual capabilities

多模态中文LLaMA&Alpaca大语言模型(VisualCLA)

GitHub

424 stars
9 watching
36 forks
Language: Python
last commit: over 1 year ago
alpacachinesellamallmloramultimodalnlpvision-language

Related projects:

Repository Description Stars
lc1332/chinese-alpaca-lora Develops and maintains a Chinese language model finetuned on LLaMA, used for text generation and summarization tasks. 711
linksoul-ai/chinese-llama-2-7b A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data 2,228
fittentech/openllama-chinese A Chinese language large language model built from OpenLLaMA and fine-tuned on various datasets for multilingual text generation. 64
wisconsinaivision/vip-llava A system designed to enable large multimodal models to understand arbitrary visual prompts 294
lyuchenyang/macaw-llm A multi-modal language model that integrates image, video, audio, and text data to improve language understanding and generation 1,550
andrewzhe/lawyer-llama An AI model trained on legal data to provide answers and explanations in Chinese law 851
michael-wzhu/chinese-llama2 A custom Chinese version of the Meta Llama 2 model for improved Chinese language support and application 747
jerry1993-tech/cornucopia-llama-fin-chinese A Chinese finance-focused large language model fine-tuning framework 589
ailab-cvc/seed An implementation of a multimodal language model with capabilities for comprehension and generation 576
pleisto/yuren-baichuan-7b A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks 72
openlmlab/openchinesellama An incremental pre-trained Chinese large language model based on the LLaMA-7B model 234
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,300
dvlab-research/llama-vid An image-based language model that uses large language models to generate visual and text features from videos 733
llava-vl/llava-interactive-demo An all-in-one demo for interactive image processing and generation 351
llava-vl/llava-plus-codebase A platform for training and deploying large language and vision models that can use tools to perform tasks 704