Visual-Chinese-LLaMA-Alpaca
Visual Model
Develops a multimodal Chinese language model with visual capabilities
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
424 stars
9 watching
36 forks
Language: Python
last commit: over 1 year ago alpacachinesellamallmloramultimodalnlpvision-language
Related projects:
Repository | Description | Stars |
---|---|---|
lc1332/chinese-alpaca-lora | Develops and maintains a Chinese language model finetuned on LLaMA, used for text generation and summarization tasks. | 711 |
linksoul-ai/chinese-llama-2-7b | A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data | 2,228 |
fittentech/openllama-chinese | A Chinese language large language model built from OpenLLaMA and fine-tuned on various datasets for multilingual text generation. | 64 |
wisconsinaivision/vip-llava | A system designed to enable large multimodal models to understand arbitrary visual prompts | 294 |
lyuchenyang/macaw-llm | A multi-modal language model that integrates image, video, audio, and text data to improve language understanding and generation | 1,550 |
andrewzhe/lawyer-llama | An AI model trained on legal data to provide answers and explanations in Chinese law | 851 |
michael-wzhu/chinese-llama2 | A custom Chinese version of the Meta Llama 2 model for improved Chinese language support and application | 747 |
jerry1993-tech/cornucopia-llama-fin-chinese | A Chinese finance-focused large language model fine-tuning framework | 589 |
ailab-cvc/seed | An implementation of a multimodal language model with capabilities for comprehension and generation | 576 |
pleisto/yuren-baichuan-7b | A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks | 72 |
openlmlab/openchinesellama | An incremental pre-trained Chinese large language model based on the LLaMA-7B model | 234 |
lxtgh/omg-seg | Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,300 |
dvlab-research/llama-vid | An image-based language model that uses large language models to generate visual and text features from videos | 733 |
llava-vl/llava-interactive-demo | An all-in-one demo for interactive image processing and generation | 351 |
llava-vl/llava-plus-codebase | A platform for training and deploying large language and vision models that can use tools to perform tasks | 704 |