Multimodal-GPT

Multimodal Chatbot

Trains a multimodal chatbot that combines visual and language instructions to generate responses

Multimodal-GPT

GitHub

1k stars
13 watching
126 forks
Language: Python
last commit: over 1 year ago
flamingogptgpt-4llamamultimodaltransformervision-and-language

Related projects:

Repository Description Stars
360cvgroup/seechat A multimodal chatbot with computer vision capabilities integrated into a single model 98
llmkira/openaibot A chatbot platform that integrates with various messaging services and provides a plugin-based architecture for customization and extensibility 1,954
karlsoderby/upython-chat-gpt Connects to ChatGPT API via MicroPython to retrieve responses and display them on an OLED screen. 27
opengvlab/multi-modality-arena An evaluation platform for comparing multi-modality models on visual question-answering tasks 467
fengyuli-dev/multimedia-gpt Enables OpenAI GPT to process multimedia inputs like images and audio with text output 184
openrobotlab/pointllm This project develops a large language model capable of understanding and generating information about 3D point clouds. 647
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,179
franalgaba/chatgpt-telegram-bot-serverless An AWS Lambda-based Telegram bot for interacting with ChatGPT 318
toshiakit/matgpt A MATLAB application providing an interface to access OpenAI's ChatGPT API 202
hemulgm/chatgpt A native application allowing users to interact with the GPT chat model on various platforms. 415
pnkvalavala/repochat An interactive chatbot for GitHub repositories using LLMs for conversational interaction and information retrieval 275
oceanlvr/chatgpt-probot A GitHub application built on top of ChatGPT and Probot to enable user interactions with a conversational bot. 379
ghys/habot A chatbot for openHAB using machine-learning natural language processing 15
openmotionlab/motiongpt Develops a unified model to generate high-quality motions and text descriptions from human motion data 1,505
abbey4799/cutegpt A conversational language model developed to improve understanding of complex instructions and Chinese vocabulary. 62