Multimodal-GPT

Multimodal Chatbot

Trains a multimodal chatbot that combines visual and language instructions to generate responses

Multimodal-GPT

GitHub

1k stars
13 watching
125 forks
Language: Python
last commit: over 1 year ago
flamingogptgpt-4llamamultimodaltransformervision-and-language

Related projects:

Repository Description Stars
360cvgroup/seechat A multimodal chatbot with computer vision capabilities integrated into a single model 99
llmkira/openaibot A chatbot platform that integrates with various messaging services and provides a plugin-based architecture for customization and extensibility 1,956
karlsoderby/upython-chat-gpt Connects to ChatGPT API via MicroPython to retrieve responses and display them on an OLED screen. 27
opengvlab/multi-modality-arena An evaluation platform for comparing multi-modality models on visual question-answering tasks 478
fengyuli-dev/multimedia-gpt Enables OpenAI GPT to process multimedia inputs like images and audio with text output 184
openrobotlab/pointllm An open-source software framework that enables large language models to process and understand point cloud data, facilitating multimodal interactions. 670
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,196
franalgaba/chatgpt-telegram-bot-serverless An AWS Lambda-based Telegram bot for interacting with ChatGPT 320
toshiakit/matgpt A MATLAB application providing an interface to access OpenAI's ChatGPT API 203
hemulgm/chatgpt A native application allowing users to interact with the GPT chat model on various platforms. 421
pnkvalavala/repochat An interactive chatbot for GitHub repositories using LLMs for conversational interaction and information retrieval 283
oceanlvr/chatgpt-probot A GitHub application built on top of ChatGPT and Probot to enable user interactions with a conversational bot. 379
ghys/habot A chatbot for openHAB using machine-learning natural language processing 15
openmotionlab/motiongpt Develops a unified model to generate high-quality motions and text descriptions from human motion data 1,531
abbey4799/cutegpt A conversational language model developed to improve understanding of complex instructions and Chinese vocabulary. 62