multimedia-gpt

Multimedia processor

Enables OpenAI GPT to process multimedia inputs like images and audio with text output

Empowering your ChatGPT with vision and audio inputs.

GitHub

184 stars
3 watching
13 forks
Language: Python
last commit: over 1 year ago
chatbotchatgptgptopenai-api

Related projects:

Repository Description Stars
open-mmlab/multimodal-gpt Trains a multimodal chatbot that combines visual and language instructions to generate responses 1,478
ftp27/fastlane-plugin-translate_gpt A fastlane plugin that enables automatic translation of iOS and Android app strings using the OpenAI GPT API. 48
toshiakit/matgpt A MATLAB application providing an interface to access OpenAI's ChatGPT API 203
dwisiswant0/chatgptui An interactive tool to communicate with a language model using a text-based interface 90
forkpath/openai-feishu-bot An OpenAI-powered chatbot integrated with Feishu's messaging platform, enabling text and image-based conversations. 21
okgodoit/openai-api-dotnet An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services 1,870
hello-simpleai/chatgpt-comparison-detection A repository providing datasets and detectors for comparing human-generated content with ChatGPT-generated content 1,264
williamfzc/chat-gpt-ppt Automates the creation of PowerPoint presentations using ChatGPT as a backend. 909
kejunmao/ai-anything An open-source toolset for creating custom ChatGPT interfaces 568
stoerr/codevelopergptengine Provides read/write file access and executes actions on local files using ChatGPT as an OpenAI GPT action 12
imevro/chatgpt_repl A CLI tool for interacting with the ChatGPT API using its own model 12
ailab-cvc/gpt4tools An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. 762
whatwewant/chatgpt-for-chatbot-feishu A Go-based project integrating ChatGPT with Feishu, enabling private work assistants or employee assistants. 322
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
adamlui/chatgpt-widescreen Enhances chat interfaces with widescreen and fullscreen modes to reduce scrolling and improve user experience 130