multimedia-gpt

Multimedia processor

Enables OpenAI GPT to process multimedia inputs like images and audio with text output

Empowering your ChatGPT with vision and audio inputs.

GitHub

184 stars
3 watching
13 forks
Language: Python
last commit: about 1 year ago
chatbotchatgptgptopenai-api

Related projects:

Repository Description Stars
open-mmlab/multimodal-gpt Trains a multimodal chatbot that combines visual and language instructions to generate responses 1,477
ftp27/fastlane-plugin-translate_gpt A fastlane plugin that enables automatic translation of iOS and Android app strings using the OpenAI GPT API. 46
toshiakit/matgpt A MATLAB application providing an interface to access OpenAI's ChatGPT API 202
dwisiswant0/chatgptui An interactive tool to communicate with a language model using a text-based interface 91
forkpath/openai-feishu-bot An OpenAI-powered chatbot integrated with Feishu's messaging platform, enabling text and image-based conversations. 21
okgodoit/openai-api-dotnet An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services 1,860
hello-simpleai/chatgpt-comparison-detection A repository providing datasets and detectors for comparing human-generated content with ChatGPT-generated content 1,257
williamfzc/chat-gpt-ppt Automates the creation of PowerPoint presentations using ChatGPT as a backend. 906
kejunmao/ai-anything An open-source toolset for creating custom ChatGPT interfaces 566
stoerr/codevelopergptengine Provides read/write file access and executes actions on local files using ChatGPT as an OpenAI GPT action 11
imevro/chatgpt_repl A CLI tool for interacting with the ChatGPT API using its own model 12
ailab-cvc/gpt4tools An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. 760
whatwewant/chatgpt-for-chatbot-feishu A Go-based project integrating ChatGPT with Feishu, enabling private work assistants or employee assistants. 320
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
adamlui/chatgpt-widescreen Enhances chat sessions with widescreen and fullscreen modes for improved user experience 127