multimedia-gpt
Multimedia processor
Enables OpenAI GPT to process multimedia inputs like images and audio with text output
Empowering your ChatGPT with vision and audio inputs.
184 stars
3 watching
13 forks
Language: Python
last commit: over 1 year ago chatbotchatgptgptopenai-api
Related projects:
Repository | Description | Stars |
---|---|---|
| Trains a multimodal chatbot that combines visual and language instructions to generate responses | 1,478 |
| A fastlane plugin that enables automatic translation of iOS and Android app strings using the OpenAI GPT API. | 48 |
| A MATLAB application providing an interface to access OpenAI's ChatGPT API | 203 |
| An interactive tool to communicate with a language model using a text-based interface | 90 |
| An OpenAI-powered chatbot integrated with Feishu's messaging platform, enabling text and image-based conversations. | 21 |
| An unofficial .NET wrapper around OpenAI's GPT-3 API for accessing various text and vision processing services | 1,870 |
| A repository providing datasets and detectors for comparing human-generated content with ChatGPT-generated content | 1,264 |
| Automates the creation of PowerPoint presentations using ChatGPT as a backend. | 909 |
| An open-source toolset for creating custom ChatGPT interfaces | 568 |
| Provides read/write file access and executes actions on local files using ChatGPT as an OpenAI GPT action | 12 |
| A CLI tool for interacting with the ChatGPT API using its own model | 12 |
| An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. | 762 |
| A Go-based project integrating ChatGPT with Feishu, enabling private work assistants or employee assistants. | 322 |
| This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| Enhances chat interfaces with widescreen and fullscreen modes to reduce scrolling and improve user experience | 130 |