phonix
Video captioner
Generates captions for videos using OpenAI's Whisper API
Generate captions for videos using the power of OpenAI's Whisper API
39 stars
2 watching
3 forks
Language: Python
last commit: 9 months ago
Linked from 1 awesome list
openaiopenai-apiopenai-whispervideo-srtvideo-to-captionvideo-to-textwhisper
Related projects:
Repository | Description | Stars |
---|---|---|
m1guelpf/yt-whisper | Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model | 1,373 |
xiadingz/video-caption.pytorch | PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 402 |
purfview/whisper-standalone-win | Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
softcatala/whisper-ctranslate2 | An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. | 938 |
showlab/vlog | Transforms video content into a long document containing visual and audio information that can be used for chat or other applications. | 545 |
lumingyin/quickcaption | Automated captioning and transcription tool for video and audio files | 74 |
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 408 |
ninthwalker/nowshowing | Generates email and web page summaries of new media added to Plex | 73 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
lainiwa/ph-marks | Tools for bookmarking and interacting with Pornhub video content from the command line. | 11 |
trakt/plex-trakt-scrobbler | Automates data synchronization between Plex media server and Trakt.tv profile | 1,456 |
phoboslab/pl_mpeg | A single-file C library for decoding MPEG1 Video and MP2 Audio | 807 |
illyism/openai-whisper-api | An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run. | 110 |
hypjudy/sparkles | Develops multimodal instruction-following models for open-ended dialogues across multiple images | 43 |
drmonkeyninja/arc_youtube | An embeddable Youtube video player plugin for Textpattern CMS. | 5 |