VLog
Video doc generator
Transforms video content into a long document containing visual and audio information that can be used for chat or other applications.
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
538 stars
6 watching
26 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list
chatgptlangchainlarge-language-modelvideo-languagewhisper
Related projects:
Repository | Description | Stars |
---|---|---|
showlab/show-1 | This project enables text-to-video generation by combining pixel and latent diffusion models | 1,103 |
damo-nlp-sg/videollama2 | An audio-visual language model designed to understand and generate video content | 871 |
m1guelpf/yt-whisper | Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model | 1,365 |
0voice/ffmpeg_develop_doc | A repository aggregating online ffmpeg learning resources and documentation for developing multimedia software. | 1,945 |
venuv/langchain_yt_tools | Custom tools to extract text from YouTube video transcripts | 62 |
mbzuai-oryx/video-chatgpt | A video conversation model that generates meaningful conversations about videos using large vision and language models | 1,213 |
aspiers/ly2video | Converts music represented by a GNU LilyPond file into a video containing a horizontally scrolling music staff synchronized with audio rendering. | 158 |
antoine77340/howto100m | Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset | 250 |
timothycrosley/portray | Automates the creation of documentation websites for Python projects with minimal configuration | 862 |
platisd/phonix | Generates captions for videos using OpenAI's Whisper API | 37 |
pku-yuangroup/video-bench | Evaluates and benchmarks large language models' video understanding capabilities | 117 |
techgaun/gulp-apidoc | Generates documentation for RESTful web APIs | 5 |
context-labs/autodoc | Tool for auto-generating codebase documentation using Large Language Models | 1,967 |
transitive-bullshit/ffmpeg-generate-video-preview | Generates image strips or GIFs from video files | 152 |
opengvlab/internvideo | Developing video foundation models and datasets for multimodal understanding and applications | 1,413 |