VLog

Video doc generator

Transforms video content into a long document containing visual and audio information that can be used for chat or other applications.

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

GitHub

538 stars
6 watching
26 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

chatgptlangchainlarge-language-modelvideo-languagewhisper

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
showlab/show-1 This project enables text-to-video generation by combining pixel and latent diffusion models 1,103
damo-nlp-sg/videollama2 An audio-visual language model designed to understand and generate video content 871
m1guelpf/yt-whisper Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model 1,365
0voice/ffmpeg_develop_doc A repository aggregating online ffmpeg learning resources and documentation for developing multimedia software. 1,945
venuv/langchain_yt_tools Custom tools to extract text from YouTube video transcripts 62
mbzuai-oryx/video-chatgpt A video conversation model that generates meaningful conversations about videos using large vision and language models 1,213
aspiers/ly2video Converts music represented by a GNU LilyPond file into a video containing a horizontally scrolling music staff synchronized with audio rendering. 158
antoine77340/howto100m Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset 250
timothycrosley/portray Automates the creation of documentation websites for Python projects with minimal configuration 862
platisd/phonix Generates captions for videos using OpenAI's Whisper API 37
pku-yuangroup/video-bench Evaluates and benchmarks large language models' video understanding capabilities 117
techgaun/gulp-apidoc Generates documentation for RESTful web APIs 5
context-labs/autodoc Tool for auto-generating codebase documentation using Large Language Models 1,967
transitive-bullshit/ffmpeg-generate-video-preview Generates image strips or GIFs from video files 152
opengvlab/internvideo Developing video foundation models and datasets for multimodal understanding and applications 1,413