howto100m

Text-Video Embedding Toolkit

Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset

Code for the HowTo100M paper

254 stars

5 watching

37 forks

Language: Python

last commit: over 5 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

danieljf24/awesome-video-text-retrieval

Related projects:

Repository	Description	Stars
antoine77340/mixture-of-embedding-experts	An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks.	118
jwieting/para-nmt-50m	A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets.	102
pku-yuangroup/video-bench	Evaluates and benchmarks large language models' video understanding capabilities	121
kronoscode/django-magicembed	Provides a tool to easily embed videos and generate thumbnails in Django web applications.	19
showlab/show-1	This project enables text-to-video generation using a combination of pixel and latent diffusion models.	1,110
snailedlt/markdown-videos	Embeds YouTube and Vimeo videos into GitHub markdown with ease using an API and website	82
nlprinceton/text_embedding	A utility class for generating and evaluating document representations using word embeddings.	54
jwieting/acl2017	A codebase for training and using models of sentence embeddings.	33
materialsintelligence/mat2vec	Unsupervised word embeddings capture latent knowledge from materials science literature	624
florianmai/word2mat	A framework for learning sentence embeddings from matrices	21
samirhodzic/ngx-embed-video	A library for embedding video content from YouTube, Vimeo, and Dailymotion in web applications.	56
showlab/vlog	Transforms video content into a long document containing visual and audio information that can be used for chat or other applications.	545
gink03/alt-i2v	An implementation of a deep learning-based image representation learning approach using a modified fully connected layer and transfer learning from VGG16	34
jwieting/paragram-word	Trains word embeddings from a paraphrase database to represent semantic relationships between words.	30
jayleicn/tvqa	PyTorch implementation of video question answering system based on TVQA dataset	172