video-caption.pytorch

Video captioner

PyTorch implementation of video captioning, combining deep learning and computer vision techniques.

pytorch implementation of video captioning

GitHub

402 stars
11 watching
131 forks
Language: Python
last commit: over 5 years ago
deep-learningpytorchvideo-captioning

Related projects:

Repository Description Stars
chaoyuaw/pytorch-coviar A PyTorch implementation of a compressed video action recognition system 502
yiwuzhong/sub-gc A PyTorch implementation of image captioning models via scene graph decomposition. 96
kacky24/stylenet A PyTorch implementation of a framework for generating captions with styles for images and videos. 63
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,236
nickjiang2378/vl-interp This project provides an official PyTorch implementation of a method to interpret and edit vision-language representations to mitigate hallucinations in image captions. 46
ruotianluo/imagecaptioning.pytorch A Python-based framework for training and testing image captioning models using PyTorch. 1,458
boyuanjiang/context_encoder_pytorch A PyTorch implementation of a deep learning model for inpainting images using contextual information 366
clementpinard/sfmlearner-pytorch Pytorch implementation of unsupervised depth and ego-motion learning from video sequences 1,022
ruotianluo/self-critical.pytorch An implementation of Self-critical Sequence Training for Image Captioning and related techniques. 998
kaixhin/grokking-pytorch An introduction to using PyTorch for deep learning tasks 1,194
xiayandi/pytorch_text_classification An implementation of convolutional neural networks for text classification using PyTorch 66
zhanghang1989/pytorch-encoding A Python framework for building deep learning models with optimized encoding layers and batch normalization. 2,044
zijundeng/pytorch-semantic-segmentation Provides PyTorch implementations of various models and pipelines for semantic segmentation in deep learning. 1,729
yjxiong/tsn-pytorch A PyTorch implementation of Temporal Segment Networks (TSN) for action recognition and video understanding. 1,066
xxradon/igcv3-pytorch Reimplements MobileNet-V2 and IGCV3 using PyTorch for efficient deep learning. 19