T2Vid
Text to image generator
A data augmentation method for improving the efficiency of training large language models on video data by generating synthetic images from long text instructions.
Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"
44 stars
2 watching
0 forks
Language: Jupyter Notebook
last commit: about 1 month ago