T2Vid

Text to image generator

A data augmentation method for improving the efficiency of training large language models on video data by generating synthetic images from long text instructions.

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

GitHub

44 stars
2 watching
0 forks
Language: Jupyter Notebook
last commit: about 1 month ago