T2Vid

Text to image generator

A data augmentation method for improving the efficiency of training large language models on video data by generating synthetic images from long text instructions.

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

GitHub

44 stars

2 watching

0 forks

Language: Jupyter Notebook

last commit: over 1 year ago