ETBench

Video texter

A comprehensive framework for understanding and generating text from videos in real-time.

👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)

GitHub

42 stars
3 watching
0 forks
Language: Python
last commit: 3 months ago