Emu3

Multimodal model trainer

A suite of multimodal models trained solely with next-token prediction to excel in generation and perception tasks

Next-Token Prediction is All You Need

GitHub

2k stars
33 watching
76 forks
Language: Python
last commit: 3 months ago