Emu3
Multimodal model trainer
A suite of multimodal models trained solely with next-token prediction to excel in generation and perception tasks
Next-Token Prediction is All You Need
2k stars
33 watching
76 forks
Language: Python
last commit: 3 months ago