Emu3
Multimodal model trainer
A suite of multimodal models trained solely with next-token prediction to excel in generation and perception tasks
Next-Token Prediction is All You Need
2k stars
32 watching
71 forks
Language: Python
last commit: about 1 month ago