Emu3

Multimodal model trainer

A suite of multimodal models trained solely with next-token prediction to excel in generation and perception tasks

Next-Token Prediction is All You Need

GitHub

2k stars
32 watching
71 forks
Language: Python
last commit: about 1 month ago