HunyuanDiT

Diffusion Transformer

A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

GitHub

3k stars
43 watching
295 forks
Language: Python
last commit: about 1 month ago

Related projects:

Repository Description Stars
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,114
tencent/tnn A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. 4,415
huawei-noah/efficient-ai-backbones A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. 4,054
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,611
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
huawei-noah/pretrained-language-model A collection of pre-trained language models and optimization techniques for efficient natural language processing 3,028
doubiiu/tooncrafter Generates cartoon-style videos from two images using pre-trained diffusion models 5,353
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 7,964
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,515
huawei-noah/pretrained-ipt This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining. 448
tencentarc/gfpgan An algorithm for restoring damaged or obscured faces in images 35,898
pku-yuangroup/video-llava This project enables large language models to perform visual reasoning capabilities on images and videos simultaneously by learning united visual representations before projection. 2,990
clovaai/stargan-v2 A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. 3,500
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 7,968
doubiiu/dynamicrafter This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. 2,580