HunyuanDiT

Transformer model

A PyTorch model definition and inference/sampling code repository for a powerful diffusion transformer with fine-grained Chinese understanding

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

GitHub

4k stars

42 watching

318 forks

Language: Jupyter Notebook

last commit: 8 months ago

Screenshot of Tencent/HunyuanDiT website

dit.hunyuan.tencent.com/

Related projects:

Repository	Description	Stars
tencent/tencent-hunyuan-large	This project makes a large language model accessible for research and development	1,245
tencent/tnn	A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms.	4,435
huawei-noah/efficient-ai-backbones	A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab.	4,098
kimiyoung/transformer-xl	Implementations of a neural network architecture for language modeling	3,619
huggingface/transformers	A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects.	136,357
huawei-noah/pretrained-language-model	A collection of pre-trained language models and optimization techniques for efficient natural language processing	3,039
doubiiu/tooncrafter	Generates cartoon-style videos from two images using pre-trained diffusion models	5,447
sjtu-ipads/powerinfer	An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs	8,011
facebookresearch/metaseq	A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms.	6,519
huawei-noah/pretrained-ipt	This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining.	451
tencentarc/gfpgan	An algorithm for restoring damaged or obscured faces in images	36,009
pku-yuangroup/video-llava	A deep learning framework for generating videos from text inputs and visual features.	3,071
clovaai/stargan-v2	A Python implementation of an image-to-image translation model for generating diverse images across multiple domains.	3,513
thu-ml/tianshou	A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents.	8,069
doubiiu/dynamicrafter	This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors.	2,668