HunyuanDiT

Transformer model

A PyTorch model definition and inference/sampling code repository for a powerful diffusion transformer with fine-grained Chinese understanding

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

GitHub

4k stars
42 watching
318 forks
Language: Jupyter Notebook
last commit: about 1 month ago

Related projects:

Repository Description Stars
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,245
tencent/tnn A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. 4,435
huawei-noah/efficient-ai-backbones A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. 4,098
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,619
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 136,357
huawei-noah/pretrained-language-model A collection of pre-trained language models and optimization techniques for efficient natural language processing 3,039
doubiiu/tooncrafter Generates cartoon-style videos from two images using pre-trained diffusion models 5,447
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 8,011
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,519
huawei-noah/pretrained-ipt This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining. 451
tencentarc/gfpgan An algorithm for restoring damaged or obscured faces in images 36,009
pku-yuangroup/video-llava A deep learning framework for generating videos from text inputs and visual features. 3,071
clovaai/stargan-v2 A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. 3,513
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 8,069
doubiiu/dynamicrafter This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. 2,668