HunyuanDiT
Transformer model
A PyTorch model definition and inference/sampling code repository for a powerful diffusion transformer with fine-grained Chinese understanding
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
4k stars
42 watching
318 forks
Language: Jupyter Notebook
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
tencent/tencent-hunyuan-large | This project makes a large language model accessible for research and development | 1,245 |
tencent/tnn | A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. | 4,435 |
huawei-noah/efficient-ai-backbones | A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. | 4,098 |
kimiyoung/transformer-xl | Implementations of a neural network architecture for language modeling | 3,619 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
huawei-noah/pretrained-language-model | A collection of pre-trained language models and optimization techniques for efficient natural language processing | 3,039 |
doubiiu/tooncrafter | Generates cartoon-style videos from two images using pre-trained diffusion models | 5,447 |
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 8,011 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,519 |
huawei-noah/pretrained-ipt | This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining. | 451 |
tencentarc/gfpgan | An algorithm for restoring damaged or obscured faces in images | 36,009 |
pku-yuangroup/video-llava | A deep learning framework for generating videos from text inputs and visual features. | 3,071 |
clovaai/stargan-v2 | A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. | 3,513 |
thu-ml/tianshou | A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. | 8,069 |
doubiiu/dynamicrafter | This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. | 2,668 |