HunyuanDiT
Diffusion Transformer
A PyTorch-based diffusion transformer model for generating images with fine-grained Chinese understanding and text-to-image synthesis
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
3k stars
43 watching
295 forks
Language: Python
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
tencent/tencent-hunyuan-large | This project makes a large language model accessible for research and development | 1,114 |
tencent/tnn | A high-performance neural network inference framework supporting various deep learning frameworks and hardware platforms. | 4,415 |
huawei-noah/efficient-ai-backbones | A collection of efficient AI backbone architectures developed by Huawei Noah's Ark Lab. | 4,054 |
kimiyoung/transformer-xl | Implementations of a neural network architecture for language modeling | 3,611 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,022 |
huawei-noah/pretrained-language-model | A collection of pre-trained language models and optimization techniques for efficient natural language processing | 3,028 |
doubiiu/tooncrafter | Generates cartoon-style videos from two images using pre-trained diffusion models | 5,353 |
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 7,964 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,515 |
huawei-noah/pretrained-ipt | This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining. | 448 |
tencentarc/gfpgan | An algorithm for restoring damaged or obscured faces in images | 35,898 |
pku-yuangroup/video-llava | This project enables large language models to perform visual reasoning capabilities on images and videos simultaneously by learning united visual representations before projection. | 2,990 |
clovaai/stargan-v2 | A Python implementation of an image-to-image translation model for generating diverse images across multiple domains. | 3,500 |
thu-ml/tianshou | A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. | 7,968 |
doubiiu/dynamicrafter | This project generates animated videos from open-domain images by leveraging pre-trained video diffusion priors. | 2,580 |