vision_transformer
Vision transformer framework
Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax
11k stars
105 watching
1k forks
Language: Jupyter Notebook
last commit: 7 months ago
Linked from 3 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
google-research/big_vision | Supports large-scale vision model training on GPU machines or Google Cloud TPUs using scalable input pipelines. | 2,370 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,747 |
google-research/nested-transformer | An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency. | 193 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,517 |
google-research/text-to-text-transfer-transformer | Provides tools and libraries for training and fine-tuning large language models using transformer architectures | 6,196 |
poloclub/transformer-explainer | An interactive visualization tool to help users understand how large language models like GPT work | 3,468 |
nvidia/megatron-lm | A framework for training large language models using scalable and optimized GPU techniques | 10,685 |
labmlai/annotated_deep_learning_paper_implementations | Implementations of various deep learning algorithms and techniques with accompanying documentation | 56,762 |
huggingface/trl | A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. | 10,208 |
yitu-opensource/t2t-vit | A deep learning framework for training vision transformers from scratch on image data. | 1,151 |
dmlc/gluon-cv | A toolkit for building and deploying deep learning models in computer vision | 5,845 |
donnyyou/torchcv | A comprehensive PyTorch-based framework for computer vision tasks | 2,250 |
google-research/big_transfer | Pre-trained models and code for fine-tuning image recognition tasks using deep learning frameworks | 1,515 |
huggingface/transformers.js | Runs machine learning models directly in the browser without server-side support. | 12,240 |
vision-cair/minigpt-4 | Enabling vision-language understanding by fine-tuning large language models on visual data. | 25,454 |