vision_transformer

Vision transformer framework

Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax

GitHub

11k stars
105 watching
1k forks
Language: Jupyter Notebook
last commit: 7 months ago
Linked from 3 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/big_vision Supports large-scale vision model training on GPU machines or Google Cloud TPUs using scalable input pipelines. 2,370
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,747
google-research/nested-transformer An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency. 193
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,517
google-research/text-to-text-transfer-transformer Provides tools and libraries for training and fine-tuning large language models using transformer architectures 6,196
poloclub/transformer-explainer An interactive visualization tool to help users understand how large language models like GPT work 3,468
nvidia/megatron-lm A framework for training large language models using scalable and optimized GPU techniques 10,685
labmlai/annotated_deep_learning_paper_implementations Implementations of various deep learning algorithms and techniques with accompanying documentation 56,762
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,208
yitu-opensource/t2t-vit A deep learning framework for training vision transformers from scratch on image data. 1,151
dmlc/gluon-cv A toolkit for building and deploying deep learning models in computer vision 5,845
donnyyou/torchcv A comprehensive PyTorch-based framework for computer vision tasks 2,250
google-research/big_transfer Pre-trained models and code for fine-tuning image recognition tasks using deep learning frameworks 1,515
huggingface/transformers.js Runs machine learning models directly in the browser without server-side support. 12,240
vision-cair/minigpt-4 Enabling vision-language understanding by fine-tuning large language models on visual data. 25,454