bigBatch
Batch trainer
A tool for training neural networks using large batch sizes and analyzing the trade-offs between longer training periods and better generalization performance.
Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"
148 stars
8 watching
24 forks
Language: Python
last commit: over 7 years ago Related projects:
Repository | Description | Stars |
---|---|---|
openbmb/bmtrain | A toolkit for training large models in a distributed manner while keeping code simple and efficient. | 563 |
bigscience-workshop/megatron-deepspeed | A collection of tools and scripts for training large transformer language models at scale | 1,335 |
openbmb/cpm-live | A live training platform for large-scale deep learning models, allowing community participation and collaboration in model development and deployment. | 511 |
jbloomaus/saelens | A tool for training and analyzing sparse autoencoders to improve the understanding of neural networks and create safer AI systems. | 461 |
martinkersner/train-deeplab | Trains DeepLab for semantic image segmentation using annotated data and convolutional neural networks | 172 |
madrylab/robustness | A library for training and evaluating neural networks with a focus on adversarial robustness. | 918 |
eladhoffer/convnet.pytorch | A PyTorch implementation of various deep convolutional networks for efficient training and evaluation on diverse datasets. | 347 |
neuralhydrology/neuralhydrology | A Python library for training neural networks with focus on hydrological applications using PyTorch. | 364 |
zhanghang1989/pytorch-encoding | A Python framework for building deep learning models with optimized encoding layers and batch normalization. | 2,041 |
maxpumperla/elephas | Enables distributed deep learning with Keras and Spark for scalable model training | 1,574 |
intelligent-machine-learning/dlrover | An automatic distributed deep learning system that simplifies the training of large AI models | 1,270 |
bobazooba/xllm | A tool for training and fine-tuning large language models using advanced techniques | 380 |
ahmedfgad/neuralgenetic | Tools and techniques for training neural networks using genetic algorithms | 240 |
microsoft/megatron-deepspeed | Research tool for training large transformer language models at scale | 1,895 |
6-billionaires/trading-gym | A platform for training reinforcement learning agents to trade in financial markets. | 230 |