bigBatch

Batch trainer

A tool for training neural networks using large batch sizes and analyzing the trade-offs between longer training periods and better generalization performance.

Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"

GitHub

148 stars
8 watching
24 forks
Language: Python
last commit: over 7 years ago

Related projects:

Repository Description Stars
openbmb/bmtrain A toolkit for training large models in a distributed manner while keeping code simple and efficient. 563
bigscience-workshop/megatron-deepspeed A collection of tools and scripts for training large transformer language models at scale 1,335
openbmb/cpm-live A live training platform for large-scale deep learning models, allowing community participation and collaboration in model development and deployment. 511
jbloomaus/saelens A tool for training and analyzing sparse autoencoders to improve the understanding of neural networks and create safer AI systems. 461
martinkersner/train-deeplab Trains DeepLab for semantic image segmentation using annotated data and convolutional neural networks 172
madrylab/robustness A library for training and evaluating neural networks with a focus on adversarial robustness. 918
eladhoffer/convnet.pytorch A PyTorch implementation of various deep convolutional networks for efficient training and evaluation on diverse datasets. 347
neuralhydrology/neuralhydrology A Python library for training neural networks with focus on hydrological applications using PyTorch. 364
zhanghang1989/pytorch-encoding A Python framework for building deep learning models with optimized encoding layers and batch normalization. 2,041
maxpumperla/elephas Enables distributed deep learning with Keras and Spark for scalable model training 1,574
intelligent-machine-learning/dlrover An automatic distributed deep learning system that simplifies the training of large AI models 1,270
bobazooba/xllm A tool for training and fine-tuning large language models using advanced techniques 380
ahmedfgad/neuralgenetic Tools and techniques for training neural networks using genetic algorithms 240
microsoft/megatron-deepspeed Research tool for training large transformer language models at scale 1,895
6-billionaires/trading-gym A platform for training reinforcement learning agents to trade in financial markets. 230