bigBatch
Batch trainer
A tool for training neural networks using large batch sizes and analyzing the trade-offs between longer training periods and better generalization performance.
Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"
148 stars
8 watching
24 forks
Language: Python
last commit: over 7 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A toolkit for training large models in a distributed manner while keeping code simple and efficient. | 570 |
| A collection of tools and scripts for training large transformer language models at scale | 1,342 |
| A live training platform for large-scale deep learning models, allowing community participation and collaboration in model development and deployment. | 511 |
| A tool for training and analyzing sparse autoencoders to improve the understanding of neural networks and create safer AI systems. | 526 |
| Trains DeepLab model for semantic image segmentation using annotated data and various training procedures | 172 |
| A library for training and evaluating neural networks with a focus on adversarial robustness. | 921 |
| A PyTorch implementation of various deep convolutional networks for efficient training and evaluation on diverse datasets. | 347 |
| A Python library for training neural networks with focus on hydrological applications using PyTorch. | 372 |
| A Python framework for building deep learning models with optimized encoding layers and batch normalization. | 2,044 |
| Enables distributed deep learning with Keras and Spark for scalable model training | 1,574 |
| Automates large-scale deep learning training on distributed clusters, providing fault tolerance and fast recovery from failures. | 1,302 |
| A tool for training and fine-tuning large language models using advanced techniques | 387 |
| Trains artificial neural networks using the genetic algorithm | 241 |
| Research tool for training large transformer language models at scale | 1,926 |
| A platform for training reinforcement learning agents to trade in financial markets. | 230 |