electra
Language model training
A method for pre-training transformer networks to learn language representations from text data without labeled supervision
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
2k stars
59 watching
354 forks
Language: Python
last commit: 11 months ago deep-learningnlptensorflow
Related projects:
Repository | Description | Stars |
---|---|---|
| Trains and evaluates a Chinese language model using adversarial training on a large corpus. | 140 |
| Provides pre-trained Chinese language models based on the ELECTRA framework for natural language processing tasks | 1,405 |
| This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks | 6,251 |
| An implementation of a machine learning-based communications system using deep learning techniques. | 127 |
| A Python module for creating character-level or word-level neural networks for text generation and training on various datasets | 4,944 |
| Analyzing knowledge development and evolution in large language models during training | 2,309 |
| Enables distributed deep learning with Keras and Spark for scalable model training | 1,574 |
| Implements Google's Text-to-Image Neural Network in PyTorch using a cascading DDPM architecture with dynamic clipping and noise level conditioning. | 8,127 |
| An explanation of key concepts and advancements in the field of Machine Learning | 7,352 |
| Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. | 6,997 |
| Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,189 |
| Implementations of various deep learning algorithms and techniques with accompanying documentation | 57,177 |
| A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
| Creates synthetic high-resolution spatiotemporal data for renewable energy resources using generative adversarial networks. | 88 |