electra
Language model training
A method for pre-training transformer networks to learn language representations from text data without labeled supervision
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
2k stars
59 watching
354 forks
Language: Python
last commit: over 1 year ago deep-learningnlptensorflow
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Trains and evaluates a Chinese language model using adversarial training on a large corpus. | 140 |
| | Provides pre-trained Chinese language models based on the ELECTRA framework for natural language processing tasks | 1,405 |
| | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| | An implementation of Google's 2018 BERT model in PyTorch, allowing pre-training and fine-tuning for natural language processing tasks | 6,251 |
| | An implementation of a machine learning-based communications system using deep learning techniques. | 127 |
| | A Python module for creating character-level or word-level neural networks for text generation and training on various datasets | 4,944 |
| | Analyzing knowledge development and evolution in large language models during training | 2,309 |
| | Enables distributed deep learning with Keras and Spark for scalable model training | 1,574 |
| | Implements Google's Text-to-Image Neural Network in PyTorch using a cascading DDPM architecture with dynamic clipping and noise level conditioning. | 8,127 |
| | An explanation of key concepts and advancements in the field of Machine Learning | 7,352 |
| | Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. | 6,997 |
| | Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. | 3,189 |
| | Implementations of various deep learning algorithms and techniques with accompanying documentation | 57,177 |
| | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
| | Creates synthetic high-resolution spatiotemporal data for renewable energy resources using generative adversarial networks. | 88 |