bandit-nmt
NMT framework
A framework for integrating policy gradient methods into neural machine translation models and evaluating their performance under simulated human feedback.
136 stars
13 watching
26 forks
Language: Python
last commit: about 7 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A comprehensive catalog of various neural machine translation implementations using different deep learning frameworks. | 359 |
| A framework for federated learning that leverages the neural tangent kernel to address statistical heterogeneity in distributed machine learning. | 3 |
| An implementation of a sequence-to-sequence model with attention mechanism using LSTMs and character embeddings for neural machine translation | 1,263 |
| This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 214 |
| A framework providing a generalized strategy holder for text classification | 11 |
| An open-source software framework that integrates human advice into gradient boosting decision trees for improved performance in machine learning tasks. | 8 |
| A Pytorch implementation of a neural network model for machine translation | 47 |
| A PyTorch package implementing multi-task deep neural networks for natural language understanding | 2,238 |
| An implementation of neural network components and optimization methods for text analysis, including rationales for neural predictions. | 355 |
| An implementation of Community Preserving Network Embedding using deep learning and matrix factorization techniques | 121 |
| Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations | 637 |
| An approach to train and optimize machine learning models in a decentralized setting by convexifying the optimization process | 4 |
| An environment and framework for training reinforcement learning agents to make trading decisions on cryptocurrency markets. | 165 |
| A PyTorch implementation of 2D convolutional neural networks for sequence-to-sequence prediction in machine translation | 502 |
| A framework for parallel population-based reinforcement learning | 507 |