bandit-nmt
NMT framework
A framework for integrating policy gradient methods into neural machine translation models and evaluating their performance under simulated human feedback.
136 stars
13 watching
26 forks
Language: Python
last commit: over 7 years ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A comprehensive catalog of various neural machine translation implementations using different deep learning frameworks. | 359 |
| | A framework for federated learning that leverages the neural tangent kernel to address statistical heterogeneity in distributed machine learning. | 3 |
| | An implementation of a sequence-to-sequence model with attention mechanism using LSTMs and character embeddings for neural machine translation | 1,263 |
| | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 214 |
| | A framework providing a generalized strategy holder for text classification | 11 |
| | An open-source software framework that integrates human advice into gradient boosting decision trees for improved performance in machine learning tasks. | 8 |
| | A Pytorch implementation of a neural network model for machine translation | 47 |
| | A PyTorch package implementing multi-task deep neural networks for natural language understanding | 2,238 |
| | An implementation of neural network components and optimization methods for text analysis, including rationales for neural predictions. | 355 |
| | An implementation of Community Preserving Network Embedding using deep learning and matrix factorization techniques | 121 |
| | Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations | 637 |
| | An approach to train and optimize machine learning models in a decentralized setting by convexifying the optimization process | 4 |
| | An environment and framework for training reinforcement learning agents to make trading decisions on cryptocurrency markets. | 165 |
| | A PyTorch implementation of 2D convolutional neural networks for sequence-to-sequence prediction in machine translation | 502 |
| | A framework for parallel population-based reinforcement learning | 507 |