bandit-nmt

NMT framework

A framework for integrating policy gradient methods into neural machine translation models and evaluating their performance under simulated human feedback.

GitHub

136 stars

13 watching

26 forks

Language: Python

last commit: over 7 years ago

Related projects:

Repository	Description	Stars
jonsafari/nmt-list	A comprehensive catalog of various neural machine translation implementations using different deep learning frameworks.	359
kai-yue/ntk-fed	A framework for federated learning that leverages the neural tangent kernel to address statistical heterogeneity in distributed machine learning.	3
harvardnlp/seq2seq-attn	An implementation of a sequence-to-sequence model with attention mechanism using LSTMs and character embeddings for neural machine translation	1,263
ethanyanjiali/minchatgpt	This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2.	214
mustafaturan/omnicat	A framework providing a generalized strategy holder for text classification	11
harshakokel/kigb	An open-source software framework that integrates human advice into gradient boosting decision trees for improved performance in machine learning tasks.	8
kefirski/bytenet	A Pytorch implementation of a neural network model for machine translation	47
namisan/mt-dnn	A PyTorch package implementing multi-task deep neural networks for natural language understanding	2,238
taolei87/rcnn	An implementation of neural network components and optimization methods for text analysis, including rationales for neural predictions.	355
benedekrozemberczki/m-nmf	An implementation of Community Preserving Network Embedding using deep learning and matrix factorization techniques	121
google-deepmind/meltingpot	Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations	637
yaodongyu/tct	An approach to train and optimize machine learning models in a decentralized setting by convexifying the optimization process	4
kkuette/tradzqai	An environment and framework for training reinforcement learning agents to make trading decisions on cryptocurrency markets.	165
elbayadm/attn2d	A PyTorch implementation of 2D convolutional neural networks for sequence-to-sequence prediction in machine translation	502
sjtu-marl/malib	A framework for parallel population-based reinforcement learning	507