lstm

LSTM model trainer

A package to train a neural network for language modeling using a variant of the Long Short Term Memory algorithm

GitHub

665 stars
48 watching
283 forks
Language: Lua
last commit: over 6 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
wojzaremba/algorithm-learning A framework to learn simple algorithms from examples by generating and visualizing intermediate solutions 180
bobazooba/xllm A tool for training and fine-tuning large language models using advanced techniques 387
wojciechz/learning_to_execute This software trains an LSTM-based neural network to predict Python code snippets 479
huashiyiqike/lstm-matlab An implementation of the Long Short-term Memory (LSTM) algorithm in MATLAB 252
cwxcode/lstm-matlab A Matlab implementation of a Long Short-Term Memory neural network architecture 51
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 601
yoonkim/lstm-char-cnn An LSTM language model with a convolutional neural network over characters as input to predict words. 826
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
volcengine/vescale A PyTorch-based framework for training large language models in parallel on multiple devices 679
robromijnders/lstm_tsc An implementation of a Long Short-term memory network for time-series classification using Python and TensorFlow 410
dasguptar/treelstm.pytorch An implementation of Tree-LSTM networks in PyTorch for semantic similarity tasks using tree structures 551
gt-vision-lab/vqa_lstm_cnn A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. 377
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 513
taketwo/llm-ollama Provides an interface to model-based natural language understanding and generation on an Ollama server 171
ermlab/politbert Trains a language model using a RoBERTa architecture on high-quality Polish text data 33