recurrentgemma

Language model

An implementation of a fast and efficient language model architecture

Open weights language model from Google DeepMind, based on Griffin.

GitHub

607 stars
18 watching
26 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
deepseek-ai/deepseek-moe A large language model with improved efficiency and performance compared to similar models 1,006
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 214
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
gmftbygmftby/science-llm A large-scale language model for scientific domain training on redpajama arXiv split 122
l0sg/relational-rnn-pytorch An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling 244
deepseek-ai/deepseek-llm A large language model trained on a massive dataset for various applications 1,450
google-deepmind/functa A repository containing code for a meta-learning experiment on image datasets 149
google-deepmind/jraph A lightweight library for working with graph neural networks in jax. 1,375
google-deepmind/einshape A unified reshaping library for JAX and other frameworks. 99
google-research/flan A repository providing tools and datasets to fine-tune language models for specific tasks 1,474
openai/pixel-cnn A generative model with tractable likelihood and easy sampling, allowing for efficient data generation. 1,921
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
google-deepmind/narrativeqa A dataset collection providing text documents with corresponding summaries and questions. 458
google-deepmind/meltingpot Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations 620