recurrentgemma

Language model

An implementation of a fast and efficient language model architecture

Open weights language model from Google DeepMind, based on Griffin.

613 stars

18 watching

26 forks

Language: Python

last commit: about 1 year ago

Linked from 1 awesome list

Backlinks from these awesome lists:

hannibal046/awesome-llm

Related projects:

Repository	Description	Stars
deepseek-ai/deepseek-moe	A large language model with improved efficiency and performance compared to similar models	1,024
ibm-granite/granite-3.0-language-models	A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources.	232
elanmart/psmm	An implementation of a neural network model for character-level language modeling.	50
gmftbygmftby/science-llm	A large-scale language model for scientific domain training on redpajama arXiv split	125
l0sg/relational-rnn-pytorch	An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling	245
deepseek-ai/deepseek-llm	A large language model trained on a massive dataset for various applications	1,512
google-deepmind/functa	A repository containing code for a meta-learning experiment on image datasets	150
google-deepmind/jraph	A lightweight library for working with graph neural networks in jax.	1,380
google-deepmind/einshape	A unified reshaping library for JAX and other frameworks.	100
google-research/flan	A repository providing tools and datasets to fine-tune language models for specific tasks	1,484
openai/pixel-cnn	A generative model with tractable likelihood and easy sampling, allowing for efficient data generation.	1,921
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
google-deepmind/narrativeqa	A dataset collection providing text documents with corresponding summaries and questions.	463
google-deepmind/meltingpot	Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations	637