recurrentgemma
Language model
An implementation of a fast and efficient language model architecture
Open weights language model from Google DeepMind, based on Griffin.
607 stars
18 watching
26 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
deepseek-ai/deepseek-moe | A large language model with improved efficiency and performance compared to similar models | 1,006 |
ibm-granite/granite-3.0-language-models | A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 214 |
elanmart/psmm | An implementation of a neural network model for character-level language modeling. | 50 |
gmftbygmftby/science-llm | A large-scale language model for scientific domain training on redpajama arXiv split | 122 |
l0sg/relational-rnn-pytorch | An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 244 |
deepseek-ai/deepseek-llm | A large language model trained on a massive dataset for various applications | 1,450 |
google-deepmind/functa | A repository containing code for a meta-learning experiment on image datasets | 149 |
google-deepmind/jraph | A lightweight library for working with graph neural networks in jax. | 1,375 |
google-deepmind/einshape | A unified reshaping library for JAX and other frameworks. | 99 |
google-research/flan | A repository providing tools and datasets to fine-tune language models for specific tasks | 1,474 |
openai/pixel-cnn | A generative model with tractable likelihood and easy sampling, allowing for efficient data generation. | 1,921 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
vhellendoorn/code-lms | A guide to using pre-trained large language models in source code analysis and generation | 1,782 |
google-deepmind/narrativeqa | A dataset collection providing text documents with corresponding summaries and questions. | 458 |
google-deepmind/meltingpot | Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations | 620 |