recurrentgemma
Language model
An implementation of a fast and efficient language model architecture
Open weights language model from Google DeepMind, based on Griffin.
613 stars
18 watching
26 forks
Language: Python
last commit: 8 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A large language model with improved efficiency and performance compared to similar models | 1,024 |
| A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 232 |
| An implementation of a neural network model for character-level language modeling. | 50 |
| A large-scale language model for scientific domain training on redpajama arXiv split | 125 |
| An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling | 245 |
| A large language model trained on a massive dataset for various applications | 1,512 |
| A repository containing code for a meta-learning experiment on image datasets | 150 |
| A lightweight library for working with graph neural networks in jax. | 1,380 |
| A unified reshaping library for JAX and other frameworks. | 100 |
| A repository providing tools and datasets to fine-tune language models for specific tasks | 1,484 |
| A generative model with tractable likelihood and easy sampling, allowing for efficient data generation. | 1,921 |
| This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| A guide to using pre-trained large language models in source code analysis and generation | 1,789 |
| A dataset collection providing text documents with corresponding summaries and questions. | 463 |
| Assesses generalization of multi-agent reinforcement learning algorithms to novel social situations | 637 |