levanter
Language model trainer
A framework for training large language models that prioritizes legibility, scalability, and reproducibility
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
527 stars
14 watching
85 forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for training and utilizing large language models with knowledge augmentation capabilities | 1,251 |
| A framework for training and fine-tuning multimodal language models on various data types | 601 |
| A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. | 232 |
| An implementation of a convolutional Conditional Random Field model for semantic segmentation tasks. | 564 |
| Developing a Large Language Model capable of processing 3D representations as inputs | 979 |
| A lightweight model framework for validating and serializing data. | 2 |
| A flexible data model and API for representing linguistic data in a language-independent and theory-neutral way. | 15 |
| A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,557 |
| A lightweight, multilingual language model with a long context length | 920 |
| A pre-trained language model designed for various NLP tasks, including dialogue generation, code completion, and retrieval. | 94 |
| A framework for grounding language models to images and handling multimodal inputs and outputs | 478 |
| An implementation of a fast and efficient language model architecture | 613 |
| Provides pre-trained machine learning models for natural language processing tasks using Clojure and the clj-djl framework. | 0 |
| A tool for evaluating the performance of large language model APIs | 678 |
| A framework for configuring and running machine learning experiments on top of Jax. | 461 |