lit-llama
Language Model
An implementation of a large language model using the nanoGPT architecture
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
6k stars
69 watching
520 forks
Language: Python
last commit: 3 months ago
Linked from 2 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
opengvlab/llama-adapter | An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,754 |
meta-llama/llama-stack | Provides a set of standardized APIs and tools to build generative AI applications | 4,591 |
hiyouga/llama-factory | A unified platform for fine-tuning multiple large language models with various training approaches and methods | 34,436 |
ggerganov/llama.cpp | Enables efficient inference of large language models using optimized C/C++ implementations and various backend frameworks | 67,866 |
meta-llama/llama3 | Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference | 27,138 |
meta-llama/llama | A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models. | 56,437 |
meta-llama/llama-recipes | Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,126 |
scisharp/llamasharp | A C#/.NET library to efficiently run Large Language Models (LLMs) on local devices | 2,673 |
meta-llama/codellama | Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks | 16,039 |
tloen/alpaca-lora | Tuning a large language model on consumer hardware using low-rank adaptation | 18,651 |
meta-llama/purplellama | A set of tools to help developers build responsibly with open generative AI models. | 2,716 |
run-llama/llama_index | A data framework for augmenting Large Language Models (LLMs) with private data | 36,776 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,720 |
young-geng/easylm | A framework for training and serving large language models using JAX/Flax | 2,409 |
nomic-ai/gpt4all | An open-source Python client for running Large Language Models (LLMs) locally on any device. | 70,694 |