lit-llama

Language Model

An implementation of a large language model using the nanoGPT architecture

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

GitHub

6k stars
69 watching
520 forks
Language: Python
last commit: 3 months ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
opengvlab/llama-adapter An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy 5,754
meta-llama/llama-stack Provides a set of standardized APIs and tools to build generative AI applications 4,591
hiyouga/llama-factory A unified platform for fine-tuning multiple large language models with various training approaches and methods 34,436
ggerganov/llama.cpp Enables efficient inference of large language models using optimized C/C++ implementations and various backend frameworks 67,866
meta-llama/llama3 Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference 27,138
meta-llama/llama A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models. 56,437
meta-llama/llama-recipes Provides tools and examples for fine-tuning the Meta Llama model and building applications with it 15,126
scisharp/llamasharp A C#/.NET library to efficiently run Large Language Models (LLMs) on local devices 2,673
meta-llama/codellama Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks 16,039
tloen/alpaca-lora Tuning a large language model on consumer hardware using low-rank adaptation 18,651
meta-llama/purplellama A set of tools to help developers build responsibly with open generative AI models. 2,716
run-llama/llama_index A data framework for augmenting Large Language Models (LLMs) with private data 36,776
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,720
young-geng/easylm A framework for training and serving large language models using JAX/Flax 2,409
nomic-ai/gpt4all An open-source Python client for running Large Language Models (LLMs) locally on any device. 70,694