lit-llama

Language Model

An implementation of a large language model using the nanoGPT architecture

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

GitHub

6k stars

71 watching

517 forks

Language: Python

last commit: about 1 year ago

Linked from 2 awesome lists

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
opengvlab/llama-adapter	An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy	5,775
meta-llama/llama-stack	Provides pre-packaged building blocks for generative AI applications with standardized APIs and service-oriented design.	5,164
hiyouga/llama-factory	A tool for efficiently fine-tuning large language models across multiple architectures and methods.	36,219
ggerganov/llama.cpp	Enables LLM inference with minimal setup and high performance on various hardware platforms	69,185
meta-llama/llama3	Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference	27,527
meta-llama/llama	A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models.	56,832
meta-llama/llama-recipes	Provides tools and examples for fine-tuning the Meta Llama model and building applications with it	15,578
scisharp/llamasharp	An efficient C#/.NET library for running Large Language Models (LLMs) on local devices	2,750
meta-llama/codellama	Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks	16,097
tloen/alpaca-lora	Tuning a large language model on consumer hardware using low-rank adaptation	18,710
meta-llama/purplellama	A set of tools to help developers build responsibly with open generative AI models.	2,791
run-llama/llama_index	A data framework for augmenting Large Language Models (LLMs) with private data	37,371
alpha-vllm/llama2-accessory	An open-source toolkit for pretraining and fine-tuning large language models	2,732
young-geng/easylm	A framework for training and serving large language models using JAX/Flax	2,428
nomic-ai/gpt4all	An open-source Python client for running Large Language Models (LLMs) locally on any device.	71,176