llama-recipes
LLM toolkit
Provides tools and examples for fine-tuning the Meta Llama model and building applications with it
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
16k stars
200 watching
2k forks
Language: Jupyter Notebook
last commit: 2 months ago aifinetuninglangchainllamallama2llmmachine-learningpythonpytorchvllm
Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models. | 56,832 |
| Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference | 27,527 |
| Enables LLM inference with minimal setup and high performance on various hardware platforms | 69,185 |
| Provides pre-packaged building blocks for generative AI applications with standardized APIs and service-oriented design. | 5,164 |
| An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,775 |
| Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks | 16,097 |
| An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |
| An efficient C#/.NET library for running Large Language Models (LLMs) on local devices | 2,750 |
| An implementation of a large language model using the nanoGPT architecture | 6,013 |
| A tool for efficiently fine-tuning large language models across multiple architectures and methods. | 36,219 |
| A minimalistic C implementation of the Llama 2 language model inference engine. | 17,593 |
| A data framework for augmenting Large Language Models (LLMs) with private data | 37,371 |
| Tuning a large language model on consumer hardware using low-rank adaptation | 18,710 |
| A system that uses large language and vision models to generate and process visual instructions | 20,683 |
| A data framework for integrating large language models into applications with custom data | 1,997 |