llama-stack
AI toolkit
Provides a set of standardized APIs and tools to build generative AI applications
Composable building blocks to build Llama Apps
5k stars
130 watching
579 forks
Language: Python
last commit: 3 days ago Related projects:
Repository | Description | Stars |
---|---|---|
meta-llama/llama | A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models. | 56,437 |
meta-llama/llama3 | Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference | 27,138 |
meta-llama/llama-recipes | Provides tools and examples for fine-tuning the Meta Llama model and building applications with it | 15,126 |
lightning-ai/lit-llama | An implementation of a large language model using the nanoGPT architecture | 5,993 |
meta-llama/codellama | Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks | 16,039 |
scisharp/llamasharp | A C#/.NET library to efficiently run Large Language Models (LLMs) on local devices | 2,673 |
hiyouga/llama-factory | A unified platform for fine-tuning multiple large language models with various training approaches and methods | 34,436 |
run-llama/llama_index | A data framework for augmenting Large Language Models (LLMs) with private data | 36,776 |
ggerganov/llama.cpp | Enables efficient inference of large language models using optimized C/C++ implementations and various backend frameworks | 67,866 |
llmware-ai/llmware | A framework for building enterprise LLM-based applications using small, specialized models | 6,651 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,720 |
confident-ai/deepeval | A framework for evaluating large language models | 3,669 |
run-llama/llamaindexts | A data framework for integrating large language models into applications with custom data | 1,937 |
microsoft/lmops | A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models. | 3,695 |
opengvlab/llama-adapter | An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy | 5,754 |