llama-stack

AI toolkit

Provides a set of standardized APIs and tools to build generative AI applications

Composable building blocks to build Llama Apps

GitHub

5k stars
130 watching
579 forks
Language: Python
last commit: 3 days ago

Related projects:

Repository Description Stars
meta-llama/llama A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models. 56,437
meta-llama/llama3 Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference 27,138
meta-llama/llama-recipes Provides tools and examples for fine-tuning the Meta Llama model and building applications with it 15,126
lightning-ai/lit-llama An implementation of a large language model using the nanoGPT architecture 5,993
meta-llama/codellama Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks 16,039
scisharp/llamasharp A C#/.NET library to efficiently run Large Language Models (LLMs) on local devices 2,673
hiyouga/llama-factory A unified platform for fine-tuning multiple large language models with various training approaches and methods 34,436
run-llama/llama_index A data framework for augmenting Large Language Models (LLMs) with private data 36,776
ggerganov/llama.cpp Enables efficient inference of large language models using optimized C/C++ implementations and various backend frameworks 67,866
llmware-ai/llmware A framework for building enterprise LLM-based applications using small, specialized models 6,651
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,720
confident-ai/deepeval A framework for evaluating large language models 3,669
run-llama/llamaindexts A data framework for integrating large language models into applications with custom data 1,937
microsoft/lmops A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models. 3,695
opengvlab/llama-adapter An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy 5,754