llama-stack

AI toolkit

Provides pre-packaged building blocks for generative AI applications with standardized APIs and service-oriented design.

Composable building blocks to build Llama Apps

GitHub

5k stars

138 watching

659 forks

Language: Python

last commit: about 1 year ago

Related projects:

Repository	Description	Stars
meta-llama/llama	A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models.	56,832
meta-llama/llama3	Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference	27,527
meta-llama/llama-recipes	Provides tools and examples for fine-tuning the Meta Llama model and building applications with it	15,578
lightning-ai/lit-llama	An implementation of a large language model using the nanoGPT architecture	6,013
meta-llama/codellama	Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks	16,097
scisharp/llamasharp	An efficient C#/.NET library for running Large Language Models (LLMs) on local devices	2,750
hiyouga/llama-factory	A tool for efficiently fine-tuning large language models across multiple architectures and methods.	36,219
run-llama/llama_index	A data framework for augmenting Large Language Models (LLMs) with private data	37,371
ggerganov/llama.cpp	Enables LLM inference with minimal setup and high performance on various hardware platforms	69,185
llmware-ai/llmware	A framework for building enterprise LLM-based applications using small, specialized models	8,303
alpha-vllm/llama2-accessory	An open-source toolkit for pretraining and fine-tuning large language models	2,732
confident-ai/deepeval	A framework for evaluating large language models	4,003
run-llama/llamaindexts	A data framework for integrating large language models into applications with custom data	1,997
microsoft/lmops	A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models.	3,747
opengvlab/llama-adapter	An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy	5,775