llama-stack

AI toolkit

Provides pre-packaged building blocks for generative AI applications with standardized APIs and service-oriented design.

Composable building blocks to build Llama Apps

GitHub

5k stars
138 watching
659 forks
Language: Python
last commit: 3 months ago

Related projects:

Repository Description Stars
meta-llama/llama A collection of tools and utilities for deploying, fine-tuning, and utilizing large language models. 56,832
meta-llama/llama3 Provides pre-trained and instruction-tuned Llama 3 language models and tools for loading and running inference 27,527
meta-llama/llama-recipes Provides tools and examples for fine-tuning the Meta Llama model and building applications with it 15,578
lightning-ai/lit-llama An implementation of a large language model using the nanoGPT architecture 6,013
meta-llama/codellama Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks 16,097
scisharp/llamasharp An efficient C#/.NET library for running Large Language Models (LLMs) on local devices 2,750
hiyouga/llama-factory A tool for efficiently fine-tuning large language models across multiple architectures and methods. 36,219
run-llama/llama_index A data framework for augmenting Large Language Models (LLMs) with private data 37,371
ggerganov/llama.cpp Enables LLM inference with minimal setup and high performance on various hardware platforms 69,185
llmware-ai/llmware A framework for building enterprise LLM-based applications using small, specialized models 8,303
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,732
confident-ai/deepeval A framework for evaluating large language models 4,003
run-llama/llamaindexts A data framework for integrating large language models into applications with custom data 1,997
microsoft/lmops A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models. 3,747
opengvlab/llama-adapter An implementation of a method for fine-tuning language models to follow instructions with high efficiency and accuracy 5,775