GPTCache

LLM Cache

A semantic cache designed to reduce the cost and improve the speed of LLM API calls by storing responses.

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

GitHub

7k stars

58 watching

511 forks

Language: Python

last commit: almost 2 years ago

Linked from 4 awesome lists

aigcautogptbabyagichatbotchatgptchatgpt-apidollygptlangchainllamallama-indexllmmemcachemilvusopenairedissemantic-searchsimilarity-searchvector-search

Screenshot of zilliztech/GPTCache website

gptcache.readthedocs.io

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
llm-workflow-engine/llm-workflow-engine	A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4.	3,674
modeltc/lightllm	A Python-based framework for serving large language models with low latency and high scalability.	2,691
tmc/langchaingo	Provides a Go implementation of LangChain for generating text based on large language models.	5,155
mooler0410/llmspracticalguide	A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP	9,551
mlabonne/llm-course	A comprehensive course and resource package on building and deploying Large Language Models (LLMs)	40,053
pathwaycom/llm-app	Provides pre-built AI application templates to integrate Large Language Models (LLMs) with various data sources for scalable RAG and enterprise search.	7,426
langchain-ai/langchainjs	A framework for building context-aware reasoning applications using language models and composability	12,923
langchain-ai/langchain	A framework for developing applications powered by large language models.	96,146
langchain-ai/chat-langchain	A chatbot that leverages LangChain and LangGraph to generate real-time answers to user queries from a vector store of pre-loaded documentation.	5,531
nomic-ai/gpt4all	An open-source Python client for running Large Language Models (LLMs) locally on any device.	71,176
dicklesworthstone/swiss_army_llama	A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures.	947
nlpxucan/wizardlm	Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework	9,295
rasbt/llms-from-scratch	Developing and pretraining a GPT-like Large Language Model from scratch	35,405
langchain4j/langchain4j	A unified Java API for integrating Large Language Models (LLMs) and vector stores into Java applications	5,082
alpha-vllm/llama2-accessory	An open-source toolkit for pretraining and fine-tuning large language models	2,732