GPTCache

LLM Cache

A semantic cache designed to reduce the cost and improve the speed of LLM API calls by storing responses.

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

GitHub

7k stars
58 watching
502 forks
Language: Python
last commit: 2 months ago
Linked from 4 awesome lists

aigcautogptbabyagichatbotchatgptchatgpt-apidollygptlangchainllamallama-indexllmmemcachemilvusopenairedissemantic-searchsimilarity-searchvector-search

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
llm-workflow-engine/llm-workflow-engine A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. 3,659
modeltc/lightllm An LLM inference and serving framework providing a lightweight design, scalability, and high-speed performance for large language models. 2,609
tmc/langchaingo Provides a Go implementation of LangChain for generating text based on large language models. 4,635
mooler0410/llmspracticalguide A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP 9,489
mlabonne/llm-course A comprehensive course and resource package on building and deploying Large Language Models (LLMs) 39,120
pathwaycom/llm-app Pre-built templates for integrating large language models into enterprise applications with real-time data APIs and various data sources. 4,642
langchain-ai/langchainjs A framework for building context-aware reasoning applications using language models and composability 12,711
langchain-ai/langchain A framework for building applications powered by large language models (LLMs) with tools for development, productionization, and deployment. 94,887
langchain-ai/chat-langchain A chatbot that leverages LangChain and LangGraph to generate real-time answers to user queries from a vector store of pre-loaded documentation. 5,457
nomic-ai/gpt4all An open-source Python client for running Large Language Models (LLMs) locally on any device. 70,694
dicklesworthstone/swiss_army_llama A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures. 941
nlpxucan/wizardlm Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework 9,268
rasbt/llms-from-scratch Developing and pretraining a GPT-like Large Language Model from scratch 32,908
langchain4j/langchain4j A unified Java API for integrating large language models and vector stores into applications. 4,873
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,720