GPTCache

LLM Cache

A semantic cache designed to reduce the cost and improve the speed of LLM API calls by storing responses.

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

GitHub

7k stars
58 watching
511 forks
Language: Python
last commit: 4 months ago
Linked from 4 awesome lists

aigcautogptbabyagichatbotchatgptchatgpt-apidollygptlangchainllamallama-indexllmmemcachemilvusopenairedissemantic-searchsimilarity-searchvector-search

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
llm-workflow-engine/llm-workflow-engine A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. 3,674
modeltc/lightllm A Python-based framework for serving large language models with low latency and high scalability. 2,691
tmc/langchaingo Provides a Go implementation of LangChain for generating text based on large language models. 5,155
mooler0410/llmspracticalguide A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP 9,551
mlabonne/llm-course A comprehensive course and resource package on building and deploying Large Language Models (LLMs) 40,053
pathwaycom/llm-app Provides pre-built AI application templates to integrate Large Language Models (LLMs) with various data sources for scalable RAG and enterprise search. 7,426
langchain-ai/langchainjs A framework for building context-aware reasoning applications using language models and composability 12,923
langchain-ai/langchain A framework for developing applications powered by large language models. 96,146
langchain-ai/chat-langchain A chatbot that leverages LangChain and LangGraph to generate real-time answers to user queries from a vector store of pre-loaded documentation. 5,531
nomic-ai/gpt4all An open-source Python client for running Large Language Models (LLMs) locally on any device. 71,176
dicklesworthstone/swiss_army_llama A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures. 947
nlpxucan/wizardlm Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework 9,295
rasbt/llms-from-scratch Developing and pretraining a GPT-like Large Language Model from scratch 35,405
langchain4j/langchain4j A unified Java API for integrating Large Language Models (LLMs) and vector stores into Java applications 5,082
alpha-vllm/llama2-accessory An open-source toolkit for pretraining and fine-tuning large language models 2,732