GPTCache
LLM Cache
A semantic cache designed to reduce the cost and improve the speed of LLM API calls by storing responses.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
7k stars
58 watching
502 forks
Language: Python
last commit: 2 months ago
Linked from 4 awesome lists
aigcautogptbabyagichatbotchatgptchatgpt-apidollygptlangchainllamallama-indexllmmemcachemilvusopenairedissemantic-searchsimilarity-searchvector-search
Related projects:
Repository | Description | Stars |
---|---|---|
llm-workflow-engine/llm-workflow-engine | A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. | 3,659 |
modeltc/lightllm | An LLM inference and serving framework providing a lightweight design, scalability, and high-speed performance for large language models. | 2,609 |
tmc/langchaingo | Provides a Go implementation of LangChain for generating text based on large language models. | 4,635 |
mooler0410/llmspracticalguide | A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP | 9,489 |
mlabonne/llm-course | A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 39,120 |
pathwaycom/llm-app | Pre-built templates for integrating large language models into enterprise applications with real-time data APIs and various data sources. | 4,642 |
langchain-ai/langchainjs | A framework for building context-aware reasoning applications using language models and composability | 12,711 |
langchain-ai/langchain | A framework for building applications powered by large language models (LLMs) with tools for development, productionization, and deployment. | 94,887 |
langchain-ai/chat-langchain | A chatbot that leverages LangChain and LangGraph to generate real-time answers to user queries from a vector store of pre-loaded documentation. | 5,457 |
nomic-ai/gpt4all | An open-source Python client for running Large Language Models (LLMs) locally on any device. | 70,694 |
dicklesworthstone/swiss_army_llama | A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures. | 941 |
nlpxucan/wizardlm | Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework | 9,268 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 32,908 |
langchain4j/langchain4j | A unified Java API for integrating large language models and vector stores into applications. | 4,873 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,720 |