GPTCache
LLM Cache
A semantic cache designed to reduce the cost and improve the speed of LLM API calls by storing responses.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
7k stars
58 watching
511 forks
Language: Python
last commit: 4 months ago
Linked from 4 awesome lists
aigcautogptbabyagichatbotchatgptchatgpt-apidollygptlangchainllamallama-indexllmmemcachemilvusopenairedissemantic-searchsimilarity-searchvector-search
Related projects:
Repository | Description | Stars |
---|---|---|
llm-workflow-engine/llm-workflow-engine | A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. | 3,674 |
modeltc/lightllm | A Python-based framework for serving large language models with low latency and high scalability. | 2,691 |
tmc/langchaingo | Provides a Go implementation of LangChain for generating text based on large language models. | 5,155 |
mooler0410/llmspracticalguide | A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP | 9,551 |
mlabonne/llm-course | A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 40,053 |
pathwaycom/llm-app | Provides pre-built AI application templates to integrate Large Language Models (LLMs) with various data sources for scalable RAG and enterprise search. | 7,426 |
langchain-ai/langchainjs | A framework for building context-aware reasoning applications using language models and composability | 12,923 |
langchain-ai/langchain | A framework for developing applications powered by large language models. | 96,146 |
langchain-ai/chat-langchain | A chatbot that leverages LangChain and LangGraph to generate real-time answers to user queries from a vector store of pre-loaded documentation. | 5,531 |
nomic-ai/gpt4all | An open-source Python client for running Large Language Models (LLMs) locally on any device. | 71,176 |
dicklesworthstone/swiss_army_llama | A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures. | 947 |
nlpxucan/wizardlm | Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework | 9,295 |
rasbt/llms-from-scratch | Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |
langchain4j/langchain4j | A unified Java API for integrating Large Language Models (LLMs) and vector stores into Java applications | 5,082 |
alpha-vllm/llama2-accessory | An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |