GPTCache
LLM Cache
A semantic cache designed to reduce the cost and improve the speed of LLM API calls by storing responses.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
7k stars
58 watching
511 forks
Language: Python
last commit: 5 months ago
Linked from 4 awesome lists
aigcautogptbabyagichatbotchatgptchatgpt-apidollygptlangchainllamallama-indexllmmemcachemilvusopenairedissemantic-searchsimilarity-searchvector-search
Related projects:
Repository | Description | Stars |
---|---|---|
| A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. | 3,674 |
| A Python-based framework for serving large language models with low latency and high scalability. | 2,691 |
| Provides a Go implementation of LangChain for generating text based on large language models. | 5,155 |
| A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP | 9,551 |
| A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 40,053 |
| Provides pre-built AI application templates to integrate Large Language Models (LLMs) with various data sources for scalable RAG and enterprise search. | 7,426 |
| A framework for building context-aware reasoning applications using language models and composability | 12,923 |
| A framework for developing applications powered by large language models. | 96,146 |
| A chatbot that leverages LangChain and LangGraph to generate real-time answers to user queries from a vector store of pre-loaded documentation. | 5,531 |
| An open-source Python client for running Large Language Models (LLMs) locally on any device. | 71,176 |
| A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures. | 947 |
| Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework | 9,295 |
| Developing and pretraining a GPT-like Large Language Model from scratch | 35,405 |
| A unified Java API for integrating Large Language Models (LLMs) and vector stores into Java applications | 5,082 |
| An open-source toolkit for pretraining and fine-tuning large language models | 2,732 |