swiss_army_llama
LLM toolset
A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures.
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
941 stars
16 watching
52 forks
Language: Python
last commit: about 2 months ago
Linked from 2 awesome lists
embedding-similarityembedding-vectorsembeddingsllama2llamacppsemantic-search
Related projects:
Repository | Description | Stars |
---|---|---|
snunez1/llama.cl | A Common Lisp port of a Large Language Model (LLM) implementation | 35 |
run-llama/llamaindexts | A data framework for integrating large language models into applications with custom data | 1,960 |
victordibia/llmx | An API that provides a unified interface to multiple large language models for chat fine-tuning | 79 |
melih-unsal/demogpt | A comprehensive toolset for building Large Language Model (LLM) based applications | 1,710 |
msoedov/langcorn | A framework for serving large language models with a robust and efficient API | 906 |
jetxu-llm/llama-github | A Python library that uses LLMs to retrieve relevant code snippets and information from GitHub based on user queries. | 242 |
damo-nlp-sg/m3exam | A benchmark for evaluating large language models in multiple languages and formats | 92 |
pors/langchain-chat-websockets | An open-source chat application using LangChain and FastAPI that enables real-time conversations with an LLM | 89 |
linksoul-ai/chinese-llama-2-7b | A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data | 2,228 |
aiplanethub/beyondllm | An open-source toolkit for building and evaluating large language models | 263 |
km1994/llmsninestorydemontower | Exploring various LLMs and their applications in natural language processing and related areas | 1,798 |
ai-hypercomputer/maxtext | A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,529 |
deepseek-ai/deepseek-moe | A large language model with improved efficiency and performance compared to similar models | 1,006 |
blackhc/llm-strategy | Decouples software implementation from underlying logic using LLMs to automate parsing of structured data | 388 |
ngxson/wllama | A WebAssembly binding for a large language model inference engine | 437 |