swiss_army_llama

LLM toolset

A FastAPI service that facilitates semantic text search using precomputed embeddings and advanced similarity measures.

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

GitHub

947 stars

16 watching

55 forks

Language: Python

last commit: over 1 year ago

Linked from 2 awesome lists

embedding-similarityembedding-vectorsembeddingsllama2llamacppsemantic-search

Backlinks from these awesome lists:

Related projects:

Repository	Description	Stars
snunez1/llama.cl	A Common Lisp port of a Large Language Model (LLM) implementation	36
run-llama/llamaindexts	A data framework for integrating large language models into applications with custom data	1,997
victordibia/llmx	An API that provides a unified interface to multiple large language models for chat fine-tuning	79
melih-unsal/demogpt	A comprehensive toolset for building Large Language Model (LLM) based applications	1,733
msoedov/langcorn	A framework for serving large language models with a robust and efficient API	909
jetxu-llm/llama-github	Empowers AI-driven applications to retrieve relevant code snippets and repository information from GitHub using LLM-powered question analysis	258
damo-nlp-sg/m3exam	A benchmark for evaluating large language models in multiple languages and formats	93
pors/langchain-chat-websockets	An open-source chat application using LangChain and FastAPI that enables real-time conversations with an LLM	91
linksoul-ai/chinese-llama-2-7b	A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data	2,235
aiplanethub/beyondllm	An open-source toolkit for building and evaluating large language models	267
km1994/llmsninestorydemontower	Exploring various LLMs and their applications in natural language processing and related areas	1,854
ai-hypercomputer/maxtext	A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs.	1,557
deepseek-ai/deepseek-moe	A large language model with improved efficiency and performance compared to similar models	1,024
blackhc/llm-strategy	Decouples software implementation from underlying logic using LLMs to automate parsing of structured data	392
ngxson/wllama	A WebAssembly binding for the LLaMA model that enables on-browser inference without requiring a backend or GPU.	465