 infinity
 infinity 
 Embedding API
 A high-throughput, low-latency API for serving text and multimodal embeddings from various models.
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
2k stars
 18 watching
 119 forks
 
Language: Python 
last commit: 11 months ago 
Linked from   2 awesome lists  
  bert-embeddingsllmtext-embeddings 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | An API to connect LLMs with vector databases for search and retrieval of data. | 499 | 
|  | A fast and efficient utility package for utilizing vector embeddings in machine learning models | 1,635 | 
|  | A FastAPI-based framework for serving machine learning models in production-ready applications | 412 | 
|  | An API client library for interacting with multiple Fediverse platforms using a single interface. | 255 | 
|  | Makes SimFin data easily accessible in R by wrapping the SimFin Web-API | 19 | 
|  | A client-side library for embedding Mastodon feeds into web pages without requiring a backend server | 82 | 
|  | A Python package providing an interface to interact with Lemmy API instances | 50 | 
|  | A framework to build auto-documented REST APIs with Flask and marshmallow | 672 | 
|  | A modern API for working with files and directories in Emacs. | 686 | 
|  | A data-driven micro web framework for building RESTful APIs in Haskell | 104 | 
|  | Provides fast and efficient word embeddings for natural language processing. | 223 | 
|  | A simple API wrapper that allows developers to easily interact with APIs by appending URL parts to the base endpoint. | 462 | 
|  | An implementation of a non-parameterized approach for building sentence representations | 19 | 
|  | Implementation of Poincaré Embeddings algorithm in PyTorch for hierarchical representation learning | 1,684 | 
|  | Provides an interface to interact with Facebook's Graph API | 137 |