infinity
Embedding API
A high-throughput, low-latency API for serving text and multimodal embeddings from various models.
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
2k stars
18 watching
119 forks
Language: Python
last commit: 2 months ago
Linked from 2 awesome lists
bert-embeddingsllmtext-embeddings
Related projects:
Repository | Description | Stars |
---|---|---|
| An API to connect LLMs with vector databases for search and retrieval of data. | 499 |
| A fast and efficient utility package for utilizing vector embeddings in machine learning models | 1,635 |
| A FastAPI-based framework for serving machine learning models in production-ready applications | 412 |
| An API client library for interacting with multiple Fediverse platforms using a single interface. | 255 |
| Makes SimFin data easily accessible in R by wrapping the SimFin Web-API | 19 |
| A client-side library for embedding Mastodon feeds into web pages without requiring a backend server | 82 |
| A Python package providing an interface to interact with Lemmy API instances | 50 |
| A framework to build auto-documented REST APIs with Flask and marshmallow | 672 |
| A modern API for working with files and directories in Emacs. | 686 |
| A data-driven micro web framework for building RESTful APIs in Haskell | 104 |
| Provides fast and efficient word embeddings for natural language processing. | 223 |
| A simple API wrapper that allows developers to easily interact with APIs by appending URL parts to the base endpoint. | 462 |
| An implementation of a non-parameterized approach for building sentence representations | 19 |
| Implementation of Poincaré Embeddings algorithm in PyTorch for hierarchical representation learning | 1,684 |
| Provides an interface to interact with Facebook's Graph API | 137 |