infinity

Embedding API

A high-throughput, low-latency API for serving text and multimodal embeddings from various models.

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

GitHub

2k stars
18 watching
119 forks
Language: Python
last commit: about 1 month ago
Linked from 2 awesome lists

bert-embeddingsllmtext-embeddings

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
different-ai/embedbase An API to connect LLMs with vector databases for search and retrieval of data. 499
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,635
eightbec/fastapi-ml-skeleton A FastAPI-based framework for serving machine learning models in production-ready applications 412
h3poteto/megalodon An API client library for interacting with multiple Fediverse platforms using a single interface. 255
matthiasgomolka/simfinapi Makes SimFin data easily accessible in R by wrapping the SimFin Web-API 19
sampsyo/emfed A client-side library for embedding Mastodon feeds into web pages without requiring a backend server 82
fedihosting-foundation/plemmy A Python package providing an interface to interact with Lemmy API instances 50
marshmallow-code/flask-smorest A framework to build auto-documented REST APIs with Flask and marshmallow 672
rejeep/f.el A modern API for working with files and directories in Emacs. 686
monadicsystems/okapi A data-driven micro web framework for building RESTful APIs in Haskell 104
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
inf0rmer/blanket A simple API wrapper that allows developers to easily interact with APIs by appending URL parts to the base endpoint. 462
fursovia/geometric_embedding An implementation of a non-parameterized approach for building sentence representations 19
facebookresearch/poincare-embeddings Implementation of Poincaré Embeddings algorithm in PyTorch for hierarchical representation learning 1,684
mweibel/facebook.ex Provides an interface to interact with Facebook's Graph API 137