infinity

Embedding API

A high-throughput, low-latency API for serving text and multimodal embeddings from various models.

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

GitHub

1k stars
18 watching
113 forks
Language: Python
last commit: 5 days ago
Linked from 2 awesome lists

bert-embeddingsllmtext-embeddings

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
different-ai/embedbase An API to connect LLMs with vector databases for search and retrieval of data. 498
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,627
eightbec/fastapi-ml-skeleton A FastAPI-based framework for serving machine learning models in production-ready applications 394
h3poteto/megalodon A Fediverse API client library for NodeJS and browsers providing unified interface to Mastodon, Pleroma, Friendica, and Firefish servers. 250
matthiasgomolka/simfinapi Makes SimFin data easily accessible in R by wrapping the SimFin Web-API 19
sampsyo/emfed A client-side library for embedding Mastodon feeds into web pages without requiring a backend server 78
fedihosting-foundation/plemmy A Python package providing an interface to interact with Lemmy API instances 49
marshmallow-code/flask-smorest A framework for building REST APIs with Flask and automatic API documentation 667
rejeep/f.el A modern API for working with files and directories in Emacs. 685
monadicsystems/okapi A data-driven micro web framework for building RESTful APIs in Haskell 104
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
inf0rmer/blanket A simple API wrapper that allows developers to easily interact with APIs by appending URL parts to the base endpoint. 462
fursovia/geometric_embedding An implementation of a non-parameterized approach for building sentence representations 19
facebookresearch/poincare-embeddings Implementation of Poincaré Embeddings algorithm in PyTorch for hierarchical representation learning 1,681
mweibel/facebook.ex Provides an interface to interact with Facebook's Graph API 138