text-embeddings-inference

Embedding Server

A toolkit for deploying and serving text embeddings models with high-performance inference capabilities.

A blazing fast inference solution for text embeddings models

GitHub

3k stars
36 watching
190 forks
Language: Rust
last commit: 2 months ago
Linked from 1 awesome list

aiembeddingshuggingfacellmml

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
huggingface/text-generation-inference A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation 9,456
embeddings-benchmark/mteb Provides tools and benchmarks for evaluating text embedding models 2,021
princeton-nlp/simcse An open source framework for learning sentence embeddings using contrastive learning. 3,457
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,635
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,699
layumi/image-text-embedding A repository providing code and tools for building image-text embedding models using convolutional neural networks and word embeddings. 287
huggingface/tokenizers A toolkit providing optimized tokenizers for natural language processing tasks in various programming languages. 9,156
neuml/txtai An all-in-one embeddings database for semantic search, LLM orchestration and language model workflows 9,709
mnqu/pte An implementation of the Predictive Text Embedding model for learning word representations from large-scale heterogeneous text networks. 96
ukplab/sentence-transformers Provides dense vector representations for text using transformer networks 15,556
fminference/flexllmgen Generates large language model outputs in high-throughput mode on single GPUs 9,236
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
brightmart/text_classification An NLP project offering various text classification models and techniques for deep learning exploration 7,881
michaelfeil/infinity A high-throughput, low-latency API for serving text and multimodal embeddings from various models. 1,586
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 840