text-embeddings-inference

Text Embedding Inference Tool

A blazing fast inference solution for text embeddings models.

A blazing fast inference solution for text embeddings models

GitHub

3k stars
34 watching
177 forks
Language: Rust
last commit: 19 days ago
Linked from 1 awesome list

aiembeddingshuggingfacellmml

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
huggingface/text-generation-inference A toolkit for deploying and serving Large Language Models. 9,106
embeddings-benchmark/mteb A benchmarking suite for evaluating text embedding models across various NLP tasks and datasets. 1,952
princeton-nlp/simcse An open source framework for learning sentence embeddings using contrastive learning. 3,434
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,627
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,505
layumi/image-text-embedding A repository providing code and tools for building image-text embedding models using convolutional neural networks and word embeddings. 287
huggingface/tokenizers A toolkit providing optimized tokenizers for natural language processing tasks in various programming languages. 9,051
neuml/txtai An all-in-one embeddings database for semantic search, LLM orchestration and language model workflows 9,369
mnqu/pte An implementation of the Predictive Text Embedding model for learning word representations from large-scale heterogeneous text networks. 96
ukplab/sentence-transformers Provides dense vector representations for text using transformer networks 15,382
fminference/flexllmgen Generates large language model outputs in high-throughput mode on single GPUs 9,192
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
brightmart/text_classification An NLP project offering various text classification models and techniques for deep learning exploration 7,861
michaelfeil/infinity A high-throughput, low-latency API for serving text and multimodal embeddings from various models. 1,464
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 833