text-embeddings-inference
Embedding Server
A toolkit for deploying and serving text embeddings models with high-performance inference capabilities.
A blazing fast inference solution for text embeddings models
3k stars
36 watching
190 forks
Language: Rust
last commit: 2 months ago
Linked from 1 awesome list
aiembeddingshuggingfacellmml
Related projects:
Repository | Description | Stars |
---|---|---|
| A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation | 9,456 |
| Provides tools and benchmarks for evaluating text embedding models | 2,021 |
| An open source framework for learning sentence embeddings using contrastive learning. | 3,457 |
| A fast and efficient utility package for utilizing vector embeddings in machine learning models | 1,635 |
| An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters | 16,699 |
| A repository providing code and tools for building image-text embedding models using convolutional neural networks and word embeddings. | 287 |
| A toolkit providing optimized tokenizers for natural language processing tasks in various programming languages. | 9,156 |
| An all-in-one embeddings database for semantic search, LLM orchestration and language model workflows | 9,709 |
| An implementation of the Predictive Text Embedding model for learning word representations from large-scale heterogeneous text networks. | 96 |
| Provides dense vector representations for text using transformer networks | 15,556 |
| Generates large language model outputs in high-throughput mode on single GPUs | 9,236 |
| An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks | 803 |
| An NLP project offering various text classification models and techniques for deep learning exploration | 7,881 |
| A high-throughput, low-latency API for serving text and multimodal embeddings from various models. | 1,586 |
| Develops unified sentence embedding models for NLP tasks | 840 |