tensorzero
LLM optimizer
A tool that creates a feedback loop to optimize large language models by integrating model gateways and providing data analytics and machine learning capabilities.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
569 stars
9 watching
26 forks
Language: Rust
last commit: 7 days ago
Linked from 1 awesome list
aiai-engineeringanthropicartificial-intelligencedeep-learninggenaigenerative-aigptlarge-language-modelsllamallmllmopsllmsmachine-learningmlml-engineeringmlopsopenaipythonrust
Related projects:
Repository | Description | Stars |
---|---|---|
kvcache-ai/ktransformers | A flexible framework for LLM inference optimizations with support for multiple models and architectures | 736 |
nvidia/tensorflow | An optimized version of TensorFlow to support newer hardware and libraries for NVIDIA GPU users | 996 |
lucfra/far-ho | A package for optimizing hyperparameters and meta-learning using gradient-based methods in TensorFlow. | 187 |
lyogavin/anima | An optimization technique for large language models allowing them to run on limited hardware resources without significant performance loss. | 6 |
aiplanethub/beyondllm | An open-source toolkit for building and evaluating large language models | 261 |
ai-hypercomputer/maxtext | A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. | 1,529 |
lge-arc-advancedai/auptimizer | Automates model building and deployment process by optimizing hyperparameters and compressing models for edge computing. | 200 |
brml/climin | A framework for optimizing machine learning functions using gradient-based optimization methods. | 180 |
intel/neural-compressor | Tools and techniques for optimizing large language models on various frameworks and hardware platforms. | 2,226 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 508 |
bobazooba/xllm | A tool for training and fine-tuning large language models using advanced techniques | 380 |
deepseek-ai/deepseek-moe | A large language model with improved efficiency and performance compared to similar models | 1,006 |
iglaweb/tfprofiler | An app for profiling and optimizing the performance of TensorFlow Lite models on mobile devices | 27 |
damo-nlp-sg/m3exam | A benchmark for evaluating large language models in multiple languages and formats | 92 |
deepseek-ai/deepseek-llm | A large language model trained on a massive dataset for various applications | 1,450 |