tensorzero

LLM optimizer

A tool that creates a feedback loop to optimize large language models by integrating model gateways and providing data analytics and machine learning capabilities.

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

GitHub

569 stars
9 watching
26 forks
Language: Rust
last commit: 7 days ago
Linked from 1 awesome list

aiai-engineeringanthropicartificial-intelligencedeep-learninggenaigenerative-aigptlarge-language-modelsllamallmllmopsllmsmachine-learningmlml-engineeringmlopsopenaipythonrust

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
kvcache-ai/ktransformers A flexible framework for LLM inference optimizations with support for multiple models and architectures 736
nvidia/tensorflow An optimized version of TensorFlow to support newer hardware and libraries for NVIDIA GPU users 996
lucfra/far-ho A package for optimizing hyperparameters and meta-learning using gradient-based methods in TensorFlow. 187
lyogavin/anima An optimization technique for large language models allowing them to run on limited hardware resources without significant performance loss. 6
aiplanethub/beyondllm An open-source toolkit for building and evaluating large language models 261
ai-hypercomputer/maxtext A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs. 1,529
lge-arc-advancedai/auptimizer Automates model building and deployment process by optimizing hyperparameters and compressing models for edge computing. 200
brml/climin A framework for optimizing machine learning functions using gradient-based optimization methods. 180
intel/neural-compressor Tools and techniques for optimizing large language models on various frameworks and hardware platforms. 2,226
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 508
bobazooba/xllm A tool for training and fine-tuning large language models using advanced techniques 380
deepseek-ai/deepseek-moe A large language model with improved efficiency and performance compared to similar models 1,006
iglaweb/tfprofiler An app for profiling and optimizing the performance of TensorFlow Lite models on mobile devices 27
damo-nlp-sg/m3exam A benchmark for evaluating large language models in multiple languages and formats 92
deepseek-ai/deepseek-llm A large language model trained on a massive dataset for various applications 1,450