snowflake-arctic
LLM inference stack
A project providing optimized stacks for fine-tuning and inference of large language models, focusing on low-latency and high-throughput performance.
525 stars
6 watching
47 forks
Language: Python
last commit: 8 months ago Related projects:
Repository | Description | Stars |
---|---|---|
| Detects whether a given text sequence is part of the training data used to train a large language model. | 23 |
| A large language model with improved efficiency and performance compared to similar models | 1,024 |
| A framework for Bayesian quantification of uncertainty in large-scale ice sheet models. | 5 |
| An open-source implementation of a vision-language instructed large language model | 513 |
| Measures the performance of deep learning models in various deployment scenarios. | 1,256 |
| A large-scale language model for scientific domain training on redpajama arXiv split | 125 |
| A framework to perform time series forecasting using machine learning models on large datasets. | 924 |
| A framework for leveraging language models in production code | 216 |
| A benchmark for evaluating large language models in multiple languages and formats | 93 |
| A benchmarking framework for federated machine learning tasks across various domains and datasets | 856 |
| A flexible framework for LLM inference optimizations with support for multiple models and architectures | 771 |
| Provides a set of tools and components to simplify the integration of Large Language Models into Dart/Flutter applications | 441 |
| A Common Lisp port of a Large Language Model (LLM) implementation | 36 |
| A platform for conducting high-performance federated learning simulations in Python. | 185 |
| A pre-trained language model for multiple natural language processing tasks with support for few-shot learning and transfer learning. | 656 |