maxtext
LLM framework
A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs.
A simple, performant and scalable Jax LLM!
2k stars
37 watching
294 forks
Language: Python
last commit: 5 days ago
Linked from 2 awesome lists
gptlarge-language-modelsllm
Related projects:
Repository | Description | Stars |
---|---|---|
opengvlab/lamm | A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. | 301 |
aiplanethub/beyondllm | An open-source toolkit for building and evaluating large language models | 261 |
wgryc/phasellm | A framework for managing and testing large language models to evaluate their performance and optimize user experiences. | 448 |
jina-ai/thinkgpt | A Python library to augment large language models by enabling them to think and reason more effectively | 1,544 |
deepseek-ai/deepseek-llm | A large language model trained on a massive dataset for various applications | 1,450 |
melih-unsal/demogpt | A comprehensive toolset for building Large Language Model (LLM) based applications | 1,710 |
samholt/l2mac | Automates large code generation and writing tasks using a large language model framework | 70 |
google/paxml | A framework for configuring and running machine learning experiments on top of Jax. | 457 |
deepseek-ai/deepseek-moe | A large language model with improved efficiency and performance compared to similar models | 1,006 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 508 |
damo-nlp-sg/m3exam | A benchmark for evaluating large language models in multiple languages and formats | 92 |
internlm/lagent | A lightweight framework for building agent-based applications using LLMs and transformer architectures | 1,865 |
huaizhengzhang/ai-system-school | A curated collection of research papers, articles, and resources on machine learning systems, including design principles, infrastructure, and best practices. | 2,690 |
ailab-cvc/seed | An implementation of a multimodal language model with capabilities for comprehension and generation | 576 |
blackhc/llm-strategy | Decouples software implementation from underlying logic using LLMs to automate parsing of structured data | 388 |