maxtext
LLM framework
A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs.
A simple, performant and scalable Jax LLM!
2k stars
40 watching
304 forks
Language: Python
last commit: about 1 month ago
Linked from 2 awesome lists
gptlarge-language-modelsllm
Related projects:
Repository | Description | Stars |
---|---|---|
opengvlab/lamm | A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. | 305 |
aiplanethub/beyondllm | An open-source toolkit for building and evaluating large language models | 267 |
wgryc/phasellm | A framework for managing and testing large language models to evaluate their performance and optimize user experiences. | 451 |
jina-ai/thinkgpt | A Python library to augment large language models by enabling them to think and reason more effectively | 1,550 |
deepseek-ai/deepseek-llm | A large language model trained on a massive dataset for various applications | 1,512 |
melih-unsal/demogpt | A comprehensive toolset for building Large Language Model (LLM) based applications | 1,733 |
samholt/l2mac | Automates large code generation and writing tasks using a large language model framework | 79 |
google/paxml | A framework for configuring and running machine learning experiments on top of Jax. | 461 |
deepseek-ai/deepseek-moe | A large language model with improved efficiency and performance compared to similar models | 1,024 |
luogen1996/lavin | An open-source implementation of a vision-language instructed large language model | 513 |
damo-nlp-sg/m3exam | A benchmark for evaluating large language models in multiple languages and formats | 93 |
internlm/lagent | A lightweight framework for building agent-based applications using LLMs and transformer architectures | 1,924 |
huaizhengzhang/ai-system-school | A curated collection of research papers, articles, and resources on machine learning systems, including design principles, infrastructure, and best practices. | 2,710 |
ailab-cvc/seed | An implementation of a multimodal language model with capabilities for comprehension and generation | 585 |
blackhc/llm-strategy | Decouples software implementation from underlying logic using LLMs to automate parsing of structured data | 392 |