maxtext

LLM framework

A high-performance LLM written in Python/Jax for training and inference on Google Cloud TPUs and GPUs.

A simple, performant and scalable Jax LLM!

GitHub

2k stars
37 watching
294 forks
Language: Python
last commit: 6 days ago
Linked from 2 awesome lists

gptlarge-language-modelsllm

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
opengvlab/lamm A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. 301
aiplanethub/beyondllm An open-source toolkit for building and evaluating large language models 263
wgryc/phasellm A framework for managing and testing large language models to evaluate their performance and optimize user experiences. 448
jina-ai/thinkgpt A Python library to augment large language models by enabling them to think and reason more effectively 1,548
deepseek-ai/deepseek-llm A large language model trained on a massive dataset for various applications 1,450
melih-unsal/demogpt A comprehensive toolset for building Large Language Model (LLM) based applications 1,710
samholt/l2mac Automates large code generation and writing tasks using a large language model framework 70
google/paxml A framework for configuring and running machine learning experiments on top of Jax. 457
deepseek-ai/deepseek-moe A large language model with improved efficiency and performance compared to similar models 1,006
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 508
damo-nlp-sg/m3exam A benchmark for evaluating large language models in multiple languages and formats 92
internlm/lagent A lightweight framework for building agent-based applications using LLMs and transformer architectures 1,865
huaizhengzhang/ai-system-school A curated collection of research papers, articles, and resources on machine learning systems, including design principles, infrastructure, and best practices. 2,696
ailab-cvc/seed An implementation of a multimodal language model with capabilities for comprehension and generation 582
blackhc/llm-strategy Decouples software implementation from underlying logic using LLMs to automate parsing of structured data 388