LLMs-from-scratch
GPT model development
Developing and pretraining a GPT-like Large Language Model from scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
35k stars
382 watching
4k forks
Language: Jupyter Notebook
last commit: 30 days ago
Linked from 2 awesome lists
chatgptgptlarge-language-modelsllmpythonpytorch
Related projects:
Repository | Description | Stars |
---|---|---|
mooler0410/llmspracticalguide | A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP | 9,551 |
mlabonne/llm-course | A comprehensive course and resource package on building and deploying Large Language Models (LLMs) | 40,053 |
zjh-819/llmdatahub | A curated collection of high-quality datasets for training large language models. | 2,708 |
llm-workflow-engine/llm-workflow-engine | A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. | 3,674 |
pytorch/torchtitan | A native PyTorch library for training large language models using distributed parallelism and optimization techniques. | 2,765 |
rdspring1/pytorch_gbw_lm | Trains a large-scale PyTorch language model on the 1-Billion Word dataset | 123 |
nlpxucan/wizardlm | Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework | 9,295 |
young-geng/easylm | A framework for training and serving large language models using JAX/Flax | 2,428 |
openbmb/bmtools | Tools and platform for building and extending large language models | 2,907 |
nomic-ai/gpt4all | An open-source Python client for running Large Language Models (LLMs) locally on any device. | 71,176 |
peremartra/large-language-model-notebooks-course | A practical course teaching large language models and their applications through hands-on projects using OpenAI API and Hugging Face library. | 1,338 |
thunlp/plmpapers | Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,331 |
paulpierre/rasagpt | An open-source chatbot platform built on Rasa and Langchain with support for LLM indexing and Telegram integration. | 2,372 |
eth-sri/lmql | A language that enables efficient and constraint-guided programming with large language models | 3,731 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,487 |