LLMs-from-scratch

GPT model development

Developing and pretraining a GPT-like Large Language Model from scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

GitHub

35k stars
382 watching
4k forks
Language: Jupyter Notebook
last commit: 30 days ago
Linked from 2 awesome lists

chatgptgptlarge-language-modelsllmpythonpytorch

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mooler0410/llmspracticalguide A curated list of resources to help developers navigate the landscape of large language models and their applications in NLP 9,551
mlabonne/llm-course A comprehensive course and resource package on building and deploying Large Language Models (LLMs) 40,053
zjh-819/llmdatahub A curated collection of high-quality datasets for training large language models. 2,708
llm-workflow-engine/llm-workflow-engine A command-line tool and workflow manager for interacting with large language models like ChatGPT/GPT4. 3,674
pytorch/torchtitan A native PyTorch library for training large language models using distributed parallelism and optimization techniques. 2,765
rdspring1/pytorch_gbw_lm Trains a large-scale PyTorch language model on the 1-Billion Word dataset 123
nlpxucan/wizardlm Large pre-trained language models trained to follow complex instructions using an evolutionary instruction framework 9,295
young-geng/easylm A framework for training and serving large language models using JAX/Flax 2,428
openbmb/bmtools Tools and platform for building and extending large language models 2,907
nomic-ai/gpt4all An open-source Python client for running Large Language Models (LLMs) locally on any device. 71,176
peremartra/large-language-model-notebooks-course A practical course teaching large language models and their applications through hands-on projects using OpenAI API and Hugging Face library. 1,338
thunlp/plmpapers Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. 3,331
paulpierre/rasagpt An open-source chatbot platform built on Rasa and Langchain with support for LLM indexing and Telegram integration. 2,372
eth-sri/lmql A language that enables efficient and constraint-guided programming with large language models 3,731
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,487