LoRA

Parameter reduction

A method to adapt large language models by reducing their parameter count using low-rank adaptation matrices

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

GitHub

11k stars

70 watching

695 forks

Language: Python

last commit: about 1 year ago

Linked from 1 awesome list

adaptationdebertadeep-learninggpt-2gpt-3language-modelloralow-rankpytorchroberta

arxiv.org/abs/2106.09685

Backlinks from these awesome lists:

filipecalegario/awesome-generative-ai

Related projects:

Repository	Description	Stars
tloen/alpaca-lora	Tuning a large language model on consumer hardware using low-rank adaptation	18,710
phoebussi/alpaca-cot	Provides a unified interface for fine-tuning large language models with parameter-efficient methods and instruction collection data	2,640
huggingface/peft	An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters	16,699
huggingface/lerobot	A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning.	7,874
git-cloner/llama2-lora-fine-tuning	Fine-tuning the LLaMA 2 chat model using DeepSpeed and Lora for improved performance on a large dataset.	171
adapter-hub/adapters	A unified library for parameter-efficient and modular transfer learning in NLP tasks	2,600
lich99/chatglm-finetune-lora	A codebase for fine-tuning the ChatGLM-6b language model using low-rank adaptation (LoRA) and providing finetuned weights.	727
meta-llama/codellama	Provides inference code and tools for fine-tuning large language models, specifically designed for code generation tasks	16,097
microsoft/flaml	Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms	3,968
optimalscale/lmflow	A toolkit for fine-tuning and inferring large machine learning models	8,312
ermlab/politbert	Trains a language model using a RoBERTa architecture on high-quality Polish text data	33
rasbt/llms-from-scratch	Developing and pretraining a GPT-like Large Language Model from scratch	35,405
wybiral/micropython-lora	A MicroPython library for controlling Semtech SX127x LoRa modules over SPI.	37
peremartra/large-language-model-notebooks-course	A practical course teaching large language models and their applications through hands-on projects using OpenAI API and Hugging Face library.	1,338
rdspring1/pytorch_gbw_lm	Trains a large-scale PyTorch language model on the 1-Billion Word dataset	123