OneLLM

Language model trainer

A framework for training and fine-tuning multimodal language models on various data types

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

GitHub

601 stars

11 watching

33 forks

Language: Python

last commit: 10 months ago

Linked from 1 awesome list

Backlinks from these awesome lists:

bradyfu/awesome-multimodal-large-language-models

Related projects:

Repository	Description	Stars
microsoft/mpnet	Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning.	288
yunwentechnology/unilm	This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.	439
bobazooba/xllm	A tool for training and fine-tuning large language models using advanced techniques	387
elanmart/psmm	An implementation of a neural network model for character-level language modeling.	50
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
vhellendoorn/code-lms	A guide to using pre-trained large language models in source code analysis and generation	1,789
bilibili/index-1.9b	A lightweight, multilingual language model with a long context length	920
llava-vl/llava-plus-codebase	A platform for training and deploying large language and vision models that can use tools to perform tasks	717
bytedance/lynx-llm	A framework for training GPT4-style language models with multimodal inputs using large datasets and pre-trained models	231
openai/lm-human-preferences	Training methods and tools for fine-tuning language models using human preferences	1,240
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
yiren-jian/blitext	Develops and trains models for vision-language learning with decoupled language pre-training	24
pleisto/yuren-baichuan-7b	A multi-modal large language model that integrates natural language and visual capabilities with fine-tuning for various tasks	73
lyuchenyang/macaw-llm	A multi-modal language model that integrates image, video, audio, and text data to improve language understanding and generation	1,568
luogen1996/lavin	An open-source implementation of a vision-language instructed large language model	513