align-anything
Model aligner
Aligns large multimodal models with human intentions and values using various algorithms and fine-tuning methods.
Align Anything: Training All-modality Model with Feedback
270 stars
9 watching
53 forks
Language: Python
last commit: 3 months ago chameleondpolarge-language-modelsmultimodalrlhfvision-language-model
Related projects:
Repository | Description | Stars |
---|---|---|
| Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
| An MLLM architecture designed to align visual and textual embeddings through structural alignment | 575 |
| This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 214 |
| This project proposes a novel method for calibrating attention distributions in multimodal models to improve contextualized representations of image-text pairs. | 30 |
| Evaluates and aligns the values of Chinese large language models with safety and responsibility standards | 481 |
| Aligns large language models' behavior through fine-grained correctional human feedback to improve trustworthiness and accuracy. | 245 |
| Evaluates and benchmarks large language models' video understanding capabilities | 121 |
| An open-source benchmark and evaluation tool for assessing multimodal large language models' performance in embodied decision-making tasks | 99 |
| Scripts for aligning laboratory speech production data in Inuktitut | 3 |
| A large vision-language model using a mixture-of-experts architecture to improve performance on multi-modal learning tasks | 2,023 |
| Tools for aligning laboratory speech production data to forced audio alignment using HTK and SoX. | 333 |
| Evaluating and improving large multimodal models through in-context learning | 21 |
| Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 15 |
| A Python library that uses machine learning and natural language processing to improve translation accuracy by aligning source and target languages | 0 |
| An evaluation platform for comparing multi-modality models on visual question-answering tasks | 478 |