maestro
fine-tuner
A tool to streamline fine-tuning of multimodal models for vision-language tasks
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
1k stars
20 watching
102 forks
Language: Python
last commit: 10 days ago captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqa
Related projects:
Repository | Description | Stars |
---|---|---|
spandan-madan/pytorch_fine_tuning_tutorial | Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. | 279 |
ys-zong/vlguard | Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks | 45 |
n-waves/multifit | Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library | 284 |
nicholas-leonard/drmad | A toolbox for efficient hyperparameter tuning in deep learning using Bayesian optimization and automatic differentiation | 23 |
circleradon/osprey | This project presents a new approach to fine-grained visual understanding using pixel-wise mask regions in language instructions | 770 |
zygmuntz/hyperband | A hyperparameter tuning framework with support for multiple machine learning models and algorithms. | 593 |
creafz/pytorch-cnn-finetune | A PyTorch-based framework for fine-tuning pre-trained convolutional neural networks on various architectures and datasets. | 724 |
codefuse-ai/mftcoder | A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency | 637 |
jerry1993-tech/cornucopia-llama-fin-chinese | A Chinese finance-focused large language model fine-tuning framework | 589 |
google-research/flan | A repository providing tools and datasets to fine-tune language models for specific tasks | 1,474 |
salt-nlp/llavar | An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets. | 258 |
vlf-silkie/vlfeedback | An annotated preference dataset and training framework for improving large vision language models. | 85 |
icoz69/stablellava | A tool for generating and evaluating multimodal Large Language Models with visual instruction tuning capabilities | 91 |
guopengf/auto-fedrl | A reinforcement learning-based framework for optimizing hyperparameters in distributed machine learning environments. | 15 |
chendelong1999/polite-flamingo | Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models | 63 |