maestro

fine-tuner

A tool to streamline fine-tuning of multimodal models for vision-language tasks

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

GitHub

1k stars

20 watching

103 forks

Language: Python

last commit: 6 months ago

captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqa

maestro.roboflow.com

Related projects:

Repository	Description	Stars
spandan-madan/pytorch_fine_tuning_tutorial	Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch.	279
ys-zong/vlguard	Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks	47
n-waves/multifit	Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library	284
nicholas-leonard/drmad	A toolbox for efficient hyperparameter tuning in deep learning using Bayesian optimization and automatic differentiation	23
circleradon/osprey	This project presents a new approach to fine-grained visual understanding using pixel-wise mask regions in language instructions	781
zygmuntz/hyperband	A hyperparameter tuning framework with support for multiple machine learning models and algorithms.	594
creafz/pytorch-cnn-finetune	A PyTorch-based framework for fine-tuning pre-trained convolutional neural networks on various architectures and datasets.	726
codefuse-ai/mftcoder	A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency	647
jerry1993-tech/cornucopia-llama-fin-chinese	A Chinese finance-focused large language model fine-tuning framework	596
google-research/flan	A repository providing tools and datasets to fine-tune language models for specific tasks	1,484
salt-nlp/llavar	An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets.	259
vlf-silkie/vlfeedback	An annotated preference dataset and training framework for improving large vision language models.	88
icoz69/stablellava	A tool for generating and evaluating multimodal Large Language Models with visual instruction tuning capabilities	93
guopengf/auto-fedrl	A reinforcement learning-based framework for optimizing hyperparameters in distributed machine learning environments.	15
chendelong1999/polite-flamingo	Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models	63