maestro

fine-tuner

A tool to streamline fine-tuning of multimodal models for vision-language tasks

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

GitHub

1k stars
20 watching
102 forks
Language: Python
last commit: 10 days ago
captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqa

Related projects:

Repository Description Stars
spandan-madan/pytorch_fine_tuning_tutorial Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. 279
ys-zong/vlguard Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks 45
n-waves/multifit Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library 284
nicholas-leonard/drmad A toolbox for efficient hyperparameter tuning in deep learning using Bayesian optimization and automatic differentiation 23
circleradon/osprey This project presents a new approach to fine-grained visual understanding using pixel-wise mask regions in language instructions 770
zygmuntz/hyperband A hyperparameter tuning framework with support for multiple machine learning models and algorithms. 593
creafz/pytorch-cnn-finetune A PyTorch-based framework for fine-tuning pre-trained convolutional neural networks on various architectures and datasets. 724
codefuse-ai/mftcoder A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency 637
jerry1993-tech/cornucopia-llama-fin-chinese A Chinese finance-focused large language model fine-tuning framework 589
google-research/flan A repository providing tools and datasets to fine-tune language models for specific tasks 1,474
salt-nlp/llavar An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets. 258
vlf-silkie/vlfeedback An annotated preference dataset and training framework for improving large vision language models. 85
icoz69/stablellava A tool for generating and evaluating multimodal Large Language Models with visual instruction tuning capabilities 91
guopengf/auto-fedrl A reinforcement learning-based framework for optimizing hyperparameters in distributed machine learning environments. 15
chendelong1999/polite-flamingo Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models 63