maestro
fine-tuner
A tool to streamline fine-tuning of multimodal models for vision-language tasks
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
1k stars
20 watching
103 forks
Language: Python
last commit: 2 months ago captioningfine-tuningflorence-2multimodalobjectdetectionpaligemmaphi-3-visiontransformersvision-and-languagevqa
Related projects:
Repository | Description | Stars |
---|---|---|
| Provides guidance on fine-tuning pre-trained models for image classification tasks using PyTorch. | 279 |
| Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks | 47 |
| Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library | 284 |
| A toolbox for efficient hyperparameter tuning in deep learning using Bayesian optimization and automatic differentiation | 23 |
| This project presents a new approach to fine-grained visual understanding using pixel-wise mask regions in language instructions | 781 |
| A hyperparameter tuning framework with support for multiple machine learning models and algorithms. | 594 |
| A PyTorch-based framework for fine-tuning pre-trained convolutional neural networks on various architectures and datasets. | 726 |
| A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency | 647 |
| A Chinese finance-focused large language model fine-tuning framework | 596 |
| A repository providing tools and datasets to fine-tune language models for specific tasks | 1,484 |
| An open-source project that enhances visual instruction tuning for text-rich image understanding by integrating GPT-4 models with multimodal datasets. | 259 |
| An annotated preference dataset and training framework for improving large vision language models. | 88 |
| A tool for generating and evaluating multimodal Large Language Models with visual instruction tuning capabilities | 93 |
| A reinforcement learning-based framework for optimizing hyperparameters in distributed machine learning environments. | 15 |
| Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models | 63 |