mmf
Multimodal framework
A modular framework for building vision and language multimodal research projects using PyTorch.
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
6k stars
114 watching
937 forks
Language: Python
last commit: 3 months ago captioningdeep-learningdialoghateful-memesmulti-taskingmultimodalpretrained-modelspytorchtextvqavqa
Related projects:
Repository | Description | Stars |
---|---|---|
| An open-source software framework for jointly optimizing face recognition models in federated learning settings. | 14 |
| A machine learning framework designed to be extensible and agnostic, with support for multiple backends and linear algebra libraries. | 1,114 |
| A platform for collaboration, data management, and reproducibility in machine learning development | 1,440 |
| An ML framework for accelerating research and its integration into production workflows | 264 |
| A unified framework for improving privacy and reducing communication overhead in distributed machine learning models. | 12 |
| A platform for object detection and segmentation tasks using machine learning algorithms | 30,778 |
| An approach to mitigating catastrophic forgetting in federated class incremental learning for vision tasks using a generative model and data-free methods | 15 |
| This repository provides an end-to-end language model capable of generating coherent text based on both spoken and written inputs. | 845 |
| An open-source framework for training machine learning models across decentralized data | 2,323 |
| A framework for implementing machine learning algorithms in Swift to make it easier for developers to incorporate ML into their projects. | 152 |
| Implementation of federated learning algorithms for distributed machine learning on private client data | 37 |
| Provides an infrastructure for machine learning in R, enabling users to focus on experiments without writing lengthy wrappers and boilerplate code. | 1,648 |
| A toolkit for training custom sequence-to-sequence models for various NLP tasks | 30,675 |
| Provides state-of-the-art video understanding codebase with efficient training methods and pre-trained models for various tasks | 6,680 |
| A framework for building federated AI systems with customization, extensibility, and flexibility in mind. | 5,219 |