M3DBench

3D dataset

An open-source software project providing a comprehensive 3D instruction-following dataset with multi-modal prompts for training large language models.

[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.

GitHub

57 stars
5 watching
2 forks
Language: Python
last commit: about 2 months ago
3ddatasetinstruction-tuningllmmlmmulti-modal

Related projects:

Repository Description Stars
fuenwang/layoutmp3d Provides a 3D layout dataset and annotation tools for training and testing 3D layout models. 27
open-mmlab/mmhuman3d Provides a modular framework and tools for working with 3D human parametric models in computer vision and graphics 1,240
umass-foundation-model/3d-llm Developing a Large Language Model capable of processing 3D representations as inputs 961
open-compass/mmbench A collection of benchmarks to evaluate the multi-modal understanding capability of large vision language models. 163
drorlab/atom3d Enables machine learning on three-dimensional molecular structure by providing tools and datasets for working with 3D molecular data 303
damo-nlp-sg/m3exam A benchmark for evaluating large language models in multiple languages and formats 92
oscarmcnulty/gta-3d-dataset A dataset of 2D images and 3D data generated from the Grand Theft Auto game engine for object localization research. 134
open-mmlab/mmengine Provides a flexible and configurable framework for training deep learning models with PyTorch. 1,179
openbmb/bmlist A curated list of large machine learning models tracked over time 341
alexa/massive A collection of tools and modeling code for a large multilingual Natural Language Understanding dataset 538
opendatalab/mllm-dataengine Automates data generation and model training for improving MLLM capabilities 36
open3da/ll3da An interactive system for understanding and interacting with 3D environments using natural language. 248
deepmimo/deepmimo-matlab Provides MATLAB code and dataset for training machine learning models in millimeter wave and massive MIMO systems 156
tri-ml/ddad A collection of data and tools for training algorithms to estimate dense depth in urban environments. 493
openbmb/viscpm A family of large multimodal models supporting multimodal conversational capabilities and text-to-image generation in multiple languages 1,089