LL3DA

3D assistant

An interactive system for understanding and interacting with 3D environments using natural language.

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

GitHub

248 stars
6 watching
9 forks
Language: Python
last commit: 4 months ago
3d3d-models3d-to-textcvpr2024gptinstruction-tuninglanguage-modelllmmulti-modalscene-understanding

Related projects:

Repository Description Stars
umass-foundation-model/3d-llm Developing a Large Language Model capable of processing 3D representations as inputs 961
dvlab-research/llmga An implementation of a multimodal generation assistant using large language models and various image editing techniques. 461
openm3d/m3dbench An open-source software project providing a comprehensive 3D instruction-following dataset with multi-modal prompts for training large language models. 57
openfl/away3d An open source platform for developing interactive 3D graphics. 208
batra-mlp-lab/visdial A system for an AI agent to engage in natural dialog about visual content using a combination of encoder and decoder architectures. 228
agenta-ai/agenta A developer platform for building and deploying large language models 1,275
airaria/visual-chinese-llama-alpaca Develops a multimodal Chinese language model with visual capabilities 424
aidc-ai/ovis An architecture designed to align visual and textual embeddings in multimodal learning 517
openrobotlab/pointllm This project develops a large language model capable of understanding and generating information about 3D point clouds. 647
gulvarol/surreal This project involves generating synthetic human data to train 3D models of human appearance and behavior. 588
harfang3d/harfang3d An all-in-one 3D visualization library for C++, Python, Lua, and Go. 576
opengvlab/lamm A framework and benchmark for training and evaluating multi-modal large language models, enabling the development of AI agents capable of seamless interaction between humans and machines. 301
microsoft/llava-med A research project aimed at building large language and vision models for biomedical applications with capabilities comparable to GPT-4. 1,556
pfirsich/kaun A Lua module for 3D graphics intended to provide a low-level API for abstracting away OpenGL details and enabling advanced techniques without the need for significant modifications to an existing game engine. 7
groverburger/g3d Simplifies 3D rendering in the LÖVE game engine using Lua. 570