MuVI

Multi-view modeler

A software framework for multi-view latent variable modeling with domain-informed structured sparsity

A multi-view latent variable model with domain-informed structured sparsity for integrating noisy feature sets.

GitHub

29 stars
5 watching
2 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tsb0601/mmvp An evaluation framework for multimodal language models' visual capabilities using image and question benchmarks. 288
yuliang-liu/monkey A toolkit for building conversational AI models that can process images and text inputs. 1,825
mcs07/molvs Tool for validating and standardizing chemical structures to improve data quality and facilitate comparisons. 159
vita-mllm/vita A large multimodal language model designed to process and analyze video, image, text, and audio inputs in real-time. 961
nvlabs/eagle Develops high-resolution multimodal LLMs by combining vision encoders and various input resolutions 539
yuweihao/mm-vet Evaluates the capabilities of large multimodal models using a set of diverse tasks and metrics 267
opengvlab/multi-modality-arena An evaluation platform for comparing multi-modality models on visual question-answering tasks 467
yfzhang114/slime Develops large multimodal models for high-resolution understanding and analysis of text, images, and other data types. 137
xverse-ai/xverse-v-13b A large multimodal model for visual question answering, trained on a dataset of 2.1B image-text pairs and 8.2M instruction sequences. 77
pku-yuangroup/moe-llava Develops a neural network architecture for multi-modal learning with large vision-language models 1,980
zhourax/vega Develops a multimodal task and dataset to assess vision-language models' ability to handle interleaved image-text inputs. 33
freedomintelligence/mllm-bench Evaluates and compares the performance of multimodal large language models on various tasks 55
subho406/omninet An implementation of a unified architecture for multi-modal multi-task learning using PyTorch. 512
chenllliang/mmevalpro A benchmarking framework for evaluating Large Multimodal Models by providing rigorous metrics and an efficient evaluation pipeline. 22
mlpc-ucsd/bliva A multimodal LLM designed to handle text-rich visual questions 269