IMAD
Dialogue analyzer
A toolkit for analyzing and generating multi-modal dialogue with images
[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue
4 stars
1 watching
0 forks
Language: Python
last commit: over 1 year ago datasetdeep-learningdialogue-systemsimage2textmultimodalmultimodal-deep-learning
Related projects:
Repository | Description | Stars |
---|---|---|
zhourax/vega | Develops a multimodal task and dataset to assess vision-language models' ability to handle interleaved image-text inputs. | 33 |
vimalabs/vima | An implementation of a general-purpose robot learning model using multimodal prompts | 781 |
multimodal-art-projection/omnibench | Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 15 |
bfelbo/deepmoji | A deep learning model for analyzing sentiment and emotion in text based on emojis. | 1,525 |
thindil/ada-bundle | Provides tools and features to support development in the Ada programming language within Vim/NeoVim text editors. | 7 |
yapplabs/ember-modal-dialog | An Ember addon for building modal dialogs using a consistent pattern and layout approach. | 390 |
llava-vl/llava-interactive-demo | An all-in-one demo for interactive image processing and generation | 353 |
huaizhengzhang/awsome-deep-learning-for-video-analysis | A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. | 767 |
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
vinhnx/inkchatgpt | An application that enables users to upload documents and converse with an AI-powered language model. | 9 |
bestivictory/ilgnet | A deep learning-based framework for image aesthetics assessment using a convolutional neural network structure | 112 |
hypjudy/sparkles | Develops multimodal instruction-following models for open-ended dialogues across multiple images | 43 |
vcciv/blvd | A large-scale 5D semantics benchmark for autonomous driving | 171 |
millionintegrals/vel | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
ailab-cvc/gpt4tools | An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. | 762 |