IMAD
Dialogue analyzer
A toolkit for analyzing and generating multi-modal dialogue with images
[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue
4 stars
1 watching
0 forks
Language: Python
last commit: over 1 year ago datasetdeep-learningdialogue-systemsimage2textmultimodalmultimodal-deep-learning
Related projects:
Repository | Description | Stars |
---|---|---|
zhourax/vega | Develops a multimodal task and dataset to assess vision-language models' ability to handle interleaved image-text inputs. | 33 |
vimalabs/vima | An implementation of a general-purpose robot learning model using multimodal prompts | 774 |
multimodal-art-projection/omnibench | Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 14 |
bfelbo/deepmoji | A deep learning model for analyzing sentiment and emotion in text based on emojis. | 1,518 |
thindil/ada-bundle | Provides tools and features to support development in the Ada programming language within Vim/NeoVim text editors. | 7 |
yapplabs/ember-modal-dialog | An Ember addon providing components to implement modal dialogs with consistent layout and hierarchy | 390 |
llava-vl/llava-interactive-demo | An all-in-one demo for interactive image processing and generation | 351 |
huaizhengzhang/awsome-deep-learning-for-video-analysis | A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. | 763 |
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 723 |
vinhnx/inkchatgpt | An application that enables users to upload documents and converse with an AI-powered language model. | 9 |
bestivictory/ilgnet | A deep learning-based framework for image aesthetics assessment using a convolutional neural network structure | 112 |
hypjudy/sparkles | Develops multimodal instruction-following models for open-ended dialogues across multiple images | 41 |
vcciv/blvd | A large-scale 5D semantics benchmark for autonomous driving | 170 |
millionintegrals/vel | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
ailab-cvc/gpt4tools | An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. | 760 |