IMAD

Dialogue analyzer

A toolkit for analyzing and generating multi-modal dialogue with images

[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue

GitHub

4 stars
1 watching
0 forks
Language: Python
last commit: over 1 year ago
datasetdeep-learningdialogue-systemsimage2textmultimodalmultimodal-deep-learning

Related projects:

Repository Description Stars
zhourax/vega Develops a multimodal task and dataset to assess vision-language models' ability to handle interleaved image-text inputs. 33
vimalabs/vima An implementation of a general-purpose robot learning model using multimodal prompts 774
multimodal-art-projection/omnibench Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. 14
bfelbo/deepmoji A deep learning model for analyzing sentiment and emotion in text based on emojis. 1,518
thindil/ada-bundle Provides tools and features to support development in the Ada programming language within Vim/NeoVim text editors. 7
yapplabs/ember-modal-dialog An Ember addon providing components to implement modal dialogs with consistent layout and hierarchy 390
llava-vl/llava-interactive-demo An all-in-one demo for interactive image processing and generation 351
huaizhengzhang/awsome-deep-learning-for-video-analysis A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques. 763
pku-yuangroup/languagebind Extending pretraining models to handle multiple modalities by aligning language and video representations 723
vinhnx/inkchatgpt An application that enables users to upload documents and converse with an AI-powered language model. 9
bestivictory/ilgnet A deep learning-based framework for image aesthetics assessment using a convolutional neural network structure 112
hypjudy/sparkles Develops multimodal instruction-following models for open-ended dialogues across multiple images 41
vcciv/blvd A large-scale 5D semantics benchmark for autonomous driving 170
millionintegrals/vel A collection of modular deep learning components that can be easily configured and reused in various applications. 276
ailab-cvc/gpt4tools An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings. 760