IMAD

Dialogue analyzer

A toolkit for analyzing and generating multi-modal dialogue with images

[AINL 2023] IMAD: IMage Augmented multi-modal Dialogue

GitHub

4 stars

1 watching

0 forks

Language: Python

last commit: about 3 years ago

datasetdeep-learningdialogue-systemsimage2textmultimodalmultimodal-deep-learning

Related projects:

Repository	Description	Stars
zhourax/vega	Develops a multimodal task and dataset to assess vision-language models' ability to handle interleaved image-text inputs.	33
vimalabs/vima	An implementation of a general-purpose robot learning model using multimodal prompts	781
multimodal-art-projection/omnibench	Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously.	15
bfelbo/deepmoji	A deep learning model for analyzing sentiment and emotion in text based on emojis.	1,525
thindil/ada-bundle	Provides tools and features to support development in the Ada programming language within Vim/NeoVim text editors.	7
yapplabs/ember-modal-dialog	An Ember addon for building modal dialogs using a consistent pattern and layout approach.	390
llava-vl/llava-interactive-demo	An all-in-one demo for interactive image processing and generation	353
huaizhengzhang/awsome-deep-learning-for-video-analysis	A collection of resources and tools for video analysis using deep learning and multi-modal learning techniques.	767
pku-yuangroup/languagebind	Extending pretraining models to handle multiple modalities by aligning language and video representations	751
vinhnx/inkchatgpt	An application that enables users to upload documents and converse with an AI-powered language model.	9
bestivictory/ilgnet	A deep learning-based framework for image aesthetics assessment using a convolutional neural network structure	112
hypjudy/sparkles	Develops multimodal instruction-following models for open-ended dialogues across multiple images	43
vcciv/blvd	A large-scale 5D semantics benchmark for autonomous driving	171
millionintegrals/vel	A collection of modular deep learning components that can be easily configured and reused in various applications.	276
ailab-cvc/gpt4tools	An intelligent system that enables automatic control and utilization of visual foundation models to interact with images in conversational settings.	762