Ask-Anything

Conversational AI model builder

A platform for building conversational AI models that understand and respond to video and image inputs.

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

GitHub

3k stars
37 watching
252 forks
Language: Python
last commit: 3 months ago
Linked from 4 awesome lists

big-modelcaptioning-videoschatchatgptfoundation-modelsgradiolangchainlarge-language-modelslarge-modelstablelmvideovideo-question-answeringvideo-understanding

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
clovaai/deep-text-recognition-benchmark Provides a benchmarking framework and implementation for deep learning-based text recognition models 3,755
cbica/captk Software platform for analyzing medical images and predicting cancer outcomes through machine learning 181
pytorch/captum Provides tools and algorithms to understand how machine learning models make predictions 4,931
mbi/django-simple-captcha Adds captchas to forms in web applications 1,383
trekjs/captcha A pure JavaScript solution for generating and validating image-based CAPTCHAs without relying on specific libraries or frameworks. 467
yongliang-wu/explorecfg This project explores how varying configurations affect the performance of image captioning models 27
cos120/captcha_crack A system for solving opt-in CAPTCHAs by detecting text within images and classifying it using a trained model. 653
kostya/benchmarks A collection of benchmarking tests for various programming languages 2,814
thekvs/cpp-serializers Compares performance of various data serialization libraries in C++ 730
fabianwennink/iconcaptcha-php A captcha solution designed to be fast and user-friendly, providing an easy alternative to traditional captchas. 140
texnomic/hcaptcha A package that enables server-side verification of hCaptcha responses in ASP.NET Core Blazor applications. 7
contextualai/lens Enhances language models to generate text based on visual descriptions of images 351
neurotechx/moabb A comprehensive benchmarking framework for evaluating brain-computer interface algorithms on EEG datasets 695
ptigas/simple-captcha-solver A CAPTCHA solver that extracts and matches text from an image by calculating pixel differences between the input image and pre-computed letter masks. 556
webspiderutils/verification_code A comprehensive collection of tools and techniques for breaking and recognizing various types of CAPTCHAs used in online services 1,042