Ask-Anything
Conversational AI model builder
A platform for building conversational AI models that understand and respond to video and image inputs.
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
3k stars
37 watching
252 forks
Language: Python
last commit: 3 months ago
Linked from 4 awesome lists
big-modelcaptioning-videoschatchatgptfoundation-modelsgradiolangchainlarge-language-modelslarge-modelstablelmvideovideo-question-answeringvideo-understanding
Related projects:
Repository | Description | Stars |
---|---|---|
clovaai/deep-text-recognition-benchmark | Provides a benchmarking framework and implementation for deep learning-based text recognition models | 3,755 |
cbica/captk | Software platform for analyzing medical images and predicting cancer outcomes through machine learning | 181 |
pytorch/captum | Provides tools and algorithms to understand how machine learning models make predictions | 4,931 |
mbi/django-simple-captcha | Adds captchas to forms in web applications | 1,383 |
trekjs/captcha | A pure JavaScript solution for generating and validating image-based CAPTCHAs without relying on specific libraries or frameworks. | 467 |
yongliang-wu/explorecfg | This project explores how varying configurations affect the performance of image captioning models | 27 |
cos120/captcha_crack | A system for solving opt-in CAPTCHAs by detecting text within images and classifying it using a trained model. | 653 |
kostya/benchmarks | A collection of benchmarking tests for various programming languages | 2,814 |
thekvs/cpp-serializers | Compares performance of various data serialization libraries in C++ | 730 |
fabianwennink/iconcaptcha-php | A captcha solution designed to be fast and user-friendly, providing an easy alternative to traditional captchas. | 140 |
texnomic/hcaptcha | A package that enables server-side verification of hCaptcha responses in ASP.NET Core Blazor applications. | 7 |
contextualai/lens | Enhances language models to generate text based on visual descriptions of images | 351 |
neurotechx/moabb | A comprehensive benchmarking framework for evaluating brain-computer interface algorithms on EEG datasets | 695 |
ptigas/simple-captcha-solver | A CAPTCHA solver that extracts and matches text from an image by calculating pixel differences between the input image and pre-computed letter masks. | 556 |
webspiderutils/verification_code | A comprehensive collection of tools and techniques for breaking and recognizing various types of CAPTCHAs used in online services | 1,042 |