vision-agent
Vision coder
An agent framework to generate vision code
Vision agent
1k stars
20 watching
131 forks
Language: Python
last commit: 8 days ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
google-research/parti | An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models. | 1,548 |
graphic-design-ai/graphist | A software system that uses AI to generate graphic compositions from unordered sets of design elements | 98 |
deepseek-ai/deepseek-vl | A multimodal AI model that enables real-world vision-language understanding applications | 2,077 |
jolibrain/joligen | An integrated framework for training custom generative AI models | 244 |
google/sg2im | An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks | 1,300 |
aofei/cameron | An avatar generator for creating images based on input URLs or paths | 121 |
mansimov/text2image | A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. | 592 |
byungkwanlee/moai | Improves performance of vision language tasks by integrating computer vision capabilities into large language models | 311 |
sigil-wen/dream-with-vision-pro | A tool to generate 3D models from text descriptions using AI-powered rendering and scaling | 186 |
rbbrdckybk/ai-art-generator | Automates large batches of AI-generated artwork locally using GPU acceleration. | 634 |
ibm/max-fast-neural-style-transfer | A service for generating new images by mixing the content of an input image with the style of another image. | 50 |
ashual/scene_generation | A PyTorch implementation of a deep learning-based method for generating interactive scenes with specified object attributes and relations | 187 |
jtoy/sketchnet | Generates code in a visual programming language using images as input | 40 |
wpiroboticsprojects/grip | A computer vision framework for robotics applications that simplifies the creation of vision systems and generates code in multiple programming languages. | 379 |
baaivision/eve | A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities | 230 |