vision-agent

Vision coder

An agent framework to generate vision code

Vision agent

GitHub

1k stars
20 watching
131 forks
Language: Python
last commit: 8 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/parti An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models. 1,548
graphic-design-ai/graphist A software system that uses AI to generate graphic compositions from unordered sets of design elements 98
deepseek-ai/deepseek-vl A multimodal AI model that enables real-world vision-language understanding applications 2,077
jolibrain/joligen An integrated framework for training custom generative AI models 244
google/sg2im An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks 1,300
aofei/cameron An avatar generator for creating images based on input URLs or paths 121
mansimov/text2image A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. 592
byungkwanlee/moai Improves performance of vision language tasks by integrating computer vision capabilities into large language models 311
sigil-wen/dream-with-vision-pro A tool to generate 3D models from text descriptions using AI-powered rendering and scaling 186
rbbrdckybk/ai-art-generator Automates large batches of AI-generated artwork locally using GPU acceleration. 634
ibm/max-fast-neural-style-transfer A service for generating new images by mixing the content of an input image with the style of another image. 50
ashual/scene_generation A PyTorch implementation of a deep learning-based method for generating interactive scenes with specified object attributes and relations 187
jtoy/sketchnet Generates code in a visual programming language using images as input 40
wpiroboticsprojects/grip A computer vision framework for robotics applications that simplifies the creation of vision systems and generates code in multiple programming languages. 379
baaivision/eve A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities 230