vision-agent

Vision generator

An agent-based library for generating vision code

Vision agent

GitHub

2k stars
23 watching
179 forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
google-research/parti An autoregressive text-to-image generation model that generates photorealistic images from text prompts and leverages advances in large language models. 1,554
graphic-design-ai/graphist A software system that uses AI to generate graphic compositions from unordered sets of design elements 102
deepseek-ai/deepseek-vl A multimodal AI model that enables real-world vision-language understanding applications 2,145
jolibrain/joligen An integrated framework for training custom generative AI models 246
google/sg2im An end-to-end neural network model that generates images from scene graphs by processing input graph information through multiple layers of networks 1,302
aofei/cameron An avatar generator for creating images based on input URLs or paths 123
mansimov/text2image A model that generates image patches from natural language descriptions by iteratively drawing and attending to relevant words. 594
byungkwanlee/moai Improves performance of vision language tasks by integrating computer vision capabilities into large language models 314
sigil-wen/dream-with-vision-pro A tool to generate 3D models from text descriptions using AI-powered rendering and scaling 189
rbbrdckybk/ai-art-generator Automates large batches of AI-generated artwork locally using GPU acceleration. 633
ibm/max-fast-neural-style-transfer A service for generating new images by mixing the content of an input image with the style of another image. 51
ashual/scene_generation A PyTorch implementation of a deep learning-based method for generating interactive scenes with specified object attributes and relations 188
jtoy/sketchnet Generates code in a visual programming language using images as input 40
wpiroboticsprojects/grip A computer vision framework for robotics applications that simplifies the creation of vision systems and generates code in multiple programming languages. 380
baaivision/eve A PyTorch implementation of an encoder-free vision-language model that can be fine-tuned for various tasks and modalities 246