sam2

Visual Segmentation Model

An open-source software project providing code and tools for running inference with a deep learning model designed for visual segmentation in images and videos.

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

GitHub

12k stars
75 watching
1k forks
Language: Jupyter Notebook
last commit: about 1 month ago

Related projects:

Repository Description Stars
facebookresearch/segment-anything This project provides code and tools for running inference with a visual segmentation model that can generate object masks from input prompts. 47,627
facebookresearch/detectron2 A platform for object detection and segmentation tasks using machine learning algorithms 30,539
facebookresearch/co-tracker A model for tracking any point on a video using transformer-based architecture and optical flow benefits 3,820
facebookresearch/audio2photoreal Generating photorealistic avatars from audio 2,709
casia-iva-lab/fastsam A deep learning model for fast object segmentation 7,497
facebookresearch/slowfast Provides state-of-the-art video understanding codebase with efficient training methods and pre-trained models for various tasks 6,623
facebookresearch/dinov2 A PyTorch implementation of a self-supervised learning method for learning robust visual features without supervision. 9,211
chaoningzhang/mobilesam Lightweight and efficient deep learning model for image segmentation on mobile devices 4,820
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,515
mkocabas/vibe A video pose and shape estimation method that predicts body parameters for each frame of an input video. 2,897
facebookresearch/mmf A modular framework for building vision and language multimodal research projects using PyTorch. 5,500
facebookresearch/imagebind An AI framework that combines data from multiple sources into a single embedding space, enabling various applications such as cross-modal retrieval and generation. 8,362
enginbozkurt/carlasimulatordatacollector Automates data collection from the CARLA simulator for use in semantic segmentation training. 26
simonkohl/probabilistic_unet Reimplementation of a neural network model for conditional segmentation of ambiguous images 546
zhengpeng7/birefnet An implementation of a deep learning-based image segmentation model for high-resolution images 1,319