sam2
Visual Segmentation Model
An open-source software project providing code and tools for running inference with a deep learning model designed for visual segmentation in images and videos.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
12k stars
75 watching
1k forks
Language: Jupyter Notebook
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/segment-anything | This project provides code and tools for running inference with a visual segmentation model that can generate object masks from input prompts. | 47,627 |
facebookresearch/detectron2 | A platform for object detection and segmentation tasks using machine learning algorithms | 30,539 |
facebookresearch/co-tracker | A model for tracking any point on a video using transformer-based architecture and optical flow benefits | 3,820 |
facebookresearch/audio2photoreal | Generating photorealistic avatars from audio | 2,709 |
casia-iva-lab/fastsam | A deep learning model for fast object segmentation | 7,497 |
facebookresearch/slowfast | Provides state-of-the-art video understanding codebase with efficient training methods and pre-trained models for various tasks | 6,623 |
facebookresearch/dinov2 | A PyTorch implementation of a self-supervised learning method for learning robust visual features without supervision. | 9,211 |
chaoningzhang/mobilesam | Lightweight and efficient deep learning model for image segmentation on mobile devices | 4,820 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,515 |
mkocabas/vibe | A video pose and shape estimation method that predicts body parameters for each frame of an input video. | 2,897 |
facebookresearch/mmf | A modular framework for building vision and language multimodal research projects using PyTorch. | 5,500 |
facebookresearch/imagebind | An AI framework that combines data from multiple sources into a single embedding space, enabling various applications such as cross-modal retrieval and generation. | 8,362 |
enginbozkurt/carlasimulatordatacollector | Automates data collection from the CARLA simulator for use in semantic segmentation training. | 26 |
simonkohl/probabilistic_unet | Reimplementation of a neural network model for conditional segmentation of ambiguous images | 546 |
zhengpeng7/birefnet | An implementation of a deep learning-based image segmentation model for high-resolution images | 1,319 |