sam2
Video Segmentation Framework
A software framework for video segmentation in images and videos using AI models
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
13k stars
73 watching
1k forks
Language: Jupyter Notebook
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
facebookresearch/segment-anything | This project provides code and tools for running inference with a visual segmentation model that can generate object masks from input prompts. | 48,092 |
facebookresearch/detectron2 | A platform for object detection and segmentation tasks using machine learning algorithms | 30,778 |
facebookresearch/co-tracker | A model for tracking any point on a video using transformer-based architecture and optical flow benefits | 3,978 |
facebookresearch/audio2photoreal | Generating photorealistic avatars from audio | 2,715 |
casia-iva-lab/fastsam | A deep learning model for fast object segmentation | 7,575 |
facebookresearch/slowfast | Provides state-of-the-art video understanding codebase with efficient training methods and pre-trained models for various tasks | 6,680 |
facebookresearch/dinov2 | A PyTorch implementation of a self-supervised learning method for learning robust visual features without supervision. | 9,425 |
chaoningzhang/mobilesam | Lightweight and efficient deep learning model for image segmentation on mobile devices | 4,895 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,519 |
mkocabas/vibe | A video pose and shape estimation method that predicts body parameters for each frame of an input video. | 2,911 |
facebookresearch/mmf | A modular framework for building vision and language multimodal research projects using PyTorch. | 5,510 |
facebookresearch/imagebind | An AI framework that combines data from multiple sources into a single embedding space, enabling various applications such as cross-modal retrieval and generation. | 8,424 |
enginbozkurt/carlasimulatordatacollector | Automates data collection from the CARLA simulator for use in semantic segmentation training. | 26 |
simonkohl/probabilistic_unet | Reimplementation of a neural network model for conditional segmentation of ambiguous images | 548 |
zhengpeng7/birefnet | An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. | 1,484 |