monoloco
3D Vision Framework
A software framework for 3D vision and computer vision tasks using deep learning and 2D keypoints.
A 3D vision library from 2D keypoints: monocular and stereo 3D detection for humans, social distancing, and body orientation.
431 stars
22 watching
81 forks
Language: Python
last commit: over 2 years ago
Linked from 1 awesome list
3d-deep-learning3d-detection3d-object-detection3d-visioncomputer-visioncovid-19deep-learninghuman-pose-estimationiccv2019icra2021kitti-datasetmachine-learningobject-detectionopenpifpafpifpafpose-estimationpytorchuncertainty
Related projects:
Repository | Description | Stars |
---|---|---|
| A deep learning library for image segmentation and object detection using PyTorch. | 1,054 |
| Develops a framework for multi-view 3D object detection and perception from camera images using position embedding transformation. | 881 |
| A MATLAB project that uses deep learning and stereo cameras to estimate 3D human pose from image data | 43 |
| A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning | 1,977 |
| A software framework for capturing 3D human body and facial features from single images using machine learning models. | 1,789 |
| An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 222 |
| Automatically inferring 2D and 3D change detection maps from bitemporal optical images without relying on DSMs. | 29 |
| A library for immediate mode rendering of basic 3D primitives and UI tools with platform and graphics API agnosticism for VR support. | 1,213 |
| Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms. | 607 |
| A deep learning framework for training multi-modal models with vision and language capabilities. | 1,299 |
| Provides a consistent interface to multiple 3D computer graphics application APIs. | 138 |
| A computer vision library for detecting and tracking human presence in images and videos using convolutional neural networks. | 1,800 |
| Estimates object poses and positions in 3D space from RGB images | 1,031 |
| A Python 3D graphics library designed to be easy to use and follow the structure of Three.js. | 113 |
| An implementation of a monocular visual inertial odometry algorithm based on Multi-State Constraint Kalman Filter for accurate and robust localization | 737 |