monoloco

3D vision toolkit

A library for 3D vision tasks using 2D keypoints

A 3D vision library from 2D keypoints: monocular and stereo 3D detection for humans, social distancing, and body orientation.

GitHub

428 stars
22 watching
81 forks
Language: Python
last commit: over 2 years ago
Linked from 1 awesome list

3d-deep-learning3d-detection3d-object-detection3d-visioncomputer-visioncovid-19deep-learninghuman-pose-estimationiccv2019icra2021kitti-datasetmachine-learningobject-detectionopenpifpafpifpafpose-estimationpytorchuncertainty

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ayoolaolafenwa/pixellib A deep learning library for image segmentation and object detection using PyTorch. 1,049
megvii-research/petr Develops a framework for multi-view 3D object detection and perception from camera images using position embedding transformation. 874
matlab-deep-learning/pose-estimation-3d-with-stereo-camera A MATLAB project that uses deep learning and stereo cameras to estimate 3D human pose from image data 43
tinghuiz/sfmlearner An unsupervised learning framework for depth and ego-motion estimation from monocular videos 1,967
vchoutas/smplify-x A software framework for capturing 3D human body and facial features from single images using machine learning models. 1,777
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 209
vmarsocci/3dcd Automatically inferring 2D and 3D change detection maps from bitemporal optical images without relying on DSMs. 28
john-chapman/im3d A library for immediate mode rendering of basic 3D primitives and UI tools with platform and graphics API agnosticism for VR support. 1,210
vita-epfl/crowdnav Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms. 598
nvlabs/prismer A deep learning framework for training multi-modal models with vision and language capabilities. 1,298
blurstudio/cross3d Provides a consistent interface to multiple 3D computer graphics application APIs. 138
mpatacchiola/deepgaze A computer vision library for detecting and tracking human presence in images and videos using convolutional neural networks. 1,790
nvlabs/deep_object_pose Estimates object poses and positions in 3D space from RGB images 1,028
stemkoski/three.py A Python 3D graphics library designed to be easy to use and follow the structure of Three.js. 113
petworm/larvio An implementation of a monocular visual inertial odometry algorithm based on Multi-State Constraint Kalman Filter for accurate and robust localization 735