monoloco

3D Vision Framework

A software framework for 3D vision and computer vision tasks using deep learning and 2D keypoints.

A 3D vision library from 2D keypoints: monocular and stereo 3D detection for humans, social distancing, and body orientation.

GitHub

431 stars

22 watching

81 forks

Language: Python

last commit: about 3 years ago

Linked from 1 awesome list

3d-deep-learning3d-detection3d-object-detection3d-visioncomputer-visioncovid-19deep-learninghuman-pose-estimationiccv2019icra2021kitti-datasetmachine-learningobject-detectionopenpifpafpifpafpose-estimationpytorchuncertainty

Screenshot of vita-epfl/monoloco website

vita.epfl.ch/monoloco

Backlinks from these awesome lists:

ly0n/awesome-robotic-tooling

Related projects:

Repository	Description	Stars
ayoolaolafenwa/pixellib	A deep learning library for image segmentation and object detection using PyTorch.	1,054
megvii-research/petr	Develops a framework for multi-view 3D object detection and perception from camera images using position embedding transformation.	881
matlab-deep-learning/pose-estimation-3d-with-stereo-camera	A MATLAB project that uses deep learning and stereo cameras to estimate 3D human pose from image data	43
tinghuiz/sfmlearner	A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning	1,977
vchoutas/smplify-x	A software framework for capturing 3D human body and facial features from single images using machine learning models.	1,789
damo-nlp-sg/vcd	An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs	222
vmarsocci/3dcd	Automatically inferring 2D and 3D change detection maps from bitemporal optical images without relying on DSMs.	29
john-chapman/im3d	A library for immediate mode rendering of basic 3D primitives and UI tools with platform and graphics API agnosticism for VR support.	1,213
vita-epfl/crowdnav	Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms.	607
nvlabs/prismer	A deep learning framework for training multi-modal models with vision and language capabilities.	1,299
blurstudio/cross3d	Provides a consistent interface to multiple 3D computer graphics application APIs.	138
mpatacchiola/deepgaze	A computer vision library for detecting and tracking human presence in images and videos using convolutional neural networks.	1,800
nvlabs/deep_object_pose	Estimates object poses and positions in 3D space from RGB images	1,031
stemkoski/three.py	A Python 3D graphics library designed to be easy to use and follow the structure of Three.js.	113
petworm/larvio	An implementation of a monocular visual inertial odometry algorithm based on Multi-State Constraint Kalman Filter for accurate and robust localization	737