PETR

Position Embedding Framework

Develops a framework for multi-view 3D object detection and perception from camera images using position embedding transformation.

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

GitHub

871 stars
13 watching
131 forks
Language: Python
last commit: about 1 year ago
Linked from 1 awesome list

3d-position-embeddingmulti-cameramulti-task-learningobject-detectionsegmentation

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
megvii-research/tlc Improves image restoration performance by converting global operations to local ones during inference 231
vita-epfl/monoloco A library for 3D vision tasks using 2D keypoints 428
gink03/alt-i2v An implementation of a deep learning-based image representation learning approach using a modified fully connected layer and transfer learning from VGG16 34
gamrix/cs231n_proj This project focuses on manipulating 3D views using deep learning techniques. 6
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,627
jshilong/gpt4roi Training and deploying large language models on computer vision tasks using region-of-interest inputs 506
appinho/sarosperceptionkitti A ROS package for processing and analyzing KITTI vision data to enable object detection, tracking, and evaluation in autonomous vehicles. 246
icetttb/planetr3d An implementation of a deep learning-based method for 3D plane recovery from images. 93
vmarsocci/3dcd Automatically inferring 2D and 3D change detection maps from bitemporal optical images without relying on DSMs. 28
esri/pyprt Python bindings for CityEngine's procedural runtime for generating 3D models 64
megviirobot/camlasercalibratool Automates extrinsic calibration of cameras and 2D lasers in robotics using ROS 662
petworm/larvio An implementation of a monocular visual inertial odometry algorithm based on Multi-State Constraint Kalman Filter for accurate and robust localization 735
vita-epfl/crowdnav Develops robot navigation policies in crowded spaces using reinforcement learning and attention mechanisms. 598
lavi-lab/visual-table A project that generates visual representations tailored for general visual reasoning, leveraging hierarchical scene descriptions and instance-level world knowledge. 14
yunishi3/3d-fcr-alphagan This project aims to develop a generative model for 3D multi-object scenes using a novel network architecture inspired by auto-encoding and generative adversarial networks. 103