This is an unofficial inplementation of [Bird's Eye View Object Detection Algorithm for self-driving Cars] PIXOR in Pytorch. A large part of this project is based on the work here. Thanks to @Ankita Kalra. This work is still ongoing.Origin project is based on the work here.Thanks to @philip-huang.
https://github.com/overfitover/pixor_ovo
python3.5+
Pytorch
(tested on 0.4.1)opencv-python
shapely
matplotlib
tensorboardX
- Clone this repository.
-
Download the 3D KITTI detection dataset from here. Data to download include:
- Velodyne point clouds (29 GB): input data to VoxelNet
- Training labels of object data set (5 MB): input label to VoxelNet
- Camera calibration matrices of object data set (16 MB): for visualization of predictions
- Left color images of object data set (12 GB): for visualization of predictions
-
Split the training set into training and validation set according to the protocol here. And rearrange the folders to have the following structure:
└── KITTI
├── training <-- training data
| ├── image_2
| ├── label_2
| └── velodyne
└── validation <--- evaluation data
| ├── image_2
| ├── label_2
| └── velodyne
|
|__ train.txt
|
|__ val.txt
|
|__ trainval.txt
$ python run_training.py
- There is a pre-trained model for car in
pretrained_models/model_90.pth
.
$ python run_inference.py
by the way 在model里面有7层se_module,你可以尝试不同的组合是否可以提升网络的效果。
- datagen.py maybe has a problem. h w l x y z w h l y z x.
- improve the performances
- reproduce results for
Pedestrian
andCyclist
- provide SummaryWriter()
- provide run_evaluate.py