lw-eg-monodepth

This is the implementation of the paper: Light-Weight Edge-Guided Self-supervised Monocular Depth Estimation [arXiv]. This work is evolved from the project Monodepth. Please cite our paper if you use our results. Thanks.

@article{ kuo2019arXiv,
    author={Kuo-Shiuan Peng and Gregory Ditzler and Jerzy Rozenblit},
    title={ Edge-Guided Occlusion Fading Reduction for a Light-Weighted Self-Supervised Monocular Depth Estimation },
    journal={ arXiv },
    pages={1911.11705}, 
    year={2019}}

Main Contributions

Our work focus on the network optimization and occlusion fading reduction using post-procssing. We introduce Atrous Spatial Pyramid Pooling (ASPP) module into DispNet to improve the performance and reduce the computational costs including paramters and inference time. The proposed Light-Weight Dispnet is shown as below

The proposed network is lighter, faster, and better than the conventional DispNet.

Furthermore, we also resolve the occlusion fading issue of self-supervision method on Depth Estimation. We proposed an Edge-Guided post-processing method involving from Godard et. al.ref to produce the depth estimation results with minimal halo effects. We detect the clear edges and occlusion fading using an edge detector. Then the flip trick is used to keep the clear edges and remove the occlusion fading to yield the final result. The architecture of the proposed Edge-Guided post-processing method is shown as below:

The performance is visualized as follows:

The G.T. is the ground truth, pp is the prior of Godard et. al., and EG-PP is the propsoed method. Please to refer to our paper to see all the details.

System Requirements

This work is implemented using Tensorflow 1.5, CUDA 10.0, cuDNN 7.6, and anaconda/python 3.7 under Ubuntu 18.04LTS. There may have some warnings from Tensorflow 1.5, but it won't effect the simmulation.

Create Dataset Link

Please download the kitti and cityscape dataset and converted the input image to JPEG format in your own path. Then create the link to local directories inside the project as following.

mkdir dataset
ln -s ~/path/to/kitti/ ./dataset/
ln -s ~/path/to/cityscapes/ ./dataset/

Train

We have prepared two bash scripts to train our models on KITTI and Cityscapes dataset. After preparing the dataset, please run bash file as following (Take kitti dataset as example):

sh ./bash/bash_train_kitti.sh

Please configurate the model and output file path by your preference.

Evaluation

We have prepared two bash scripts to evaluate the performance of Kitti and Eigen splits on Kitti dataset. Please change the varaiables in the scripts to run the evaluation. You will get the similar results we have in the paper.

Example on vggASPP model with KITTI training result.

sh ./bash/bash_evaluate_kitti.sh

You will recieve the results as shown:

now testing 200 files
done.
Total time:  4.48
Inferece FPS:  42.44
writing disparities.
done.
>>> 
>>> Kitti: Native Evaluation
   abs_rel,     sq_rel,        rms,    log_rms,     d1_all,         a1,         a2,         a3
    0.1134,     1.1636,      5.734,      0.201,     27.379,      0.853,      0.945,      0.979
>>> Kitti: Post-Processing Evaluation
   abs_rel,     sq_rel,        rms,    log_rms,     d1_all,         a1,         a2,         a3
    0.1079,     1.0259,      5.464,      0.192,     26.395,      0.857,      0.949,      0.982
>>> Kitti: Edge-Guided Post-Processing Evaluation
   abs_rel,     sq_rel,        rms,    log_rms,     d1_all,         a1,         a2,         a3
    0.1077,     1.0238,      5.387,      0.189,     26.152,      0.860,      0.951,      0.983

We skip first 10 testing files in computing FPS due to the unstability of first few iterations.

Pre-built models

We have prepared the built models for references here.

U of Arizona HPC users

I have prepared a best practice of bash files for UA hpc users. The bash files are located in: /bash/ua_hpc_user/.

The sample command is:

sh ./bash/ua_hpc_user/bash_train_kitti_ua.sh

Name	Name	Last commit message	Last commit date
Latest commit calpb Merge pull request #2 from Cal-B/revert-1-dashcam_dataset Aug 12, 2020 4e99e49 · Aug 12, 2020 History 55 Commits
bash	bash	Revert "Dashcam dataset"	Aug 12, 2020
fig	fig	Delete lw-eg-post-proc.jpg	Nov 27, 2019
utils	utils	Revert "Dashcam dataset"	Aug 12, 2020
LICENSE	LICENSE	main sources	Nov 25, 2019
README.md	README.md	Update README.md	Jun 14, 2020
average_gradients.py	average_gradients.py	main sources	Nov 25, 2019
bilinear_sampler.py	bilinear_sampler.py	main sources	Nov 25, 2019
monodepth_dataloader.py	monodepth_dataloader.py	main sources	Nov 25, 2019
monodepth_main.py	monodepth_main.py	Revert "Dashcam dataset"	Aug 12, 2020
monodepth_model.py	monodepth_model.py	Added support for manual loss function weight input through commandli…	Jul 21, 2020
monodepth_simple.py	monodepth_simple.py	Update monodepth_simple.py	Nov 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lw-eg-monodepth

Main Contributions

System Requirements

Create Dataset Link

Train

Evaluation

Pre-built models

U of Arizona HPC users

About

Releases

Packages

Languages

License

calpb/lw-eg-monodepth

Folders and files

Latest commit

History

Repository files navigation

lw-eg-monodepth

Main Contributions

System Requirements

Create Dataset Link

Train

Evaluation

Pre-built models

U of Arizona HPC users

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages