Dynamic-Object-Removal-and-Inpainting

A Visual SLAM system improved with Dynamic object removal and inpainting the scene.

The project aims to segment dynamic objects in the video, remove them from the frames, and further inpaint these frames with background as we don’t want dynamic objects to be used for camera localization and to be included in the final reconstructed map of the scene.This project focuses on two major aspects, Dynamic Object Segmentation and Video Inpainting. The Project tries to improve the SLAM system with efficient and accurate learning based methods built upon the baseline DynaSLAM- https://github.com/BertaBescos/DynaSLAM.

The Project is divided into two phases:

Dynamic Object Detection and Semantic Segmentation
Video Inpainting

Dynamic Object Detection and Semantic Segmentation

To achieve the dynamic object detection on certain dynamic object classes such as Humans, Birds etc. we fine tune a pre trained Transformer model called Segmenter building upon the baseline code (Source Code).

Install the environment according to above source and place the segmenter folder in the directory where the below notebooks are run. Also, please refer to instructions inside the ipynb files for the config files need to be replaced with source configs before performing transfer learning

The Integrated Code Implementation done to achieve this task can be accessed through Jupyter Notebook.

Also, the Loss Curves and Precision Curves are inferred for different datasets when fine tuned the pre trained Model. The Source Notebook Can be found here

Training and Validation curves :

Training loss curve

Validation loss curve

Validation Mean IoU Curve

Object Detection Results:

The Segmentation Results Before and After Fine Tuning:

Video Inpainting

To improve the Background reconstruction quality after dynamic object removal upon the exisiting implementation of DynaSLAM we use Inpainting Technique called ProPainter building upon the baseline code (Source Code).

To run inference using Propainter, sequences from TUM RGBD Dataset/custom Data require preprocessing(Resizing,sequencing images etc).The code used for data processing is provided here.

Video Inpainting Results Results:

The Video Inpainting Results of an Image Compared with DynaSLAM:

The Video result inferred on custom dataset:

Acknowledgements:

We would like to acknowledge and refer the works below for building on our baseline code

DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes by Berta Bescos and Jose M. Facil and Javier Civera and Jose Neira (Source)
Segmenter: Transformer for Semantic Segmentation by Strudel, Robin and Garcia, Ricardo and Laptev, Ivan and Schmid, Cordelia (Source)
ProPainter: Improving Propagation and Transformer for Video Inpainting by Shangchen Zhou and Chongyi Li and Kelvin C. K. Chan and Chen Change Loy (Source)

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Data_PreProcessing		Data_PreProcessing
results		results
segmenter_configs		segmenter_configs
ENPM809K_Final_Project (3).pdf		ENPM809K_Final_Project (3).pdf
LICENSE		LICENSE
README.md		README.md
Transfer_Learning.ipynb		Transfer_Learning.ipynb
pascal_tloss.png		pascal_tloss.png
pascal_valloss.png		pascal_valloss.png
plot_iou_TL.png		plot_iou_TL.png
sem_seg_trans.ipynb		sem_seg_trans.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic-Object-Removal-and-Inpainting