selim_sef-solution

dlindenbaum

Apr 30, 2018

dff88d6 · Apr 30, 2018

Name	Name	Last commit message	Last commit date
parent directory ..
datasets	datasets	updated selim_sef-solution	Apr 30, 2018
tools	tools	updated selim_sef-solution	Apr 30, 2018
Dockerfile	Dockerfile	updated selim_sef-solution	Apr 30, 2018
README.md	README.md	readme update	Apr 30, 2018
calculate_stats.py	calculate_stats.py	updated selim_sef-solution	Apr 30, 2018
docker-build.sh	docker-build.sh	updated selim_sef-solution	Apr 30, 2018
docker-remove.sh	docker-remove.sh	updated selim_sef-solution	Apr 30, 2018
docker-run.sh	docker-run.sh	updated selim_sef-solution	Apr 30, 2018
docker-stop.sh	docker-stop.sh	updated selim_sef-solution	Apr 30, 2018
download_models.sh	download_models.sh	updated selim_sef-solution	Apr 30, 2018
generate_submission.py	generate_submission.py	updated selim_sef-solution	Apr 30, 2018
inceptionv3_padding.py	inceptionv3_padding.py	updated selim_sef-solution	Apr 30, 2018
inceptionv3_padding_swish.py	inceptionv3_padding_swish.py	updated selim_sef-solution	Apr 30, 2018
linknet.py	linknet.py	updated selim_sef-solution	Apr 30, 2018
losses.py	losses.py	updated selim_sef-solution	Apr 30, 2018
model_name_encoder.py	model_name_encoder.py	updated selim_sef-solution	Apr 30, 2018
models.py	models.py	updated selim_sef-solution	Apr 30, 2018
params.py	params.py	updated selim_sef-solution	Apr 30, 2018
predict_all.py	predict_all.py	updated selim_sef-solution	Apr 30, 2018
preprocess_clahe.py	preprocess_clahe.py	updated selim_sef-solution	Apr 30, 2018
resnet50_padding.py	resnet50_padding.py	updated selim_sef-solution	Apr 30, 2018
test.sh	test.sh	updated selim_sef-solution	Apr 30, 2018
train.py	train.py	updated selim_sef-solution	Apr 30, 2018
train.sh	train.sh	updated selim_sef-solution	Apr 30, 2018
transormer.py	transormer.py	updated selim_sef-solution	Apr 30, 2018

README.md

Marathon Match - Solution Description

Overview

Congrats on winning this marathon match. As part of your final submission and in order to receive payment for this marathon match, please complete the following document.

1. Introduction

Tell us a bit about yourself, and why you have decided to participate in the contest.

Name: Selim Seferbekov
Handle: selim_sef

2. Solution Development

How did you solve the problem? What approaches did you try and what choices did you make, and why? Also, what alternative approaches did you consider?

I solved the task in two stages: 1. semantic segmentation of road centerlines 2. vectorization for the binary masks to get final road graph. To produce binary segmentation masks I used encoder-decoder architectures with skip connections similar to U-Net [Olaf et al, 2015] and Linknet [Chaurasia et al]. To produce road graphs I used skeletonization + graph generation with sknw library and some basic postprocessing.
Data Type: I decided to use MUL-Pansharpen images instead of RGB hoping that neural networks will find indices like road REA/BAI . Which in the end caused a lot of problems during training/testing due CPU/IO bottleneck. After the competition I think that it was better to use original data i.e. full size PAN and small MUL images with late fusion.
One model for each city or a shared model? I decided to use shared model and added one hot city encoding as additional channels.
Transfer learning or training from scratch: I used encoders pretrained on ImageNet and just initialized with He initialization additional input channels. Using pretrained encoders allows network to converge faster and produce better results even if it had less input channels originally.
Originaly I added a topology loss term [A. Mosinska et al] which visually improved masks significantly but to due bugs in graph generation I could not get any improvement on the leaderboard and decided not to use it.

3. Final Approach

Please provide a bulleted description of your final approach. What ideas/decisions/features have been found to be the most important for your solution performance:

For semantic segmentation I used different variation of Unet and Linknet architectures with InceptionV3 and Resnet50 encoders. I trained neworks with RmsProp optimizer and loss=bce+(1–soft dice). Using both crossentropy and soft dice in the loss is crucial to achieve good results in binary semantic segementation and to get better results with ensembling.
Mask posprocessing: Guassian smoothing and binary dilation. That helped to fill some small gaps in the masks. I also padded masks with reflection in order to produce better graph near the borders which gave +15k on the leaderboard.
Graph generation: I produced sekeletons and then simply used sknw library to get road graph. After than I simplified graphs to have less lines.
For validation I used the same 20% holdout set for all models.
I used contrast normalization (CLAHE) to preprocess images which gave a bit higher score than using original image with simple normalization.
The final solution has ensemble of 6 models to produce binary masks. The masks produced by these models are averaged and after that vectorized to obtain the final graph.

4. Open Source Resources, Frameworks and Libraries

Please specify the name of the open source resource along with a URL to where it's housed and it's license type:

Docker, https://www.docker.com (Apache License 2.0)
Tensorflow, https://www.tensorflow.org/ (Apache License 2.0)
Nvidia-docker, https://github.com/NVIDIA/nvidia-docker, ( BSD 3-clause)
Python 3, https://www.python.org/, ( PSFL (Python Software Foundation License))
Scikti-image, http://scikit-image.org/, ( BSD 3-clause)
Scikit-learn, http://scikit-learn.org/stable/, (BSD 3-clause)
Numpy, http://www.numpy.org/, (BSD)
Scipy, https://www.scipy.org/, (BSD)
Tqdm, https://github.com/noamraph/tqdm, ( The MIT License)
Keras, https://keras.io/, ( The MIT License)
Anaconda, https://www.continuum.io/Anaconda-Overview,( New BSD License)
OpenCV, https://opencv.org/ (BSD)
SKNW https://github.com/yxdragon/sknw (BSD 3-clause)
Simplification https://github.com/urschrei/simplification (MIT)

5. Potential Algorithm Improvements

Please specify any potential improvements that can be made to the algorithm:

Use more masks of different width for different roads to train networks
Somehow incorporate loss term that punishes topology violations.
For binary segmetation it is usually better to use simple architeture like Unet with VGG16 BN encoder. As vgg16bn is extremely slow I decided not to use it in this challenge.
Add posproccessing to connect gaps in the graph
Instead of using PAN sharpened images it could be benefitial to use original MUL and PAN images and fuse them with neural network.

6. Algorithm Limitations

Please specify any potential limitations with the algorithm:

The current approach doesn't handle bridges and multilevel intersections properly.

7. Deployment Guide

Please provide the exact steps required to build and deploy the code:

In this contest, a Dockerized version of the solution was required, which should run out of the box

8. Final Verification

Please provide instructions that explain how to train the algorithm and have it execute against sample data:

The algorithm can be executed by the instructions provided for the contest.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

selim_sef-solution

selim_sef-solution

README.md

Files

selim_sef-solution

Directory actions

More options

Directory actions

More options

Latest commit

History

selim_sef-solution

Folders and files

parent directory

README.md