Fix geospacial queries to use the MongoDB index #171

remuslazar · 2021-01-31T11:49:55Z

Resolves #170

TL;DR

This changes the existing MongoDB geospatial index in "2d" and rework the query logic to use that.

Prior to this PR MongoDB was not able to use the geospatial index when doing boundingbox queries and this lead to a major performance degradation and high RAM usage on the MongoDB server instance. See #170 for details.

Technical Stuff

MongoDB uses the 2dsphere index only for queries with the $geometry operator, see https://docs.mongodb.com/manual/tutorial/query-a-2dsphere-index/. For "basic" $polygon queries only the legacy 2d index can be used.

To leverage the 2dsphere index a newer version of the MongoDB Library has to be used. The legacy version of the MongoDB driver (1.x) (currently used throughout the codebase) can not do that, unfortunately.

Benchmarks

On my local machine the query time for a boundingbox query went down from ~400ms to ~30ms. Using the index the mongo instance runs also fine with 512MB RAM in docker.

Sorting Behavior Change

Additionally this change also skips the sorting of the result set by ID when doing boundingbox queries to get even more performance. This is especially a thing when dealing with large result sets, e.v. 500 records or more.

Refactor both the nearby (radius) and the boundingbox queries to use the "2d" MongoDB index correctly. Make the MongoDB index creation logic more robust and DRY.

See inline comments for details

We must do the sorting when NOT in boundingbox mode..

webprofusion-chrisc · 2021-01-31T23:16:02Z

Thanks, this is awesome! Regarding the removal of the sort, we try to keep the results set the same between the SQL implementation and the MongoDB implementation so that we can automate a few tests but I'll merge this and see how I get on.

remuslazar added 3 commits January 31, 2021 01:49

Rework the Geospacial Queries to use the index

dc5d603

Refactor both the nearby (radius) and the boundingbox queries to use the "2d" MongoDB index correctly. Make the MongoDB index creation logic more robust and DRY.

Do not perform sorting in boundingbox mode

eee173e

See inline comments for details

Fix boundingbox condition

3888eac

We must do the sorting when NOT in boundingbox mode..

remuslazar mentioned this pull request Jan 31, 2021

Bad API Query Performance (1000ms for a basic bounding box request) #170

Closed

remuslazar changed the title ~~Fix geospacial queries to use the 2s index~~ Fix geospacial queries to use a 2d index Jan 31, 2021

remuslazar changed the title ~~Fix geospacial queries to use a 2d index~~ Fix geospacial queries to use the MongoDB index Jan 31, 2021

webprofusion-chrisc merged commit f5bc875 into openchargemap:master Jan 31, 2021

remuslazar deleted the fix-geospacial-queries branch November 10, 2021 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix geospacial queries to use the MongoDB index #171

Fix geospacial queries to use the MongoDB index #171

remuslazar commented Jan 31, 2021

webprofusion-chrisc commented Jan 31, 2021

Fix geospacial queries to use the MongoDB index #171

Fix geospacial queries to use the MongoDB index #171

Conversation

remuslazar commented Jan 31, 2021

TOC

TL;DR

Technical Stuff

Benchmarks

Sorting Behavior Change

webprofusion-chrisc commented Jan 31, 2021