-
-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Download sqlite default endpoint to geocode.earth and remove bundle support #487
Conversation
FYI we don't currently build venue data, I could add that but I assumed it was old, unmaintained and noone was using it? |
I agree with you. I also think that nobody uses venue 🤷♂️. Edit discussion: Should we remove venue ? |
Actually Julian and I discussed it on a call today. I think WOF venues are 'soft deprecated', our experience with them is that the data is old and hasn't had new contributions for a long time. As such we (Geocode Earth) will not be publishing dist files for WOF venues, the official dist.whosonfirst.org downloads are probably as up-to-date as they will ever be. Regarding Pelias, it seems a pity to delete code which supports WOF venues since other developers may benefit from it, but if it becomes a maintenance burden then it may be ok to do so. |
So... Maybe we just have a generic warning message that the file you requested isn't available on the host you specified and leave it at that for now? |
If we want to be very thorough we could grab inventory files from both hosts and compare the last_modified time to find the best download. Maybe this is overkill? But it would allow 'fallback' to dist.whosonfirst.org for files not published by GE |
You are right... So, if and only if venue are activated, we can add download it from WOF ? Maybe I have an elegant way to do this 🤔 I will try it tomorrow :)
Yes this is overkill 😅 Since we want to promote SQLite over Bundles, we can migrate to data GE only SQLite download part ? With some deprecation warnings ? |
3a86846
to
8f885ed
Compare
Updated, I added the backward compatibility for venues via WOF |
Nice, I just checked on the build and it's finished, just syncing the files to the bucket, so it should be available shortly. |
Cool, I'm currently testing this out (actually, on top of the minor fix in #484). It seems to work well so far but unfortunately we don't have a lot of automated testing around the download functionality :( :( I think if we are going to merge this and default to Geocode Earth's WOF data (which I fully support), then we need to do one of two things:
The first of these would mean that we lose compatibility with the data on |
Also, I extracted the bugfix for the |
With this PR, by default it will download What can we do ? Download all I have already sync the PR 😉 |
New commit to be compatible with WOF for full planet download |
Hi! Big news today! Who's on First has turned off their downloads at dist.whosonfirst.org. I haven't been following recent changes to the WOF importer super closely, but that really simplifies things for this PR, doesn't it? There's now no need at all to worry about backwards compatibility with |
Oh! That's interesting ! Yes, this simplify a little bit the PR, but there are still users using custom URLs 😞. |
We have no plans to publish venues distributions at this time. I call a vote to remove support for WOF venues in order to simplify the code. 👍 for yes 👎 for needs further discussion |
This should be pushed forward. Wof just disabled their downloads. But the changes from the pr are not enough. The structure from https://geocode.earth/ is different and lots of things are not working anymore... |
Hi @pixeldublu I started this PR in March and the WOF change was made last Friday. This PR was originally made for SQLite download only. Are you using SQLite or bundle download ? Why this PR is not working for you ? |
For example venues are not available anymore... and i the bundles also not available anymore... |
Yes, check out our discussion here #487 (comment) If you need further discussion, fell free 🙂 The bundle was out of the scope of this PR. I will add it later. |
I was trying to make individual db downloads last week and then this wof download disable happened :) So i made some changes to my PR to use geocode earth and works but its not mergeable yet. Looking for improvement ideas. |
3d50ef6
to
d7f5b51
Compare
Big commit.... BREAKING CHANGE is coming ! I dropped the support for bundle, the |
Oh, |
Only the download part has be deleted (download of the 40Go SQLites + generating bundle like architecture). The whosonfirst/src/components/sqliteStream.js Lines 73 to 91 in d7f5b51
That means it will work like a charm even for small areas 🙂 For example, I want only {
"datapath": "/opt/whosonfirst/",
"importPostalcodes": true,
"countries": "fr",
"importPlace": 404227465,
"sqlite": true
} Inport time is 2 secondes and the last log is:
Full France import time is 1min16s and the last log is:
|
This is a small PR into #487 that allows country codes to be specified in upper case or lower case. Before this, specifying country codes in anything but lowercase would cause the downloader to skip all files.
This is a small PR into #487 that allows country codes to be specified in upper case or lower case. Before this, specifying country codes in anything but lowercase would cause the downloader to skip all files.
13b7ebd
to
17f9e8f
Compare
This is a small PR into #487 that allows country codes to be specified in upper case or lower case. Before this, specifying country codes in anything but lowercase would cause the downloader to skip all files.
This is a small PR into #487 that allows country codes to be specified in upper case or lower case. Before this, specifying country codes in anything but lowercase would cause the downloader to skip all files.
eb1c75b
to
c737102
Compare
Bundles hosted by the Who's on First team at https://dist.whosonfirst.org have been removed. Data can now be downloaded based on a sponsorship from Geocode Earth. More info at https://geocode.earth/data
This helps ensure that in the future we don't break anything that depends on this module by exporting something that doesn't exist.
a18f477
to
6ba6c1e
Compare
🎉 |
Now that the Who's on First downloader defaults to using SQLite downloads and supports country-specific bundles thanks to pelias/whosonfirst#487, we can start using that new functionality. This PR sets the `imports.whosonfirst.countryCode` property appropriately across all our current projects. This should not only help make all imports quite a bit faster, but it will require much less disk space for everyone. Since we're now sponsoring the Who's on First downloads here at Geocode Earth, the bandwidth savings also make us happy :) Along the way I made a few minor changes to remove some deprecated config options like `importVenues` and `sqlite`.
Well done everyone 😄 🚀 |
Well done mostly to you @Joxit for doing most of the work to start this PR and the one in Placeholder :) I just had some fun adding If there are any bugs they probably came in from all my rebasing, let us know :) |
Now that the Who's on First downloader defaults to using SQLite downloads and supports country-specific bundles thanks to pelias/whosonfirst#487, we can start using that new functionality. This PR sets the `imports.whosonfirst.countryCode` property appropriately across all our current projects. This should not only help make all imports quite a bit faster, but it will require much less disk space for everyone. Since we're now sponsoring the Who's on First downloads here at Geocode Earth, the bandwidth savings also make us happy :) Along the way I made a few minor changes to remove some deprecated config options like `importVenues` and `sqlite`.
BREAKING CHANGE: drop support for wof bundles
BREAKING CHANGE:
imports.whosonfirst.sqlite
default totrue
Background
WOF database is hosted on geocode.earth, both per country and global admin are available, I'm updating the default endpoint in favor of geocode.earth.
What's new ?
A new config is available
config.imports.whosonfirst.countries
. This configuration take care advantage of the per country databases, it will allow you to download only the wanted countries. If the config is not present, it will download the global databases.Minor fix introduced by #483
bunzip2
and notbunzip
Discussion
In this PR, I did only the SQLite part, the bundle download is not compatible with geocode.earth. The old way was per placetype, now it's admin/postalcode/constituency. That's means old users with a custom download URL will have some issues.Breaking changeOr use bothimportPlaces
are very useful for small imports, now the per country is available, should we deprecate it ?countries
andimportPlaces
?DeprecatedimportPlaces
extract data in bundle format, should we deprecate this part to ?related: #460 #469 #477 pelias/wof-admin-lookup#289
Fixes #469
Closes #460
Fixes #437