I am trying to install Nominatim based off of a dockerized version of the tool found here: https://github.com/mediagis/nominatim-docker. I was successful with installing dockerised Nominatim with a smaller country locally. And the process ran pretty fast. Then, I got an EC2 instance with, 64GB memory, 1TB SSD, and 6 core processors to do an install with the entire planet (48GB). The most of the planet file got processed fast, in about 6hours and when processing the relations gave an error and failed. When I ran the planet file based on the same docker image, the object parsing (specially ways) were extremely slow. About 100 times slower than the initial run. This is the error I got when the planet ran faster:
Finally, I just took a pbf of North America (8GB) and tried the install. Object parsing related to North America went relatively fast, but it is still working on rankings and indexes. The install is going on for over 16 hours. I didn't do any PostgreSQL optimizations before running the install. When the object parsing is slow, I see the container is only using about 1% of the available memory. In Nominatim install instructions (here: https://nominatim.org/release-docs/latest/admin/Installation/) I see that with less config (32 GB RAM) a planet install is taking only 2 days. What am I doing wrong and how can I optimize the process? asked 22 Dec '19, 03:34 picmate |
The mediagis docker image is not particularly well suited for large size imports like North America or the planet. Read the original Nominatim installation instructions to get an idea about tuning and here in particular the section on PostgreSQL tuning, using flatnode files and notes on time and memory use during import. A machine of your size should manage a planet import in 2-3 days but only if you use flatnode files and a tuned postgres configuration. answered 22 Dec '19, 16:39 lonvia |
In the Docker file I don't see "--osm2pgsql-cache" set for the setup.php (https://nominatim.org/release-docs/latest/admin/Import-and-Update/#initial-import-of-the-data) That will give osm2pgsql more RAM to work with. answered 22 Dec '19, 04:17 mtmail Thanks. Can I know whether 20 ways/sec parsing time is what I should expect for way parsing when working with this type of instance configuration?
(22 Dec '19, 13:27)
picmate
1
I had around 6000k/s (node), 60k/s (way), 1000/s (relation) on the last planet import.
(22 Dec '19, 17:09)
mtmail
|