Skip to content

Latest commit

 

History

History
56 lines (47 loc) · 3.12 KB

installation.md

File metadata and controls

56 lines (47 loc) · 3.12 KB

Installation

Bootstrap a default version of Amundsen using Docker

The following instructions are for setting up a version of Amundsen using Docker.

  1. Install docker and docker-compose.

  2. Clone this repo and its submodules by running:

    $ git clone --recursive [email protected]:lyft/amundsen.git
  3. Enter the cloned directory and run:

    # For Neo4j Backend
    $ docker-compose -f docker-amundsen.yml up
    
    # For Atlas
    $ docker-compose -f docker-amundsen-atlas.yml up
  4. Ingest dummy data into Neo4j by doing the following: (Please skip if you are using Atlas backend)

    • Change directory to the amundsendatabuilder submodule.
    • Run the following commands in the amundsendatabuilder upstream directory:
     $ python3 -m venv venv
     $ source venv/bin/activate  
     $ pip3 install -r requirements.txt
     $ python3 setup.py install
     $ python3 example/scripts/sample_data_loader.py
  5. View UI at http://localhost:5000 and try to search test, it should return some result.

Atlas Note: Atlas takes some time to boot properly. So you may not be able to see the results immediately after docker-compose up command. Atlas would be ready once you'll have the following output in the docker output Amundsen Entity Definitions Created...

Verify setup

  1. You can verify dummy data has been ingested into Neo4j by by visiting http://localhost:7474/browser/ and run MATCH (n:Table) RETURN n LIMIT 25 in the query box. You should see two tables:
    1. hive.test_schema.test_table1
    2. dynamo.test_schema.test_table2
  2. You can verify the data has been loaded into the metadataservice by visiting:
    1. http://localhost:5000/table_detail/gold/hive/test_schema/test_table1
    2. http://localhost:5000/table_detail/gold/dynamo/test_schema/test_table2

Troubleshooting

  1. If the docker container doesn't have enough heap memory for Elastic Search, es_amundsen will fail during docker-compose.
    1. docker-compose error: es_amundsen | [1]: max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
    2. Increase the heap memory detailed instructions here
      1. Edit /etc/sysctl.conf
      2. Make entry vm.max_map_count=262144. Save and exit.
      3. Reload settings $ sysctl -p
      4. Restart docker-compose
  2. If docker-amundsen-local.yml stops because of org.elasticsearch.bootstrap.StartupException: java.lang.IllegalStateException: Failed to create node environment, then es_amundsen cannot write to .local/elasticsearch.
    1. chown -R 1000:1000 .local/elasticsearch
    2. Restart docker-compose