This reference architecture creates an AWS Service Catalog Portfolio called "Service Catalog - AWS Glue Reference Architecture" with one associated product. The AWS Service Catalog Product references a cloudformation template for the a Glue Crawler which can be launched by end users through AWS Service Catalog. The AWS Service Catalog Glue product creates a crawler and a glue database. The crawler can be used to crawl S3 data source to populate the glue data catalog.
Launch the Glue portfolio stack:
- clone this git repo:
git clone [email protected]:aws-samples/aws-service-catalog-reference-architectures.git
- Copy everything in the repo to an S3 bucket:
cd aws-service-catalog-reference-architectures
aws s3 cp . s3://[YOUR-BUCKET-NAME-HERE] --exclude "*" --include "*.json" --include "*.yml" --recursive
- In the AWS CloudFormation console choose "Create Stack" and supply the Portfolio S3 url:
https://s3.amazonaws.com/[YOUR-BUCKET-NAME-HERE]/glue/sc-portfolio-glue.json
- Set the LinkedRole1 and LinkedRole2 parameters to any additional end user roles you may want to link to the Portfolio.
- Set the CreateEndUsers parameter to No if you have already run a Portfolio stack from this repo (ServiceCatalogEndusers already exists).
- Change the RepoRootURL parameter to your bucket's root url:
https://s3.amazonaws.com/[YOUR-BUCKET-NAME-HERE]/