How to connect a Sagemaker Notebook to Glue Catalog

0

A customer wants to connect a Sagemaker notebook to Glue Catalog, but is not allowed to use developer endpoints because of security constraints.

I can't seem to find documentation on the Glue Catalog API that would allow this, or examples of how this might be done. Any links or pointers would be greatly appreciated.

asked 4 years ago2229 views
1 Answer
0
Accepted Answer

So there is the catalog API which allows you to describe databases, tables, etc. Documentation regarding the calls and data structures can be found here:

Boto3 for get_table

If they have a restrictive security posture (as suggested by the avoidance of Dev Endpoints) you may also suggest a Glue VPC-E: https://docs.aws.amazon.com/vpc/latest/userguide/vpce-interface.html

I would ask what are they accessing the catalog for, as the Dev Endpoint isn't entirely about the Glue Catalog, but about the compute resources andSparkMagic.

Also, think about steering them towards AWS Data Wrangler for interacting with Glue Catalog if they are using Pandas. Helpful snippets can be found here:

AWS
John_G
answered 4 years ago
AWS
EXPERT
reviewed 2 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions