By using AWS re:Post, you agree to the Terms of Use
/Is it possible to specify DB snapshot in AWS Lake Formation? /

Is it possible to specify DB snapshot in AWS Lake Formation?

0

When defining blueprints in AWS Lake Formation, can we specify a particular snapshot? Does Lake Formation always uses the recent snapshot by default?

2 Answers
0
Accepted Answer

AWS Lake Formation blueprints make a JDBC connection to your database and effectively do a "select *" on the tables you want to import. It's referred to as a database snapshot because it brings full tables over at a point in time. It does not use RDS backup snapshots to import data.

Lake Formation enables you to create a workflow from a blueprint, creating workflows is much simpler and more automated in Lake Formation. The other option is to create workflows in AWS Glue. Lake Formation provides the following types of blueprints:

Database snapshot – Loads or reloads data from all tables into the data lake from a JDBC source. You can exclude some data from the source based on an exclude pattern.

Incremental database – Loads only new data into the data lake from a JDBC source, based on previously set bookmarks. You specify the individual tables in the JDBC source database to include. For each table, you choose the bookmark columns and bookmark sort order to keep track of data that has previously been loaded. The first time that you run an incremental database blueprint against a set of tables, the workflow loads all data from the tables and sets bookmarks for the next incremental database blueprint run. You can therefore use an incremental database blueprint instead of the database snapshot blueprint to load all data, provided that you specify each table in the data source as a parameter.

Log file – Bulk loads data from log file sources, including AWS CloudTrail, Elastic Load Balancing logs, and Application Load Balancer logs.

This is not to be confused with the functionality in Amazon RDS that can take data from an RDS backup snapshot and load that into S3. For more information on this see: https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/USER_ExportSnapshot.html

answered a year ago
0

No, Lake Formation BluePrint read data from table and not from the previously created RDS backup snapshots.

answered 9 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions