Questions tagged with Data Lakes
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
working on a POC to understand how data Governed Tables compaction work, after governed table is created and data getting loaded into the table using a Glue job,
compaction is getting triggered...
1
answers
0
votes
320
views
asked 2 years agolg...
I am setting up a new data lake and have been tasked with creating the master data tables in the data bricks delta lake component. I'm trying to do this in a use-case agnostic way (or as agnostic as...
1
answers
0
votes
390
views
asked 2 years agolg...
When developing some Glue scripts from a successful Crawler run from a JDBC Oracle data source, I am encountering an error that I cannot resolve.
```
An error occurred while calling...
0
answers
0
votes
131
views
asked 2 years agolg...
Have created a DMS task to migrate data from MongoDB to S3 in parquet, and will be using parquet files in Glue. But the column names contain spaces in their names, due to which the parquet files are...
1
answers
0
votes
1797
views
asked 2 years agolg...
Hi,
I would like to know, when crawling the data from s3 in order to create a database; does the database must be a relational database ? It can have tables that no relation with other tables ?
1
answers
0
votes
254
views
asked 2 years agolg...
AWS Glue visual joblg...
Hi
I was trying to work on a simple problem where I am taking data from 3 csv files in s3(source) and after combining them, I am appending them to a table in postgre sql where my database is by...
1
answers
0
votes
338
views
asked 2 years agolg...
_temp lake formation blueprint pipeline tables appears to IAM user in Athena editor, although I didn't give this user permission on them below the policy granted to this IAM user,also in lake...
1
answers
0
votes
365
views
asked 2 years agolg...
Hello. Development Endpoint only supports Glue version <= 1.0. With upgraded Glue Versions, will Glue Version 1.0 eventually be deprecated?
I saw the following post related to development under Glue...
1
answers
0
votes
632
views
asked 2 years agolg...
Athena and Analyticslg...
1. What is the best way to Create a subset of factory location data
Current process: Query location data for specific factories, save in a new Athena table with a direct insert statement
2. Get...
0
answers
0
votes
82
views
asked 2 years agolg...
I have two S3 buckets with data tables, namely A and B, and a Glue job, that transforms data from A to B. Both tables contain a column called x. The Glue job performs a GroupBy operation on this...
1
answers
0
votes
1575
views
asked 2 years agolg...
**ERROR MESSAGE:** An error occurred while calling o518.pyWriteDynamicFrame. Unsupported case of DataType: com.amazonaws.services.glue.schema.types.StringType@235d3a6f and DynamicNode: integernode.
I...
1
answers
1
votes
1417
views
asked 2 years agolg...
Hi,
I have a database with around 40 tables. However, some end users don't need to see all tables in the database. I'm using Lake Formation Tagging and know that if a tag is added to the database...
1
answers
0
votes
606
views
asked 2 years agolg...