All Content tagged with Data Lakes

Content language: English

Select tags to filter
Sort by most recent
97 results
Hi all, I'm trying to create a Iceberg table through Athena. The detailed S3 bucket path looks like this 's3://my_bucket/company_id={company_id}/date={YYYY-MM-DD}/data.parquet However, the 'date' ...
1
answers
0
votes
61
views
asked a month ago
I am creating a data mesh architecture on datazone domain with associating multiple accounts from the Central Account but when I am adding IAM role as a user into root domain of Datazone it does not a...
1
answers
0
votes
105
views
asked 2 months ago
Don't miss us live on [Twitch.tv](https://bit.ly/4anH9WR) on Monday, November 25th to learn how you can Build Next-Gen Data Platforms using Apache Iceberg.
profile pictureAWS
EXPERT
published 5 months ago1 votes699 views
The purpose of this article is to provide a guide on leveraging AWS Lambda and the Python-Magic library to accurately detect and categorize file types. This approach enables businesses to build more r...
In AWS DMS, I have a Serverless replication, but I want to modify it now justs to add an extra table. No matter what I change, I get this error: Task Settings CloudWatchLogGroup or CloudWatchLogStream...
1
answers
0
votes
95
views
asked 6 months ago
This is the third time I've run into this error. I don't know is it a real issue or I just forget to set something up. So basically when I try to add data permission to all table in Lakeformation ba...
0
answers
0
votes
38
views
asked 7 months ago
Config of Redshift Cluster: - Enhanced VPC routing has enabled - Redshift subnet in the same subnet as S3 vpc endpoint Config of S3 - VPC endpoints created for S3 - Routing has configured to rout...
1
answers
0
votes
132
views
asked 7 months ago
hello, in planning phase of a Datalake project and came across LakeFormation which seems to be the preferred way. I understand that essentially it is a group of S3 buckets so resiliency & durability i...
1
answers
0
votes
438
views
asked 8 months ago
HIVE_CURSOR_ERROR: incorrect data check This query ran against the "dbreport" database, unless qualified by the query. Please post the error message on our forum or contact customer support with Que...
1
answers
0
votes
252
views
asked 8 months ago
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error: ``` Error Category: UNCLASSIFIED_ERROR; An error occurr...
0
answers
0
votes
278
views
asked 9 months ago
Is it possible to integrate opentelemetry(ADOT) and promethus managed by AWS with a data lake. The data lake could be both on edge or on AWS? If yes then how does it work? How can oepntemetry and Pro...
1
answers
0
votes
262
views
asked 10 months ago
profile pictureAWS
published 10 months ago1 votes1.6K views
This article demonstrates the end-to-end data pipeline solution with data ingestion, data storage, data processing, data analytics and data visualization. Today several enterprises face numerous chall...