All Content tagged with Data Lakes

Content language: English

Select up to 5 tags to filter
Sort by most recent
95 results
Don't miss us live on [Twitch.tv](https://bit.ly/4anH9WR) on Monday, November 25th to learn how you can Build Next-Gen Data Platforms using Apache Iceberg.
profile pictureAWS
EXPERT
published 3 months ago1 votes447 views
The purpose of this article is to provide a guide on leveraging AWS Lambda and the Python-Magic library to accurately detect and categorize file types. This approach enables businesses to build more r...
In AWS DMS, I have a Serverless replication, but I want to modify it now justs to add an extra table. No matter what I change, I get this error: Task Settings CloudWatchLogGroup or CloudWatchLogStream...
1
answers
0
votes
76
views
asked 4 months ago
This is the third time I've run into this error. I don't know is it a real issue or I just forget to set something up. So basically when I try to add data permission to all table in Lakeformation ba...
0
answers
0
votes
36
views
asked 5 months ago
Config of Redshift Cluster: - Enhanced VPC routing has enabled - Redshift subnet in the same subnet as S3 vpc endpoint Config of S3 - VPC endpoints created for S3 - Routing has configured to rout...
1
answers
0
votes
122
views
asked 5 months ago
hello, in planning phase of a Datalake project and came across LakeFormation which seems to be the preferred way. I understand that essentially it is a group of S3 buckets so resiliency & durability i...
1
answers
0
votes
424
views
asked 6 months ago
HIVE_CURSOR_ERROR: incorrect data check This query ran against the "dbreport" database, unless qualified by the query. Please post the error message on our forum or contact customer support with Que...
1
answers
0
votes
233
views
asked 6 months ago
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error: ``` Error Category: UNCLASSIFIED_ERROR; An error occurr...
0
answers
0
votes
273
views
asked 7 months ago
Is it possible to integrate opentelemetry(ADOT) and promethus managed by AWS with a data lake. The data lake could be both on edge or on AWS? If yes then how does it work? How can oepntemetry and Pro...
1
answers
0
votes
238
views
asked 7 months ago
profile pictureAWS
published 8 months ago1 votes1.6K views
This article demonstrates the end-to-end data pipeline solution with data ingestion, data storage, data processing, data analytics and data visualization. Today several enterprises face numerous chall...
At every 30 minutes it saying "ICEBERG_VACUUM_MORE_RUNS_NEEDED: Removed 20000 files in this round of vacuum" but when I calculate the my table metadata size it didn't changed before and after the vacu...
2
answers
0
votes
761
views
asked 10 months ago
Hi! I have been searching and playing around with services and cannot seem to find what I need. I am using the following architecture to guide me in building out my end-to-end solution: https://aws....
2
answers
0
votes
413
views
asked a year ago