Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Trying to write the records from S3 text file to Redshit. It running when the record count is around 10000, but running long and further connection timing out when trying to write the entire file (50K...
1
answers
0
votes
189
views
asked 9 months agolg...
I am trying to use Amazon Grafana with files on Amazon S3. For that, I need to change the date format of the original file to fit on Grafana. I use the following command :
SELECT...
1
answers
0
votes
291
views
asked 9 months agolg...
Hi,
I have a list of Glue jobs, they are up and running. Starting from 2023/08/14 I'm having a lot of errors from CoarseGrainedExecutorBackend like this:
**ERROR CoarseGrainedExecutorBackend:**...
1
answers
0
votes
1036
views
asked 9 months agolg...
Hi all, I created a EventBridge rule with the following event pattern that is suppose to match all glue crawler state changes and send them to a lambda function, however, the rule is only only sending...
2
answers
0
votes
763
views
asked 9 months agolg...
Hi,
I am searching for a transformation engine that supports multi-tenancy with the following requirements:
* Each tenant must be transformed every 10 minutes.
* One tenant transformation transforms...
1
answers
0
votes
346
views
asked 9 months agolg...
Hello, we have an S3 bucket with various CSV files and an AWS Glue crawler to update the Data Catalog and finally an AWS Glue job to move the data to RedShift. The handling of data and target table is...
0
answers
0
votes
128
views
asked 9 months agolg...
Hi all. I generated a batch transform. The process finished correctly with no issues. When I check the output data, the directory is empty.
This is the manifest file format (RecordIO)...
1
answers
0
votes
254
views
asked 9 months agolg...
importing form encoded/csv data from data logger using aws api gateway,lamda function and dynamodblg...
greetings, for past few weeks i have been trying to fetch data using a solar data logger using an api gateway that sends data to my api gateway which is integrated to my lamda function but the problem...
4
answers
0
votes
428
views
asked 9 months agolg...
Hello everyone.
Data from the rest api in the form of JSON is loaded daily by lambda into s3-bucket-1.
Then this data should be stored in s3-bucket-2 in the form of a flat parquet table.
I did it in...
0
answers
0
votes
73
views
asked 9 months agolg...
Hey Guys
I want to run my pyspark on EMR Serverless but it has some dependencies/libraries which are needed by the pyspark script to run. Please suggest a optimized approach to import the...
1
answers
0
votes
397
views
asked 9 months agolg...
Hello,
I have gone through the recommended changes provided in [this](https://repost.aws/knowledge-center/glue-crawler-internal-service-exception) article. However, I continue to get the same...
1
answers
0
votes
240
views
asked 9 months agolg...
Hi there I managed to convert csv files to parquet files using glue job, my crawler does see the parquet files in the s3 bucket and crawls it and present me with the proper schema and adds for each...
1
answers
0
votes
499
views
asked 10 months agolg...