Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I want to extract data from my PostgreSQL on RDS using Aws data glue, transform the data and export the data to s3 bucket. how do i do that. i need an AWS tutorial on this.
2
answers
0
votes
1095
views
asked a year agolg...
I am writing a glue script to take data from s3(PSQL WAL LOGS) to write that data into a hudi data lake.
Whenever I am trying to do that I am getting unable to upsert data with commit time error with...
1
answers
0
votes
762
views
asked a year agolg...
```
df.toDF().write.format("jdbc").\
option("url", "").\
option("dbtable", f"public.{tableName}_staging").\
option("user", "").\
option("password", "").\
...
1
answers
0
votes
357
views
asked a year agolg...
HI,
I'm working with databrew to import some excel file to be then used by Athena. For some columns if I try to add an action I get the following error
`<ACTION>has not allowed characters in...
0
answers
0
votes
88
views
asked a year agolg...
First of all, I have already asked about this problem here in re:Post and on Database Administrators Stack Exchange...
0
answers
0
votes
127
views
asked a year agolg...
Hi, I want to run a job on EMR serverless which reads and writes data from postgresql. I downloaded the jar file and pushed it to s3 and set "spark.jars" in Spark properties in management console....
1
answers
0
votes
1099
views
asked a year agolg...
We have been able to connect to a Microsoft SQL Server DB using both Glue's DynamicFrame and Spark's own JDBC write option due to the Glue connection option.
However, we want to move this workload to...
1
answers
0
votes
666
views
asked a year agolg...
The goal is to create an ETL job that can be altered and executed by non-technical users in our organization, which is why we are sticking to only visuals and not code.
The problem is that the nodes...
1
answers
0
votes
814
views
asked a year agolg...
Hi,
I'm loading a set of files from S3 and filtering the dataset to files from the last hour.
See error in the Cloudwatch log below:
```
Processing parameterized path with the following...
0
answers
0
votes
108
views
asked a year agolg...
I'm attempting to write some ETL jobs in a jupyter notebook, however whenever I leave the page the notebook isn't listed on my written jobs. When I attempt to save the notebook manually I get this...
1
answers
0
votes
432
views
asked a year agolg...
Hi Folks,
I have a table called demo and it is cataloged in Glue. The table has three partition columns (col_year, col_month and col_day). I want to get the name of the partition columns...
1
answers
0
votes
512
views
asked a year agolg...
Hi AWS, I have a folder inside s3 bucket where the cost and usage report data is stored both in .csv and .csv.gz format. When I am creating the TABLE using that LOCATION the records are not printed in...
1
answers
0
votes
640
views
asked a year agolg...