Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
We are moving our content from one developer to another. I am trying to figure out not to stop my Images and pdf uploads from being rasterized.
My old developer had figured it out, but I can't.
Any...
1
answers
0
votes
219
views
asked a year agolg...
I am trying to create a Delta Table from spark sql using the Glue meta catalog.
I can correctly query a Delta table using the Glue metastore:
```
%%sql
select * from `my_table` VERSION AS OF 1 limit...
2
answers
0
votes
1686
views
asked a year agolg...
Hi Team,
I have a complex nested xml file which I want to read using AWS Glue and convert it to parquet format. I want to use pandas read_xml function to read the xml file. But, I get error lxml not...
1
answers
0
votes
325
views
asked a year agolg...
I read data from s3 using as follow.
```
sec_id_dyf = glueContext.create_dynamic_frame.from_options(
connection_type = 's3',
...
0
answers
0
votes
106
views
asked a year agolg...
Hello, I have been experimenting with Aws glue, and created some crawlers to crawl the data but the behavior wasn't what I expected,
Question 1) I had an S3 bucket
with 3...
1
answers
0
votes
354
views
asked a year agolg...
Looks like attempting to write to a Delta Lake table from a DynamicFrame is not working. The Visual Glue interface generates a script like:
```
s3 =...
2
answers
0
votes
606
views
asked a year agolg...
We are trying to use Glue to query and aggregate some Parquet files in S3.
We get this error related to schema mismatch:
```
An error occurred while calling o106.pyWriteDynamicFrame....
2
answers
0
votes
277
views
asked a year agolg...
Hello,
I am experience an issue when trying to use the Glue ETL on one of tables in my data catalogue. I am using the visual tool with a very simple SQL transformation on the table and when clicking...
0
answers
0
votes
96
views
asked a year agolg...
I have a large postgres table which is having issues replicating on DMS v3.5.1. The same task (and endpoints) work fine with v3.4.7, so I have rolled back to that for now. The job is a full load &...
1
answers
0
votes
992
views
asked a year agolg...
Hello,
I am trying to save job parameters that I would like to pass to individual jobs within the workflow. After adding them and clicking the 'Update' button and reopening the job parameters, all...
1
answers
0
votes
305
views
asked a year agolg...
I am looking to have metadata produced by crawlers during Glue Jobs record both source AND target information on the same data catalog table. From the research I have done, recording source metadata...
1
answers
0
votes
253
views
asked a year agolg...
In a AWS Glue job, I am getting issue when I am trying to execute sql query in below code:
```
source_dataset = glueContext.create_dynamic_frame.from_options(connection_type="oracle",...
2
answers
0
votes
2324
views
asked a year agolg...