Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Good day,
I am trying to build a no/zero code architecture, i was planning to use MWAA for orchestration but i was tasked to look at alternatives keeping it simple
I was hoping Redshift query...
2
answers
0
votes
435
views
asked 4 months agolg...
Bug: Sagemaker Canvas can't import parquet files with numpy.nan/None/pandas.NA as first row valuelg...
I'm trying to create a tabular dataset in Sagemaker Canvas Data Wrangler by importing a local parquet file created with the pandas python library. I succeed in loading the file and can preview it....
1
answers
0
votes
169
views
asked 4 months agolg...
I have an existing AWS Glue script that has been successfully running in Glue 2.4 for some time. I went in today to upgrade it to Glue 3.0 and am unable to connect to my database. I am simply reading...
1
answers
0
votes
512
views
asked 5 months agolg...
We're using S3 Select SelectObjectContent to convert CSV input to JSON output.
CSV files on input are very large, so we're passing chunks using ScanRange. Recently we ran into an issue with CSV files...
1
answers
0
votes
336
views
asked 5 months agolg...
Hi,
I am considering Glue to connect to a third party application's database (Oracle) and bring a data set (in excess of 1M rows) obtained by joining multiple tables at source end. The destination...
1
answers
0
votes
367
views
asked 5 months agolg...
I have multiple Visual ETL configured correctly, but if go back to the previous screen and then try to see the job again, the display editor will lost the configuration and it will highlight some...
0
answers
0
votes
111
views
asked 5 months agolg...
I am working with .sas7bdat file stored in my s3 bucket
I want to convert the sas7bdat file to csv but in glue visual etl I cannot see an option for sas7bdat file format
Can someone please help me...
1
answers
0
votes
329
views
asked 5 months agolg...
Hello,
While building a job in AWS Glue (Amazon S3, Change Schema, AWS Glue Data Catalog), I had a surprising cost for data preview session (AWS Glue GlueInteractiveSession) of 91% of the total...
1
answers
0
votes
227
views
asked 5 months agolg...
I am importing the data dump file that I have downloaded from S3.
```
-----load schema
DECLARE
v_hdnl NUMBER;
BEGIN
v_hdnl := DBMS_DATAPUMP.OPEN(operation => 'IMPORT', job_mode => 'SCHEMA',...
1
answers
0
votes
1035
views
asked 5 months agolg...
Hello,
While trying to run this command `DELETE FROM "datasets"."us_spending"` in Athena, on a table from AWS Data Catalog, I had this error:
```
NOT_SUPPORTED: Cannot delete from non-managed Hive...
1
answers
0
votes
769
views
asked 5 months agolg...
Hello,
For an AWS Data Catalog table, I ran Glue (structure: Amazon S3 -> Change Schema -> AWS Glue Data Catalog ) and populate table with only string records. All the actions were done from the...
1
answers
0
votes
185
views
asked 5 months agolg...
Hello
I am using PySpark on Glue Job to do ETL on a table sourced from S3 And S3 sourced from mysql via DMS (table schema as below, column 'op', 'row_updated_timestamp' & 'row_commit_timestamp' are...
1
answers
0
votes
142
views
asked 5 months agolg...