Questions tagged with Extract Transform & Load Data
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I've tried to use Glue Spark Job for very basic partitioning over GZIP JSON data about 50GB.
The reason for trying Glue Job is my data could have more than 100 partitions and it is not really...
2
answers
0
votes
1108
views
asked a year agolg...
How can I add an ODBC driver to my Glue **Python shell** job? I am trying to use the pyodbc library and can see with pyodbc.drivers() the MySQL and PostgresSQL are available however I would like to...
2
answers
3
votes
2166
views
asked a year agolg...
Does anyone know how to use Amazon's API to retrieve all available promotional codes given to affiliates? Or is there another way to do that?
2
answers
0
votes
1279
views
asked a year agolg...
Hi,
I am trying to perform an upsert of an inceberg table.
The script below creates a table with raw data stored in parquet format in an S3 bucket.
Then it creates an empty iceberg table to be...
1
answers
0
votes
1483
views
asked a year agolg...
Pretty basic newbie redshift question here. Wanting to upload historical test data to a redshift database and the data as we have it is in multiple csv's and is formatted in typical table format with...
2
answers
0
votes
319
views
asked a year agolg...
While processing a file through EMR, if the cluster is terminated, few records were only updated. While processing it again should we delete the file at target location, so we can process the file...
1
answers
0
votes
189
views
asked a year agolg...
When I created a crawler to crawl an RDS (Postgres), it was able to connect and crawl one table I specified. When I created a job, using the node type "AWS Glue Data Catalog table with PostgreSQL as...
2
answers
0
votes
549
views
asked a year agolg...
Cron job Problem it is work monday to friday but this is stop staturday and sunday please help for this problem
0
answers
0
votes
97
views
asked a year agolg...
Hello Team, is there a limit to the number of tables which can be scanned using the Glue Crawler? I have a crawler which scans S3 buckets from a single source for data from January 2021 until December...
2
answers
0
votes
348
views
asked a year agolg...
Hello!,
Im facing some issue while trying to stream data from cloudwatch to opensearch using firehose as middleware, the message that is giving me Opensearch, after running the transformation Lambda...
1
answers
0
votes
462
views
asked a year agolg...
Hi all,
I have followed the instructions https://docs.aws.amazon.com/athena/latest/ug/connect-data-source-serverless-app-repo.html to deploy Timestream as an additional data source to Athena and can...
1
answers
0
votes
343
views
asked a year agolg...
I've created a data validation box in my Glue ETL, which imports the following:
`from awsgluedq.transforms import EvaluateDataQuality`
To develop further my script, I've copied the script to a AWS...
2
answers
0
votes
554
views
asked a year agolg...