Questions tagged with AWS Glue
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
I am trying to create two DynamicFrames based on a column that is a boolean. I have tried
`dyf.split_rows({'mybool': {'=': 'true'}}, 'is_true', 'is_not_true')`
`dyf.split_rows({'mybool': {'=':...
2
answers
0
votes
80
views
asked 16 days agolg...
I am writing this question after going through bunch of glue pricing documents. Essentially what I want to know is how glue divides visual job ETL components for pricing.
**Pipeline...
1
answers
0
votes
82
views
asked 16 days agolg...
AWS Glue Job Errorlg...
Im trying to convert CSV files in S3 to Parquet in another S3 bucket. So first I read the CSV files using a crawler, load the data into a Table, and then use a Job to convert from the Table to S3 in...
0
answers
0
votes
313
views
asked 17 days agolg...
I have a json file in s3 (sample below) in json lines format. I create a crawler in aws glue to read this file, which creates a table definition and produces a table schema as such ,
schema:
```
# ...
1
answers
0
votes
100
views
asked 18 days agolg...
I am setting up the Connection in AWS Glue, trying to connect to my dev db instance (AWS rds Aurora PostgreSQL), I double checked VPC, subnet and setup the inbound rule to allow incoming connections...
1
answers
0
votes
511
views
asked 19 days agolg...
Hello, can anyone give any advice on this.
I created the very simple test Glue job: Source - RDS Postgres, Destination - S3 bucket.
Run takes about 23 minuts and ends with timeout error.
In the log I...
3
answers
0
votes
604
views
asked 20 days agolg...
Hello,
We set up AWS DMS, where the source is MS SQL Server 2019, and the target is S3 (with parquet). Setting up CDC copying. And it is important for us to check that DDLs on source work as well:
1)...
0
answers
0
votes
229
views
asked 22 days agolg...
I am getting json files to my s3. For example:
```
{
"name" : "John",
"lastname": "Doe",
"meta" : {
"x": "a",
"y": "b",
"unwanted_field": {
"some":...
1
answers
0
votes
85
views
asked 22 days agolg...
say i have couple of json files in s3, I would to set up a crawler or a glue job, such that i can create table in aws rds (mysql or postgre) , such that in table 1, it creates a autogenerated id and...
1
answers
0
votes
920
views
asked 22 days agolg...
How to build AWS Glue ETL Jobs or Data Quality Jobs, if access to console is not allowed as per company policy. Does not having AWS Console access defeats the purpose AWS Glue? What features cannot be...
2
answers
0
votes
131
views
asked 23 days agolg...
I've been trying to test out Iceberg tables with Amazon Redshift Spectrum and have come across a major issue.
Here is my setup:
1. I create an iceberg table via spark (emr 7.0) and insert data across...
0
answers
1
votes
471
views
asked 23 days agolg...
Hi,
I want to read a tar file from s3, uncompress it and load it to another s3 bucket using Glue job. But I am facing "fileobj must implement read".
obj=s3.getObject(bucketname,key)
objbuffer =...
1
answers
0
votes
144
views
asked 24 days agolg...