AWS Data Pipeline stuck in "WAITING FOR RUNNER"
all my AWS Data Pipelines i create to export data from a dynamodb table to a .csv in an s3 bucket are stuck in the "WAITING FOR RUNNER" state. I am following the tutorial here: https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-importexport-ddb.html
I also checked the values "Runs on resource" which is: "EmrClusterForBackup" and "workerGroup" which is "df-0274180EXA6BQJAA9HV_@EmrClusterForBackup_2019-01-13T10:21:37"
can you tell me if these values are correct?
Common issue would be ,
- check the IAM access of emr ec2 role and make sure that it has datapipeline:*
- make sure that internet connectivity is available from emr/ec2 either nat or igw
you can add ec2 key in emr cluster and log in to emr master (ssh) then check task runner log
tail -100f /mnt/taskrunner/output/logs/tasrunner*.log
thank you for your answer.
I think with EMR EC2 Role you mean the DataPipelineDefaultResourceRole? The one i assign durin the creation of the pipeline to the field "EC2 instance role"?
If so, i checked. It has datapipeline full access.
If i activate my datapipeline, then switch to the EMR console, and check the respective cluster, it says "Install Task Runner cancelled".
Do you have an idea what the issue could be?
Edited by: OlliHC on Jan 14, 2019 7:08 AM
Edited by: OlliHC on Jan 14, 2019 7:14 AM
check your pipeline terminateaftertimeout parameter and increase 2-3 times normal run +15 minutes for cluster launch
Job usual run time is 10 minutes then 30 +15 =45 minutes would be best value for terminateafter
Just to be precise:
I fixed it.
The reason was that our limit on EC2 instances of the respective account was reached.
We texted the AWS support and their increased our limit, now it works fine.
AWS Data Pipeline stuck in state DELETINGasked 3 years ago
data loading to s3 in csv format is adding line breaker in rows, randomly to one columnasked 2 months ago
AWS Data Pipeline stuck in "WAITING FOR RUNNER"asked 3 years ago
How to merge aws data pipeline output files into a single file?asked 3 months ago
How could we have Glue to get data from csv as String?Accepted Answerasked 2 months ago
Possible to save Honeycode data directly to Dynamodbasked 4 months ago
Data Pipeline stops processing files in S3 bucketAccepted Answerasked 7 months ago
Store csv data from s3 bucket automatically inside timestreamasked 3 months ago
backslash in CSV with glueasked 5 months ago
How to store the Athena Query Results in DynamoDB table?asked 3 years ago