1 Answer
- Newest
- Most votes
- Most comments
0
You can run any script in script runner mode.
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hadoop-script.html
Basically from the step function invoke the syntax thats required by the script runner, for example:
"StartAt":"Step 1",
"States":{
"Step_1":{
"Type":"Task",
"Resource":"arn:aws:states:::elasticmapreduce:addStep.sync",
"Parameters":{
"ClusterId.$":"$.ClusterId",
"Step":{
"Name":"1 - Step 1",
"ActionOnFailure":"CONTINUE",
"HadoopJarStep":{
"Jar":"s3://elasticmapreduce/libs/script-runner/script-runner.jar",
"Args":[
"s3://xxx/scripts/step1.sh"
]
}
}
},
"End":true
}
}
}
Put all the code in the step1.sh script, this script will execute on master node and do any task you want including your sqoop stuff
Relevant content
- Accepted Answerasked 5 years ago
- Accepted Answerasked a year ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated 2 years ago
- How can I use a Lambda function to automatically start an AWS Glue job when a crawler run completes?AWS OFFICIALUpdated 2 years ago