Workflow Stuck on "Set up compute" in CodeCatalyst for Over 8 Hours

0

Hi,

I am currently experiencing an issue with my workflow in AWS CodeCatalyst. The workflow gets stuck during a build action and does not proceed further. The last log entry I see is "Set up compute", after which there are no further logs. The issue has persisted for over 8 hours now. Timeout is set to 5 minutes.

Here are the details from the dashboard: Start time: About 15 hours ago Duration: 8 hours 3 seconds

And here are the configurations for the action where the workflow gets stuck: Compute type: EC2 Compute: Linux.Arm64.Large Operating system: Amazon Linux 2 Provisioning model: On-demand Machine type: 4 GB memory, 2 vCPUs Timeout: 5 minutes

Could you please assist me in identifying the cause of this issue and suggest possible solutions? I am eager to resolve this as soon as possible to ensure the smooth operation of my workflows.

Thank you for your assistance.

Robert

profile picture
asked 10 months ago316 views
4 Answers
1
Accepted Answer

The issue has been resolved. After waiting 8 hours for the build job to timeout and switching to a larger Compute type, as recommended by Didier, I encountered a "CLIENT_ERROR: Unable to pull customer's container image" error message. Once I directed it to the correct container image, it functioned as expected.

profile picture
answered 10 months ago
0

Hi, not sure but maybe related to insufficient size of EC2 instance. Did you try with xlarge instead of large ? Didier

profile pictureAWS
EXPERT
answered 10 months ago
  • Thank you for your response. I'm unable to proceed with testing because I'm stuck in this state. It's not possible to rerun the workflow until I manage to stop the currently running workflow. The timeout isn't working, and I can't figure out how to kill it.

0

Seems to be a combination of bugs.

  • UI bug, showing In progress, but the workflow actually stopped after 8 hours.
  • Timeout bug, Timeout configured to 5 minutes, but timeouts after 8 hours.

Not possible to configure Linux.Arm64.XLarge from the visual editor, not sure if this is due to restrictions or limitations in the UI but I'll figure it out.

profile picture
answered 10 months ago
0

@Robert Dahlborg What do you mean when you said you "directed it to the correct container image" ? Just change the image to the correct link? I also got the error: CLIENT_ERROR: Unable to pull customer's container image" error message but I don't know how to debug Here's the content I use for the field Image: {myAccountNumber}.dkr.ecr.eu-central-1.amazonaws.com/{myImage} I also setup the permission for ECR and Trust policy as mentioned in https://docs.aws.amazon.com/codecatalyst/latest/userguide/build-action-ref.html#build.configuration.container.image So I have no idea why it couldn't pull the image. If I use a public DockerHub or public Aws ECE it worked fine. Do you have any idea how to debug it since I can't see anything in the build log's except the error message above

Vu Hai
answered a month ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions