Timeout Error while uploading files to S3 Bucket using OpenShift Cluster

0

Hi AWS, I have a shell script in AWS OpenShift and it is having worker nodes backed by EC2 instances. The purpose of the script is to upload the files in Amazon S3 bucket. Having said that the job works fine in the morning but giving timeout errors during afternoon and night time which seems to be a weird issue. One more thing is the S3 bucket lives in another AWS account and both the bucket and the OpenShift cluster are in the ca-central-1 region.

Can you please help me figure out what is the reason behind this. At first glance I thought it's a proxy issue similar to AWS Lambda but I am not sure if that's really the case.

Please guide.

profile picture
Arjun
asked 10 months ago161 views
1 Answer
0

Hello Arjun,

I would like to just give general ideas of this issue since I am working with little information here. You mentioned the timeouts happen in the afternoons and at nights.

This brings up some questions like, does the write rate increase at these hours of the day or is the write consistent through out the day? If the rate of writing to bucket increases then it might just mean you are hitting default s3 limits.

For example, your application can achieve at least 3,500 PUT/COPY/POST/DELETE or 5,500 GET/HEAD requests per second per partitioned Amazon S3 prefix and in case you are hitting these limits, you most likely will get timeout errors as Amazon S3 scales to high request rates.

If scaling is the issue here, then you can basically implement retry mechanism for these put requests. Moreover, you can design your s3 to be performant by increasing the number of prefixes.

You can increase your read or write performance by using parallelization. For example, if you create 10 prefixes in an Amazon S3 bucket to parallelize reads, you could scale your read performance to 55,000 read requests per second. Similarly, you can scale write operations by writing to multiple prefixes.

The scaling, in the case of both read and write operations, happens gradually and is not instantaneous. While Amazon S3 is scaling to your new higher request rate, you may see some 503 (Slow Down) errors. These errors will dissipate when the scaling is complete.

Read more about this here: https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance.html

answered a month ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions