What's the load time for FSx for Lustre from S3?

0

A customer is fronting S3 with FSx to interact with Sagemaker more efficiently. They will probably use scratch mode so they can turn on/off frequently, but I want to understand how long that initial load would be from S3 to see if it just makes sense to go the persistent route. I see in the FAQs that 'When linked to an S3 bucket, an FSx for Lustre file system transparently presents S3 objects as files and allows you to write changed data back to S3.' I'm assuming that means there is an initial load time from S3 to Lustre? Does it happen all at once or on an object-by-object basis, i.e. only when a certain object is requested for the first time?

What type of information can I give the customer about this initial latency around creating the file system?

AWS
已提問 3 年前檢視次數 1578 次
1 個回答
0
已接受的答案

It will take a few minutes to create the filesystem metadata representation of the S3 Bucket/prefix. The amount of time to create the filesystem depends on the number of objects in the filesystem. Once the file system is created, the data is lazy-loaded in the scratch FSx for Lustre based on file access, so the first access will be typical S3 latencies with subsequent access at sub-MS latencies for the scratch tier. Alternately, the customer can pre-populate the data (or a subset of the data) after the file system is created using the Lustre HSM commands: https://docs.aws.amazon.com/fsx/latest/LustreGuide/preload-file-contents-hsm.html

已回答 3 年前
profile picture
專家
已審閱 10 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南