S3 File Gateway - large files and robocopy error

0

I have an S3 File Gateway on EC2 in the same region as an EC2 File server on Windows 2022. Both instances are r5b.xlarge

S3 Gateway Server has 4x 2000gb gp3 drives.

I am using robocopy on the Windows server to move 10's of TB of files to the S3 File gateway. On large files I often see errors like below 90gig+ files usually

2023/08/21 15:26:02 ERROR 64 (0x00000040) Copying File F:\somesourcefolder\somefile2.zip The specified network name is no longer available.

Robocopy command like below robocopy F:\someLocalsource \S3GatewayServer\SomeFolder /J /COPY:DAT /S /E /MT:8

Sometimes it retries and the copy works and sometimes it just goes onto the next files and skips the large file. There's no indication of the S3 gateway instance being down. The same copy works fine between 2 windows servers in the same region.

I can see on the gateway cloudwatch graph that the dump to S3 is happening at just about the same speed as the smb copy to the gateway from the file server so I don't think it's getting too swamped. I expect that the reported error is somewhat generic and doesn't really mean the network is down or the s3 gateway is down but don't know where to troubleshoot.

已提问 9 个月前415 查看次数
1 回答
0
已接受的回答

Hello,

Please check if the Cache%Dirty metric on the gateway to see if it is above 80% during the copy. Gateway starts throttling the reads and writes to the cache if the metric is above 80%. However, if the large file copy still fails even with the CachePercentDirty metric being less than 80 percent, then your File Gateway might be hitting a client-side session timeout.

For SMB, you can increase this timeout using the PowerShell command below. Running this command sets the timeout to 1800 seconds. The default value is 60 Seconds.

Set-SmbClientConfiguration -SessionTimeout 1800

Also, you can alternatively set the SMB session timeout on the SMB client by updating the registry setting at HKEY_LOCAL_MACHINE\SYSTEM\CURRENTCONTROLSET\SERVICES\ LANMANWORKSTATION\PARAMETERS\SESSTIMEOUT.

Ref - https://docs.aws.amazon.com/filegateway/latest/files3/troubleshooting-file-gateway-issues.html#backup-job-fails

If the above suggestion did not address the issue, please open a case with AWS support for further investigation.

Harshi
已回答 9 个月前
profile picture
专家
已审核 1 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则