跳至內容

Mode of retrieval of records and storage (S3 Glacier)

0

Use case: To store records in pdf format and access it for audit purposes(once in six months or a year, only a few records sample). Solution: S3 Glacier is chosen to meet the objective and the cost benefit.

Unknown; Since records are stored in pdf formats, some times instead of file name, certain fields in records need to be searched. Eg; name, address or phone number to identify the record. How to do this? (cost effective and efficient solution).

已提問 2 年前檢視次數 237 次
1 個回答
5
已接受的答案

Hello,

please find the solution it will be helpful for you.

To store and occasionally retrieve PDF records in a cost-effective manner using Amazon S3 Glacier, you can implement a streamlined solution involving several AWS services. First, upload your PDFs to an S3 bucket and set a lifecycle policy to transition them to S3 Glacier for long-term storage. To enable searching by specific fields like name, address, or phone number, use Amazon Textract to extract text from the PDFs upon upload. This can be automated with an AWS Lambda function that triggers on new uploads, extracts the relevant metadata, and stores it as JSON files in a separate folder within the same S3 bucket. For searching, leverage Amazon S3 Select to query the stored metadata directly within S3, avoiding the need for a complex database setup. When you need to access a specific PDF, initiate a restore request from S3 Glacier to temporarily move the file back to S3 Standard for download. This approach ensures cost-efficiency and simplifies the process of storing and retrieving records for audit purposes.

please find the below AWS Document If you get more information.

https://docs.aws.amazon.com/AmazonS3/latest/userguide/restoring-objects-retrieval-options.html

https://aws.amazon.com/s3/storage-classes/glacier/

專家
已回答 2 年前
專家
已審閱 2 年前
專家
已審閱 2 年前
專家
已審閱 2 年前
  • Thanks for the quick reply and detailed answer. I was wondering if Athena could do the job. The solution shared in the answer seems to be straight forward and very useful. Thanks very much.

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。